语料库资源

The Babel parallel corpus can be accessed via the ParaConc Web or MySql interface (both hosted at The Institute of Education, Singapore). Users can search in either English or Chinese texts. The concordancer returns matched whole sentences and their translations as well as the their locations. At the bottom of the resulting concordance page is a query report that indicate the query strings and distribution of matches. Users can also specify the format the output concordances as POS-tagged or plain texts.

–上海交通大学语言工程研究所目前有JDEST,LOB,BROWN,CLEC四个语料库共计700万词可供网上检索, 并可以对检索和统计数据结果下载.

http://corpus.sjtu.edu.cn/WebCast/

–The Translational English Corpus (TEC)

http://www.llc.manchester.ac.uk/ctis/research/english-corpus/

http://ronaldo.cs.tcd.ie/tec/jnlp/

–English Chinese Parallel Concordancer (E-C Concord)

The Hong Kong Institute of Education.

Project leader: Dr. Wang Lixun. Program designers: Chris Greaves, Wang Lixun

http://ec-concord.ied.edu.hk/paraconc/index.htm

–Academia Sinica Balanced Corpus of Modern Chinese 中央研究院现代汉语平衡语料库

http://www.sinica.edu.tw/SinicaCorpus/

–Lancaster Corpus of Mandarin Chinese

http://bowland-files.lancs.ac.uk/cor…gi-bin/conc.pl

may be changed to

http://www.lancs.ac.uk/fass/projects/corpus/LCMC/

–People’s Daily 2000 corpus

some related information here

http://www.lancs.ac.uk/fass/projects/corpus/pdc2000/default.htm

–A Parallel Corpus of Chinese Legal Texts 中國法律文件漢英平行語料庫

http://score.crpp.nie.edu.sg/law/index.htm

–语料库语言学与英语教育教学。华南师范大学外国语言文化学院语料库语言学研究室

http://sfs.scnu.edu.cn/corpus4u/default.aspx

语料库工具

http://www.fleric.org.cn/crg/tools.htm 北外语料库语言学沙龙

Beiwai Corpus Research Group (CRG)

Corpus tools developed by group members

Chi-square and loglikelihood Calculator, (卡方检验和对数似然率计算工具)

TreeTagger for Windows, (语料库词性标注工具TreeTagger的Windows界面)
Colligator 1.0 & 2.0, (语料库类联接分析工具)
PatternBuilder 1.0, (赋码语料库检索辅助工具)
The Edinburgh Associative Thesaurus (EAT) for Windows,(爱丁堡联想词库Windows查询工具)
Wordlist Tools 1.0 Beta,(词表分析工具)
My Good Old Blackboard,(我的电子黑板)
BFSU Stanford Parser 1.0,(英文自动句法分析工具)。
BFSU Stanford POS Tagger 1.0,(英文自动词性赋码工具)。
BFSU Sentence Collector 1.0,(例句提取工具)。
BFSU NewWord Marker 1.0,(生词标注工具)。
BFSU Sentence Segmenter 1.0,(英文自动分句工具)。
Web Colligator。
Collocator 1.0: A collocation extraction tool,(搭配分析工具)。
Log-likelihood ratio calculator,(对数似然率计算器)。
Readability Analyzer 1.0 ,(英文文本可读性分析工具)。

Other free corpus tools
AntConc: A free concordancer(跟WordSmith主要功能接近的语料库索引工具)
Range: Vocabulary coverage tools(基于底表的分级词汇测量工具)

语料库检索软件Paraconc 和Multiconcord:

Paraconc由Barlow制作(该软件的演示版可在网上下载),该软件的特点是可以进行多达四种语言的同时检索,或者是一个原文的三个译本的检索。这一软件的特点是可以灵活定义语言、索引行的大小、标注符号的隐显,而且支持通配符检索。