Difference between revisions of "Resources for Chinese"

From ACL Wiki
Jump to navigation Jump to search
(Added: Araneum)
m (Move * [http://pears.lib.ohio-state.edu/China/linguist.html Chinese Linguistics] (broken link) from Uncategorized resource to Resources for Chinese)
 
Line 20: Line 20:
 
* [http://www.ling.lancs.ac.uk/corplang/lcmc/ Lancaster Corpus of Mandarin Chinese]
 
* [http://www.ling.lancs.ac.uk/corplang/lcmc/ Lancaster Corpus of Mandarin Chinese]
 
* [http://corpus.leeds.ac.uk/query-zh.html A collection of Chinese corpora and frequency lists]  Online query with three corpora
 
* [http://corpus.leeds.ac.uk/query-zh.html A collection of Chinese corpora and frequency lists]  Online query with three corpora
 +
* [http://pears.lib.ohio-state.edu/China/linguist.html Chinese Linguistics]
  
 
[[Category:Resources by language|Chinese]]
 
[[Category:Resources by language|Chinese]]

Latest revision as of 17:42, 2 September 2019

Tools

Free software

  • rseg word segmentation; written in ruby (no compilation, no hard dependencies apart from ruby), comes with a model (MIT license)
  • ctbparser word segmentation, POS tagging, NER, dependency parsing, all using Conditional Random Fields; written in C++ (LGPL license)
  • ZPar word segmentation, POS tagging, CFG/dep/CCG parsing of Chinese and English; written in C++ (GPL3 license)
  • DuDuPlus: a graph-based dependency parser for English and Chinese ("Other Open Source" license?)
    • where is the source code?

Corpora

Free license

Nonfree or Unknown license