Difference between revisions of "Resources for Chinese"

From ACL Wiki
Jump to navigation Jump to search
Line 1: Line 1:
 
==Tools==
 
==Tools==
 
===Free software===
 
===Free software===
* [https://github.com/yzhang/rseg rseg] word segmentation, in ruby (no compilation, no hard dependencies apart from ruby), comes with a model (MIT license)
+
* [https://github.com/yzhang/rseg rseg] word segmentation; written in ruby (no compilation, no hard dependencies apart from ruby), comes with a model (MIT license)
 
* [https://code.google.com/p/ctbparser/ ctbparser] word segmentation, POS tagging, NER, dependency parsing, all using Conditional Random Fields; written in C++ (LGPL license)
 
* [https://code.google.com/p/ctbparser/ ctbparser] word segmentation, POS tagging, NER, dependency parsing, all using Conditional Random Fields; written in C++ (LGPL license)
 
* [http://www.cl.cam.ac.uk/~yz360/zpar.html ZPar] word segmentation, POS tagging, CFG/dep/CCG parsing of Chinese and English; written in C++ (GPL3 license)
 
* [http://www.cl.cam.ac.uk/~yz360/zpar.html ZPar] word segmentation, POS tagging, CFG/dep/CCG parsing of Chinese and English; written in C++ (GPL3 license)

Revision as of 10:02, 6 December 2012

Tools

Free software

  • rseg word segmentation; written in ruby (no compilation, no hard dependencies apart from ruby), comes with a model (MIT license)
  • ctbparser word segmentation, POS tagging, NER, dependency parsing, all using Conditional Random Fields; written in C++ (LGPL license)
  • ZPar word segmentation, POS tagging, CFG/dep/CCG parsing of Chinese and English; written in C++ (GPL3 license)
  • DuDuPlus: a graph-based dependency parser for English and Chinese ("Other Open Source" license?)
    • where is the source code?

Data

Unknown license