Resources for Japanese: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
Bond (talk | contribs)
Free/Open Licence: Added wordnet
Bond (talk | contribs)
Free/Open Licence: added Kyoto University and NTT Blog Corpus
Line 4: Line 4:


===Free/Open Licence===
===Free/Open Licence===
====Multilingual====
* [http://www.edrdg.org/projects/tanaka/tanakacorpus.html Tanaka Corpus] by Jim Breen, under a CC-BY-SA 3.0 licence
* [http://www.edrdg.org/projects/tanaka/tanakacorpus.html Tanaka Corpus] by Jim Breen, under a CC-BY-SA 3.0 licence
** [http://tatoeba.org/eng/home Tatoeba] Updated version of the Tanaka Corpus;  ≈150,000 sentence pairs  (CC-BY)
** [http://tatoeba.org/eng/home Tatoeba] Updated version of the Tanaka Corpus;  ≈150,000 sentence pairs  (CC-BY)
Line 10: Line 11:
* [http://mastarpj.nict.go.jp/~mutiyama/align/index.html English-Japanese Translation Alignment Data]  aligned by [http://mastarpj.nict.go.jp/~mutiyama/ Masao Utiyama] (GFDL, CC-by-nc 1.0)
* [http://mastarpj.nict.go.jp/~mutiyama/align/index.html English-Japanese Translation Alignment Data]  aligned by [http://mastarpj.nict.go.jp/~mutiyama/ Masao Utiyama] (GFDL, CC-by-nc 1.0)
* [http://nlpwww.nict.go.jp/wn-ja/index.en.html WordNet Definitions and Glosses]  ≈180,000 sentence/phrase pairs (WordNet license, similar to BSD)
* [http://nlpwww.nict.go.jp/wn-ja/index.en.html WordNet Definitions and Glosses]  ≈180,000 sentence/phrase pairs (WordNet license, similar to BSD)
====Monolingual====
* [http://www-lab25.kuee.kyoto-u.ac.jp/NLP_Portal/lr-cat-e.html#jp:knb_corpus Kyoto University and NTT Blog Corpus]


== Grammars ==
== Grammars ==

Revision as of 00:54, 4 May 2011

Corpora

Proprietary

Free/Open Licence

Multilingual

Monolingual

Grammars

Free/Open Licence

Unknown licence

Dictionaries

Free/Open Licence

Unknown licence