Difference between revisions of "Resources for Japanese"

From ACL Wiki
Jump to navigation Jump to search
(One intermediate revision by one other user not shown)
Line 5: Line 5:
 
* [http://corpora.informatik.uni-leipzig.de/ Japanese plain text and Co-occurrences at LCC] (downloadable and web-searchable, but only for non-commercial use)
 
* [http://corpora.informatik.uni-leipzig.de/ Japanese plain text and Co-occurrences at LCC] (downloadable and web-searchable, but only for non-commercial use)
 
* [http://www.ninjal.ac.jp/english/products/bccwj/ Balanced Corpus of Contemporary Written Japanese (BCCWJ)] (subset is web searchable at Kotonoha)
 
* [http://www.ninjal.ac.jp/english/products/bccwj/ Balanced Corpus of Contemporary Written Japanese (BCCWJ)] (subset is web searchable at Kotonoha)
 +
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
  
 
===Free/Open Licence===
 
===Free/Open Licence===
Line 29: Line 30:
 
==Dictionaries==
 
==Dictionaries==
 
===Free/Open Licence===
 
===Free/Open Licence===
* [http://www.csse.monash.edu.au/~jwb/edict.html EDICT] Japanese-English dictionary, by Jim Breen, (CC-BY-SA 3.0 licence)
+
* [http://www.edrdg.org/jmdict/edict_doc.html JMdict/EDICT] Japanese-English and Japanese-Multilanguage dictionary in text and XML formats, by EDRDG (Electronic Dictionary R&D Group) - 170,000 entries, (CC-BY-SA 3.0 licence)
* [http://www.csse.monash.edu.au/~jwb/enamdict_doc.html ENAMDICT/JMnedict] proper name dictionary, by Jim Breen, (CC-BY-SA 3.0 licence)
+
* [http://www.edrdg.org/enamdict/enamdict_doc.html ENAMDICT/JMnedict] proper name dictionary in text and XML formats - 740,000 entries, by EDRDG (Electronic Dictionary R&D Group), (CC-BY-SA 3.0 licence)
 
* [http://nlpwww.nict.go.jp/wn-ja/index.en.html Japanese version of WordNet] by NICT, (WordNet license, like BSD)
 
* [http://nlpwww.nict.go.jp/wn-ja/index.en.html Japanese version of WordNet] by NICT, (WordNet license, like BSD)
 +
* [http://www.edrdg.org/kanjidic/kanjidic.html Kanjidic]/[http://www.edrdg.org/kanjidic/kanjd2index.html Kanjidic2] Kanji dictionaries in text and XML formats covering about 13,000 characters, by EDRDG (Electronic Dictionary R&D Group),  (CC-BY-SA 3.0 licence)
  
 
===Unknown licence===
 
===Unknown licence===

Revision as of 17:45, 5 September 2014

There is a very good list at JAIST: Catalogue of Language Resources and Tools in Japan

Corpora

Proprietary

Free/Open Licence

Multilingual

Monolingual

Grammars

Free/Open Licence

Unknown licence

Dictionaries

Free/Open Licence

  • JMdict/EDICT Japanese-English and Japanese-Multilanguage dictionary in text and XML formats, by EDRDG (Electronic Dictionary R&D Group) - 170,000 entries, (CC-BY-SA 3.0 licence)
  • ENAMDICT/JMnedict proper name dictionary in text and XML formats - 740,000 entries, by EDRDG (Electronic Dictionary R&D Group), (CC-BY-SA 3.0 licence)
  • Japanese version of WordNet by NICT, (WordNet license, like BSD)
  • Kanjidic/Kanjidic2 Kanji dictionaries in text and XML formats covering about 13,000 characters, by EDRDG (Electronic Dictionary R&D Group), (CC-BY-SA 3.0 licence)

Unknown licence