Resources for Portugese: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
added sections; +Europarl corpus
Zeman (talk | contribs)
HamleDT
Line 2: Line 2:
==Corpora==
==Corpora==
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.


==Software==
==Software==

Revision as of 15:50, 26 May 2014

Corpora

  • Europarl corpus, sentence aligned with English
  • HamleDT, harmonized dependency treebanks of many languages, common annotation style.

Software

  • CEPRIL - Portugese Segmenter
  • Corpógrafo - a Web-based environment for corpora research