Difference between revisions of "Resources for Dutch"

From ACL Wiki
Jump to navigation Jump to search
(→‎Corpora: +Europarl corpus)
(Added: Araneum)
(2 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
== Corpora ==
 
== Corpora ==
 +
* [http://ucts.uniba.sk/aranea_about/ Araneum Nederlandicum], Gigaword Dutch web corpus
 
* [http://corpora.informatik.uni-leipzig.de/ Dutch Plain text and Co-occurrences at LCC]
 
* [http://corpora.informatik.uni-leipzig.de/ Dutch Plain text and Co-occurrences at LCC]
* [http://www.let.rug.nl/~vannoord/alp/Alpino/ Dutch HPSG-based parser. Includes the Alpino treebank (7137 sentences, newspaper, manually corrected).]
+
* [http://www.statmt.org/europarl Europarl corpus] - sentence-aligned with English
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
+
* [http://www.clips.uantwerpen.be/datasets/csi-corpus CLiPS Stylometry Investigation (CSI) corpus] - multi-purpose text corpus, main use in stylometry
 +
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 +
 
 +
== Tools ==
 +
* [http://www.let.rug.nl/~vannoord/alp/Alpino/ Dutch HPSG-based parser] Includes the Alpino treebank (7137 sentences, newspaper, manually corrected)
  
 
== Grammars ==
 
== Grammars ==

Revision as of 13:12, 8 March 2015

Corpora

Tools

Grammars