Difference between revisions of "Resources for Arabic"

From ACL Wiki
Jump to navigation Jump to search
(→‎Proprietary: some details added)
Line 17: Line 17:
 
==Corpora==
 
==Corpora==
 
===Proprietary===
 
===Proprietary===
*[http://www.ldc.upenn.edu/Catalog/LDC2001T55.html Arabic Newswire Part 1]
+
*[http://www.ldc.upenn.edu/Catalog/LDC2001T55.html Arabic Newswire Part 1], 76 million tokens, annotation: paragraphs
  
 
===Free/open licence===
 
===Free/open licence===

Revision as of 14:55, 5 October 2011

Morphology

Free software

  • AraMorph - Perl - An Arabic morphological analyzer and part-of-speech tagger written in Perl (originally by Tim Buckwalter)
  • AraMorph - Java - An Arabic morphological analyzer and part-of-speech tagger rewritten in Java for Lucene

Proprietary

Parsers

Free software

Corpora

Proprietary

Free/open licence

Bibliography

External links