<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Alishir</id>
	<title>ACL Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Alishir"/>
	<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/Special:Contributions/Alishir"/>
	<updated>2026-04-30T12:44:58Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.6</generator>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Arabic&amp;diff=11350</id>
		<title>Resources for Arabic</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Arabic&amp;diff=11350"/>
		<updated>2016-01-09T19:11:39Z</updated>

		<summary type="html">&lt;p&gt;Alishir: /* Free software */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Morphology==&lt;br /&gt;
&lt;br /&gt;
===Free software===&lt;br /&gt;
*[https://sourceforge.net/projects/aramorph/ AraMorph - Perl] - An Arabic morphological analyzer and part-of-speech tagger written in Perl (originally by Tim Buckwalter)&lt;br /&gt;
*[http://www.nongnu.org/aramorph/ AraMorph - Java] - An Arabic morphological analyzer and part-of-speech tagger rewritten in Java for [http://lucene.apache.org/ Lucene]&lt;br /&gt;
*[http://sourceforge.net/projects/aracomlex/ AraComLex] - An open source finite state morphology for Modern Standard Arabic. The source files can be compiled by the open source compiler, foma, or Xerox xfst.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://www.arabic-morphology.com Xerox Arabic Morphological Analyzer and Generator]&lt;br /&gt;
&lt;br /&gt;
==WordNets==&lt;br /&gt;
&lt;br /&gt;
===Free software===&lt;br /&gt;
* http://compling.hss.ntu.edu.sg/omw/ Hebrew Wordnet with links to all the other Open Multilingual Wordnets&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
* http://babelnet.org/ (available for download for &amp;quot;Non-Commercial&amp;quot; use)&lt;br /&gt;
&lt;br /&gt;
==Parsers==&lt;br /&gt;
===Free software===&lt;br /&gt;
* [http://www.cis.upenn.edu/~dbikel/software.html#stat-parser Bikel&#039;s implementation of Collins Parser] by [http://www.cis.upenn.edu/~dbikel/ Dan Bikel].&lt;br /&gt;
* [http://www.ling.ohio-state.edu/~jonsafari/arabiclg/arabiclg.20060829.tar.bz2 Arabic dictionaries], by [http://www.ling.ohio-state.edu/~jonsafari/ Jon Dehdari], for the [http://www.abisource.com/projects/link-grammar/ Link-Grammar parser]. These require the Aramorph stemming package, above. &lt;br /&gt;
* [https://sourceforge.net/apps/trac/elixir-fm/wiki ElixirFM] ([http://quest.ms.mff.cuni.cz/cgi-bin/elixir/index.fcgi online interface here]) is a Functional Arabic Morphology written in Haskell and Perl; the lexicon is a &amp;quot;re-processed&amp;quot; version of the Buckwalter analyser.&lt;br /&gt;
* [http://sourceforge.net/projects/sarf Sarf] - Arabic Morphology System (all in Java)&lt;br /&gt;
&lt;br /&gt;
==Corpora==&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://www.ldc.upenn.edu/Catalog/LDC2001T55.html Arabic Newswire Part 1], 76 million tokens, annotation: paragraphs&lt;br /&gt;
&lt;br /&gt;
===Free/open licence===&lt;br /&gt;
* [http://github.com/anastaw/Meedan-Memory Meedan-Memory], Arabic-English TMX (sentence-aligned), ~467,000 words on the English side, [http://www.opendatacommons.org/licenses/odbl/ Open Database Licence]&lt;br /&gt;
* [http://quran.uk.net/ Quranic Arabic Corpus], 77,430 words of Quranic Arabic, with manually verified contextual POS, inflection, derivation; [[dependency grammar]] annotation is planned.&lt;br /&gt;
* [http://www1.ccls.columbia.edu/~ybenajiba/downloads.html Arabic NER corpora] by [http://www1.ccls.columbia.edu/~ybenajiba/ Yassine Benajiba], 150,000+ words.&lt;br /&gt;
* [http://www.euromatrixplus.net/multi-un/ UN parallel corpora]&lt;br /&gt;
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.&lt;br /&gt;
&lt;br /&gt;
==Bibliography==&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
*[http://www.elsnet.org/acl2001-arabic.html ACL/EACL 2001 Workshop on Arabic NLP]&lt;br /&gt;
*[http://www1.cs.columbia.edu/~mdiab/software/ASVMTools_2.0.tar.gz Basic Arabic Processing Tools]&lt;br /&gt;
*[http://acl.ldc.upenn.edu/coling2004/W5/index.html COLING 2004 Workshop on computational approaches to Arabic script-based languages]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|Arabic]]&lt;/div&gt;</summary>
		<author><name>Alishir</name></author>
	</entry>
</feed>