<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Mightyscience</id>
	<title>ACL Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Mightyscience"/>
	<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/Special:Contributions/Mightyscience"/>
	<updated>2026-04-24T02:46:37Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.6</generator>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11418</id>
		<title>Resources for Persian</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11418"/>
		<updated>2016-02-23T15:58:11Z</updated>

		<summary type="html">&lt;p&gt;Mightyscience: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Corpora ==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora VOA Persian Corpus 2003-2008] (public domain)&lt;br /&gt;
*[https://www.clarin.si/repository/xmlui/handle/11356/1042 Orwell&#039;s 1984 Corpus in MULTEXT-EAST] (public domain)&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
&amp;lt;!-- Please keep this list in alphabetical order --&amp;gt;&lt;br /&gt;
*[http://ece.ut.ac.ir/DBRG/Bijankhan/ Bijankhan corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC96S50 CALLFRIEND Farsi (speech)], LDC&lt;br /&gt;
*[http://ece.ut.ac.ir/dbrg/hamshahri/ Hamshahri corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.elda.org/catalogue/en/speech/S0112.html Persian speech database Farsdat], ELRA&lt;br /&gt;
&lt;br /&gt;
== Online Concordance Tools ==&lt;br /&gt;
*[http://pars.ie/lr/corpora/run.cgi/corp_info?corpname=multext_east_farsi Orwell&#039;s 1984 Corpus] (public domain)&lt;br /&gt;
==Lexical resources==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora/wikipedia_fa-en_20120217.txt.xz Persian - English dictionary], derived from Wikipedia article names.  Retains Wikipedia&#039;s CC-BY-SA 3.0 license.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://pwn.ir Persian WordNet]&lt;br /&gt;
&lt;br /&gt;
*[http://catalog.elra.info/product_info.php?products_id=1126 ELRA Persian Lexicon, ISLRN : 547-614-436-004-7]&lt;br /&gt;
&lt;br /&gt;
==Machine translation==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://ece.ut.ac.ir/node/100869?destination=node%2F100869 Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -&amp;gt; English)&lt;br /&gt;
&lt;br /&gt;
==Morphology tools==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://sourceforge.net/projects/perstem Perstem] - Persian stemmer, light morphological analyzer, and character set converter.&lt;br /&gt;
*[http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa/apertium-tg-fa.fa.dix Morphological dictionary] &amp;amp;mdash; compiled using [[lttoolbox]].&lt;br /&gt;
*[http://stp.lingfil.uu.se/~mojgan/ BLARK by Mojgan Seraji] – normaliser, tokeniser, segmentation, hunpos model for PoS-tagging and (java) dependency parser, all GPL&lt;br /&gt;
&lt;br /&gt;
==Parsing==&lt;br /&gt;
===Free===&lt;br /&gt;
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.&lt;br /&gt;
* [http://www.ling.ohio-state.edu/~jonsafari/persianlg/ Persian dictionaries] for the [http://www.abisource.com/projects/link-grammar/ Link-Grammar parser]. By [http://www.ling.ohio-state.edu/~jonsafari/ Jon Dehdari]. These require the Perstem stemming package, above.&lt;br /&gt;
* [http://stp.lingfil.uu.se/~mojgan/UPDT.html Uppsala Persian Dependency Treebank], Creative Commons Attribution 3.0 License&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://dadegan.ir/en/persiandependencytreebank Dadegan Dependency Treebank] for research purposes only.&lt;br /&gt;
*[http://hpsg.fu-berlin.de/~ghayoomi/PTB.html HPSG Persian Treebank (PerTreeBank)] for academic research purposes only.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Bibliography==&lt;br /&gt;
* Dehdari, Jon, and Deryle Lonsdale. 2008. [http://www.ling.ohio-state.edu/~jonsafari/papers/dehdari_lonsdale_2005.pdf A link grammar parser for Persian]. In Karimi, S., Samiian, V., and Stilo, D., editors, &#039;&#039;Aspects of Iranian Linguistics&#039;&#039;, volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 ([http://www.ling.ohio-state.edu/~jonsafari/bib/dehdarilonsdale2005.bib.txt BIB])&lt;br /&gt;
&lt;br /&gt;
* QasemiZadeh, Behrang and Rahimi Saeed. Persian in MULTEXT-East Framework, FinTAL, 2006, pp 541-551 ([http://pars.ie/publications/papers/pre-prints/persian-in-multext-east.pdf]).&lt;br /&gt;
&lt;br /&gt;
*  Feili, H. and G. Ghassem-Sani (2004) &amp;quot;[http://sharif.edu/~sani/papers/Feili_SaniE2.pdf An Application of Lexicalized Grammars in English-Persian Translation]&amp;quot;. &#039;&#039;Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004)&#039;&#039;, 24-27 Aug. 2004, Universidad Politecnica de Valencia, Valencia, Spain, pp. 596-600.&lt;br /&gt;
* Megerdoomian, K. (2000) &amp;quot;[http://crl.nmsu.edu/Research/Projects/shiraz/publications/papers/Cicling.pdf Unification-Based Persian Morphology]&amp;quot;. &#039;&#039;Proceedings of CICLing 2000&#039;&#039;, Alexander Gelbukh, Center of Investigation on Computation-IPN, Mexico, 2000.&lt;br /&gt;
* Megerdoomian, K. (2004) &amp;quot;[http://acl.ldc.upenn.edu/coling2004/W5/pdf/W5-7.pdf Finite-State Morphological Analysis of Persian]&amp;quot;. &#039;&#039;COLING 2004 Computational Approaches to Arabic Script-based Languages&#039;&#039;. Ali Farghaly and Karine Megerdoomian editors, Geneva, Switzerland, 2004, pgs. 35-41.&lt;br /&gt;
* Mohammad Amin Farajian (2011). [http://world-comp.org/p2011/ICA4953.pdf PEN: Parallel English-Persian News Corpus]. Proceedings of 2011 International Conference on Artificial Intelligence (ICAI&#039;11), Nevada, USA.&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
*[[Resources for Kurdish]]&lt;br /&gt;
*[[Resources for Tajik]]&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
*https://wiki.iranianlinguistics.org/wiki/Main_Page: NLP Resources for Persian]&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|Persian]]&lt;/div&gt;</summary>
		<author><name>Mightyscience</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11415</id>
		<title>Resources for Persian</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11415"/>
		<updated>2016-02-20T07:50:15Z</updated>

		<summary type="html">&lt;p&gt;Mightyscience: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Corpora ==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora VOA Persian Corpus 2003-2008] (public domain)&lt;br /&gt;
*[https://www.clarin.si/repository/xmlui/handle/11356/1042 Orwell&#039;s 1984 Corpus in MULTEXT-EAST] (public domain)&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
&amp;lt;!-- Please keep this list in alphabetical order --&amp;gt;&lt;br /&gt;
*[http://ece.ut.ac.ir/DBRG/Bijankhan/ Bijankhan corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC96S50 CALLFRIEND Farsi (speech)], LDC&lt;br /&gt;
*[http://ece.ut.ac.ir/dbrg/hamshahri/ Hamshahri corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.elda.org/catalogue/en/speech/S0112.html Persian speech database Farsdat], ELRA&lt;br /&gt;
&lt;br /&gt;
== Online Concordance Tools ==&lt;br /&gt;
*[http://pars.ie/lr/corpora/run.cgi/corp_info?corpname=multext_east_farsi Orwell&#039;s 1984 Corpus] (public domain)&lt;br /&gt;
==Lexical resources==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora/wikipedia_fa-en_20120217.txt.xz Persian - English dictionary], derived from Wikipedia article names.  Retains Wikipedia&#039;s CC-BY-SA 3.0 license.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://pwn.ir Persian WordNet]&lt;br /&gt;
&lt;br /&gt;
*[http://catalog.elra.info/product_info.php?products_id=1126 ELRA Persian Lexicon, ISLRN : 547-614-436-004-7]&lt;br /&gt;
&lt;br /&gt;
==Machine translation==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://ece.ut.ac.ir/node/100869?destination=node%2F100869 Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -&amp;gt; English)&lt;br /&gt;
&lt;br /&gt;
==Morphology tools==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://sourceforge.net/projects/perstem Perstem] - Persian stemmer, light morphological analyzer, and character set converter.&lt;br /&gt;
*[http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa/apertium-tg-fa.fa.dix Morphological dictionary] &amp;amp;mdash; compiled using [[lttoolbox]].&lt;br /&gt;
*[http://stp.lingfil.uu.se/~mojgan/ BLARK by Mojgan Seraji] – normaliser, tokeniser, segmentation, hunpos model for PoS-tagging and (java) dependency parser, all GPL&lt;br /&gt;
&lt;br /&gt;
==Parsing==&lt;br /&gt;
===Free===&lt;br /&gt;
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.&lt;br /&gt;
* [http://www.ling.ohio-state.edu/~jonsafari/persianlg/ Persian dictionaries] for the [http://www.abisource.com/projects/link-grammar/ Link-Grammar parser]. By [http://www.ling.ohio-state.edu/~jonsafari/ Jon Dehdari]. These require the Perstem stemming package, above.&lt;br /&gt;
* [http://stp.lingfil.uu.se/~mojgan/UPDT.html Uppsala Persian Dependency Treebank], Creative Commons Attribution 3.0 License&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://dadegan.ir/en/persiandependencytreebank Dadegan Dependency Treebank] for research purposes only.&lt;br /&gt;
*[http://hpsg.fu-berlin.de/~ghayoomi/PTB.html HPSG Persian Treebank (PerTreeBank)] for academic research purposes only.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Bibliography==&lt;br /&gt;
* Dehdari, Jon, and Deryle Lonsdale. 2008. [http://www.ling.ohio-state.edu/~jonsafari/papers/dehdari_lonsdale_2005.pdf A link grammar parser for Persian]. In Karimi, S., Samiian, V., and Stilo, D., editors, &#039;&#039;Aspects of Iranian Linguistics&#039;&#039;, volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 ([http://www.ling.ohio-state.edu/~jonsafari/bib/dehdarilonsdale2005.bib.txt BIB])&lt;br /&gt;
&lt;br /&gt;
* QasemiZadeh, Behrang and Rahimi Saeed, FinTAL, 2006, pp 541-551 ([http://pars.ie/publications/papers/pre-prints/persian-in-multext-east.pdf]).&lt;br /&gt;
&lt;br /&gt;
*  Feili, H. and G. Ghassem-Sani (2004) &amp;quot;[http://sharif.edu/~sani/papers/Feili_SaniE2.pdf An Application of Lexicalized Grammars in English-Persian Translation]&amp;quot;. &#039;&#039;Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004)&#039;&#039;, 24-27 Aug. 2004, Universidad Politecnica de Valencia, Valencia, Spain, pp. 596-600.&lt;br /&gt;
* Megerdoomian, K. (2000) &amp;quot;[http://crl.nmsu.edu/Research/Projects/shiraz/publications/papers/Cicling.pdf Unification-Based Persian Morphology]&amp;quot;. &#039;&#039;Proceedings of CICLing 2000&#039;&#039;, Alexander Gelbukh, Center of Investigation on Computation-IPN, Mexico, 2000.&lt;br /&gt;
* Megerdoomian, K. (2004) &amp;quot;[http://acl.ldc.upenn.edu/coling2004/W5/pdf/W5-7.pdf Finite-State Morphological Analysis of Persian]&amp;quot;. &#039;&#039;COLING 2004 Computational Approaches to Arabic Script-based Languages&#039;&#039;. Ali Farghaly and Karine Megerdoomian editors, Geneva, Switzerland, 2004, pgs. 35-41.&lt;br /&gt;
* Mohammad Amin Farajian (2011). [http://world-comp.org/p2011/ICA4953.pdf PEN: Parallel English-Persian News Corpus]. Proceedings of 2011 International Conference on Artificial Intelligence (ICAI&#039;11), Nevada, USA.&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
*[[Resources for Kurdish]]&lt;br /&gt;
*[[Resources for Tajik]]&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
*https://wiki.iranianlinguistics.org/wiki/Main_Page: NLP Resources for Persian]&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|Persian]]&lt;/div&gt;</summary>
		<author><name>Mightyscience</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11414</id>
		<title>Resources for Persian</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11414"/>
		<updated>2016-02-20T07:43:56Z</updated>

		<summary type="html">&lt;p&gt;Mightyscience: /* Proprietary */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Corpora ==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora VOA Persian Corpus 2003-2008] (public domain)&lt;br /&gt;
*[https://www.clarin.si/repository/xmlui/handle/11356/1042 Orwell&#039;s 1984 Corpus in MULTEXT-EAST] (public domain)&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
&amp;lt;!-- Please keep this list in alphabetical order --&amp;gt;&lt;br /&gt;
*[http://ece.ut.ac.ir/DBRG/Bijankhan/ Bijankhan corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC96S50 CALLFRIEND Farsi (speech)], LDC&lt;br /&gt;
*[http://ece.ut.ac.ir/dbrg/hamshahri/ Hamshahri corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.elda.org/catalogue/en/speech/S0112.html Persian speech database Farsdat], ELRA&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Lexical resources==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora/wikipedia_fa-en_20120217.txt.xz Persian - English dictionary], derived from Wikipedia article names.  Retains Wikipedia&#039;s CC-BY-SA 3.0 license.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://pwn.ir Persian WordNet]&lt;br /&gt;
&lt;br /&gt;
*[http://catalog.elra.info/product_info.php?products_id=1126 ELRA Persian Lexicon, ISLRN : 547-614-436-004-7]&lt;br /&gt;
&lt;br /&gt;
==Machine translation==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://ece.ut.ac.ir/node/100869?destination=node%2F100869 Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -&amp;gt; English)&lt;br /&gt;
&lt;br /&gt;
==Morphology tools==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://sourceforge.net/projects/perstem Perstem] - Persian stemmer, light morphological analyzer, and character set converter.&lt;br /&gt;
*[http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa/apertium-tg-fa.fa.dix Morphological dictionary] &amp;amp;mdash; compiled using [[lttoolbox]].&lt;br /&gt;
*[http://stp.lingfil.uu.se/~mojgan/ BLARK by Mojgan Seraji] – normaliser, tokeniser, segmentation, hunpos model for PoS-tagging and (java) dependency parser, all GPL&lt;br /&gt;
&lt;br /&gt;
==Parsing==&lt;br /&gt;
===Free===&lt;br /&gt;
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.&lt;br /&gt;
* [http://www.ling.ohio-state.edu/~jonsafari/persianlg/ Persian dictionaries] for the [http://www.abisource.com/projects/link-grammar/ Link-Grammar parser]. By [http://www.ling.ohio-state.edu/~jonsafari/ Jon Dehdari]. These require the Perstem stemming package, above.&lt;br /&gt;
* [http://stp.lingfil.uu.se/~mojgan/UPDT.html Uppsala Persian Dependency Treebank], Creative Commons Attribution 3.0 License&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://dadegan.ir/en/persiandependencytreebank Dadegan Dependency Treebank] for research purposes only.&lt;br /&gt;
*[http://hpsg.fu-berlin.de/~ghayoomi/PTB.html HPSG Persian Treebank (PerTreeBank)] for academic research purposes only.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Bibliography==&lt;br /&gt;
* Dehdari, Jon, and Deryle Lonsdale. 2008. [http://www.ling.ohio-state.edu/~jonsafari/papers/dehdari_lonsdale_2005.pdf A link grammar parser for Persian]. In Karimi, S., Samiian, V., and Stilo, D., editors, &#039;&#039;Aspects of Iranian Linguistics&#039;&#039;, volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 ([http://www.ling.ohio-state.edu/~jonsafari/bib/dehdarilonsdale2005.bib.txt BIB])&lt;br /&gt;
&lt;br /&gt;
* QasemiZadeh, Behrang and Rahimi Saeed, FinTAL, 2006, pp 541-551 ([http://pars.ie/publications/papers/pre-prints/persian-in-multext-east.pdf]).&lt;br /&gt;
&lt;br /&gt;
*  Feili, H. and G. Ghassem-Sani (2004) &amp;quot;[http://sharif.edu/~sani/papers/Feili_SaniE2.pdf An Application of Lexicalized Grammars in English-Persian Translation]&amp;quot;. &#039;&#039;Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004)&#039;&#039;, 24-27 Aug. 2004, Universidad Politecnica de Valencia, Valencia, Spain, pp. 596-600.&lt;br /&gt;
* Megerdoomian, K. (2000) &amp;quot;[http://crl.nmsu.edu/Research/Projects/shiraz/publications/papers/Cicling.pdf Unification-Based Persian Morphology]&amp;quot;. &#039;&#039;Proceedings of CICLing 2000&#039;&#039;, Alexander Gelbukh, Center of Investigation on Computation-IPN, Mexico, 2000.&lt;br /&gt;
* Megerdoomian, K. (2004) &amp;quot;[http://acl.ldc.upenn.edu/coling2004/W5/pdf/W5-7.pdf Finite-State Morphological Analysis of Persian]&amp;quot;. &#039;&#039;COLING 2004 Computational Approaches to Arabic Script-based Languages&#039;&#039;. Ali Farghaly and Karine Megerdoomian editors, Geneva, Switzerland, 2004, pgs. 35-41.&lt;br /&gt;
* Mohammad Amin Farajian (2011). [http://world-comp.org/p2011/ICA4953.pdf PEN: Parallel English-Persian News Corpus]. Proceedings of 2011 International Conference on Artificial Intelligence (ICAI&#039;11), Nevada, USA.&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
*[[Resources for Kurdish]]&lt;br /&gt;
*[[Resources for Tajik]]&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
*https://wiki.iranianlinguistics.org/wiki/Main_Page: NLP Resources for Persian]&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|Persian]]&lt;/div&gt;</summary>
		<author><name>Mightyscience</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11413</id>
		<title>Resources for Persian</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11413"/>
		<updated>2016-02-20T07:39:19Z</updated>

		<summary type="html">&lt;p&gt;Mightyscience: /* Bibliography */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Corpora ==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora VOA Persian Corpus 2003-2008] (public domain)&lt;br /&gt;
*[https://www.clarin.si/repository/xmlui/handle/11356/1042 Orwell&#039;s 1984 Corpus in MULTEXT-EAST] (public domain)&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
&amp;lt;!-- Please keep this list in alphabetical order --&amp;gt;&lt;br /&gt;
*[http://ece.ut.ac.ir/DBRG/Bijankhan/ Bijankhan corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC96S50 CALLFRIEND Farsi (speech)], LDC&lt;br /&gt;
*[http://ece.ut.ac.ir/dbrg/hamshahri/ Hamshahri corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.elda.org/catalogue/en/speech/S0112.html Persian speech database Farsdat], ELRA&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Lexical resources==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora/wikipedia_fa-en_20120217.txt.xz Persian - English dictionary], derived from Wikipedia article names.  Retains Wikipedia&#039;s CC-BY-SA 3.0 license.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://pwn.ir Persian WordNet]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Machine translation==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://ece.ut.ac.ir/node/100869?destination=node%2F100869 Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -&amp;gt; English)&lt;br /&gt;
&lt;br /&gt;
==Morphology tools==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://sourceforge.net/projects/perstem Perstem] - Persian stemmer, light morphological analyzer, and character set converter.&lt;br /&gt;
*[http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa/apertium-tg-fa.fa.dix Morphological dictionary] &amp;amp;mdash; compiled using [[lttoolbox]].&lt;br /&gt;
*[http://stp.lingfil.uu.se/~mojgan/ BLARK by Mojgan Seraji] – normaliser, tokeniser, segmentation, hunpos model for PoS-tagging and (java) dependency parser, all GPL&lt;br /&gt;
&lt;br /&gt;
==Parsing==&lt;br /&gt;
===Free===&lt;br /&gt;
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.&lt;br /&gt;
* [http://www.ling.ohio-state.edu/~jonsafari/persianlg/ Persian dictionaries] for the [http://www.abisource.com/projects/link-grammar/ Link-Grammar parser]. By [http://www.ling.ohio-state.edu/~jonsafari/ Jon Dehdari]. These require the Perstem stemming package, above.&lt;br /&gt;
* [http://stp.lingfil.uu.se/~mojgan/UPDT.html Uppsala Persian Dependency Treebank], Creative Commons Attribution 3.0 License&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://dadegan.ir/en/persiandependencytreebank Dadegan Dependency Treebank] for research purposes only.&lt;br /&gt;
*[http://hpsg.fu-berlin.de/~ghayoomi/PTB.html HPSG Persian Treebank (PerTreeBank)] for academic research purposes only.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Bibliography==&lt;br /&gt;
* Dehdari, Jon, and Deryle Lonsdale. 2008. [http://www.ling.ohio-state.edu/~jonsafari/papers/dehdari_lonsdale_2005.pdf A link grammar parser for Persian]. In Karimi, S., Samiian, V., and Stilo, D., editors, &#039;&#039;Aspects of Iranian Linguistics&#039;&#039;, volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 ([http://www.ling.ohio-state.edu/~jonsafari/bib/dehdarilonsdale2005.bib.txt BIB])&lt;br /&gt;
&lt;br /&gt;
* QasemiZadeh, Behrang and Rahimi Saeed, FinTAL, 2006, pp 541-551 ([http://pars.ie/publications/papers/pre-prints/persian-in-multext-east.pdf]).&lt;br /&gt;
&lt;br /&gt;
*  Feili, H. and G. Ghassem-Sani (2004) &amp;quot;[http://sharif.edu/~sani/papers/Feili_SaniE2.pdf An Application of Lexicalized Grammars in English-Persian Translation]&amp;quot;. &#039;&#039;Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004)&#039;&#039;, 24-27 Aug. 2004, Universidad Politecnica de Valencia, Valencia, Spain, pp. 596-600.&lt;br /&gt;
* Megerdoomian, K. (2000) &amp;quot;[http://crl.nmsu.edu/Research/Projects/shiraz/publications/papers/Cicling.pdf Unification-Based Persian Morphology]&amp;quot;. &#039;&#039;Proceedings of CICLing 2000&#039;&#039;, Alexander Gelbukh, Center of Investigation on Computation-IPN, Mexico, 2000.&lt;br /&gt;
* Megerdoomian, K. (2004) &amp;quot;[http://acl.ldc.upenn.edu/coling2004/W5/pdf/W5-7.pdf Finite-State Morphological Analysis of Persian]&amp;quot;. &#039;&#039;COLING 2004 Computational Approaches to Arabic Script-based Languages&#039;&#039;. Ali Farghaly and Karine Megerdoomian editors, Geneva, Switzerland, 2004, pgs. 35-41.&lt;br /&gt;
* Mohammad Amin Farajian (2011). [http://world-comp.org/p2011/ICA4953.pdf PEN: Parallel English-Persian News Corpus]. Proceedings of 2011 International Conference on Artificial Intelligence (ICAI&#039;11), Nevada, USA.&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
*[[Resources for Kurdish]]&lt;br /&gt;
*[[Resources for Tajik]]&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
*https://wiki.iranianlinguistics.org/wiki/Main_Page: NLP Resources for Persian]&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|Persian]]&lt;/div&gt;</summary>
		<author><name>Mightyscience</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11412</id>
		<title>Resources for Persian</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_Persian&amp;diff=11412"/>
		<updated>2016-02-20T07:36:19Z</updated>

		<summary type="html">&lt;p&gt;Mightyscience: /* Free */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Corpora ==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora VOA Persian Corpus 2003-2008] (public domain)&lt;br /&gt;
*[https://www.clarin.si/repository/xmlui/handle/11356/1042 Orwell&#039;s 1984 Corpus in MULTEXT-EAST] (public domain)&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
&amp;lt;!-- Please keep this list in alphabetical order --&amp;gt;&lt;br /&gt;
*[http://ece.ut.ac.ir/DBRG/Bijankhan/ Bijankhan corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC96S50 CALLFRIEND Farsi (speech)], LDC&lt;br /&gt;
*[http://ece.ut.ac.ir/dbrg/hamshahri/ Hamshahri corpus] (gratis for research/non-commercial purposes)&lt;br /&gt;
*[http://www.elda.org/catalogue/en/speech/S0112.html Persian speech database Farsdat], ELRA&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Lexical resources==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/corpora/wikipedia_fa-en_20120217.txt.xz Persian - English dictionary], derived from Wikipedia article names.  Retains Wikipedia&#039;s CC-BY-SA 3.0 license.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://pwn.ir Persian WordNet]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Machine translation==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://ece.ut.ac.ir/node/100869?destination=node%2F100869 Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -&amp;gt; English)&lt;br /&gt;
&lt;br /&gt;
==Morphology tools==&lt;br /&gt;
===Free===&lt;br /&gt;
*[http://sourceforge.net/projects/perstem Perstem] - Persian stemmer, light morphological analyzer, and character set converter.&lt;br /&gt;
*[http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa/apertium-tg-fa.fa.dix Morphological dictionary] &amp;amp;mdash; compiled using [[lttoolbox]].&lt;br /&gt;
*[http://stp.lingfil.uu.se/~mojgan/ BLARK by Mojgan Seraji] – normaliser, tokeniser, segmentation, hunpos model for PoS-tagging and (java) dependency parser, all GPL&lt;br /&gt;
&lt;br /&gt;
==Parsing==&lt;br /&gt;
===Free===&lt;br /&gt;
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.&lt;br /&gt;
* [http://www.ling.ohio-state.edu/~jonsafari/persianlg/ Persian dictionaries] for the [http://www.abisource.com/projects/link-grammar/ Link-Grammar parser]. By [http://www.ling.ohio-state.edu/~jonsafari/ Jon Dehdari]. These require the Perstem stemming package, above.&lt;br /&gt;
* [http://stp.lingfil.uu.se/~mojgan/UPDT.html Uppsala Persian Dependency Treebank], Creative Commons Attribution 3.0 License&lt;br /&gt;
&lt;br /&gt;
===Proprietary===&lt;br /&gt;
*[http://dadegan.ir/en/persiandependencytreebank Dadegan Dependency Treebank] for research purposes only.&lt;br /&gt;
*[http://hpsg.fu-berlin.de/~ghayoomi/PTB.html HPSG Persian Treebank (PerTreeBank)] for academic research purposes only.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Bibliography==&lt;br /&gt;
* Dehdari, Jon, and Deryle Lonsdale. 2008. [http://www.ling.ohio-state.edu/~jonsafari/papers/dehdari_lonsdale_2005.pdf A link grammar parser for Persian]. In Karimi, S., Samiian, V., and Stilo, D., editors, &#039;&#039;Aspects of Iranian Linguistics&#039;&#039;, volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 ([http://www.ling.ohio-state.edu/~jonsafari/bib/dehdarilonsdale2005.bib.txt BIB])&lt;br /&gt;
&lt;br /&gt;
*  Feili, H. and G. Ghassem-Sani (2004) &amp;quot;[http://sharif.edu/~sani/papers/Feili_SaniE2.pdf An Application of Lexicalized Grammars in English-Persian Translation]&amp;quot;. &#039;&#039;Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004)&#039;&#039;, 24-27 Aug. 2004, Universidad Politecnica de Valencia, Valencia, Spain, pp. 596-600.&lt;br /&gt;
* Megerdoomian, K. (2000) &amp;quot;[http://crl.nmsu.edu/Research/Projects/shiraz/publications/papers/Cicling.pdf Unification-Based Persian Morphology]&amp;quot;. &#039;&#039;Proceedings of CICLing 2000&#039;&#039;, Alexander Gelbukh, Center of Investigation on Computation-IPN, Mexico, 2000.&lt;br /&gt;
* Megerdoomian, K. (2004) &amp;quot;[http://acl.ldc.upenn.edu/coling2004/W5/pdf/W5-7.pdf Finite-State Morphological Analysis of Persian]&amp;quot;. &#039;&#039;COLING 2004 Computational Approaches to Arabic Script-based Languages&#039;&#039;. Ali Farghaly and Karine Megerdoomian editors, Geneva, Switzerland, 2004, pgs. 35-41.&lt;br /&gt;
* Mohammad Amin Farajian (2011). [http://world-comp.org/p2011/ICA4953.pdf PEN: Parallel English-Persian News Corpus]. Proceedings of 2011 International Conference on Artificial Intelligence (ICAI&#039;11), Nevada, USA.&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
*[[Resources for Kurdish]]&lt;br /&gt;
*[[Resources for Tajik]]&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
*https://wiki.iranianlinguistics.org/wiki/Main_Page: NLP Resources for Persian]&lt;br /&gt;
*[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|Persian]]&lt;/div&gt;</summary>
		<author><name>Mightyscience</name></author>
	</entry>
</feed>