Resources for Persian: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
Zeman (talk | contribs)
HamleDT
No edit summary
 
(8 intermediate revisions by 3 users not shown)
Line 2: Line 2:
===Free===
===Free===
*[http://www.ling.ohio-state.edu/~jonsafari/corpora VOA Persian Corpus 2003-2008] (public domain)
*[http://www.ling.ohio-state.edu/~jonsafari/corpora VOA Persian Corpus 2003-2008] (public domain)
*[https://www.clarin.si/repository/xmlui/handle/11356/1042 Orwell's 1984 Corpus in MULTEXT-EAST] (public domain)


===Proprietary===
===Proprietary===
Line 10: Line 11:
*[http://www.elda.org/catalogue/en/speech/S0112.html Persian speech database Farsdat], ELRA
*[http://www.elda.org/catalogue/en/speech/S0112.html Persian speech database Farsdat], ELRA


 
== Online Concordance Tools ==
*[http://pars.ie/lr/corpora/run.cgi/corp_info?corpname=multext_east_farsi Orwell's 1984 Corpus] (public domain)
==Lexical resources==
==Lexical resources==
===Free===
===Free===
Line 18: Line 20:
*[http://pwn.ir Persian WordNet]
*[http://pwn.ir Persian WordNet]


*[http://catalog.elra.info/product_info.php?products_id=1126 ELRA Persian Lexicon, ISLRN : 547-614-436-004-7]


==Machine translation==
==Machine translation==
===Free===
===Free===
*[http://ece.ut.ac.ir/node/100869?destination=node%2F100869 Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.


===Proprietary===
===Proprietary===
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -> English)
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -> English)
*[http://ece.ut.ac.ir/NLP/resources.htm Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.


==Morphology tools==
==Morphology tools==
Line 31: Line 33:
*[http://sourceforge.net/projects/perstem Perstem] - Persian stemmer, light morphological analyzer, and character set converter.
*[http://sourceforge.net/projects/perstem Perstem] - Persian stemmer, light morphological analyzer, and character set converter.
*[http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa/apertium-tg-fa.fa.dix Morphological dictionary] — compiled using [[lttoolbox]].
*[http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa/apertium-tg-fa.fa.dix Morphological dictionary] — compiled using [[lttoolbox]].
 
*[http://stp.lingfil.uu.se/~mojgan/ BLARK by Mojgan Seraji] – normaliser, tokeniser, segmentation, hunpos model for PoS-tagging and (java) dependency parser, all GPL


==Parsing==
==Parsing==
Line 47: Line 49:
==Bibliography==
==Bibliography==
* Dehdari, Jon, and Deryle Lonsdale. 2008. [http://www.ling.ohio-state.edu/~jonsafari/papers/dehdari_lonsdale_2005.pdf A link grammar parser for Persian]. In Karimi, S., Samiian, V., and Stilo, D., editors, ''Aspects of Iranian Linguistics'', volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 ([http://www.ling.ohio-state.edu/~jonsafari/bib/dehdarilonsdale2005.bib.txt BIB])
* Dehdari, Jon, and Deryle Lonsdale. 2008. [http://www.ling.ohio-state.edu/~jonsafari/papers/dehdari_lonsdale_2005.pdf A link grammar parser for Persian]. In Karimi, S., Samiian, V., and Stilo, D., editors, ''Aspects of Iranian Linguistics'', volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 ([http://www.ling.ohio-state.edu/~jonsafari/bib/dehdarilonsdale2005.bib.txt BIB])
* QasemiZadeh, Behrang and Rahimi Saeed. Persian in MULTEXT-East Framework, FinTAL, 2006, pp 541-551 ([http://pars.ie/publications/papers/pre-prints/persian-in-multext-east.pdf]).


*  Feili, H. and G. Ghassem-Sani (2004) "[http://sharif.edu/~sani/papers/Feili_SaniE2.pdf An Application of Lexicalized Grammars in English-Persian Translation]". ''Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004)'', 24-27 Aug. 2004, Universidad Politecnica de Valencia, Valencia, Spain, pp. 596-600.
*  Feili, H. and G. Ghassem-Sani (2004) "[http://sharif.edu/~sani/papers/Feili_SaniE2.pdf An Application of Lexicalized Grammars in English-Persian Translation]". ''Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004)'', 24-27 Aug. 2004, Universidad Politecnica de Valencia, Valencia, Spain, pp. 596-600.
Line 58: Line 62:


==External links==
==External links==
*[http://www.iranianlinguistics.org/wiki/index.php?title=Persian Iranian Linguistics: NLP Resources for Persian]
*https://wiki.iranianlinguistics.org/wiki/Main_Page: NLP Resources for Persian]
*[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools)
*[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools)




[[Category:Resources by language|Persian]]
[[Category:Resources by language|Persian]]

Latest revision as of 15:58, 23 February 2016

Corpora

Free

Proprietary

Online Concordance Tools

Lexical resources

Free

Proprietary

Machine translation

Free

Proprietary

Morphology tools

Free

Parsing

Free

Proprietary


Bibliography

  • Dehdari, Jon, and Deryle Lonsdale. 2008. A link grammar parser for Persian. In Karimi, S., Samiian, V., and Stilo, D., editors, Aspects of Iranian Linguistics, volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 (BIB)
  • QasemiZadeh, Behrang and Rahimi Saeed. Persian in MULTEXT-East Framework, FinTAL, 2006, pp 541-551 ([1]).

See also

External links