Difference between revisions of "Resources for Persian"

From ACL Wiki
Jump to navigation Jump to search
(Alphabetize sections; add lexical resources section; add Persian WordNet entry)
(6 intermediate revisions by 3 users not shown)
Line 13: Line 13:
 
==Lexical resources==
 
==Lexical resources==
 
===Free===
 
===Free===
 +
*[http://www.ling.ohio-state.edu/~jonsafari/corpora/wikipedia_fa-en_20120217.txt.xz Persian - English dictionary], derived from Wikipedia article names.  Retains Wikipedia's CC-BY-SA 3.0 license.
  
 
===Proprietary===
 
===Proprietary===
Line 20: Line 21:
 
==Machine translation==
 
==Machine translation==
 
===Free===
 
===Free===
 +
*[http://ece.ut.ac.ir/node/100869?destination=node%2F100869 Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.
  
 
===Proprietary===
 
===Proprietary===
 
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -> English)
 
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -> English)
*[http://ece.ut.ac.ir/NLP/resources.htm Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.
 
 
  
 
==Morphology tools==
 
==Morphology tools==
Line 30: Line 30:
 
*[http://sourceforge.net/projects/perstem Perstem] - Persian stemmer, light morphological analyzer, and character set converter.
 
*[http://sourceforge.net/projects/perstem Perstem] - Persian stemmer, light morphological analyzer, and character set converter.
 
*[http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa/apertium-tg-fa.fa.dix Morphological dictionary] — compiled using [[lttoolbox]].
 
*[http://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-tg-fa/apertium-tg-fa.fa.dix Morphological dictionary] — compiled using [[lttoolbox]].
 
+
*[http://stp.lingfil.uu.se/~mojgan/ BLARK by Mojgan Seraji] – normaliser, tokeniser, segmentation, hunpos model for PoS-tagging and (java) dependency parser, all GPL
  
 
==Parsing==
 
==Parsing==
 
===Free===
 
===Free===
* [http://www.ling.ohio-state.edu/~jonsafari/persianlg/ Persian dictionaries] for the [http://www.abisource.com/projects/link-grammar/ Link-Grammar parser]. By [http://www.ling.ohio-state.edu/~jonsafari/ Jon Dehdari]. These require the Perstem stemming package, above.  
+
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 +
* [http://www.ling.ohio-state.edu/~jonsafari/persianlg/ Persian dictionaries] for the [http://www.abisource.com/projects/link-grammar/ Link-Grammar parser]. By [http://www.ling.ohio-state.edu/~jonsafari/ Jon Dehdari]. These require the Perstem stemming package, above.
 +
* [http://stp.lingfil.uu.se/~mojgan/UPDT.html Uppsala Persian Dependency Treebank], Creative Commons Attribution 3.0 License
  
 
===Proprietary===
 
===Proprietary===
 
*[http://dadegan.ir/en/persiandependencytreebank Dadegan Dependency Treebank] for research purposes only.
 
*[http://dadegan.ir/en/persiandependencytreebank Dadegan Dependency Treebank] for research purposes only.
 
*[http://hpsg.fu-berlin.de/~ghayoomi/PTB.html HPSG Persian Treebank (PerTreeBank)] for academic research purposes only.
 
*[http://hpsg.fu-berlin.de/~ghayoomi/PTB.html HPSG Persian Treebank (PerTreeBank)] for academic research purposes only.
*[http://stp.lingfil.uu.se/~mojgan/persian_dependency_treebank.pdf A soon-to-be-released Persian Dependency Treebank],  license not specified yet.
+
 
  
  
Line 55: Line 57:
  
 
==External links==
 
==External links==
*[http://www.iranianlinguistics.org/wiki/index.php?title=Persian Iranian Linguistics: NLP Resources for Persian]
+
*https://wiki.iranianlinguistics.org/wiki/Main_Page: NLP Resources for Persian]
 
*[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools)
 
*[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools)
  
  
 
[[Category:Resources by language|Persian]]
 
[[Category:Resources by language|Persian]]

Revision as of 12:45, 11 August 2015

Corpora

Free

Proprietary


Lexical resources

Free

Proprietary


Machine translation

Free

Proprietary

Morphology tools

Free

Parsing

Free

Proprietary


Bibliography

  • Dehdari, Jon, and Deryle Lonsdale. 2008. A link grammar parser for Persian. In Karimi, S., Samiian, V., and Stilo, D., editors, Aspects of Iranian Linguistics, volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 (BIB)

See also

External links