Difference between revisions of "Resources for Persian"
Jump to navigation
Jump to search
(add links; update links) |
|||
Line 1: | Line 1: | ||
− | ==Machine translation | + | ==Machine translation== |
− | ===Free | + | ===Free resources=== |
− | ===Proprietary=== | + | ===Proprietary resources=== |
*[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -> English) | *[http://crl.nmsu.edu/Research/Projects/shiraz/index.html The Shiraz project] (Persian -> English) | ||
+ | *[http://ece.ut.ac.ir/NLP/resources.htm Tehran English-Persian Parallel Corpus] by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use. | ||
==Morphology tools== | ==Morphology tools== | ||
Line 12: | Line 13: | ||
== Corpora == | == Corpora == | ||
− | ===Free | + | ===Free=== |
− | *[http:// | + | *[http://www.ling.ohio-state.edu/~jonsafari/corpora VOA Persian Corpus 2003-2008] (public domain) |
===Proprietary=== | ===Proprietary=== | ||
Line 23: | Line 24: | ||
− | == | + | ==Parsing== |
− | ===Free | + | ===Free resources=== |
− | * [http://www.ling.ohio-state.edu/~jonsafari/persianlg/ Persian dictionaries] | + | * [http://www.ling.ohio-state.edu/~jonsafari/persianlg/ Persian dictionaries] for the [http://www.abisource.com/projects/link-grammar/ Link-Grammar parser]. By [http://www.ling.ohio-state.edu/~jonsafari/ Jon Dehdari]. These require the Perstem stemming package, above. |
+ | |||
+ | ===Proprietary=== | ||
+ | *[http://dadegan.ir/en/persiandependencytreebank Dadegan Dependency Treebank] for research purposes only. | ||
+ | *[http://hpsg.fu-berlin.de/~ghayoomi/PTB.html HPSG Persian Treebank (PerTreeBank)] for academic research purposes only. | ||
+ | *[http://stp.lingfil.uu.se/~mojgan/persian_dependency_treebank.pdf A soon-to-be-released Persian Dependency Treebank], license not specified yet. | ||
+ | |||
==Bibliography== | ==Bibliography== | ||
Line 38: | Line 45: | ||
==External links== | ==External links== | ||
− | *[http:// | + | *[http://www.iranianlinguistics.org/wiki/index.php?title=Persian Iranian Linguistics: NLP Resources for Persian] |
+ | *[http://www.ling.ohio-state.edu/~jonsafari/persian_nlp.html the Jon safari] (link parser, small lexicon, stemmer, morphological analysis tools) | ||
[[Category:Resources by language|Persian]] | [[Category:Resources by language|Persian]] |
Revision as of 11:05, 20 February 2012
Machine translation
Free resources
Proprietary resources
- The Shiraz project (Persian -> English)
- Tehran English-Persian Parallel Corpus by Mohammad Taher Pilevar, NLP Lab, University of Tehran. For research or non-commercial use.
Morphology tools
Free software
- Perstem - Persian stemmer, light morphological analyzer, and character set converter.
- Morphological dictionary — compiled using lttoolbox.
Corpora
Free
- VOA Persian Corpus 2003-2008 (public domain)
Proprietary
- Bijankhan corpus (gratis for research/non-commercial purposes)
- CALLFRIEND Farsi (speech), LDC
- Hamshahri corpus (gratis for research/non-commercial purposes)
- Persian speech database Farsdat, ELRA
Parsing
Free resources
- Persian dictionaries for the Link-Grammar parser. By Jon Dehdari. These require the Perstem stemming package, above.
Proprietary
- Dadegan Dependency Treebank for research purposes only.
- HPSG Persian Treebank (PerTreeBank) for academic research purposes only.
- A soon-to-be-released Persian Dependency Treebank, license not specified yet.
Bibliography
- Dehdari, Jon, and Deryle Lonsdale. 2008. A link grammar parser for Persian. In Karimi, S., Samiian, V., and Stilo, D., editors, Aspects of Iranian Linguistics, volume 1. Cambridge Scholars Press. ISBN: 978-18-471-8639-3 (BIB)
- Feili, H. and G. Ghassem-Sani (2004) "An Application of Lexicalized Grammars in English-Persian Translation". Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004), 24-27 Aug. 2004, Universidad Politecnica de Valencia, Valencia, Spain, pp. 596-600.
- Megerdoomian, K. (2000) "Unification-Based Persian Morphology". Proceedings of CICLing 2000, Alexander Gelbukh, Center of Investigation on Computation-IPN, Mexico, 2000.
- Megerdoomian, K. (2004) "Finite-State Morphological Analysis of Persian". COLING 2004 Computational Approaches to Arabic Script-based Languages. Ali Farghaly and Karine Megerdoomian editors, Geneva, Switzerland, 2004, pgs. 35-41.
See also
External links
- Iranian Linguistics: NLP Resources for Persian
- the Jon safari (link parser, small lexicon, stemmer, morphological analysis tools)