Difference between revisions of "Resources for Hungarian"
Jump to navigation
Jump to search
(Split corpus section) |
(+hungarian national corpus) |
||
Line 8: | Line 8: | ||
* Hunglish parallel corpus ([http://mokk.bme.hu/resources/hunglishcorpus download], [http://hunglish.hu/search search]) | * Hunglish parallel corpus ([http://mokk.bme.hu/resources/hunglishcorpus download], [http://hunglish.hu/search search]) | ||
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style. | * [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style. | ||
+ | * [http://corpus.nytud.hu/mnsz/ Hungarian National Corpus] | ||
Latest revision as of 07:44, 26 June 2016
Corpora
Free
- Europarl corpus, sentence aligned with English
- Hungarian Webcorpus - 590 million tokens
Non-Free
- Araneum Hungaricum, Gigaword Hungarian web corpus
- Hunglish parallel corpus (download, search)
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
- Hungarian National Corpus