Difference between revisions of "Resources for Hungarian"

From ACL Wiki
Jump to navigation Jump to search
(→‎Corpora: +Europarl corpus)
(+hungarian national corpus)
 
(3 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
==Corpora==
 
==Corpora==
 +
===Free===
 
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 +
* [http://mokk.bme.hu/resources/webcorpus/ Hungarian Webcorpus] - 590 million tokens
 +
 +
===Non-Free===
 +
* [http://ucts.uniba.sk/aranea_about/ Araneum Hungaricum], Gigaword Hungarian web corpus
 
* Hunglish parallel corpus ([http://mokk.bme.hu/resources/hunglishcorpus download], [http://hunglish.hu/search search])
 
* Hunglish parallel corpus ([http://mokk.bme.hu/resources/hunglishcorpus download], [http://hunglish.hu/search search])
* [http://mokk.bme.hu/resources/webcorpus/ Hungarian Webcorpus]
+
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 +
* [http://corpus.nytud.hu/mnsz/ Hungarian National Corpus]
 +
 
  
 
== Tools ==
 
== Tools ==
* [http://code.google.com/p/hunpos/ hunpos] (open-source POS-tagger)
+
* [http://code.google.com/p/hunpos/ hunpos] - open-source POS-tagger
* [http://mokk.bme.hu/resources/hunmorph/ hunmorph] (open-source morphological analyzer)
+
* [http://mokk.bme.hu/resources/hunmorph/ hunmorph] - open-source morphological analyzer
  
  
  
 
[[Category:Resources by language|Hungarian]]
 
[[Category:Resources by language|Hungarian]]

Latest revision as of 08:44, 26 June 2016

Corpora

Free

Non-Free


Tools

  • hunpos - open-source POS-tagger
  • hunmorph - open-source morphological analyzer