Difference between revisions of "Resources for Hungarian"

From ACL Wiki
Jump to navigation Jump to search
(Added: Araneum)
(Split corpus section)
Line 1: Line 1:
 
==Corpora==
 
==Corpora==
 +
===Free===
 +
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 +
* [http://mokk.bme.hu/resources/webcorpus/ Hungarian Webcorpus] - 590 million tokens
 +
 +
===Non-Free===
 
* [http://ucts.uniba.sk/aranea_about/ Araneum Hungaricum], Gigaword Hungarian web corpus
 
* [http://ucts.uniba.sk/aranea_about/ Araneum Hungaricum], Gigaword Hungarian web corpus
* [http://www.statmt.org/europarl Europarl corpus], sentence aligned with English
 
 
* Hunglish parallel corpus ([http://mokk.bme.hu/resources/hunglishcorpus download], [http://hunglish.hu/search search])
 
* Hunglish parallel corpus ([http://mokk.bme.hu/resources/hunglishcorpus download], [http://hunglish.hu/search search])
* [http://mokk.bme.hu/resources/webcorpus/ Hungarian Webcorpus]
 
 
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 
* [http://ufal.mff.cuni.cz/hamledt HamleDT], harmonized dependency treebanks of many languages, common annotation style.
 +
  
 
== Tools ==
 
== Tools ==
* [http://code.google.com/p/hunpos/ hunpos] (open-source POS-tagger)
+
* [http://code.google.com/p/hunpos/ hunpos] - open-source POS-tagger
* [http://mokk.bme.hu/resources/hunmorph/ hunmorph] (open-source morphological analyzer)
+
* [http://mokk.bme.hu/resources/hunmorph/ hunmorph] - open-source morphological analyzer
  
  
  
 
[[Category:Resources by language|Hungarian]]
 
[[Category:Resources by language|Hungarian]]

Revision as of 07:53, 17 June 2015

Corpora

Free

Non-Free


Tools

  • hunpos - open-source POS-tagger
  • hunmorph - open-source morphological analyzer