Universal Dependencies and Morphology for Hungarian - and on the Price of Universality

Veronika Vincze, Katalin Simkó, Zsolt Szántó, Richárd Farkas


Abstract
In this paper, we present how the principles of universal dependencies and morphology have been adapted to Hungarian. We report the most challenging grammatical phenomena and our solutions to those. On the basis of the adapted guidelines, we have converted and manually corrected 1,800 sentences from the Szeged Treebank to universal dependency format. We also introduce experiments on this manually annotated corpus for evaluating automatic conversion and the added value of language-specific, i.e. non-universal, annotations. Our results reveal that converting to universal dependencies is not necessarily trivial, moreover, using language-specific morphological features may have an impact on overall performance.
Anthology ID:
E17-1034
Volume:
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Mirella Lapata, Phil Blunsom, Alexander Koller
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
356–365
Language:
URL:
https://aclanthology.org/E17-1034
DOI:
Bibkey:
Cite (ACL):
Veronika Vincze, Katalin Simkó, Zsolt Szántó, and Richárd Farkas. 2017. Universal Dependencies and Morphology for Hungarian - and on the Price of Universality. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, pages 356–365, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Universal Dependencies and Morphology for Hungarian - and on the Price of Universality (Vincze et al., EACL 2017)
Copy Citation:
PDF:
https://aclanthology.org/E17-1034.pdf
Data
Szeged CorpusUniversal Dependencies