Enriching the WebNLG corpus

Thiago Castro Ferreira, Diego Moussallem, Emiel Krahmer, Sander Wubben


Abstract
This paper describes the enrichment of WebNLG corpus (Gardent et al., 2017a,b), with the aim to further extend its usefulness as a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation. We also produce a silver-standard German translation of the corpus to enable the exploitation of NLG approaches to other languages than English. The enriched corpus is publicly available.
Anthology ID:
W18-6521
Volume:
Proceedings of the 11th International Conference on Natural Language Generation
Month:
November
Year:
2018
Address:
Tilburg University, The Netherlands
Editors:
Emiel Krahmer, Albert Gatt, Martijn Goudbeek
Venue:
INLG
SIG:
SIGGEN
Publisher:
Association for Computational Linguistics
Note:
Pages:
171–176
Language:
URL:
https://aclanthology.org/W18-6521
DOI:
10.18653/v1/W18-6521
Bibkey:
Cite (ACL):
Thiago Castro Ferreira, Diego Moussallem, Emiel Krahmer, and Sander Wubben. 2018. Enriching the WebNLG corpus. In Proceedings of the 11th International Conference on Natural Language Generation, pages 171–176, Tilburg University, The Netherlands. Association for Computational Linguistics.
Cite (Informal):
Enriching the WebNLG corpus (Castro Ferreira et al., INLG 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-6521.pdf
Code
 ThiagoCF05/webnlg
Data
WebNLG