Data sets for NLG
This page lists sets of structured data to be used as input for natural language generation tasks, or to inform research on NLG.
Focus on Content Selection, Aggregation
These data contain predictions for meteorological parameters such as precipitation, temperature, wind speed, and cloud cover at various altitudes, at regular intervals for various points in the area of interest.
The weather corpus currently exists as an Access database and, alternatively, in form of CSV (ASCII) files.
Download and Info: SumTime-Meteo
Project link: http://www.csd.abdn.ac.uk/research/sumtime/
Focus on Referring Expression Generation
Referring expression generation is a sub-task of NLG with an active research community.
GRE3D3: Spatial Relations in Referring Expressions
A web-based production experiment was conducted by Jette Viethen under the supervision of Robert Dale. The resulting GRE3D3 corpus contains 720 referring expressions for simple objects in simple 3D scenes.
Focus on Lexicalization
Focus on Syntax, Realization
| This page was imported semi-automatically from the NLG Resources Wiki which was run by ACL SIGGEN in the years 2005–2009. Please correct conversion errors and help update its contents.|
Now this page is associated with the Natural Language Generation Portal.