This page lists sets of structured data to be used as input for natural language generation tasks.

Focus on Content Selection, Aggregation

SumTime Meteo

These data contain predictions for meteorological parameters such as precipitation, temperature, wind speed, and cloud cover at various altitudes, at regular intervals for various points in the area of interest.

The weather corpus currently exists as an Access database and, alternatively, in form of CSV (ASCII) files.

Download and Info: SumTime-Meteo

Project link:

Focus on Lexicalization

Focus on Syntax, Realization

