From ACL Wiki
Jump to: navigation, search


SUMTIME-METEO is a parallel corpus of naturally occurring weather forecast texts and the numerical data they are based on. The corpus has 1045 parallel data-text units and is available as a Microsoft Access database and as CSV (comma-separated values) text files exported from the Access database. The download zip file also includes documentation (PDF). The SUMTIME-METEO corpus is sometimes referred to simply as the SUMTIME corpus.

The textual portion of the corpus consists of human-written weather forecasts (marine forecasts intended for offshore oil rigs in the North Sea); these are actual forecasts written by professional forecasters for real clients. The data portion of the corpus consists of numerical weather predictions (of wind speed, temperature, precipitation, etc) that the human forecasters examined when they wrote the forecasts. The forecasts were written between 26-June-2000 and 10-May-2002.

For more information on the SumTime project, see our web page (, which includes a publication list.


Siggen-logo.gif This page was imported semi-automatically from the NLG Resources Wiki which was run by ACL SIGGEN in the years 2005–2009. Please correct conversion errors and help update its contents.

Now this page is associated with the Natural Language Generation Portal.