Recent initiatives in language technology have lead to the development of at least minimal
language processing toolkits for all EU-official languages, as well as for languages with a
large number of speakers worldwide such as Chinese and Arabic. This is a big step towards
the automatic processing and/or extraction of information, especially from official documents
and newspapers, where the standard, literary language is used.
Apart from those official languages, a large number of dialects or closely-related language