Resources for French
Jump to navigation
Jump to search
Corpora
- 10^9 French-English corpus
- Base Textuelle de Moyen Francais
- French plain text and Co-occurrences at LCC
- French Stopword List
- Lexique Morphalou
- Lexique Verbaction
- Le Monde Diplomatique-Die Tageszeitung Translation Corpus - French-German, aligned (parallel)
- UN parallel corpora
- WMT corpora, including Europarl, News Commentary, and News Crawl
Grammars/parsers
Free software
- HPSG FroG (under the LGPLLR according to this presentation)
- WOLF – Wordnet Libre du Français, distribuée sous licence Cecill-C (compatible LGPL)
- Lefff – (Lexique des Formes Fléchies du Français) est un lexique morphologique et syntaxique à large couverture, distribué sous licence libre LGPL-LR (Lesser General Public License For Linguistic Resources), see also Alexina
- Morfette data driven PoS tagger and lemmatizer, New BSD License
- Apertium has analysers/generators in the lttoolbox format for French, along with statistical disambiguation models, see e.g. the files in fr-ca, fr-es and br-fr
Unknown licence
- KPML generation grammar
- Treetagger has some French support (gratis for research)
- MeLT, data driven pos tagger
Morphology, dictionaries
Free software
- Dicollecte LEXIQUE FRANÇAIS, LISTE DES FORMES FLÉCHIES, MPL/GPL/LGPL
- Flemmv3.1 - inflectional morphology parser for French -- perl, GPL license.