Difference between revisions of "Corpora, datasets, lexicons"
Jump to navigation
Jump to search
m |
|||
Line 21: | Line 21: | ||
* [http://www.ruscorpora.ru/ Russian National Corpus (RNK)] | * [http://www.ruscorpora.ru/ Russian National Corpus (RNK)] | ||
* [http://korpus.juls.savba.sk/ Slovak National Corpus (SNK)] | * [http://korpus.juls.savba.sk/ Slovak National Corpus (SNK)] | ||
− | * [http://www.fida.net/ Slovenian Corpus FIDA] | + | * [http://www.fida.net/ Slovenian Corpus FIDA] and [http://www.fidaplus.net/ FIDA+] |
− | |||
* [http://www.corpusdelespanol.org/ Spanish Corpus] | * [http://www.corpusdelespanol.org/ Spanish Corpus] | ||
* [http://spraakbanken.gu.se/ Bank of Swedish] | * [http://spraakbanken.gu.se/ Bank of Swedish] |
Revision as of 15:26, 31 October 2006
Miscellaneous
Corpora
- American National Corpus (ANC)
- Biomedical corpora
- The Oslo Corpus of Bosnian
- British National Corpus (BNC)
- Brown Corpus
- Collins Wordbanks
- Croatian National Corpus (HNK)
- Czech National Corpus (CNC)
- David Lee's Bookmarks for Corpus-based Linguists
- Gutenberg
- Hungarian National Corpus
- IPI PAN Corpus of Polish
- Oxford English Corpus
- Portuguese Corpus
- Russian National Corpus (RNK)
- Slovak National Corpus (SNK)
- Slovenian Corpus FIDA and FIDA+
- Spanish Corpus
- Bank of Swedish
- WebCorp
Datasets
- Edinburgh Associative Thesaurus (EAT)
- Linguistic Data Consortium (LDC)
- MRC Psycholinguistic Database
- Noun Compound Repository
- Reuters-21578 Text Categorization Collection
- University of South Florida Free Association Norms
- WordSimilarity-353 Test Collection