Difference between revisions of "Uncategorized resources"
Jump to navigation
Jump to search
(merged links from Miscellaneous Pointers) |
|||
Line 410: | Line 410: | ||
* [http://www.entmp.org/linguistics/phonGIF/ phonGIF - bitpapped IPA characters (based on the wsuipa METAFONT)] | * [http://www.entmp.org/linguistics/phonGIF/ phonGIF - bitpapped IPA characters (based on the wsuipa METAFONT)] | ||
+ | |||
+ | *[http://www.ldc.upenn.edu/Catalog/LDC2001S97.html 2000 NIST Speaker Recognition Evaluation Corpus] | ||
+ | |||
+ | *[http://ixa.si.ehu.es/Ixa/resources/sensecorpus A Web Corpus and Topic Signatures for All WordNet 1.6 Nominal Senses (v 1.0)] | ||
+ | |||
+ | *[http://odur.let.rug.nl/~vannoord/trees/ Alpino Treebank] | ||
+ | |||
+ | *[http://www.aot.ru/search1.html AOT] | ||
+ | |||
+ | *[http://pioneer.chula.ac.th/~awirote/ling/corpuslst.htm Corpus Resources (Chulalongkorn University, Thailand)] | ||
+ | |||
+ | *[ftp://ftp.cs.cornell.edu/pub/smart/cran/ Cranfield collection] | ||
+ | |||
+ | *[http://corpus.rae.es/creanet.html CREA] | ||
+ | |||
+ | *[http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)] | ||
+ | |||
+ | *[http://www.hum.uva.nl/~ewn EuroWordNet] | ||
+ | |||
+ | *[http://rali.iro.umontreal.ca/ Hansards Corpus - Searchable] | ||
+ | |||
+ | *[http://www.hcrc.ed.ac.uk/maptask/ HCRC Map Task Corpus XML annotations] | ||
+ | |||
+ | *[http://nats-www.informatik.uni-hamburg.de/~ingo/icopost/ ICOPOST] | ||
+ | |||
+ | *[http://www.ims.uni-stuttgart.de/projekte/TC.html IMS Corpus Toolbox, Univ. of Stuttgart] | ||
+ | |||
+ | *[http://www.ims.uni-stuttgart.de/projekte/CorpusWorkbench/ IMS Corpus Workbench (CWB)] | ||
+ | |||
+ | *[http://cecl.fltr.ucl.ac.be/Cecl-Projects/Icle/icle.htm International Corpus of Learner English] | ||
+ | |||
+ | *[http://www.ipds.uni-kiel.de/links/datenmaterial.en.html Kiel University's Institute on Phonetics and Speech Procesing] | ||
+ | |||
+ | *[http://www.nilc.icmc.usp.br/lacioweb Lacio Web Corpora] | ||
+ | |||
+ | *[http://www.vuw.ac.nz/llc/ LANGUAGE LEARNING CENTER - ACADEMIC CORPUS] | ||
+ | |||
+ | *[http://www.bmanuel.org/clr2_mp.html Manuel Barbera: General Corpora and Corpus Linguistics Resources] | ||
+ | |||
+ | *[ftp://ftp.cs.cornell.edu/pub/smart/med/ Medlars collection] | ||
+ | |||
+ | *[ftp://ftp.ox.ac.uk/pub/wordlists/ Miscellaneous Word Lists from Oxford University] | ||
+ | |||
+ | *[http://www.lpl.univ-aix.fr/projects/multext/ Multilingual Text Tools and Corpora] | ||
+ | |||
+ | *[http://www.census.gov/genealogy/names Name lists from US census] | ||
+ | |||
+ | *[http://www.di.fc.ul.pt/~ahb/nexing.htm Nexing Corpus] | ||
+ | |||
+ | *[http://www.cs.cmu.edu/web/books.html On-line books at CMU] | ||
+ | |||
+ | *[http://logos.uio.no/opus/ OPUS -- An Open Source Parallel Corpus] | ||
+ | |||
+ | *[http://elex.amu.edu.pl/~przemka/PICLE_search.php Polish subcorpus of the International Corpus of Learner English] | ||
+ | |||
+ | *[http://www.cirp.es/WXN/wxn/frames/proxectos.html Ramon Piero Center for Research] | ||
+ | |||
+ | *[http://about.reuters.com/researchandstandards/corpus/ Reuters Corpus] | ||
+ | |||
+ | *[http://www.ldc.upenn.edu/Catalog/LDC2001S97.html Speech in Noisy Environments 1 (SPINE1 CODED) Coded Audio] | ||
+ | |||
+ | *[http://www.ldc.upenn.edu/Catalog/LDC2001S99.html Speech in Noisy Environments 2 (SPINE2 CODED) Coded Audio] | ||
+ | |||
+ | *[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/doc/notes/corpora.txt Survey of Electronic Corpora (by Jane A. | ||
+ | Edwards, file at CMU)] | ||
+ | |||
+ | *[http://www.ucl.ac.uk/english-usage/ Survey of English Usage, University College, London] | ||
+ | |||
+ | *[http://www.icsi.berkeley.edu/real/stp/index.html Switchboard Transcription Project] | ||
+ | |||
+ | *[http://www.tractor.de/ TELRI Research Archive of Computational Tools and Resources] | ||
+ | |||
+ | *[http://childes.psy.cmu.edu/ The Childes Corpus - Children's language] | ||
+ | |||
+ | *[http://nora.hd.uib.no/index-e.html The CORPORA DataCenter (Norway)] | ||
+ | |||
+ | *[ftp://ftp.dcs.shef.ac.uk/share/ilash/Moby/ The Moby Corpus] | ||
+ | |||
+ | *[http://www.hf.uio.no/tekstlab/prosjekter/SOFIE.htm The Sofie Treebank - A Parallel Treebank of North European Languages] |
Revision as of 13:38, 26 April 2008
- Proceedings of the Corpus Linguistics 2005 Workshop on Using Corpora for Natural Language Generation
- [http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/doc/notes/corpora.txt Survey of Electronic Corpora (by Jane A.
Edwards, file at CMU)]