Search results
Jump to navigation
Jump to search
- ...Paraphrase Identification (State of the art)|Microsoft Research Paraphrase Corpus]]2 KB (270 words) - 18:21, 12 August 2019
- | Distributional || 0.56 || - || Trained on word2vec corpus, best results for pure distributional model. | Do19-corpus6 KB (733 words) - 18:34, 15 September 2019
- | [http://www.sigwac.org.uk/wiki/WAC5 WAC5] || Fifth Workshop on Web as Corpus || co-located [http://ixa2.si.ehu.es/sepln2009/ SEPLN 2009] || San Sebastia4 KB (493 words) - 08:08, 29 July 2009
- * [http://quran.uk.net/ Quranic Arabic Corpus], 77,430 words of Quranic Arabic, with manually verified contextual POS, in3 KB (449 words) - 05:36, 29 June 2020
- ...d Frequencies in Written and Spoken English: based on the British National Corpus]5 KB (700 words) - 05:46, 23 April 2020
- * [http://cleaneval.sigwac.org.uk/ CLEANEVAL 2007] - Cleaning pages for Web corpus creation5 KB (644 words) - 05:14, 25 June 2012
- ...com/bncweb/home.html BNCweb: A Web-Based Interface to the British National Corpus ] * [http://pie.usna.edu/explorec.html Chargrams Database from British National Corpus ]33 KB (5,109 words) - 05:20, 25 June 2012
- * [[Cleaneval (State of the art)| Web Corpus Cleaning]] (stub)4 KB (563 words) - 18:23, 12 August 2019
- ...ce for English words, based on frame semantics (valences) and supported by corpus evidence | Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores28 KB (3,640 words) - 05:18, 25 June 2012
- ...85.5% unknown word accuracy on a 10-fold cross-validation of the Penn WSJ corpus. ...her results). The distributed GENiA tagger is trained on a mixed training corpus and gets 96.94% on WSJ, and 98.26% on GENiA biomedical English.11 KB (1,408 words) - 12:27, 4 March 2019
- ...lde1998">Langkilde, Irene and Kevin Knight. 1998. Generation that exploits corpus-based statistical ...k (LDC2005T13), the strings of the venerable PTB Wall Street Journal (WSJ) corpus are annotated with pairs of (a) CCG syntactic derivations and (b) sets of s13 KB (1,724 words) - 09:13, 1 June 2017
- ...text-align: center;"| No acronyms of organization names extracted from the corpus.6 KB (744 words) - 08:40, 27 March 2012
- understand, and the demo shows the human corpus4 KB (593 words) - 05:16, 25 June 2012
- ...generation of natural language summaries providing historical background: corpus-based analysis, design, implementation and evaluation</i>, PhD thesis, Colu Anja Belz and Sebastian Varges (eds). <i>Proceedings of the Corpus Linguistics 2005Workshop on Using Corpora for Natural Language Generation</11 KB (1,721 words) - 05:15, 25 June 2012
- * Specialty: Corpus Creation, Text and Speech Annotation Support * Specialty: Corpus Creation, Arabic Treebank10 KB (1,397 words) - 09:10, 30 April 2009
- ...4D9-20CD-47E3-85BC-A2F65CD28042/default.aspx Microsoft Research Paraphrase Corpus] (MSRP) ...arava, C. (2006). [http://www.cse.unt.edu/~rada/papers/mihalcea.aaai06.pdf Corpus-based and knowledge-based measures of text semantic similarity], ''Proceedi9 KB (1,169 words) - 02:34, 29 November 2016
- ...ltr.ucl.ac.be/fltr/germ/etan/cecl/cecl.html UCLouvain - Centre for English Corpus Linguistics (CECL)] ...cuni.cz/ Ústav Českého národního korpusu] (Institute of the Czech National Corpus)31 KB (4,531 words) - 11:23, 19 September 2022
- ...i.ro/fcs/en/plan/plan-1-ml.html Introduction to Computational Linguistics, Corpus Linguistics ] ||Graduate ||C C++ Java Lisp Perl ||2003 |USA ||San Diego State University ||Computational Corpus Linguistics ||Both ||Python ||200640 KB (5,097 words) - 08:29, 2 December 2021
- into a corpus of Combinatory Categorial Grammar derivations, created by [http://www.cis.u The [http://gmb.let.rug.nl Groningen Meaning Bank] is an annotated corpus of public domain texts. Version 1.0 comprises 1,000 texts with CCG analyses11 KB (1,549 words) - 09:24, 26 January 2016
- |[http://www.aclweb.org/anthology/P13-3024 A corpus-based evaluation method for Distributional Semantic Models] ...D17-1323/ Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints]14 KB (1,963 words) - 10:36, 5 July 2021