Search results

Jump to navigation Jump to search
  • ...Paraphrase Identification (State of the art)|Microsoft Research Paraphrase Corpus]]
    2 KB (270 words) - 18:21, 12 August 2019
  • | Distributional || 0.56 || - || Trained on word2vec corpus, best results for pure distributional model. | Do19-corpus
    6 KB (733 words) - 18:34, 15 September 2019
  • | [http://www.sigwac.org.uk/wiki/WAC5 WAC5] || Fifth Workshop on Web as Corpus || co-located [http://ixa2.si.ehu.es/sepln2009/ SEPLN 2009] || San Sebastia
    4 KB (493 words) - 08:08, 29 July 2009
  • * [http://quran.uk.net/ Quranic Arabic Corpus], 77,430 words of Quranic Arabic, with manually verified contextual POS, in
    3 KB (449 words) - 05:36, 29 June 2020
  • ...d Frequencies in Written and Spoken English: based on the British National Corpus]
    5 KB (700 words) - 05:46, 23 April 2020
  • * [http://cleaneval.sigwac.org.uk/ CLEANEVAL 2007] - Cleaning pages for Web corpus creation
    5 KB (644 words) - 05:14, 25 June 2012
  • ...com/bncweb/home.html BNCweb: A Web-Based Interface to the British National Corpus ] * [http://pie.usna.edu/explorec.html Chargrams Database from British National Corpus ]
    33 KB (5,109 words) - 05:20, 25 June 2012
  • * [[Cleaneval (State of the art)| Web Corpus Cleaning]] (stub)
    4 KB (563 words) - 18:23, 12 August 2019
  • ...ce for English words, based on frame semantics (valences) and supported by corpus evidence | Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores
    28 KB (3,640 words) - 05:18, 25 June 2012
  • ...85.5% unknown word accuracy on a 10-fold cross-validation of the Penn WSJ corpus. ...her results). The distributed GENiA tagger is trained on a mixed training corpus and gets 96.94% on WSJ, and 98.26% on GENiA biomedical English.
    11 KB (1,408 words) - 12:27, 4 March 2019
  • ...lde1998">Langkilde, Irene and Kevin Knight. 1998. Generation that exploits corpus-based statistical ...k (LDC2005T13), the strings of the venerable PTB Wall Street Journal (WSJ) corpus are annotated with pairs of (a) CCG syntactic derivations and (b) sets of s
    13 KB (1,724 words) - 09:13, 1 June 2017
  • ...text-align: center;"| No acronyms of organization names extracted from the corpus.
    6 KB (744 words) - 08:40, 27 March 2012
  • understand, and the demo shows the human corpus
    4 KB (593 words) - 05:16, 25 June 2012
  • ...generation of natural language summaries providing historical background: corpus-based analysis, design, implementation and evaluation</i>, PhD thesis, Colu Anja Belz and Sebastian Varges (eds). <i>Proceedings of the Corpus Linguistics 2005Workshop on Using Corpora for Natural Language Generation</
    11 KB (1,721 words) - 05:15, 25 June 2012
  • * Specialty: Corpus Creation, Text and Speech Annotation Support * Specialty: Corpus Creation, Arabic Treebank
    10 KB (1,397 words) - 09:10, 30 April 2009
  • ...4D9-20CD-47E3-85BC-A2F65CD28042/default.aspx Microsoft Research Paraphrase Corpus] (MSRP) ...arava, C. (2006). [http://www.cse.unt.edu/~rada/papers/mihalcea.aaai06.pdf Corpus-based and knowledge-based measures of text semantic similarity], ''Proceedi
    9 KB (1,169 words) - 02:34, 29 November 2016
  • ...ltr.ucl.ac.be/fltr/germ/etan/cecl/cecl.html UCLouvain - Centre for English Corpus Linguistics (CECL)] ...cuni.cz/ Ústav Českého národního korpusu] (Institute of the Czech National Corpus)
    31 KB (4,531 words) - 11:23, 19 September 2022
  • ...i.ro/fcs/en/plan/plan-1-ml.html Introduction to Computational Linguistics, Corpus Linguistics ] ||Graduate ||C C++ Java Lisp Perl ||2003 |USA ||San Diego State University ||Computational Corpus Linguistics ||Both ||Python ||2006
    40 KB (5,097 words) - 08:29, 2 December 2021
  • into a corpus of Combinatory Categorial Grammar derivations, created by [http://www.cis.u The [http://gmb.let.rug.nl Groningen Meaning Bank] is an annotated corpus of public domain texts. Version 1.0 comprises 1,000 texts with CCG analyses
    11 KB (1,549 words) - 09:24, 26 January 2016
  • |[http://www.aclweb.org/anthology/P13-3024 A corpus-based evaluation method for Distributional Semantic Models] ...D17-1323/ Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints]
    14 KB (1,963 words) - 10:36, 5 July 2021

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)