Uploads by Biem

Jump to navigation Jump to search

This special page shows all uploaded files.

File list
Date Name Thumbnail Size Description Versions
18:22, 1 February 2010 TWSI397.zip (file) 6.44 MB The TWSI is organized by target word: For the most frequent 397 nouns in English Wikipedia (dump used from January 3rd, 2008), all targets are organized into senses. With each sense, there are associated substitutions and sentences where the target word w 2
10:04, 7 June 2010 TWSI397 source sentences.zip (file) 4.55 MB Supplementary data for the TWSI Turk Bootstrap Word Sense Inventory TWSI 1.0 The file "wiki_title_sent.txt" in this Archive is an extended version of the file "corpus/wiki_titles.txt" in the TWSI 1.0. It contains 4 tab-separated columns: - sentence-id f 1
13:53, 18 October 2010 TWSI615 source sentences 1.zip (file) 4.61 MB Supplementary data for the TWSI Turk Bootstrap Word Sense Inventory TWSI 2.0 Part 1/2: concatenate parts to get full file. The file "wiki_title_sent.txt" in this archive contains 4 tab-separated columns: - sentence-id from corpus as referenced through 1
13:53, 18 October 2010 TWST615 source sentences 2.zip (file) 4.7 MB Supplementary data for the TWSI Turk Bootstrap Word Sense Inventory TWSI 2.0 Part 2/2: concatenate parts to get full file. The file "wiki_title_sent.txt" in this archive contains 4 tab-separated columns: - sentence-id from corpus as referenced through 1
13:56, 18 October 2010 Turkboot615 A-M.zip (file) 5.63 MB TWSI (Turk bootstrap Word Sense Inventory) version 2.0. This is the first part, target letters A-M. For the description of the process, please consult the paper for further documentation. In short, three Mturk tasks were used to yield the data provided 1
13:57, 18 October 2010 Turkboot615 N-Z.zip (file) 4.12 MB This file describes the data format of the TWSI (Turk bootstrap Word Sense Inventory) version 2.0. This is the second part, target letters N-Z. For the description of the process, please consult the paper for further documentation. In short, three Mturk 1
10:09, 10 March 2014 Disco2011-shared-task-complete-dataset.zip (file) 265 KB DISCO 2011 Complete Dataset (Training and Test Data, Eval Scripts) 2
10:16, 10 March 2014 AnnotationJudgmentsDiISCO2011.tar.gz (file) 378 KB (not used for task) sentence level judgments 2
10:17, 10 March 2014 ParticipantSubmissionsDISCO2011.tar.gz (file) 75 KB Submissions of Participants for shared task 2