Difference between revisions of "TWSI Turk bootstrap Word Sense Inventory 2.0 (Repository)"

From ACL Wiki
Jump to: navigation, search
m
Line 16: Line 16:
  
 
* '''Download:'''
 
* '''Download:'''
 +
Download link for full TWSI data: http://www.ukp.tu-darmstadt.de/data/lexical-resources/twsi-lexical-substitutions
 +
 +
 
- Version 1
 
- Version 1
 
[http://aclweb.org/aclwiki/index.php?title=Image:TWSI397.zip] - TWSI version 1 [http://aclweb.org/aclwiki/index.php?title=Image:TWSI397_source_sentences.zip] Supplementary Data version 1;  
 
[http://aclweb.org/aclwiki/index.php?title=Image:TWSI397.zip] - TWSI version 1 [http://aclweb.org/aclwiki/index.php?title=Image:TWSI397_source_sentences.zip] Supplementary Data version 1;  
- Version 2: split due to file size limitations
+
- Version 2: Due to file size limitations the data is not available here. Please download it at: [http://www.ukp.tu-darmstadt.de/data/lexical-resources/twsi-lexical-substitutions]  
TWSI version 2 letters A-M: [http://aclweb.org/aclwiki/index.php?title=Image:Turkboot615_A-M.zip]  TWSI version 2 letters N-Z: [http://aclweb.org/aclwiki/index.php?title=Image:Turkboot615_N-Z.zip]
+
 
Supplementary Data version 2 (1): [http://aclweb.org/aclwiki/index.php?title=Image:TWSI615_source_sentences_1.zip] Supplementary Data version 2 (2): [http://aclweb.org/aclwiki/index.php?title=Image:TWST615_source_sentences_2.zip]
+
 
  
  
 
[[Category:Data and code repository|TWSI_Turk_bootstrap_Word_Sense_Inventory_(Repository)]]
 
[[Category:Data and code repository|TWSI_Turk_bootstrap_Word_Sense_Inventory_(Repository)]]

Revision as of 06:45, 11 May 2012

  • ADCR ID: ADCR2010T006
  • Contributor: Chris Biemann, Powerset (a Microsoft company), Octber 18th, 2010
  • Copyright: (c) 2010, Microsoft Corp.
  • Citation: If you use the Turk bootstrap Word Sense Inventory in your research, please include the following citation in any resulting papers:
  • C. Biemann and V. Nygaard (2010): Crowdsourcing WordNet. In Proceedings of the 5th Global WordNet conference, Mumbai, India. , ACL Data and Code Repository, ADCR2010T006, http://aclweb.org/aclwiki.
  • Description: Version 1: Collection of more than 50,000 sentences for 397 frequent target nouns from Wikipedia, sense-labeled and with substitutions. Version 2: Collection of more than 118,000 sentences for additional 615 frequent target nouns from Wikipedia, sense-labeled and with substitutions.
  • Download:

Download link for full TWSI data: http://www.ukp.tu-darmstadt.de/data/lexical-resources/twsi-lexical-substitutions


- Version 1 [1] - TWSI version 1 [2] Supplementary Data version 1; - Version 2: Due to file size limitations the data is not available here. Please download it at: [3]