Difference between revisions of "TWSI Turk bootstrap Word Sense Inventory (Repository)"

From ACL Wiki
Jump to: navigation, search
Line 3: Line 3:
 
* '''Name of Dataset:''' TWSI (Turk bootstrap Word Sense Inventory)  
 
* '''Name of Dataset:''' TWSI (Turk bootstrap Word Sense Inventory)  
  
* '''Contributor:''' [http://wortschatz.uni-leipzig.de/~cbiemann/ Chris Biemann], Powerset (a Microsoft company), February 1st, 2010.
+
* '''Contributor:''' [http://wortschatz.uni-leipzig.de/~cbiemann/ Chris Biemann], Powerset (a Microsoft company), February 1st, 2010 / October 18th, 2010.
  
 
* '''Copyright:''' (c) 2010, Microsoft Corp.  
 
* '''Copyright:''' (c) 2010, Microsoft Corp.  
Line 13: Line 13:
 
::* C. Biemann and V. Nygaard (2010): Crowdsourcing WordNet.  In Proceedings of the 5th Global WordNet conference, Mumbai, India. , ''ACL Data and Code Repository'', ADCR2010T005, http://aclweb.org/aclwiki.
 
::* C. Biemann and V. Nygaard (2010): Crowdsourcing WordNet.  In Proceedings of the 5th Global WordNet conference, Mumbai, India. , ''ACL Data and Code Repository'', ADCR2010T005, http://aclweb.org/aclwiki.
  
* '''Description:''' Collection of more than 50,000 sentences for 397 frequent target nouns from Wikipedia, sense-labeled and with substitutions.
+
* '''Description:''' Version 1: Collection of more than 50,000 sentences for 397 frequent target nouns from Wikipedia, sense-labeled and with substitutions. Version 2: Collection of more than 118,000 sentences for additional 615 frequent target nouns from Wikipedia, sense-labeled and with substitutions.
  
* '''Download:''' [http://aclweb.org/aclwiki/index.php?title=Image:TWSI397.zip] - TWSI version 1 [http://aclweb.org/aclwiki/index.php?title=Image:TWSI397_source_sentences.zip] Supplementary Data
+
* '''Download:''' [http://aclweb.org/aclwiki/index.php?title=Image:TWSI397.zip] - TWSI version 1 [http://aclweb.org/aclwiki/index.php?title=Image:TWSI397_source_sentences.zip] Supplementary Data version 1;
  
 
[[Category:Data and code repository|TWSI_Turk_bootstrap_Word_Sense_Inventory_(Repository)]]
 
[[Category:Data and code repository|TWSI_Turk_bootstrap_Word_Sense_Inventory_(Repository)]]

Revision as of 12:40, 18 October 2010

  • ADCR ID: ADCR2010T005
  • Name of Dataset: TWSI (Turk bootstrap Word Sense Inventory)
  • Contributor: Chris Biemann, Powerset (a Microsoft company), February 1st, 2010 / October 18th, 2010.
  • Copyright: (c) 2010, Microsoft Corp.
  • Citation: If you use the Turk bootstrap Word Sense Inventory in your research, please include the following citation in any resulting papers:
  • C. Biemann and V. Nygaard (2010): Crowdsourcing WordNet. In Proceedings of the 5th Global WordNet conference, Mumbai, India. , ACL Data and Code Repository, ADCR2010T005, http://aclweb.org/aclwiki.
  • Description: Version 1: Collection of more than 50,000 sentences for 397 frequent target nouns from Wikipedia, sense-labeled and with substitutions. Version 2: Collection of more than 118,000 sentences for additional 615 frequent target nouns from Wikipedia, sense-labeled and with substitutions.
  • Download: [1] - TWSI version 1 [2] Supplementary Data version 1;