TWSI Turk bootstrap Word Sense Inventory 2.0 (Repository)
Jump to navigation
Jump to search
- ADCR ID: ADCR2010T006
- Name of Dataset: TWSI (Turk bootstrap Word Sense Inventory) 2.0, includes TWSI Turk bootstrap Word Sense Inventory (Repository)
- Contributor: Chris Biemann, Powerset (a Microsoft company), Octber 18th, 2010
- Copyright: (c) 2010, Microsoft Corp.
- Licensing: This work is licensed under the Creative Commons Attribution-Share Alike 3.0 .
- Citation: If you use the Turk bootstrap Word Sense Inventory in your research, please include the following citation in any resulting papers:
- C. Biemann and V. Nygaard (2010): Crowdsourcing WordNet. In Proceedings of the 5th Global WordNet conference, Mumbai, India. , ACL Data and Code Repository, ADCR2010T006, http://aclweb.org/aclwiki.
- Description: Version 1: Collection of more than 50,000 sentences for 397 frequent target nouns from Wikipedia, sense-labeled and with substitutions. Version 2: Collection of more than 118,000 sentences for additional 615 frequent target nouns from Wikipedia, sense-labeled and with substitutions.
- Download:
Download link for full TWSI data: http://www.lt.informatik.tu-darmstadt.de/de/data/twsi-turk-bootstrap-word-sense-inventory/
- Version 1 [1] - TWSI version 1 [2] Supplementary Data version 1; - Version 2: Due to file size limitations the data is not available here. Please download it at: http://www.lt.informatik.tu-darmstadt.de/de/data/twsi-turk-bootstrap-word-sense-inventory/