Difference between revisions of "RTE Knowledge Resources"

From ACL Wiki
Jump to navigation Jump to search
m
Line 6: Line 6:
 
<br>
 
<br>
 
__TOC__
 
__TOC__
 +
<br>
 +
=== RTE6 - Call for Resources ===
 +
<br>
 +
<br>
 +
=== RTE5 - Ablation Tests ===
 
<br>
 
<br>
 
=== Publicly available Resources ===
 
=== Publicly available Resources ===
{|class="wikitable sortable" cellpadding="3" cellspacing="0" style="margin-left: 20px;" border="1"
+
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"
 +
 
 
|- bgcolor="#CDCDCD"
 
|- bgcolor="#CDCDCD"
 
! Resource
 
! Resource
Line 16: Line 22:
 
! RTE Users*  
 
! RTE Users*  
 
! class="unsortable"|Usage info
 
! class="unsortable"|Usage info
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [[WordNet]]
 
| [[WordNet]]
Line 23: Line 30:
 
| style="text-align: center;"|24
 
| style="text-align: center;"|24
 
| [[WordNet - RTE Users|Users]]
 
| [[WordNet - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]
 
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]
Line 30: Line 38:
 
| style="text-align: center;"|4
 
| style="text-align: center;"|4
 
| [[Verbnet - RTE Users|Users]]
 
| [[Verbnet - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [[VerbOcean]]
 
| [[VerbOcean]]
Line 37: Line 46:
 
| style="text-align: center;"|5
 
| style="text-align: center;"|5
 
| [[VerbOcean - RTE Users|Users]]
 
| [[VerbOcean - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://framenet.icsi.berkeley.edu/ FrameNet]
 
| [http://framenet.icsi.berkeley.edu/ FrameNet]
Line 44: Line 54:
 
| style="text-align: center;"|2
 
| style="text-align: center;"|2
 
| [[Framenet - RTE Users|Users]]
 
| [[Framenet - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]
 
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]
Line 51: Line 62:
 
| style="text-align: center;"|3
 
| style="text-align: center;"|3
 
| [[NomBank Resource - RTE Users|Users]]  
 
| [[NomBank Resource - RTE Users|Users]]  
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]
 
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]
Line 58: Line 70:
 
| style="text-align: center;"|3
 
| style="text-align: center;"|3
 
| [[PropBank Resource - RTE Users|Users]]
 
| [[PropBank Resource - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus
 
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus
Line 65: Line 78:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Nomlex Plus - RTE Users|Users]]
 
| [[Nomlex Plus - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.wikipedia.org/ Wikipedia]
 
| [http://www.wikipedia.org/ Wikipedia]
Line 72: Line 86:
 
| style="text-align: center;"|3
 
| style="text-align: center;"|3
 
| [[Wikipedia - RTE Users|Users]]
 
| [[Wikipedia - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [[TEASE]] Collection
 
| [[TEASE]] Collection
Line 79: Line 94:
 
| style="text-align: center;"|0
 
| style="text-align: center;"|0
 
| [[Tease Collection - RTE Users|Users]]
 
| [[Tease Collection - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]
 
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]
Line 86: Line 102:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]
 
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.acronym-guide.com/ Acronym Guide]
 
| [http://www.acronym-guide.com/ Acronym Guide]
Line 93: Line 110:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Acronym Guide - RTE Users|Users]]
 
| [[Acronym Guide - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]
 
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]
Line 100: Line 118:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]
 
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]
 
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]
Line 107: Line 126:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Roget's Thesaurus - RTE Users|Users]]
 
| [[Roget's Thesaurus - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]
 
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]
Line 114: Line 134:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Web1T - RTE Users|Users]]
 
| [[Web1T - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]
 
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]
Line 121: Line 142:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[GNIS - RTE Users|Users]]
 
| [[GNIS - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.geonames.org/ Geonames]
 
| [http://www.geonames.org/ Geonames]
Line 128: Line 150:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Geonames - RTE Users|Users]]
 
| [[Geonames - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]
 
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]
Line 135: Line 158:
 
| style="text-align: center;"| 0
 
| style="text-align: center;"| 0
 
| [[Sekine's Paraphrase Database - RTE Users|Users]]
 
| [[Sekine's Paraphrase Database - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]
 
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]
Line 142: Line 166:
 
| style="text-align: center;"| 0
 
| style="text-align: center;"| 0
 
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]
 
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators]  
 
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators]  
Line 149: Line 174:
 
| style="text-align: center;"| 0
 
| style="text-align: center;"| 0
 
| [[Downward entailing operators - RTE Users|Users]]
 
| [[Downward entailing operators - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
|[http://cs.biu.ac.il/~shey/WikiRules.html WikiRules!] <br>
+
|[http://cs.biu.ac.il/~shey/WikiRules.html WikiRules!]
 
| Lexical Reference rule-base
 
| Lexical Reference rule-base
 
| Bar-Ilan University
 
| Bar-Ilan University
Line 156: Line 182:
 
| style="text-align: center;"|1  
 
| style="text-align: center;"|1  
 
| [[WikiRules! - RTE Users|Users]]
 
| [[WikiRules! - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| ''New resource''
 
| ''New resource''
Line 163: Line 190:
 
| style="text-align: center;"|  
 
| style="text-align: center;"|  
 
| [[New Resource1 - RTE Users|Users]]
 
| [[New Resource1 - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| ''New resource''
 
| ''New resource''
Line 170: Line 198:
 
| style="text-align: center;"|  
 
| style="text-align: center;"|  
 
| [[New Resource2 - RTE Users|Users]]
 
| [[New Resource2 - RTE Users|Users]]
 +
 
|}
 
|}
 
<br>
 
<br>
Line 177: Line 206:
 
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.
 
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.
 
<br>
 
<br>
{|class="wikitable sortable" cellpadding="3" cellspacing="0" style="margin-left: 20px;" border="1"
+
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"
 +
 
 
|- bgcolor="#CDCDCD"
 
|- bgcolor="#CDCDCD"
 
! Resource
 
! Resource
Line 185: Line 215:
 
! RTE Users*  
 
! RTE Users*  
 
! class="unsortable"|Usage info
 
! class="unsortable"|Usage info
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.parc.com/ PARC] Polarity Lexicon
 
| [http://www.parc.com/ PARC] Polarity Lexicon
Line 192: Line 223:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Parc Polarity Lexicon - RTE Users|Users]]
 
| [[Parc Polarity Lexicon - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [[DIRT Paraphrase Collection]]
 
| [[DIRT Paraphrase Collection]]
Line 199: Line 231:
 
| style="text-align: center;"|5
 
| style="text-align: center;"|5
 
| [[DIRT Paraphrase Collection - RTE Users|Users]]
 
| [[DIRT Paraphrase Collection - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| Gazetteer from [http://trec.nist.gov/ TREC]
 
| Gazetteer from [http://trec.nist.gov/ TREC]
Line 206: Line 239:
 
| style="text-align: center;"|1  
 
| style="text-align: center;"|1  
 
| [[Gazetteer from TREC - RTE Users|Users]]
 
| [[Gazetteer from TREC - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| DFKI Geographic Ontology<br>''(to be released)''
 
| DFKI Geographic Ontology<br>''(to be released)''
Line 213: Line 247:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Geographic Ontology - RTE Users|Users]]
 
| [[Geographic Ontology - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| Syntactic rule base <br>''(to be released)''
 
| Syntactic rule base <br>''(to be released)''
Line 220: Line 255:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Syntactic rule base - RTE Users|Users]]
 
| [[Syntactic rule base - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| Polarity rule base <br>''(to be released)''
 
| Polarity rule base <br>''(to be released)''
Line 227: Line 263:
 
| style="text-align: center;"|1  
 
| style="text-align: center;"|1  
 
| [[Polarity rule base - RTE Users|Users]]
 
| [[Polarity rule base - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| Lexical-Syntactic rule base combining WordNet, NomLex-plus and Unary DIRT
 
| Lexical-Syntactic rule base combining WordNet, NomLex-plus and Unary DIRT
Line 234: Line 271:
 
| style="text-align: center;"|1  
 
| style="text-align: center;"|1  
 
| [[Lexical-Syntactic rule base - RTE Users|Users]]
 
| [[Lexical-Syntactic rule base - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| OPENU Collection
 
| OPENU Collection
Line 241: Line 279:
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[OPENU Collection - RTE Users|Users]]
 
| [[OPENU Collection - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| ''New resource''
 
| ''New resource''
Line 248: Line 287:
 
| style="text-align: center;"|  
 
| style="text-align: center;"|  
 
| [[New Resource3 - RTE Users|Users]]
 
| [[New Resource3 - RTE Users|Users]]
 +
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| ''New resource''
 
| ''New resource''
Line 255: Line 295:
 
| style="text-align: center;"|  
 
| style="text-align: center;"|  
 
| [[New Resource4 - RTE Users|Users]]
 
| [[New Resource4 - RTE Users|Users]]
 +
 
|}
 
|}
 
<br>
 
<br>

Revision as of 02:19, 24 November 2009

Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.

This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.

RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.


RTE6 - Call for Resources



RTE5 - Ablation Tests


Publicly available Resources

Resource Type Author Brief description RTE Users* Usage info
WordNet Lexical DB Princeton University Lexical database of English nouns, verbs, adjectives and adverbs 24 Users
Verbnet Lexical DB University of Colorado Boulder Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class 4 Users
VerbOcean Lexical DB Information Sciences Institute, University of Southern California Broad-coverage semantic network of verbs 5 Users
FrameNet Lexical DB ICSI (International Computer Science Institute) - Berkley University Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence 2 Users
NomBank Lexical DB New York University Lexical resource containing syntactic frames for nouns, extracted from annotated corpora 3 Users
PropBank Lexical DB University of Colorado Boulder Lexical resource containing syntactic frames for verbs, extracted from annotated corpora 3 Users
Nomlex Plus Lexical DB New York University Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb 1 Users
Wikipedia Encyclopedia Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc. 3 Users
TEASE Collection Collection of Entailment Rules Bar-Ilan University Output of the TEASE algorithm 0 Users
BADC Acronym and Abbreviation List Word List BADC (British Atmospheric Data Centre) Acronym and Abbreviation List 1 Users
Acronym Guide Word List Acronym-Guide.com Acronym and Abbreviation Lists for English, branched in thematic directories 1 Users
Dekang Lin’s Thesaurus Thesaurus University of Alberta Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores 1 Users
Roget's Thesaurus Thesaurus Peter Mark Roget (Electronic version distributed by University of Chicago) Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition (version 1.02) is made available by University of Chicago. 1 Users
Web1T 5-grams Word list Linguistic Data Consortium, University of Pennsylvania; Google Inc. Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages 1 Users
GNIS - Geographic Names Information System Gazetteer USGS (United States Geological Survey) Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica 1 Users
Geonames Gazetteer Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources. 1 Users
Sekine's Paraphrase Database Collection of paraphrases Department of Computer Science, New York University Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases. 0 Users
Microsoft Research Paraphrase Corpus Collection of paraphrases Microsoft Research Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship. 0 Users
Downward entailing operators Collection of entailing operators Department of Computer Science, Cornell University, Ithaca NY System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'. 0 Users
WikiRules! Lexical Reference rule-base Bar-Ilan University Extraction of lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia 1 Users
New resource Participants are encouraged to contribute Users
New resource Participants are encouraged to contribute Users



Not available Resources

The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.

Resource Type Author Brief description RTE Users* Usage info
PARC Polarity Lexicon Lexical DB PARC - Palo Alto Research Center Verbs classification with respect to semantic polarity 1 Users
DIRT Paraphrase Collection Collection of paraphrases University of Alberta Output of the DIRT algorithm 5 Users
Gazetteer from TREC Gazetteer NIST - National Institute of Standards and Technology Cities and other geographical names 1 Users
DFKI Geographic Ontology
(to be released)
Ontology DFKI - German Research Center for Artificial Intelligence Ontology containing geographic terms and two kinds of relations: the directional part-of relation, and the equal relation for synonyms and abbreviations of the same geographic area (e.g the United Kingdom, the UK, Great Britain, etc.) 1 Users
Syntactic rule base
(to be released)
Collection of Entailment Rules Bar-Ilan University; Tel-Aviv University A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) 1 Users
Polarity rule base
(to be released)
Collection of Entailment Rules Bar-Ilan University; Tel-Aviv University A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) 1 Users
Lexical-Syntactic rule base combining WordNet, NomLex-plus and Unary DIRT Collection of Entailment Rules Bar-Ilan University; Tel-Aviv University Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08) 1 Users
OPENU Collection Collection of Entailment Rules and Patterns Open University Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar. 1 Users
New resource Participants are encouraged to contribute Users
New resource Participants are encouraged to contribute Users



[*] The number of Users (see "Usage Info" links for details) refers to participants in the last two RTE challenges.
RTE-3 data have been provided only by participants, whereas RTE-4 data have been integrated with information extracted from the related proceedings.