Difference between revisions of "RTE Knowledge Resources"

From ACL Wiki
Jump to navigation Jump to search
m
m
Line 40: Line 40:
 
| [[Framenet RTE Users|Users]]
 
| [[Framenet RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank] Resources
+
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]
| Lexical
+
| Lexical DB
 
| New York University
 
| New York University
| Lexical Resources containing syntactic frames for nouns, extracted from annotated corpora  
+
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora  
 
| style="text-align: center;"|2
 
| style="text-align: center;"|2
 
| [[NomBank Resources RTE Users|Users]]  
 
| [[NomBank Resources RTE Users|Users]]  
 +
|- bgcolor="#ECECEC" "align="left"
 +
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]
 +
| Lexical DB
 +
| University of Colorado Boulder
 +
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora
 +
| style="text-align: center;"|2
 +
| [[PropBank Resources RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus
 
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus
| Lexical
+
| Lexical DB
 
| New York University
 
| New York University
 
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb
 
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb
Line 55: Line 62:
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| Parc Polarity Lexicon
 
| Parc Polarity Lexicon
| Lexical
+
| Lexical DB
 
| [http://www.parc.com/ PARC] - Palo Alto Research Center
 
| [http://www.parc.com/ PARC] - Palo Alto Research Center
 
| Verbs classification with respect to semantic polarity
 
| Verbs classification with respect to semantic polarity
Line 62: Line 69:
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [[DIRT Paraphrase Collection]]
 
| [[DIRT Paraphrase Collection]]
| Collections of paraphrases (DIRT output)
+
| Collection of paraphrases
| Various
+
| University of Alberta
 
| Output of the DIRT algorithm  
 
| Output of the DIRT algorithm  
 
| style="text-align: center;"|4
 
| style="text-align: center;"|4
 
| [[DIRT Paraphrase Collections RTE Users|Users]]
 
| [[DIRT Paraphrase Collections RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
 
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank] Resources
 
| Lexical
 
| University of Colorado Boulder
 
| Lexical Resources containing syntactic frames for verbs, extracted from annotated corpora
 
| style="text-align: center;"|2
 
| [[PropBank Resources RTE Users|Users]]
 
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [[TEASE]] Collection
 
| [[TEASE]] Collection
| Syntactic-semantic
+
| Collection of Entailment Rules
 
| Bar Ilan University
 
| Bar Ilan University
| Collection of Entailment Rules
+
| Output of the TEASE algorithm
 
| style="text-align: center;"|0
 
| style="text-align: center;"|0
 
| [[Tease Collection RTE Users|Users]]
 
| [[Tease Collection RTE Users|Users]]
Line 105: Line 105:
 
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]
 
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]
 
| Thesaurus
 
| Thesaurus
| Peter Mark Roget
+
| Peter Mark Roget (Electronic version distributed by University of Chicago)
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The 1911 US edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.
+
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.
 
| style="text-align: center;"|1
 
| style="text-align: center;"|1
 
| [[Roget's Thesaurus RTE Users|Users]]
 
| [[Roget's Thesaurus RTE Users|Users]]
Line 124: Line 124:
 
| [[Wikipedia RTE Users|Users]]
 
| [[Wikipedia RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
| [http://geonames.usgs.gov/domestic/index.html GNIS - Geographic Names Information System]
+
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]
| Word List
+
| Gazetteer
 
| USGS - United States Geological Survey
 
| USGS - United States Geological Survey
 
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica.
 
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica.
Line 132: Line 132:
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.geonames.org/ Geonames]
 
| [http://www.geonames.org/ Geonames]
| Word List
+
| Gazetteer
 
|  
 
|  
 
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.
 
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.

Revision as of 01:47, 21 April 2009

The table below lists the knowledge resources used by participants in the last RTE challenges. Other important RTE resources have been added in order to encourage people to add information about potential usage.
The table is sortable by Resource name, type, author and number of users.

Resource Type Author Brief description RTE Users* Usage info
WordNet Lexical DB Princeton University Lexical database of English nouns, verbs, adjectives and adverbs 23 Users
Verbnet Lexical DB University of Colorado Boulder Lexicon for English verbs organized into classes 3 Users
VerbOcean Lexical DB University of Southern California Broad-coverage semantic network of verbs 5 Users
FrameNet Lexical DB ICSI (International Computer Science Institute) - Berkley University Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence 2 Users
NomBank Lexical DB New York University Lexical resource containing syntactic frames for nouns, extracted from annotated corpora 2 Users
PropBank Lexical DB University of Colorado Boulder Lexical resource containing syntactic frames for verbs, extracted from annotated corpora 2 Users
Nomlex Plus Lexical DB New York University Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb 1 Users
Parc Polarity Lexicon Lexical DB PARC - Palo Alto Research Center Verbs classification with respect to semantic polarity 1 Users
DIRT Paraphrase Collection Collection of paraphrases University of Alberta Output of the DIRT algorithm 4 Users
TEASE Collection Collection of Entailment Rules Bar Ilan University Output of the TEASE algorithm 0 Users
BADC Acronym and Abbreviation List Word List BADC - British Atmospheric Data Centre Acronym and Abbreviation List 1 Users
Acronym Guide Word List Acronym-Guide.com Acronym and Abbreviation Lists for English, branched in thematic directories 1 Users
Dekang Lin’s Thesaurus Thesaurus University of Alberta Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores 1 Users
Roget's Thesaurus Thesaurus Peter Mark Roget (Electronic version distributed by University of Chicago) Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition (version 1.02) is made available by University of Chicago. 1 Users
Web1T 5-grams Word list Google Inc. Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages 1 Users
Wikipedia Encyclopedia Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information ecc. 3 Users
GNIS - Geographic Names Information System Gazetteer USGS - United States Geological Survey Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica. 1 Users
Geonames Gazetteer Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources. 1 Users
Gazetteer from TREC Gazetteer NIST - National Institute of Standards and Technology Cities and other geographical names 1 Users
Geographic Ontology Gazetter University of West Florida Hierarchical data structure that allows the storage of natural and man-made feature data for use in a multitude of both manual and computerized Mapping, Charting & Geodesy systems. 1 Users
Syntactic rule base Entailment Rules' Collection Bar-Ilan University A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) 1 Users
Polarity rule base Entailment Rules' Collection Bar-Ilan University A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet, and also from the PARC polarity lexicon. It extends a resource described in (Bar-Haim et al., AAAI-07) 1 Users
Resources extracted from Parsed Corpora Collections Collections of rules, patterns etc. extracted from parsed corpora for RTE purpose. 1 Users
New resource
(Participants are encouraged to contribute)
Users
New resource
(Participants are encouraged to contribute)
Users


[*] The numbers refer to the Users in RTE4 (data extracted both from related proceedings and from RTE Knowledge Resources Questionnaire) and in RTE3 (data extracted only from RTE Knowledge Resources Questionnaire) challenges.