https://aclweb.org/aclwiki/api.php?action=feedcontributions&user=DGiampiccolo&feedformat=atomACL Wiki - User contributions [en]2024-03-28T22:36:31ZUser contributionsMediaWiki 1.35.2https://aclweb.org/aclwiki/index.php?title=RTE_Knowledge_Resources&diff=8784RTE Knowledge Resources2011-04-04T15:59:47Z<p>DGiampiccolo: </p>
<hr />
<div>Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.<br><br />
<br><br />
This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.<br><br />
<br><br />
RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.<br><br />
<br><br />
__TOC__<br />
<br><br />
=== Call for Resources ===<br />
<br />
In order to help the research, all the participants are invited to contribute, sharing their own resources with the RTE community. <br />
<br><br />
Making the resources available to be used by other systems has several advantages. On the one hand, it helps improve the TE technology; on the other hand, it offers an opportunity to further test and evaluate the resource.<br />
<br><br />
<br><br />
*[[RTE6 - Call for Resources]]<br />
*[[RTE7 - Call for Resources]]<br />
<br><br />
=== Ablation Tests ===<br />
An ablation test consists of removing one module at a time from a system, and rerunning the system on the test set with the other modules, except the one tested. <br />
<br><br />
Ablation test are meant to help better understand the relevance of the knowledge resources used by RTE systems, and evaluate the contribution of each of them to the systems' performances. In fact, comparing the results achieved in the ablation tests to those obtained by the systems as a whole allows assessing the contribution given by each single resource.<br />
<br><br />
<br><br />
*[[RTE5 - Ablation Tests]]<br />
*[[RTE6 - Ablation Tests]]<br />
<br><br />
<br />
=== Publicly available Resources ===<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="150"|Author<br />
! class="unsortable" width="300"|Brief description<br />
! width="45"|<small>PAST Users <ref name:"rtethree">RTE-3 data have been provided by participants by means of a questionnaire.</ref></small><br />
! width="45"|<small>RTE4 Users<ref name:"rtefour">RTE-4 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE5 Users<ref name:"rtefive">RTE-5 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE6 Users<ref name:"rtesix">RTE-6 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[WordNet]]<br />
| Lexical DB<br />
| Princeton University<br />
| Lexical database of English nouns, verbs, adjectives and adverbs<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|21<br />
| style="text-align: center;"|18<br />
|style="text-align: center;"| 22<br />
| [[WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://xwn.hlt.utdallas.edu/index.html eXtended Wordnet]<br />
| Lexical DB<br />
| Human Language Technology Research Institute, University of Texas at Dallas<br />
| Extension of WordNet based on the exploitation of the information contained in WordNet definitional glosses: the glosses are syntactically parsed, transformed into logic forms and content words are semantically disambiguated. The Extended Wordnet is an ongoing project. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 0<br />
| [[eXtended WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://ai.stanford.edu/~rion/swn/ Augmented Wordnet]<br />
| Lexical DB<br />
| Stanford University<br />
| The resource is the result of the application of a learning algorithm for inducing semantic taxonomies from parsed text. The algorithm automatically acquires items of world knowledge, and uses these to produce significantly enhanced versions of WordNet (up to 40,000 synsets more).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Augmented Wordnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]<br />
| Lexical DB<br />
| University of Colorado Boulder<br />
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Verbnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[VerbOcean]]<br />
| Lexical DB<br />
| Information Sciences Institute, University of Southern California<br />
| Broad-coverage semantic network of verbs<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 7<br />
| [[VerbOcean - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://framenet.icsi.berkeley.edu/ FrameNet]<br />
| Lexical DB<br />
| ICSI (International Computer Science Institute) - Berkley University<br />
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 3<br />
| [[Framenet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]<br />
| Lexical DB<br />
| New York University<br />
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[NomBank Resource - RTE Users|Users]] <br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]<br />
| Lexical DB<br />
| University of Colorado Boulder <br />
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[PropBank Resource - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus<br />
| Lexical DB<br />
| New York University<br />
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Nomlex Plus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]<br />
| Thesaurus<br />
| University of Alberta<br />
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 2<br />
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://icon.shef.ac.uk/Moby/mthes.html Grady Ward's Moby Thesaurus]<br />
| Thesaurus<br />
| University of Sheffield<br />
| Thesaurus containing 30,260 root words, with 2,520,264 synonyms and related terms. Grady Ward placed this thesaurus in the public domain in 1996.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Grady Ward's Moby Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]<br />
| Thesaurus<br />
| Peter Mark Roget (Electronic version distributed by University of Chicago)<br />
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Roget's Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.wikipedia.org/ Wikipedia]<br />
| Encyclopedia<br />
| <br />
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 6<br />
| [[Wikipedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.umbel.org/ Umbel]<br />
| Ontology<br />
| Structured Dynamics LLC, Coralville, IA<br />
| UMBEL stands for Upper Mapping and Binding Exchange Layer and is a lightweight ontology structure for relating Web content and data to a standard set of subject concepts<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Umbel - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.mpi-inf.mpg.de/yago-naga/yago/ YAGO]<br />
| Ontology<br />
| Max-Planck Institute for Informatics, Saarbrücken, Germany<br />
| Light-weight and extensible ontology. It contains more than 2 million entities and 20 million facts about these entities. The facts have been automatically extracted from Wikipedia and unified with WordNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[YAGO - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://dbpedia.org/About DBpedia]<br />
| Ontology<br />
| Open community project<br />
| DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.9 million things in 91 different languages and consists of 479 million pieces of information.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[DBpedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[DIRT Paraphrase Collection]]<br />
| Collection of paraphrases<br />
| University of Alberta<br />
| DIRT (Discovery of Inference Rules from Text) is both an algorithm and a resulting knowledge collection. The DIRT knowledge collection is the output of the DIRT algorithm over a 1GB set of newspaper text.<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|4<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 4<br />
| [[DIRT Paraphrase Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[TEASE]] Collection<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University<br />
| Output of the TEASE algorithm <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Tease Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]<br />
| Word List<br />
| BADC (British Atmospheric Data Centre)<br />
| Acronym and Abbreviation List<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://clipdemos.umiacs.umd.edu/catvar/ Catvar The Categorial Variation Database (English)]<br />
| Word List<br />
| University of Maryland<br />
| A Categorial-Variation Database (or Catvar) is a database of clusters of uninflected words (lexemes) and their categorial (i.e. part-of-speech) variants. The database was developed for English using a combination of resources and algorithms, including the LCS Verb and Preposition Databases (Dorr 2001), the Brown Corpus section of the Penn Treebank (Marcus et al. 1994), an English morphological analysis lexicon developed for PC-Kimmo (ENGLEX) (Antworth 1990), WordNet1.6 (Fellbaum 1998), an English Verb-Noun list extracted from Nomlex (Macleod et al. 1998), a similar list extracted from LDOCE (Procter 1978) and the Porter stemmer (Porter 1980).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 1<br />
| [[Categorial-Variation Database - RTE Users|Users]]<br />
<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.acronym-guide.com/ Acronym Guide]<br />
| Word List<br />
| Acronym-Guide.com <br />
| Acronym and Abbreviation Lists for English, branched in thematic directories <br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 0<br />
| [[Acronym Guide - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]<br />
| Word list<br />
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.<br />
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Web1T - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE3_RTE4_NGD.zip Normalized Google Distance (RTE3&RTE4)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE3&RTE4)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE5_NGD.zip Normalized Google Distance (RTE5)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE5)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]<br />
| Gazetteer<br />
| USGS (United States Geological Survey)<br />
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[GNIS - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.geonames.org/ Geonames]<br />
| Gazetteer<br />
| <br />
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Geonames - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]<br />
| Collection of paraphrases<br />
| Department of Computer Science, New York University <br />
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Sekine's Paraphrase Database - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]<br />
| Collection of paraphrases<br />
| Microsoft Research<br />
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators] <br />
| Collection of entailing operators<br />
| Department of Computer Science, Cornell University, Ithaca NY<br />
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Downward entailing operators - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/WikiRules.html WikiRules!]<br />
| Lexical Reference rule-base<br />
| Bar-Ilan University<br />
| Extraction of about 8 million lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia. Provides better performance than other automatically constructed resources and comparable performance to WordNet. Offers complementary knowledge to WordNet. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[WikiRules! - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://www.cs.utexas.edu/users/pclark/dart/ DART]<br />
| Collection of "world knowledge" propositions<br />
| Boeing Research and Technology<br />
| 23 million tuples such as "airplanes can fly to airports", "rivers can flood" collected from abstracted parse trees.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DART - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://cs.biu.ac.il/~nlp/download/FRED/latest/FRED.zip FRED]<br />
| FrameNet-derived entailment rule-base<br />
| Bar-Ilan University<br />
| This package contains the outputs of the FRED algorithm, an algorithm which extracts entailment rules from FrameNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[FRED - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/DIRECT.html DIRECT]<br />
| Directional Distributional Term-Similarity Resource<br />
| Bar-Ilan University<br />
| This is a resource of directional distributional term-similarity rules (mostly lexical entailment rules) automatically extracted using the inclusion relation as described in (Kotlerman et.al., ACL-09).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DIRECT - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/binaryDirt.html binaryDIRT]<br />
| Entailment rules between binary templates using DIRT algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over binary templates learned over the Reuters corpus using<br />
the DIRT algorithm of Lin and Pantel.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[BinaryDIRT- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/unaryBInc.html unaryBInc]<br />
| Entailment rules between unary templates using BInc algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over unary templates learned over the Reuters corpus using<br />
the BInc algorithm of Szpektor and Dagan (2008).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[unaryBInc- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''New resource''<br />
| <br />
| <br />
| ''Participants are encouraged to contribute''<br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
|style="text-align: center;"| <br />
| [[New Resource2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br><br />
<br />
=== Not available Resources ===<br />
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.<br />
<br><br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="180"|Author<br />
! class="unsortable" width="500"|Brief description<br />
! width="30"|<small>PAST Users</small><br />
! width="30"|<small>RTE4 Users</small><br />
! width="30"|<small>RTE5 Users</small><br />
! width="30"|<small>RTE6 Users</small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.parc.com/ PARC] Polarity Lexicon<br />
| Lexical DB<br />
| PARC - Palo Alto Research Center<br />
| Verbs classification with respect to semantic polarity<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Parc Polarity Lexicon - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Gazetteer from [http://trec.nist.gov/ TREC]<br />
| Gazetteer<br />
| NIST - National Institute of Standards and Technology<br />
| Cities and other geographical names<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Gazetteer from TREC - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DFKI Geographic Ontology<br>''(to be released)''<br />
| Ontology<br />
| DFKI - German Research Center for Artificial Intelligence<br />
| Ontology containing geographic terms and two kinds of relations: the directional ''part-of'' relation, and the ''equal'' relation for synonyms and abbreviations of the same geographic area (e.g ''the United Kingdom'', ''the UK'', ''Great Britain'', etc.) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Geographic Ontology - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Geo''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Meronymy entailment rules, based on TREC’s TIPSTER gazetteer.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geo - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Regex''<br />
| Collection of Entailment rules <br />
| Bar-Ilan University; Tel-Aviv University<br />
| Small set of entailment rules based on regular expressions, intended to address lexical variability involving temporal phrases<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Regex - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Syntactic rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Polarity rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| style="text-align: center;"|0<br />
| [[Polarity rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Lexical-Syntactic rule base<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08)<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Lexical-Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| OPENU Collection<br />
| Collection of Entailment Rules and Patterns<br />
| Open University<br />
| Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0 <br />
| [[OPENU Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Abbr''<br />
| Collection of rules for abbreviation<br />
| Bar-Ilan University; Tel-Aviv University<br />
| 2000 Abbreviation rules, extracted from [http://badc.nerc.ac.uk/help/abbrevs.html BADC] and [http://www.acronym-guide.com/ Acronym Guide]<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Abbr - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Negation_list<br />
| Negation rules<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of negative terms and words (verbs, adjectives, nouns) affecting modality or factuality of a infinitive verb preceded by the particle "to" (e.g. "believe","necessary", "attempt")<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| style="text-align: center;"|0<br />
| [[UAIC Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DLSIUAES Negation_list<br />
| List of negative terms<br />
| University of Alicante<br />
| Basic list of negative terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[DLSIUAES Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Quantifier_list<br />
| List of quantifiers<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of quantifiers affecting entailment judgment. The quantifiers are taken from a list which contains expressions like “more than”, “less than”, or words such as “over”, “under”, etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[UAIC Quantifier_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| FBKirst StopWord list<br />
| List of frequent words<br />
| FBK-Irst;<br/>University of Trento - Italy <br />
| A list of the 572 most frequent English words.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[FBKirst StopWord list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| IKOMA Dictionary of Named Entities Acronyms and Synonyms<br />
| Dictionary of Named Entities Acronyms and Synonyms<br />
| IKOMA; NEC Corporation, Takayama, Ikoma, Nara, Japan<br />
| Acronym dictionary constructed automatically from the corpus and a synonym dictionary that contains geographical terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[IKOMA2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br />
==Footnotes==<br />
<references/><br />
<br />
Return to: [[Textual_Entailment_Resource_Pool]]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE_Knowledge_Resources&diff=8783RTE Knowledge Resources2011-04-04T15:58:45Z<p>DGiampiccolo: </p>
<hr />
<div>Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.<br><br />
<br><br />
This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.<br><br />
<br><br />
RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.<br><br />
<br><br />
__TOC__<br />
<br><br />
=== Call for Resources ===<br />
<br />
In order to help the research, all the participants are invited to contribute, sharing their own resources with the RTE community. <br />
<br><br />
Making the resources available to be used by other systems has several advantages. On the one hand, it helps improve the TE technology; on the other hand, it offers an opportunity to further test and evaluate the resource.<br />
<br><br />
<br><br />
*[[RTE6 - Call for Resources]]<br />
*[[RTE7 - Call for Resources]]<br />
<br><br />
=== Ablation Tests ===<br />
An ablation test consists of removing one module at a time from a system, and rerunning the system on the test set with the other modules, except the one tested. <br />
<br><br />
Ablation test are meant to help better understand the relevance of the knowledge resources used by RTE systems, and evaluate the contribution of each of them to the systems' performances. In fact, comparing the results achieved in the ablation tests to those obtained by the systems as a whole allows assessing the contribution given by each single resource.<br />
<br><br />
<br><br />
*[[RTE-5 - Ablation Tests]]<br />
*[[RTE-6 - Ablation Tests]]<br />
<br><br />
<br />
=== Publicly available Resources ===<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="150"|Author<br />
! class="unsortable" width="300"|Brief description<br />
! width="45"|<small>PAST Users <ref name:"rtethree">RTE-3 data have been provided by participants by means of a questionnaire.</ref></small><br />
! width="45"|<small>RTE4 Users<ref name:"rtefour">RTE-4 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE5 Users<ref name:"rtefive">RTE-5 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE6 Users<ref name:"rtesix">RTE-6 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[WordNet]]<br />
| Lexical DB<br />
| Princeton University<br />
| Lexical database of English nouns, verbs, adjectives and adverbs<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|21<br />
| style="text-align: center;"|18<br />
|style="text-align: center;"| 22<br />
| [[WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://xwn.hlt.utdallas.edu/index.html eXtended Wordnet]<br />
| Lexical DB<br />
| Human Language Technology Research Institute, University of Texas at Dallas<br />
| Extension of WordNet based on the exploitation of the information contained in WordNet definitional glosses: the glosses are syntactically parsed, transformed into logic forms and content words are semantically disambiguated. The Extended Wordnet is an ongoing project. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 0<br />
| [[eXtended WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://ai.stanford.edu/~rion/swn/ Augmented Wordnet]<br />
| Lexical DB<br />
| Stanford University<br />
| The resource is the result of the application of a learning algorithm for inducing semantic taxonomies from parsed text. The algorithm automatically acquires items of world knowledge, and uses these to produce significantly enhanced versions of WordNet (up to 40,000 synsets more).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Augmented Wordnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]<br />
| Lexical DB<br />
| University of Colorado Boulder<br />
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Verbnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[VerbOcean]]<br />
| Lexical DB<br />
| Information Sciences Institute, University of Southern California<br />
| Broad-coverage semantic network of verbs<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 7<br />
| [[VerbOcean - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://framenet.icsi.berkeley.edu/ FrameNet]<br />
| Lexical DB<br />
| ICSI (International Computer Science Institute) - Berkley University<br />
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 3<br />
| [[Framenet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]<br />
| Lexical DB<br />
| New York University<br />
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[NomBank Resource - RTE Users|Users]] <br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]<br />
| Lexical DB<br />
| University of Colorado Boulder <br />
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[PropBank Resource - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus<br />
| Lexical DB<br />
| New York University<br />
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Nomlex Plus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]<br />
| Thesaurus<br />
| University of Alberta<br />
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 2<br />
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://icon.shef.ac.uk/Moby/mthes.html Grady Ward's Moby Thesaurus]<br />
| Thesaurus<br />
| University of Sheffield<br />
| Thesaurus containing 30,260 root words, with 2,520,264 synonyms and related terms. Grady Ward placed this thesaurus in the public domain in 1996.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Grady Ward's Moby Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]<br />
| Thesaurus<br />
| Peter Mark Roget (Electronic version distributed by University of Chicago)<br />
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Roget's Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.wikipedia.org/ Wikipedia]<br />
| Encyclopedia<br />
| <br />
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 6<br />
| [[Wikipedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.umbel.org/ Umbel]<br />
| Ontology<br />
| Structured Dynamics LLC, Coralville, IA<br />
| UMBEL stands for Upper Mapping and Binding Exchange Layer and is a lightweight ontology structure for relating Web content and data to a standard set of subject concepts<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Umbel - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.mpi-inf.mpg.de/yago-naga/yago/ YAGO]<br />
| Ontology<br />
| Max-Planck Institute for Informatics, Saarbrücken, Germany<br />
| Light-weight and extensible ontology. It contains more than 2 million entities and 20 million facts about these entities. The facts have been automatically extracted from Wikipedia and unified with WordNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[YAGO - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://dbpedia.org/About DBpedia]<br />
| Ontology<br />
| Open community project<br />
| DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.9 million things in 91 different languages and consists of 479 million pieces of information.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[DBpedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[DIRT Paraphrase Collection]]<br />
| Collection of paraphrases<br />
| University of Alberta<br />
| DIRT (Discovery of Inference Rules from Text) is both an algorithm and a resulting knowledge collection. The DIRT knowledge collection is the output of the DIRT algorithm over a 1GB set of newspaper text.<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|4<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 4<br />
| [[DIRT Paraphrase Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[TEASE]] Collection<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University<br />
| Output of the TEASE algorithm <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Tease Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]<br />
| Word List<br />
| BADC (British Atmospheric Data Centre)<br />
| Acronym and Abbreviation List<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://clipdemos.umiacs.umd.edu/catvar/ Catvar The Categorial Variation Database (English)]<br />
| Word List<br />
| University of Maryland<br />
| A Categorial-Variation Database (or Catvar) is a database of clusters of uninflected words (lexemes) and their categorial (i.e. part-of-speech) variants. The database was developed for English using a combination of resources and algorithms, including the LCS Verb and Preposition Databases (Dorr 2001), the Brown Corpus section of the Penn Treebank (Marcus et al. 1994), an English morphological analysis lexicon developed for PC-Kimmo (ENGLEX) (Antworth 1990), WordNet1.6 (Fellbaum 1998), an English Verb-Noun list extracted from Nomlex (Macleod et al. 1998), a similar list extracted from LDOCE (Procter 1978) and the Porter stemmer (Porter 1980).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 1<br />
| [[Categorial-Variation Database - RTE Users|Users]]<br />
<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.acronym-guide.com/ Acronym Guide]<br />
| Word List<br />
| Acronym-Guide.com <br />
| Acronym and Abbreviation Lists for English, branched in thematic directories <br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 0<br />
| [[Acronym Guide - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]<br />
| Word list<br />
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.<br />
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Web1T - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE3_RTE4_NGD.zip Normalized Google Distance (RTE3&RTE4)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE3&RTE4)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE5_NGD.zip Normalized Google Distance (RTE5)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE5)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]<br />
| Gazetteer<br />
| USGS (United States Geological Survey)<br />
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[GNIS - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.geonames.org/ Geonames]<br />
| Gazetteer<br />
| <br />
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Geonames - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]<br />
| Collection of paraphrases<br />
| Department of Computer Science, New York University <br />
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Sekine's Paraphrase Database - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]<br />
| Collection of paraphrases<br />
| Microsoft Research<br />
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators] <br />
| Collection of entailing operators<br />
| Department of Computer Science, Cornell University, Ithaca NY<br />
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Downward entailing operators - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/WikiRules.html WikiRules!]<br />
| Lexical Reference rule-base<br />
| Bar-Ilan University<br />
| Extraction of about 8 million lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia. Provides better performance than other automatically constructed resources and comparable performance to WordNet. Offers complementary knowledge to WordNet. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[WikiRules! - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://www.cs.utexas.edu/users/pclark/dart/ DART]<br />
| Collection of "world knowledge" propositions<br />
| Boeing Research and Technology<br />
| 23 million tuples such as "airplanes can fly to airports", "rivers can flood" collected from abstracted parse trees.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DART - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://cs.biu.ac.il/~nlp/download/FRED/latest/FRED.zip FRED]<br />
| FrameNet-derived entailment rule-base<br />
| Bar-Ilan University<br />
| This package contains the outputs of the FRED algorithm, an algorithm which extracts entailment rules from FrameNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[FRED - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/DIRECT.html DIRECT]<br />
| Directional Distributional Term-Similarity Resource<br />
| Bar-Ilan University<br />
| This is a resource of directional distributional term-similarity rules (mostly lexical entailment rules) automatically extracted using the inclusion relation as described in (Kotlerman et.al., ACL-09).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DIRECT - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/binaryDirt.html binaryDIRT]<br />
| Entailment rules between binary templates using DIRT algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over binary templates learned over the Reuters corpus using<br />
the DIRT algorithm of Lin and Pantel.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[BinaryDIRT- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/unaryBInc.html unaryBInc]<br />
| Entailment rules between unary templates using BInc algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over unary templates learned over the Reuters corpus using<br />
the BInc algorithm of Szpektor and Dagan (2008).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[unaryBInc- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''New resource''<br />
| <br />
| <br />
| ''Participants are encouraged to contribute''<br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
|style="text-align: center;"| <br />
| [[New Resource2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br><br />
<br />
=== Not available Resources ===<br />
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.<br />
<br><br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="180"|Author<br />
! class="unsortable" width="500"|Brief description<br />
! width="30"|<small>PAST Users</small><br />
! width="30"|<small>RTE4 Users</small><br />
! width="30"|<small>RTE5 Users</small><br />
! width="30"|<small>RTE6 Users</small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.parc.com/ PARC] Polarity Lexicon<br />
| Lexical DB<br />
| PARC - Palo Alto Research Center<br />
| Verbs classification with respect to semantic polarity<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Parc Polarity Lexicon - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Gazetteer from [http://trec.nist.gov/ TREC]<br />
| Gazetteer<br />
| NIST - National Institute of Standards and Technology<br />
| Cities and other geographical names<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Gazetteer from TREC - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DFKI Geographic Ontology<br>''(to be released)''<br />
| Ontology<br />
| DFKI - German Research Center for Artificial Intelligence<br />
| Ontology containing geographic terms and two kinds of relations: the directional ''part-of'' relation, and the ''equal'' relation for synonyms and abbreviations of the same geographic area (e.g ''the United Kingdom'', ''the UK'', ''Great Britain'', etc.) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Geographic Ontology - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Geo''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Meronymy entailment rules, based on TREC’s TIPSTER gazetteer.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geo - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Regex''<br />
| Collection of Entailment rules <br />
| Bar-Ilan University; Tel-Aviv University<br />
| Small set of entailment rules based on regular expressions, intended to address lexical variability involving temporal phrases<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Regex - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Syntactic rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Polarity rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| style="text-align: center;"|0<br />
| [[Polarity rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Lexical-Syntactic rule base<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08)<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Lexical-Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| OPENU Collection<br />
| Collection of Entailment Rules and Patterns<br />
| Open University<br />
| Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0 <br />
| [[OPENU Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Abbr''<br />
| Collection of rules for abbreviation<br />
| Bar-Ilan University; Tel-Aviv University<br />
| 2000 Abbreviation rules, extracted from [http://badc.nerc.ac.uk/help/abbrevs.html BADC] and [http://www.acronym-guide.com/ Acronym Guide]<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Abbr - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Negation_list<br />
| Negation rules<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of negative terms and words (verbs, adjectives, nouns) affecting modality or factuality of a infinitive verb preceded by the particle "to" (e.g. "believe","necessary", "attempt")<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| style="text-align: center;"|0<br />
| [[UAIC Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DLSIUAES Negation_list<br />
| List of negative terms<br />
| University of Alicante<br />
| Basic list of negative terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[DLSIUAES Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Quantifier_list<br />
| List of quantifiers<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of quantifiers affecting entailment judgment. The quantifiers are taken from a list which contains expressions like “more than”, “less than”, or words such as “over”, “under”, etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[UAIC Quantifier_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| FBKirst StopWord list<br />
| List of frequent words<br />
| FBK-Irst;<br/>University of Trento - Italy <br />
| A list of the 572 most frequent English words.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[FBKirst StopWord list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| IKOMA Dictionary of Named Entities Acronyms and Synonyms<br />
| Dictionary of Named Entities Acronyms and Synonyms<br />
| IKOMA; NEC Corporation, Takayama, Ikoma, Nara, Japan<br />
| Acronym dictionary constructed automatically from the corpus and a synonym dictionary that contains geographical terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[IKOMA2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br />
==Footnotes==<br />
<references/><br />
<br />
Return to: [[Textual_Entailment_Resource_Pool]]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE_Knowledge_Resources&diff=8782RTE Knowledge Resources2011-04-04T15:56:32Z<p>DGiampiccolo: Undo revision 8781 by DGiampiccolo (Talk)</p>
<hr />
<div>Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.<br><br />
<br><br />
This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.<br><br />
<br><br />
RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.<br><br />
<br><br />
__TOC__<br />
<br><br />
=== Call for Resources ===<br />
<br />
In order to help the research, all the participants are invited to contribute, sharing their own resources with the RTE community. <br />
<br><br />
Making the resources available to be used by other systems has several advantages. On the one hand, it helps improve the TE technology; on the other hand, it offers an opportunity to further test and evaluate the resource.<br />
<br><br />
<br><br />
*[[RTE-6 - Call for Resources]]<br />
*[[RTE-7 - Call for Resources]]<br />
<br><br />
=== Ablation Tests ===<br />
An ablation test consists of removing one module at a time from a system, and rerunning the system on the test set with the other modules, except the one tested. <br />
<br><br />
Ablation test are meant to help better understand the relevance of the knowledge resources used by RTE systems, and evaluate the contribution of each of them to the systems' performances. In fact, comparing the results achieved in the ablation tests to those obtained by the systems as a whole allows assessing the contribution given by each single resource.<br />
<br><br />
<br><br />
*[[RTE-5 - Ablation Tests]]<br />
*[[RTE-6 - Ablation Tests]]<br />
<br><br />
<br />
=== Publicly available Resources ===<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="150"|Author<br />
! class="unsortable" width="300"|Brief description<br />
! width="45"|<small>PAST Users <ref name:"rtethree">RTE-3 data have been provided by participants by means of a questionnaire.</ref></small><br />
! width="45"|<small>RTE4 Users<ref name:"rtefour">RTE-4 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE5 Users<ref name:"rtefive">RTE-5 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE6 Users<ref name:"rtesix">RTE-6 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[WordNet]]<br />
| Lexical DB<br />
| Princeton University<br />
| Lexical database of English nouns, verbs, adjectives and adverbs<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|21<br />
| style="text-align: center;"|18<br />
|style="text-align: center;"| 22<br />
| [[WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://xwn.hlt.utdallas.edu/index.html eXtended Wordnet]<br />
| Lexical DB<br />
| Human Language Technology Research Institute, University of Texas at Dallas<br />
| Extension of WordNet based on the exploitation of the information contained in WordNet definitional glosses: the glosses are syntactically parsed, transformed into logic forms and content words are semantically disambiguated. The Extended Wordnet is an ongoing project. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 0<br />
| [[eXtended WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://ai.stanford.edu/~rion/swn/ Augmented Wordnet]<br />
| Lexical DB<br />
| Stanford University<br />
| The resource is the result of the application of a learning algorithm for inducing semantic taxonomies from parsed text. The algorithm automatically acquires items of world knowledge, and uses these to produce significantly enhanced versions of WordNet (up to 40,000 synsets more).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Augmented Wordnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]<br />
| Lexical DB<br />
| University of Colorado Boulder<br />
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Verbnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[VerbOcean]]<br />
| Lexical DB<br />
| Information Sciences Institute, University of Southern California<br />
| Broad-coverage semantic network of verbs<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 7<br />
| [[VerbOcean - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://framenet.icsi.berkeley.edu/ FrameNet]<br />
| Lexical DB<br />
| ICSI (International Computer Science Institute) - Berkley University<br />
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 3<br />
| [[Framenet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]<br />
| Lexical DB<br />
| New York University<br />
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[NomBank Resource - RTE Users|Users]] <br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]<br />
| Lexical DB<br />
| University of Colorado Boulder <br />
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[PropBank Resource - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus<br />
| Lexical DB<br />
| New York University<br />
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Nomlex Plus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]<br />
| Thesaurus<br />
| University of Alberta<br />
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 2<br />
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://icon.shef.ac.uk/Moby/mthes.html Grady Ward's Moby Thesaurus]<br />
| Thesaurus<br />
| University of Sheffield<br />
| Thesaurus containing 30,260 root words, with 2,520,264 synonyms and related terms. Grady Ward placed this thesaurus in the public domain in 1996.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Grady Ward's Moby Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]<br />
| Thesaurus<br />
| Peter Mark Roget (Electronic version distributed by University of Chicago)<br />
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Roget's Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.wikipedia.org/ Wikipedia]<br />
| Encyclopedia<br />
| <br />
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 6<br />
| [[Wikipedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.umbel.org/ Umbel]<br />
| Ontology<br />
| Structured Dynamics LLC, Coralville, IA<br />
| UMBEL stands for Upper Mapping and Binding Exchange Layer and is a lightweight ontology structure for relating Web content and data to a standard set of subject concepts<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Umbel - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.mpi-inf.mpg.de/yago-naga/yago/ YAGO]<br />
| Ontology<br />
| Max-Planck Institute for Informatics, Saarbrücken, Germany<br />
| Light-weight and extensible ontology. It contains more than 2 million entities and 20 million facts about these entities. The facts have been automatically extracted from Wikipedia and unified with WordNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[YAGO - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://dbpedia.org/About DBpedia]<br />
| Ontology<br />
| Open community project<br />
| DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.9 million things in 91 different languages and consists of 479 million pieces of information.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[DBpedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[DIRT Paraphrase Collection]]<br />
| Collection of paraphrases<br />
| University of Alberta<br />
| DIRT (Discovery of Inference Rules from Text) is both an algorithm and a resulting knowledge collection. The DIRT knowledge collection is the output of the DIRT algorithm over a 1GB set of newspaper text.<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|4<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 4<br />
| [[DIRT Paraphrase Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[TEASE]] Collection<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University<br />
| Output of the TEASE algorithm <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Tease Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]<br />
| Word List<br />
| BADC (British Atmospheric Data Centre)<br />
| Acronym and Abbreviation List<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://clipdemos.umiacs.umd.edu/catvar/ Catvar The Categorial Variation Database (English)]<br />
| Word List<br />
| University of Maryland<br />
| A Categorial-Variation Database (or Catvar) is a database of clusters of uninflected words (lexemes) and their categorial (i.e. part-of-speech) variants. The database was developed for English using a combination of resources and algorithms, including the LCS Verb and Preposition Databases (Dorr 2001), the Brown Corpus section of the Penn Treebank (Marcus et al. 1994), an English morphological analysis lexicon developed for PC-Kimmo (ENGLEX) (Antworth 1990), WordNet1.6 (Fellbaum 1998), an English Verb-Noun list extracted from Nomlex (Macleod et al. 1998), a similar list extracted from LDOCE (Procter 1978) and the Porter stemmer (Porter 1980).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 1<br />
| [[Categorial-Variation Database - RTE Users|Users]]<br />
<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.acronym-guide.com/ Acronym Guide]<br />
| Word List<br />
| Acronym-Guide.com <br />
| Acronym and Abbreviation Lists for English, branched in thematic directories <br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 0<br />
| [[Acronym Guide - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]<br />
| Word list<br />
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.<br />
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Web1T - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE3_RTE4_NGD.zip Normalized Google Distance (RTE3&RTE4)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE3&RTE4)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE5_NGD.zip Normalized Google Distance (RTE5)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE5)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]<br />
| Gazetteer<br />
| USGS (United States Geological Survey)<br />
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[GNIS - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.geonames.org/ Geonames]<br />
| Gazetteer<br />
| <br />
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Geonames - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]<br />
| Collection of paraphrases<br />
| Department of Computer Science, New York University <br />
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Sekine's Paraphrase Database - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]<br />
| Collection of paraphrases<br />
| Microsoft Research<br />
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators] <br />
| Collection of entailing operators<br />
| Department of Computer Science, Cornell University, Ithaca NY<br />
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Downward entailing operators - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/WikiRules.html WikiRules!]<br />
| Lexical Reference rule-base<br />
| Bar-Ilan University<br />
| Extraction of about 8 million lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia. Provides better performance than other automatically constructed resources and comparable performance to WordNet. Offers complementary knowledge to WordNet. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[WikiRules! - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://www.cs.utexas.edu/users/pclark/dart/ DART]<br />
| Collection of "world knowledge" propositions<br />
| Boeing Research and Technology<br />
| 23 million tuples such as "airplanes can fly to airports", "rivers can flood" collected from abstracted parse trees.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DART - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://cs.biu.ac.il/~nlp/download/FRED/latest/FRED.zip FRED]<br />
| FrameNet-derived entailment rule-base<br />
| Bar-Ilan University<br />
| This package contains the outputs of the FRED algorithm, an algorithm which extracts entailment rules from FrameNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[FRED - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/DIRECT.html DIRECT]<br />
| Directional Distributional Term-Similarity Resource<br />
| Bar-Ilan University<br />
| This is a resource of directional distributional term-similarity rules (mostly lexical entailment rules) automatically extracted using the inclusion relation as described in (Kotlerman et.al., ACL-09).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DIRECT - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/binaryDirt.html binaryDIRT]<br />
| Entailment rules between binary templates using DIRT algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over binary templates learned over the Reuters corpus using<br />
the DIRT algorithm of Lin and Pantel.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[BinaryDIRT- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/unaryBInc.html unaryBInc]<br />
| Entailment rules between unary templates using BInc algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over unary templates learned over the Reuters corpus using<br />
the BInc algorithm of Szpektor and Dagan (2008).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[unaryBInc- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''New resource''<br />
| <br />
| <br />
| ''Participants are encouraged to contribute''<br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
|style="text-align: center;"| <br />
| [[New Resource2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br><br />
<br />
=== Not available Resources ===<br />
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.<br />
<br><br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="180"|Author<br />
! class="unsortable" width="500"|Brief description<br />
! width="30"|<small>PAST Users</small><br />
! width="30"|<small>RTE4 Users</small><br />
! width="30"|<small>RTE5 Users</small><br />
! width="30"|<small>RTE6 Users</small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.parc.com/ PARC] Polarity Lexicon<br />
| Lexical DB<br />
| PARC - Palo Alto Research Center<br />
| Verbs classification with respect to semantic polarity<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Parc Polarity Lexicon - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Gazetteer from [http://trec.nist.gov/ TREC]<br />
| Gazetteer<br />
| NIST - National Institute of Standards and Technology<br />
| Cities and other geographical names<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Gazetteer from TREC - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DFKI Geographic Ontology<br>''(to be released)''<br />
| Ontology<br />
| DFKI - German Research Center for Artificial Intelligence<br />
| Ontology containing geographic terms and two kinds of relations: the directional ''part-of'' relation, and the ''equal'' relation for synonyms and abbreviations of the same geographic area (e.g ''the United Kingdom'', ''the UK'', ''Great Britain'', etc.) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Geographic Ontology - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Geo''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Meronymy entailment rules, based on TREC’s TIPSTER gazetteer.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geo - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Regex''<br />
| Collection of Entailment rules <br />
| Bar-Ilan University; Tel-Aviv University<br />
| Small set of entailment rules based on regular expressions, intended to address lexical variability involving temporal phrases<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Regex - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Syntactic rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Polarity rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| style="text-align: center;"|0<br />
| [[Polarity rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Lexical-Syntactic rule base<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08)<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Lexical-Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| OPENU Collection<br />
| Collection of Entailment Rules and Patterns<br />
| Open University<br />
| Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0 <br />
| [[OPENU Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Abbr''<br />
| Collection of rules for abbreviation<br />
| Bar-Ilan University; Tel-Aviv University<br />
| 2000 Abbreviation rules, extracted from [http://badc.nerc.ac.uk/help/abbrevs.html BADC] and [http://www.acronym-guide.com/ Acronym Guide]<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Abbr - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Negation_list<br />
| Negation rules<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of negative terms and words (verbs, adjectives, nouns) affecting modality or factuality of a infinitive verb preceded by the particle "to" (e.g. "believe","necessary", "attempt")<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| style="text-align: center;"|0<br />
| [[UAIC Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DLSIUAES Negation_list<br />
| List of negative terms<br />
| University of Alicante<br />
| Basic list of negative terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[DLSIUAES Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Quantifier_list<br />
| List of quantifiers<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of quantifiers affecting entailment judgment. The quantifiers are taken from a list which contains expressions like “more than”, “less than”, or words such as “over”, “under”, etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[UAIC Quantifier_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| FBKirst StopWord list<br />
| List of frequent words<br />
| FBK-Irst;<br/>University of Trento - Italy <br />
| A list of the 572 most frequent English words.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[FBKirst StopWord list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| IKOMA Dictionary of Named Entities Acronyms and Synonyms<br />
| Dictionary of Named Entities Acronyms and Synonyms<br />
| IKOMA; NEC Corporation, Takayama, Ikoma, Nara, Japan<br />
| Acronym dictionary constructed automatically from the corpus and a synonym dictionary that contains geographical terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[IKOMA2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br />
==Footnotes==<br />
<references/><br />
<br />
Return to: [[Textual_Entailment_Resource_Pool]]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE_Knowledge_Resources&diff=8781RTE Knowledge Resources2011-04-04T15:55:07Z<p>DGiampiccolo: </p>
<hr />
<div>Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.<br><br />
<br><br />
This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.<br><br />
<br><br />
RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.<br><br />
<br><br />
__TOC__<br />
<br><br />
=== Call for Resources ===<br />
<br />
In order to help the research, all the participants are invited to contribute, sharing their own resources with the RTE community. <br />
<br><br />
Making the resources available to be used by other systems has several advantages. On the one hand, it helps improve the TE technology; on the other hand, it offers an opportunity to further test and evaluate the resource.<br />
<br><br />
<br><br />
*[[RTE-6 - Call for Resources]]<br />
*[[RTE-7 - Call for Resources]]<br />
<br><br />
=== Ablation Tests ===<br />
An ablation test consists of removing one module at a time from a system, and rerunning the system on the test set with the other modules, except the one tested. <br />
<br><br />
Ablation test are meant to help better understand the relevance of the knowledge resources used by RTE systems, and evaluate the contribution of each of them to the systems' performances. In fact, comparing the results achieved in the ablation tests to those obtained by the systems as a whole allows assessing the contribution given by each single resource.<br />
<br><br />
<br><br />
*[[RTE-5 - Ablation Tests]]<br />
*[[RTE-6 - Ablation Tests]]<br />
<br><br />
<br />
=== Publicly available Resources ===<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="150"|Author<br />
! class="unsortable" width="300"|Brief description<br />
! width="45"|<small>PAST Users <ref name:"rtethree">RTE-3 data have been provided by participants by means of a questionnaire.</ref></small><br />
! width="45"|<small>RTE-4 Users<ref name:"rtefour">RTE-4 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE-5 Users<ref name:"rtefive">RTE-5 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE-6 Users<ref name:"rtesix">RTE-6 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[WordNet]]<br />
| Lexical DB<br />
| Princeton University<br />
| Lexical database of English nouns, verbs, adjectives and adverbs<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|21<br />
| style="text-align: center;"|18<br />
|style="text-align: center;"| 22<br />
| [[WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://xwn.hlt.utdallas.edu/index.html eXtended Wordnet]<br />
| Lexical DB<br />
| Human Language Technology Research Institute, University of Texas at Dallas<br />
| Extension of WordNet based on the exploitation of the information contained in WordNet definitional glosses: the glosses are syntactically parsed, transformed into logic forms and content words are semantically disambiguated. The Extended Wordnet is an ongoing project. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 0<br />
| [[eXtended WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://ai.stanford.edu/~rion/swn/ Augmented Wordnet]<br />
| Lexical DB<br />
| Stanford University<br />
| The resource is the result of the application of a learning algorithm for inducing semantic taxonomies from parsed text. The algorithm automatically acquires items of world knowledge, and uses these to produce significantly enhanced versions of WordNet (up to 40,000 synsets more).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Augmented Wordnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]<br />
| Lexical DB<br />
| University of Colorado Boulder<br />
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Verbnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[VerbOcean]]<br />
| Lexical DB<br />
| Information Sciences Institute, University of Southern California<br />
| Broad-coverage semantic network of verbs<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 7<br />
| [[VerbOcean - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://framenet.icsi.berkeley.edu/ FrameNet]<br />
| Lexical DB<br />
| ICSI (International Computer Science Institute) - Berkley University<br />
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 3<br />
| [[Framenet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]<br />
| Lexical DB<br />
| New York University<br />
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[NomBank Resource - RTE Users|Users]] <br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]<br />
| Lexical DB<br />
| University of Colorado Boulder <br />
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[PropBank Resource - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus<br />
| Lexical DB<br />
| New York University<br />
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Nomlex Plus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]<br />
| Thesaurus<br />
| University of Alberta<br />
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 2<br />
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://icon.shef.ac.uk/Moby/mthes.html Grady Ward's Moby Thesaurus]<br />
| Thesaurus<br />
| University of Sheffield<br />
| Thesaurus containing 30,260 root words, with 2,520,264 synonyms and related terms. Grady Ward placed this thesaurus in the public domain in 1996.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Grady Ward's Moby Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]<br />
| Thesaurus<br />
| Peter Mark Roget (Electronic version distributed by University of Chicago)<br />
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Roget's Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.wikipedia.org/ Wikipedia]<br />
| Encyclopedia<br />
| <br />
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 6<br />
| [[Wikipedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.umbel.org/ Umbel]<br />
| Ontology<br />
| Structured Dynamics LLC, Coralville, IA<br />
| UMBEL stands for Upper Mapping and Binding Exchange Layer and is a lightweight ontology structure for relating Web content and data to a standard set of subject concepts<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Umbel - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.mpi-inf.mpg.de/yago-naga/yago/ YAGO]<br />
| Ontology<br />
| Max-Planck Institute for Informatics, Saarbrücken, Germany<br />
| Light-weight and extensible ontology. It contains more than 2 million entities and 20 million facts about these entities. The facts have been automatically extracted from Wikipedia and unified with WordNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[YAGO - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://dbpedia.org/About DBpedia]<br />
| Ontology<br />
| Open community project<br />
| DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.9 million things in 91 different languages and consists of 479 million pieces of information.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[DBpedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[DIRT Paraphrase Collection]]<br />
| Collection of paraphrases<br />
| University of Alberta<br />
| DIRT (Discovery of Inference Rules from Text) is both an algorithm and a resulting knowledge collection. The DIRT knowledge collection is the output of the DIRT algorithm over a 1GB set of newspaper text.<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|4<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 4<br />
| [[DIRT Paraphrase Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[TEASE]] Collection<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University<br />
| Output of the TEASE algorithm <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Tease Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]<br />
| Word List<br />
| BADC (British Atmospheric Data Centre)<br />
| Acronym and Abbreviation List<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://clipdemos.umiacs.umd.edu/catvar/ Catvar The Categorial Variation Database (English)]<br />
| Word List<br />
| University of Maryland<br />
| A Categorial-Variation Database (or Catvar) is a database of clusters of uninflected words (lexemes) and their categorial (i.e. part-of-speech) variants. The database was developed for English using a combination of resources and algorithms, including the LCS Verb and Preposition Databases (Dorr 2001), the Brown Corpus section of the Penn Treebank (Marcus et al. 1994), an English morphological analysis lexicon developed for PC-Kimmo (ENGLEX) (Antworth 1990), WordNet1.6 (Fellbaum 1998), an English Verb-Noun list extracted from Nomlex (Macleod et al. 1998), a similar list extracted from LDOCE (Procter 1978) and the Porter stemmer (Porter 1980).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 1<br />
| [[Categorial-Variation Database - RTE Users|Users]]<br />
<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.acronym-guide.com/ Acronym Guide]<br />
| Word List<br />
| Acronym-Guide.com <br />
| Acronym and Abbreviation Lists for English, branched in thematic directories <br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 0<br />
| [[Acronym Guide - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]<br />
| Word list<br />
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.<br />
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Web1T - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE3_RTE-4_NGD.zip Normalized Google Distance (RTE3&RTE-4)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE-4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE3&RTE-4)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE-5_NGD.zip Normalized Google Distance (RTE-5)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE-4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE-5)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]<br />
| Gazetteer<br />
| USGS (United States Geological Survey)<br />
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[GNIS - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.geonames.org/ Geonames]<br />
| Gazetteer<br />
| <br />
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Geonames - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]<br />
| Collection of paraphrases<br />
| Department of Computer Science, New York University <br />
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Sekine's Paraphrase Database - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]<br />
| Collection of paraphrases<br />
| Microsoft Research<br />
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators] <br />
| Collection of entailing operators<br />
| Department of Computer Science, Cornell University, Ithaca NY<br />
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Downward entailing operators - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/WikiRules.html WikiRules!]<br />
| Lexical Reference rule-base<br />
| Bar-Ilan University<br />
| Extraction of about 8 million lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia. Provides better performance than other automatically constructed resources and comparable performance to WordNet. Offers complementary knowledge to WordNet. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[WikiRules! - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://www.cs.utexas.edu/users/pclark/dart/ DART]<br />
| Collection of "world knowledge" propositions<br />
| Boeing Research and Technology<br />
| 23 million tuples such as "airplanes can fly to airports", "rivers can flood" collected from abstracted parse trees.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DART - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://cs.biu.ac.il/~nlp/download/FRED/latest/FRED.zip FRED]<br />
| FrameNet-derived entailment rule-base<br />
| Bar-Ilan University<br />
| This package contains the outputs of the FRED algorithm, an algorithm which extracts entailment rules from FrameNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[FRED - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/DIRECT.html DIRECT]<br />
| Directional Distributional Term-Similarity Resource<br />
| Bar-Ilan University<br />
| This is a resource of directional distributional term-similarity rules (mostly lexical entailment rules) automatically extracted using the inclusion relation as described in (Kotlerman et.al., ACL-09).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DIRECT - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/binaryDirt.html binaryDIRT]<br />
| Entailment rules between binary templates using DIRT algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over binary templates learned over the Reuters corpus using<br />
the DIRT algorithm of Lin and Pantel.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[BinaryDIRT- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/unaryBInc.html unaryBInc]<br />
| Entailment rules between unary templates using BInc algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over unary templates learned over the Reuters corpus using<br />
the BInc algorithm of Szpektor and Dagan (2008).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[unaryBInc- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''New resource''<br />
| <br />
| <br />
| ''Participants are encouraged to contribute''<br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
|style="text-align: center;"| <br />
| [[New Resource2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br><br />
<br />
=== Not available Resources ===<br />
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.<br />
<br><br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="180"|Author<br />
! class="unsortable" width="500"|Brief description<br />
! width="30"|<small>PAST Users</small><br />
! width="30"|<small>RTE-4 Users</small><br />
! width="30"|<small>RTE-5 Users</small><br />
! width="30"|<small>RTE-6 Users</small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.parc.com/ PARC] Polarity Lexicon<br />
| Lexical DB<br />
| PARC - Palo Alto Research Center<br />
| Verbs classification with respect to semantic polarity<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Parc Polarity Lexicon - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Gazetteer from [http://trec.nist.gov/ TREC]<br />
| Gazetteer<br />
| NIST - National Institute of Standards and Technology<br />
| Cities and other geographical names<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Gazetteer from TREC - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DFKI Geographic Ontology<br>''(to be released)''<br />
| Ontology<br />
| DFKI - German Research Center for Artificial Intelligence<br />
| Ontology containing geographic terms and two kinds of relations: the directional ''part-of'' relation, and the ''equal'' relation for synonyms and abbreviations of the same geographic area (e.g ''the United Kingdom'', ''the UK'', ''Great Britain'', etc.) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Geographic Ontology - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Geo''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Meronymy entailment rules, based on TREC’s TIPSTER gazetteer.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geo - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Regex''<br />
| Collection of Entailment rules <br />
| Bar-Ilan University; Tel-Aviv University<br />
| Small set of entailment rules based on regular expressions, intended to address lexical variability involving temporal phrases<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Regex - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Syntactic rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Polarity rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| style="text-align: center;"|0<br />
| [[Polarity rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Lexical-Syntactic rule base<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08)<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Lexical-Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| OPENU Collection<br />
| Collection of Entailment Rules and Patterns<br />
| Open University<br />
| Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0 <br />
| [[OPENU Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Abbr''<br />
| Collection of rules for abbreviation<br />
| Bar-Ilan University; Tel-Aviv University<br />
| 2000 Abbreviation rules, extracted from [http://badc.nerc.ac.uk/help/abbrevs.html BADC] and [http://www.acronym-guide.com/ Acronym Guide]<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Abbr - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Negation_list<br />
| Negation rules<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of negative terms and words (verbs, adjectives, nouns) affecting modality or factuality of a infinitive verb preceded by the particle "to" (e.g. "believe","necessary", "attempt")<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| style="text-align: center;"|0<br />
| [[UAIC Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DLSIUAES Negation_list<br />
| List of negative terms<br />
| University of Alicante<br />
| Basic list of negative terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[DLSIUAES Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Quantifier_list<br />
| List of quantifiers<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of quantifiers affecting entailment judgment. The quantifiers are taken from a list which contains expressions like “more than”, “less than”, or words such as “over”, “under”, etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[UAIC Quantifier_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| FBKirst StopWord list<br />
| List of frequent words<br />
| FBK-Irst;<br/>University of Trento - Italy <br />
| A list of the 572 most frequent English words.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[FBKirst StopWord list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| IKOMA Dictionary of Named Entities Acronyms and Synonyms<br />
| Dictionary of Named Entities Acronyms and Synonyms<br />
| IKOMA; NEC Corporation, Takayama, Ikoma, Nara, Japan<br />
| Acronym dictionary constructed automatically from the corpus and a synonym dictionary that contains geographical terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[IKOMA2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br />
==Footnotes==<br />
<references/><br />
<br />
Return to: [[Textual_Entailment_Resource_Pool]]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE_Knowledge_Resources&diff=8780RTE Knowledge Resources2011-04-04T15:52:24Z<p>DGiampiccolo: </p>
<hr />
<div>Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.<br><br />
<br><br />
This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.<br><br />
<br><br />
RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.<br><br />
<br><br />
__TOC__<br />
<br><br />
=== Call for Resources ===<br />
<br />
In order to help the research, all the participants are invited to contribute, sharing their own resources with the RTE community. <br />
<br><br />
Making the resources available to be used by other systems has several advantages. On the one hand, it helps improve the TE technology; on the other hand, it offers an opportunity to further test and evaluate the resource.<br />
<br><br />
<br><br />
*[[RTE-6 - Call for Resources]]<br />
*[[RTE-7 - Call for Resources]]<br />
<br><br />
=== Ablation Tests ===<br />
An ablation test consists of removing one module at a time from a system, and rerunning the system on the test set with the other modules, except the one tested. <br />
<br><br />
Ablation test are meant to help better understand the relevance of the knowledge resources used by RTE systems, and evaluate the contribution of each of them to the systems' performances. In fact, comparing the results achieved in the ablation tests to those obtained by the systems as a whole allows assessing the contribution given by each single resource.<br />
<br><br />
<br><br />
*[[RTE-5 - Ablation Tests]]<br />
*[[RTE-6 - Ablation Tests]]<br />
<br><br />
<br />
=== Publicly available Resources ===<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="150"|Author<br />
! class="unsortable" width="300"|Brief description<br />
! width="45"|<small>PAST Users <ref name:"rtethree">RTE-3 data have been provided by participants by means of a questionnaire.</ref></small><br />
! width="45"|<small>RTE4 Users<ref name:"rtefour">RTE-4 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE5 Users<ref name:"rtefive">RTE-5 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE6 Users<ref name:"rtesix">RTE-6 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[WordNet]]<br />
| Lexical DB<br />
| Princeton University<br />
| Lexical database of English nouns, verbs, adjectives and adverbs<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|21<br />
| style="text-align: center;"|18<br />
|style="text-align: center;"| 22<br />
| [[WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://xwn.hlt.utdallas.edu/index.html eXtended Wordnet]<br />
| Lexical DB<br />
| Human Language Technology Research Institute, University of Texas at Dallas<br />
| Extension of WordNet based on the exploitation of the information contained in WordNet definitional glosses: the glosses are syntactically parsed, transformed into logic forms and content words are semantically disambiguated. The Extended Wordnet is an ongoing project. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 0<br />
| [[eXtended WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://ai.stanford.edu/~rion/swn/ Augmented Wordnet]<br />
| Lexical DB<br />
| Stanford University<br />
| The resource is the result of the application of a learning algorithm for inducing semantic taxonomies from parsed text. The algorithm automatically acquires items of world knowledge, and uses these to produce significantly enhanced versions of WordNet (up to 40,000 synsets more).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Augmented Wordnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]<br />
| Lexical DB<br />
| University of Colorado Boulder<br />
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Verbnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[VerbOcean]]<br />
| Lexical DB<br />
| Information Sciences Institute, University of Southern California<br />
| Broad-coverage semantic network of verbs<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 7<br />
| [[VerbOcean - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://framenet.icsi.berkeley.edu/ FrameNet]<br />
| Lexical DB<br />
| ICSI (International Computer Science Institute) - Berkley University<br />
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 3<br />
| [[Framenet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]<br />
| Lexical DB<br />
| New York University<br />
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[NomBank Resource - RTE Users|Users]] <br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]<br />
| Lexical DB<br />
| University of Colorado Boulder <br />
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[PropBank Resource - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus<br />
| Lexical DB<br />
| New York University<br />
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Nomlex Plus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]<br />
| Thesaurus<br />
| University of Alberta<br />
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 2<br />
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://icon.shef.ac.uk/Moby/mthes.html Grady Ward's Moby Thesaurus]<br />
| Thesaurus<br />
| University of Sheffield<br />
| Thesaurus containing 30,260 root words, with 2,520,264 synonyms and related terms. Grady Ward placed this thesaurus in the public domain in 1996.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Grady Ward's Moby Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]<br />
| Thesaurus<br />
| Peter Mark Roget (Electronic version distributed by University of Chicago)<br />
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Roget's Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.wikipedia.org/ Wikipedia]<br />
| Encyclopedia<br />
| <br />
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 6<br />
| [[Wikipedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.umbel.org/ Umbel]<br />
| Ontology<br />
| Structured Dynamics LLC, Coralville, IA<br />
| UMBEL stands for Upper Mapping and Binding Exchange Layer and is a lightweight ontology structure for relating Web content and data to a standard set of subject concepts<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Umbel - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.mpi-inf.mpg.de/yago-naga/yago/ YAGO]<br />
| Ontology<br />
| Max-Planck Institute for Informatics, Saarbrücken, Germany<br />
| Light-weight and extensible ontology. It contains more than 2 million entities and 20 million facts about these entities. The facts have been automatically extracted from Wikipedia and unified with WordNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[YAGO - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://dbpedia.org/About DBpedia]<br />
| Ontology<br />
| Open community project<br />
| DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.9 million things in 91 different languages and consists of 479 million pieces of information.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[DBpedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[DIRT Paraphrase Collection]]<br />
| Collection of paraphrases<br />
| University of Alberta<br />
| DIRT (Discovery of Inference Rules from Text) is both an algorithm and a resulting knowledge collection. The DIRT knowledge collection is the output of the DIRT algorithm over a 1GB set of newspaper text.<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|4<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 4<br />
| [[DIRT Paraphrase Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[TEASE]] Collection<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University<br />
| Output of the TEASE algorithm <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Tease Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]<br />
| Word List<br />
| BADC (British Atmospheric Data Centre)<br />
| Acronym and Abbreviation List<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://clipdemos.umiacs.umd.edu/catvar/ Catvar The Categorial Variation Database (English)]<br />
| Word List<br />
| University of Maryland<br />
| A Categorial-Variation Database (or Catvar) is a database of clusters of uninflected words (lexemes) and their categorial (i.e. part-of-speech) variants. The database was developed for English using a combination of resources and algorithms, including the LCS Verb and Preposition Databases (Dorr 2001), the Brown Corpus section of the Penn Treebank (Marcus et al. 1994), an English morphological analysis lexicon developed for PC-Kimmo (ENGLEX) (Antworth 1990), WordNet1.6 (Fellbaum 1998), an English Verb-Noun list extracted from Nomlex (Macleod et al. 1998), a similar list extracted from LDOCE (Procter 1978) and the Porter stemmer (Porter 1980).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 1<br />
| [[Categorial-Variation Database - RTE Users|Users]]<br />
<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.acronym-guide.com/ Acronym Guide]<br />
| Word List<br />
| Acronym-Guide.com <br />
| Acronym and Abbreviation Lists for English, branched in thematic directories <br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 0<br />
| [[Acronym Guide - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]<br />
| Word list<br />
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.<br />
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Web1T - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE3_RTE4_NGD.zip Normalized Google Distance (RTE3&RTE4)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE3&RTE4)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE5_NGD.zip Normalized Google Distance (RTE5)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE5)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]<br />
| Gazetteer<br />
| USGS (United States Geological Survey)<br />
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[GNIS - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.geonames.org/ Geonames]<br />
| Gazetteer<br />
| <br />
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Geonames - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]<br />
| Collection of paraphrases<br />
| Department of Computer Science, New York University <br />
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Sekine's Paraphrase Database - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]<br />
| Collection of paraphrases<br />
| Microsoft Research<br />
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators] <br />
| Collection of entailing operators<br />
| Department of Computer Science, Cornell University, Ithaca NY<br />
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Downward entailing operators - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/WikiRules.html WikiRules!]<br />
| Lexical Reference rule-base<br />
| Bar-Ilan University<br />
| Extraction of about 8 million lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia. Provides better performance than other automatically constructed resources and comparable performance to WordNet. Offers complementary knowledge to WordNet. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[WikiRules! - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://www.cs.utexas.edu/users/pclark/dart/ DART]<br />
| Collection of "world knowledge" propositions<br />
| Boeing Research and Technology<br />
| 23 million tuples such as "airplanes can fly to airports", "rivers can flood" collected from abstracted parse trees.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DART - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://cs.biu.ac.il/~nlp/download/FRED/latest/FRED.zip FRED]<br />
| FrameNet-derived entailment rule-base<br />
| Bar-Ilan University<br />
| This package contains the outputs of the FRED algorithm, an algorithm which extracts entailment rules from FrameNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[FRED - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/DIRECT.html DIRECT]<br />
| Directional Distributional Term-Similarity Resource<br />
| Bar-Ilan University<br />
| This is a resource of directional distributional term-similarity rules (mostly lexical entailment rules) automatically extracted using the inclusion relation as described in (Kotlerman et.al., ACL-09).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DIRECT - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/binaryDirt.html binaryDIRT]<br />
| Entailment rules between binary templates using DIRT algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over binary templates learned over the Reuters corpus using<br />
the DIRT algorithm of Lin and Pantel.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[BinaryDIRT- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/unaryBInc.html unaryBInc]<br />
| Entailment rules between unary templates using BInc algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over unary templates learned over the Reuters corpus using<br />
the BInc algorithm of Szpektor and Dagan (2008).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[unaryBInc- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''New resource''<br />
| <br />
| <br />
| ''Participants are encouraged to contribute''<br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
|style="text-align: center;"| <br />
| [[New Resource2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br><br />
<br />
=== Not available Resources ===<br />
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.<br />
<br><br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="180"|Author<br />
! class="unsortable" width="500"|Brief description<br />
! width="30"|<small>PAST Users</small><br />
! width="30"|<small>RTE4 Users</small><br />
! width="30"|<small>RTE5 Users</small><br />
! width="30"|<small>RTE6 Users</small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.parc.com/ PARC] Polarity Lexicon<br />
| Lexical DB<br />
| PARC - Palo Alto Research Center<br />
| Verbs classification with respect to semantic polarity<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Parc Polarity Lexicon - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Gazetteer from [http://trec.nist.gov/ TREC]<br />
| Gazetteer<br />
| NIST - National Institute of Standards and Technology<br />
| Cities and other geographical names<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Gazetteer from TREC - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DFKI Geographic Ontology<br>''(to be released)''<br />
| Ontology<br />
| DFKI - German Research Center for Artificial Intelligence<br />
| Ontology containing geographic terms and two kinds of relations: the directional ''part-of'' relation, and the ''equal'' relation for synonyms and abbreviations of the same geographic area (e.g ''the United Kingdom'', ''the UK'', ''Great Britain'', etc.) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Geographic Ontology - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Geo''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Meronymy entailment rules, based on TREC’s TIPSTER gazetteer.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geo - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Regex''<br />
| Collection of Entailment rules <br />
| Bar-Ilan University; Tel-Aviv University<br />
| Small set of entailment rules based on regular expressions, intended to address lexical variability involving temporal phrases<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Regex - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Syntactic rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Polarity rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| style="text-align: center;"|0<br />
| [[Polarity rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Lexical-Syntactic rule base<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08)<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Lexical-Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| OPENU Collection<br />
| Collection of Entailment Rules and Patterns<br />
| Open University<br />
| Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0 <br />
| [[OPENU Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Abbr''<br />
| Collection of rules for abbreviation<br />
| Bar-Ilan University; Tel-Aviv University<br />
| 2000 Abbreviation rules, extracted from [http://badc.nerc.ac.uk/help/abbrevs.html BADC] and [http://www.acronym-guide.com/ Acronym Guide]<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Abbr - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Negation_list<br />
| Negation rules<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of negative terms and words (verbs, adjectives, nouns) affecting modality or factuality of a infinitive verb preceded by the particle "to" (e.g. "believe","necessary", "attempt")<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| style="text-align: center;"|0<br />
| [[UAIC Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DLSIUAES Negation_list<br />
| List of negative terms<br />
| University of Alicante<br />
| Basic list of negative terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[DLSIUAES Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Quantifier_list<br />
| List of quantifiers<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of quantifiers affecting entailment judgment. The quantifiers are taken from a list which contains expressions like “more than”, “less than”, or words such as “over”, “under”, etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[UAIC Quantifier_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| FBKirst StopWord list<br />
| List of frequent words<br />
| FBK-Irst;<br/>University of Trento - Italy <br />
| A list of the 572 most frequent English words.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[FBKirst StopWord list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| IKOMA Dictionary of Named Entities Acronyms and Synonyms<br />
| Dictionary of Named Entities Acronyms and Synonyms<br />
| IKOMA; NEC Corporation, Takayama, Ikoma, Nara, Japan<br />
| Acronym dictionary constructed automatically from the corpus and a synonym dictionary that contains geographical terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[IKOMA2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br />
==Footnotes==<br />
<references/><br />
<br />
Return to: [[Textual_Entailment_Resource_Pool]]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE7_-_Call_for_Resources&diff=8779RTE7 - Call for Resources2011-04-04T15:50:42Z<p>DGiampiccolo: </p>
<hr />
<div>A study on the knowledge resources used in Textual Entailment was started with the ablation tests carried out on the RTE-5 Main task submissions, which gave a hint about the impact of knowledge resources on TE system performances. Given the interest raised by this new initiative, the investigation of knowledge resources, which in RTE-6 was extended to tools, will be also continued in the RTE-7 challenge.<p><br />
<br />
Making the resources available to be used by other systems has several advantages: <br />
<br />
* It helps improve the TE technology; <br />
* it offers an opportunity to further test and evaluate the resource.</p><br />
<br />
In order to help the research, all the participants are invited to contribute and share their own resources with the RTE community, adding them to the table below.<br />
<br />
<!-- ==New resources available!== --><br />
<br><br />
<br><br />
<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! Resource<br />
! Type<br />
! Author<br />
! class="unsortable"|Brief description<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|''New resource''<br />
| <br />
| <br />
| ''People are encouraged to contribute''<br />
|}<br />
<br><br />
<br />
Return to [[RTE Knowledge Resources]]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE7_-_Call_for_Resources&diff=8778RTE7 - Call for Resources2011-04-04T15:50:10Z<p>DGiampiccolo: </p>
<hr />
<div>A study on the knowledge resources used in Textual Entailment was started with the ablation tests carried out on the RTE-5 Main task submissions, which gave a hint about the impact of knowledge resources on TE system performances. Given the interest raised by this new initiative, the investigation of knowledge resources, which in RTE-6 was extended to tools, will be also continued in the RTE-7 challenge.<p><br />
<br />
Making the resources available to be used by other systems has several advantages: <br />
<br />
* It helps improve the TE technology; <br />
* it offers an opportunity to further test and evaluate the resource.</p><br />
<br />
In order to help the research, all the participants are invited to contribute and share their own resources with the RTE community, adding them to the table below.<br />
<br />
<!-- ==New resources available!== --><br />
<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! Resource<br />
! Type<br />
! Author<br />
! class="unsortable"|Brief description<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|''New resource''<br />
| <br />
| <br />
| ''People are encouraged to contribute''<br />
|}<br />
<br><br />
<br />
Return to [[RTE Knowledge Resources]]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE_Knowledge_Resources&diff=8777RTE Knowledge Resources2011-04-04T15:49:08Z<p>DGiampiccolo: </p>
<hr />
<div>Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.<br><br />
<br><br />
This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.<br><br />
<br><br />
RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.<br><br />
<br><br />
__TOC__<br />
<br><br />
=== Call for Resources ===<br />
<br />
In order to help the research, all the participants are invited to contribute, sharing their own resources with the RTE community. <br />
<br><br />
Making the resources available to be used by other systems has several advantages. On the one hand, it helps improve the TE technology; on the other hand, it offers an opportunity to further test and evaluate the resource.<br />
<br><br />
<br><br />
*[[RTE6 - Call for Resources]]<br />
*[[RTE7 - Call for Resources]]<br />
<br><br />
=== Ablation Tests ===<br />
An ablation test consists of removing one module at a time from a system, and rerunning the system on the test set with the other modules, except the one tested. <br />
<br><br />
Ablation test are meant to help better understand the relevance of the knowledge resources used by RTE systems, and evaluate the contribution of each of them to the systems' performances. In fact, comparing the results achieved in the ablation tests to those obtained by the systems as a whole allows assessing the contribution given by each single resource.<br />
<br><br />
<br><br />
*[[RTE5 - Ablation Tests]]<br />
*[[RTE6 - Ablation Tests]]<br />
<br><br />
<br />
=== Publicly available Resources ===<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="150"|Author<br />
! class="unsortable" width="300"|Brief description<br />
! width="45"|<small>PAST Users <ref name:"rtethree">RTE-3 data have been provided by participants by means of a questionnaire.</ref></small><br />
! width="45"|<small>RTE4 Users<ref name:"rtefour">RTE-4 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE5 Users<ref name:"rtefive">RTE-5 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE6 Users<ref name:"rtesix">RTE-6 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[WordNet]]<br />
| Lexical DB<br />
| Princeton University<br />
| Lexical database of English nouns, verbs, adjectives and adverbs<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|21<br />
| style="text-align: center;"|18<br />
|style="text-align: center;"| 22<br />
| [[WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://xwn.hlt.utdallas.edu/index.html eXtended Wordnet]<br />
| Lexical DB<br />
| Human Language Technology Research Institute, University of Texas at Dallas<br />
| Extension of WordNet based on the exploitation of the information contained in WordNet definitional glosses: the glosses are syntactically parsed, transformed into logic forms and content words are semantically disambiguated. The Extended Wordnet is an ongoing project. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 0<br />
| [[eXtended WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://ai.stanford.edu/~rion/swn/ Augmented Wordnet]<br />
| Lexical DB<br />
| Stanford University<br />
| The resource is the result of the application of a learning algorithm for inducing semantic taxonomies from parsed text. The algorithm automatically acquires items of world knowledge, and uses these to produce significantly enhanced versions of WordNet (up to 40,000 synsets more).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Augmented Wordnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]<br />
| Lexical DB<br />
| University of Colorado Boulder<br />
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Verbnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[VerbOcean]]<br />
| Lexical DB<br />
| Information Sciences Institute, University of Southern California<br />
| Broad-coverage semantic network of verbs<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 7<br />
| [[VerbOcean - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://framenet.icsi.berkeley.edu/ FrameNet]<br />
| Lexical DB<br />
| ICSI (International Computer Science Institute) - Berkley University<br />
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|2<br />
|style="text-align: center;"| 3<br />
| [[Framenet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]<br />
| Lexical DB<br />
| New York University<br />
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[NomBank Resource - RTE Users|Users]] <br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]<br />
| Lexical DB<br />
| University of Colorado Boulder <br />
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[PropBank Resource - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus<br />
| Lexical DB<br />
| New York University<br />
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Nomlex Plus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]<br />
| Thesaurus<br />
| University of Alberta<br />
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 2<br />
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://icon.shef.ac.uk/Moby/mthes.html Grady Ward's Moby Thesaurus]<br />
| Thesaurus<br />
| University of Sheffield<br />
| Thesaurus containing 30,260 root words, with 2,520,264 synonyms and related terms. Grady Ward placed this thesaurus in the public domain in 1996.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Grady Ward's Moby Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]<br />
| Thesaurus<br />
| Peter Mark Roget (Electronic version distributed by University of Chicago)<br />
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Roget's Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.wikipedia.org/ Wikipedia]<br />
| Encyclopedia<br />
| <br />
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
|style="text-align: center;"| 6<br />
| [[Wikipedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.umbel.org/ Umbel]<br />
| Ontology<br />
| Structured Dynamics LLC, Coralville, IA<br />
| UMBEL stands for Upper Mapping and Binding Exchange Layer and is a lightweight ontology structure for relating Web content and data to a standard set of subject concepts<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Umbel - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.mpi-inf.mpg.de/yago-naga/yago/ YAGO]<br />
| Ontology<br />
| Max-Planck Institute for Informatics, Saarbrücken, Germany<br />
| Light-weight and extensible ontology. It contains more than 2 million entities and 20 million facts about these entities. The facts have been automatically extracted from Wikipedia and unified with WordNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[YAGO - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://dbpedia.org/About DBpedia]<br />
| Ontology<br />
| Open community project<br />
| DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.9 million things in 91 different languages and consists of 479 million pieces of information.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[DBpedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[DIRT Paraphrase Collection]]<br />
| Collection of paraphrases<br />
| University of Alberta<br />
| DIRT (Discovery of Inference Rules from Text) is both an algorithm and a resulting knowledge collection. The DIRT knowledge collection is the output of the DIRT algorithm over a 1GB set of newspaper text.<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|4<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 4<br />
| [[DIRT Paraphrase Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[TEASE]] Collection<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University<br />
| Output of the TEASE algorithm <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Tease Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]<br />
| Word List<br />
| BADC (British Atmospheric Data Centre)<br />
| Acronym and Abbreviation List<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://clipdemos.umiacs.umd.edu/catvar/ Catvar The Categorial Variation Database (English)]<br />
| Word List<br />
| University of Maryland<br />
| A Categorial-Variation Database (or Catvar) is a database of clusters of uninflected words (lexemes) and their categorial (i.e. part-of-speech) variants. The database was developed for English using a combination of resources and algorithms, including the LCS Verb and Preposition Databases (Dorr 2001), the Brown Corpus section of the Penn Treebank (Marcus et al. 1994), an English morphological analysis lexicon developed for PC-Kimmo (ENGLEX) (Antworth 1990), WordNet1.6 (Fellbaum 1998), an English Verb-Noun list extracted from Nomlex (Macleod et al. 1998), a similar list extracted from LDOCE (Procter 1978) and the Porter stemmer (Porter 1980).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 1<br />
| [[Categorial-Variation Database - RTE Users|Users]]<br />
<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.acronym-guide.com/ Acronym Guide]<br />
| Word List<br />
| Acronym-Guide.com <br />
| Acronym and Abbreviation Lists for English, branched in thematic directories <br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|3<br />
|style="text-align: center;"| 0<br />
| [[Acronym Guide - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]<br />
| Word list<br />
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.<br />
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Web1T - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE3_RTE4_NGD.zip Normalized Google Distance (RTE3&RTE4)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE3&RTE4)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE5_NGD.zip Normalized Google Distance (RTE5)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Normalized Google Distance (RTE5)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]<br />
| Gazetteer<br />
| USGS (United States Geological Survey)<br />
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[GNIS - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.geonames.org/ Geonames]<br />
| Gazetteer<br />
| <br />
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Geonames - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]<br />
| Collection of paraphrases<br />
| Department of Computer Science, New York University <br />
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Sekine's Paraphrase Database - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]<br />
| Collection of paraphrases<br />
| Microsoft Research<br />
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators] <br />
| Collection of entailing operators<br />
| Department of Computer Science, Cornell University, Ithaca NY<br />
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[Downward entailing operators - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/WikiRules.html WikiRules!]<br />
| Lexical Reference rule-base<br />
| Bar-Ilan University<br />
| Extraction of about 8 million lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia. Provides better performance than other automatically constructed resources and comparable performance to WordNet. Offers complementary knowledge to WordNet. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
|style="text-align: center;"| 0<br />
| [[WikiRules! - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://www.cs.utexas.edu/users/pclark/dart/ DART]<br />
| Collection of "world knowledge" propositions<br />
| Boeing Research and Technology<br />
| 23 million tuples such as "airplanes can fly to airports", "rivers can flood" collected from abstracted parse trees.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DART - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://cs.biu.ac.il/~nlp/download/FRED/latest/FRED.zip FRED]<br />
| FrameNet-derived entailment rule-base<br />
| Bar-Ilan University<br />
| This package contains the outputs of the FRED algorithm, an algorithm which extracts entailment rules from FrameNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[FRED - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/DIRECT.html DIRECT]<br />
| Directional Distributional Term-Similarity Resource<br />
| Bar-Ilan University<br />
| This is a resource of directional distributional term-similarity rules (mostly lexical entailment rules) automatically extracted using the inclusion relation as described in (Kotlerman et.al., ACL-09).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[DIRECT - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/binaryDirt.html binaryDIRT]<br />
| Entailment rules between binary templates using DIRT algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over binary templates learned over the Reuters corpus using<br />
the DIRT algorithm of Lin and Pantel.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[BinaryDIRT- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/unaryBInc.html unaryBInc]<br />
| Entailment rules between unary templates using BInc algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over unary templates learned over the Reuters corpus using<br />
the BInc algorithm of Szpektor and Dagan (2008).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
|style="text-align: center;"| 0<br />
| [[unaryBInc- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''New resource''<br />
| <br />
| <br />
| ''Participants are encouraged to contribute''<br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
|style="text-align: center;"| <br />
| [[New Resource2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br><br />
<br />
=== Not available Resources ===<br />
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.<br />
<br><br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="180"|Author<br />
! class="unsortable" width="500"|Brief description<br />
! width="30"|<small>PAST Users</small><br />
! width="30"|<small>RTE4 Users</small><br />
! width="30"|<small>RTE5 Users</small><br />
! width="30"|<small>RTE6 Users</small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.parc.com/ PARC] Polarity Lexicon<br />
| Lexical DB<br />
| PARC - Palo Alto Research Center<br />
| Verbs classification with respect to semantic polarity<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Parc Polarity Lexicon - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Gazetteer from [http://trec.nist.gov/ TREC]<br />
| Gazetteer<br />
| NIST - National Institute of Standards and Technology<br />
| Cities and other geographical names<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Gazetteer from TREC - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DFKI Geographic Ontology<br>''(to be released)''<br />
| Ontology<br />
| DFKI - German Research Center for Artificial Intelligence<br />
| Ontology containing geographic terms and two kinds of relations: the directional ''part-of'' relation, and the ''equal'' relation for synonyms and abbreviations of the same geographic area (e.g ''the United Kingdom'', ''the UK'', ''Great Britain'', etc.) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Geographic Ontology - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Geo''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Meronymy entailment rules, based on TREC’s TIPSTER gazetteer.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geo - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Regex''<br />
| Collection of Entailment rules <br />
| Bar-Ilan University; Tel-Aviv University<br />
| Small set of entailment rules based on regular expressions, intended to address lexical variability involving temporal phrases<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Regex - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Syntactic rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Polarity rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| style="text-align: center;"|0<br />
| [[Polarity rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Lexical-Syntactic rule base<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08)<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Lexical-Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| OPENU Collection<br />
| Collection of Entailment Rules and Patterns<br />
| Open University<br />
| Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0 <br />
| [[OPENU Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Abbr''<br />
| Collection of rules for abbreviation<br />
| Bar-Ilan University; Tel-Aviv University<br />
| 2000 Abbreviation rules, extracted from [http://badc.nerc.ac.uk/help/abbrevs.html BADC] and [http://www.acronym-guide.com/ Acronym Guide]<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Abbr - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Negation_list<br />
| Negation rules<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of negative terms and words (verbs, adjectives, nouns) affecting modality or factuality of a infinitive verb preceded by the particle "to" (e.g. "believe","necessary", "attempt")<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| style="text-align: center;"|0<br />
| [[UAIC Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DLSIUAES Negation_list<br />
| List of negative terms<br />
| University of Alicante<br />
| Basic list of negative terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[DLSIUAES Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Quantifier_list<br />
| List of quantifiers<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of quantifiers affecting entailment judgment. The quantifiers are taken from a list which contains expressions like “more than”, “less than”, or words such as “over”, “under”, etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[UAIC Quantifier_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| FBKirst StopWord list<br />
| List of frequent words<br />
| FBK-Irst;<br/>University of Trento - Italy <br />
| A list of the 572 most frequent English words.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[FBKirst StopWord list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| IKOMA Dictionary of Named Entities Acronyms and Synonyms<br />
| Dictionary of Named Entities Acronyms and Synonyms<br />
| IKOMA; NEC Corporation, Takayama, Ikoma, Nara, Japan<br />
| Acronym dictionary constructed automatically from the corpus and a synonym dictionary that contains geographical terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[IKOMA2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br />
==Footnotes==<br />
<references/><br />
<br />
Return to: [[Textual_Entailment_Resource_Pool]]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE_Knowledge_Resources&diff=8072RTE Knowledge Resources2010-07-07T13:04:33Z<p>DGiampiccolo: </p>
<hr />
<div>Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.<br><br />
<br><br />
This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.<br><br />
<br><br />
RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.<br><br />
<br><br />
__TOC__<br />
<br><br />
=== Call for Resources ===<br />
<br />
In order to help the research, all the participants are invited to contribute, sharing their own resources with the RTE community. <br />
<br><br />
Making the resources available to be used by other systems has several advantages. On the one hand, it helps improve the TE technology; on the other hand, it offers an opportunity to further test and evaluate the resource.<br />
<br><br />
<br><br />
[[RTE6 - Call for Resources]]<br />
<br><br />
=== Ablation Tests ===<br />
An ablation test consists of removing one module at a time from a system, and rerunning the system on the test set with the other modules, except the one tested. <br />
<br><br />
Ablation test are meant to help better understand the relevance of the knowledge resources used by RTE systems, and evaluate the contribution of each of them to the systems' performances. In fact, comparing the results achieved in the ablation tests to those obtained by the systems as a whole allows assessing the contribution given by each single resource.<br />
<br><br />
<br><br />
[[RTE5 - Ablation Tests]]<br />
<br><br />
<br />
=== Publicly available Resources ===<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="150"|Author<br />
! class="unsortable" width="300"|Brief description<br />
! width="45"|<small>PAST Users <ref name:"rtethree">RTE-3 data have been provided by participants by means of a questionnaire.</ref></small><br />
! width="45"|<small>RTE4 Users<ref name:"rtefour">RTE-4 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE5 Users<ref name:"rtefive">RTE-5 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[WordNet]]<br />
| Lexical DB<br />
| Princeton University<br />
| Lexical database of English nouns, verbs, adjectives and adverbs<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|21<br />
| style="text-align: center;"|18<br />
| [[WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://xwn.hlt.utdallas.edu/index.html eXtended Wordnet]<br />
| Lexical DB<br />
| Human Language Technology Research Institute, University of Texas at Dallas<br />
| Extension of WordNet based on the exploitation of the information contained in WordNet definitional glosses: the glosses are syntactically parsed, transformed into logic forms and content words are semantically disambiguated. The Extended Wordnet is an ongoing project. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|2<br />
| [[eXtended WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://ai.stanford.edu/~rion/swn/ Augmented Wordnet]<br />
| Lexical DB<br />
| Stanford University<br />
| The resource is the result of the application of a learning algorithm for inducing semantic taxonomies from parsed text. The algorithm automatically acquires items of world knowledge, and uses these to produce significantly enhanced versions of WordNet (up to 40,000 synsets more).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Augmented Wordnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]<br />
| Lexical DB<br />
| University of Colorado Boulder<br />
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| [[Verbnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[VerbOcean]]<br />
| Lexical DB<br />
| Information Sciences Institute, University of Southern California<br />
| Broad-coverage semantic network of verbs<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
| [[VerbOcean - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://framenet.icsi.berkeley.edu/ FrameNet]<br />
| Lexical DB<br />
| ICSI (International Computer Science Institute) - Berkley University<br />
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|2<br />
| [[Framenet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]<br />
| Lexical DB<br />
| New York University<br />
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[NomBank Resource - RTE Users|Users]] <br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]<br />
| Lexical DB<br />
| University of Colorado Boulder <br />
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[PropBank Resource - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus<br />
| Lexical DB<br />
| New York University<br />
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Nomlex Plus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]<br />
| Thesaurus<br />
| University of Alberta<br />
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://icon.shef.ac.uk/Moby/mthes.html Grady Ward's Moby Thesaurus]<br />
| Thesaurus<br />
| University of Sheffield<br />
| Thesaurus containing 30,260 root words, with 2,520,264 synonyms and related terms. Grady Ward placed this thesaurus in the public domain in 1996.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Grady Ward's Moby Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]<br />
| Thesaurus<br />
| Peter Mark Roget (Electronic version distributed by University of Chicago)<br />
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Roget's Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.wikipedia.org/ Wikipedia]<br />
| Encyclopedia<br />
| <br />
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
| [[Wikipedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.umbel.org/ Umbel]<br />
| Ontology<br />
| Structured Dynamics LLC, Coralville, IA<br />
| UMBEL stands for Upper Mapping and Binding Exchange Layer and is a lightweight ontology structure for relating Web content and data to a standard set of subject concepts<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Umbel - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.mpi-inf.mpg.de/yago-naga/yago/ YAGO]<br />
| Ontology<br />
| Max-Planck Institute for Informatics, Saarbrücken, Germany<br />
| Light-weight and extensible ontology. It contains more than 2 million entities and 20 million facts about these entities. The facts have been automatically extracted from Wikipedia and unified with WordNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[YAGO - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://dbpedia.org/About DBpedia]<br />
| Ontology<br />
| Open community project<br />
| DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.9 million things in 91 different languages and consists of 479 million pieces of information.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[DBpedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[DIRT Paraphrase Collection]]<br />
| Collection of paraphrases<br />
| University of Alberta<br />
| DIRT (Discovery of Inference Rules from Text) is both an algorithm and a resulting knowledge collection. The DIRT knowledge collection is the output of the DIRT algorithm over a 1GB set of newspaper text.<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|4<br />
| style="text-align: center;"|3<br />
| [[DIRT Paraphrase Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[TEASE]] Collection<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University<br />
| Output of the TEASE algorithm <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Tease Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]<br />
| Word List<br />
| BADC (British Atmospheric Data Centre)<br />
| Acronym and Abbreviation List<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.acronym-guide.com/ Acronym Guide]<br />
| Word List<br />
| Acronym-Guide.com <br />
| Acronym and Abbreviation Lists for English, branched in thematic directories <br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|3<br />
| [[Acronym Guide - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]<br />
| Word list<br />
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.<br />
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Web1T - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE3_RTE4_NGD.zip Normalized Google Distance (RTE3&RTE4)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Normalized Google Distance (RTE3&RTE4)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE5_NGD.zip Normalized Google Distance (RTE5)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Normalized Google Distance (RTE5)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]<br />
| Gazetteer<br />
| USGS (United States Geological Survey)<br />
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[GNIS - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.geonames.org/ Geonames]<br />
| Gazetteer<br />
| <br />
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geonames - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]<br />
| Collection of paraphrases<br />
| Department of Computer Science, New York University <br />
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Sekine's Paraphrase Database - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]<br />
| Collection of paraphrases<br />
| Microsoft Research<br />
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators] <br />
| Collection of entailing operators<br />
| Department of Computer Science, Cornell University, Ithaca NY<br />
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Downward entailing operators - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/WikiRules.html WikiRules!]<br />
| Lexical Reference rule-base<br />
| Bar-Ilan University<br />
| Extraction of about 8 million lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia. Provides better performance than other automatically constructed resources and comparable performance to WordNet. Offers complementary knowledge to WordNet. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[WikiRules! - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://www.cs.utexas.edu/users/pclark/dart/ DART]<br />
| Collection of "world knowledge" propositions<br />
| Boeing Research and Technology<br />
| 23 million tuples such as "airplanes can fly to airports", "rivers can flood" collected from abstracted parse trees.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[DART - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://cs.biu.ac.il/~nlp/download/FRED/latest/FRED.zip FRED]<br />
| FrameNet-derived entailment rule-base<br />
| Bar-Ilan University<br />
| This package contains the outputs of the FRED algorithm, an algorithm which extracts entailment rules from FrameNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[FRED - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/DIRECT.html DIRECT]<br />
| Directional Distributional Term-Similarity Resource<br />
| Bar-Ilan University<br />
| This is a resource of directional distributional term-similarity rules (mostly lexical entailment rules) automatically extracted using the inclusion relation as described in (Kotlerman et.al., ACL-09).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[DIRECT - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/binaryDirt.html binaryDIRT]<br />
| Entailment rules between binary templates using DIRT algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over binary templates learned over the Reuters corpus using<br />
the DIRT algorithm of Lin and Pantel.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[BinaryDIRT- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/unaryBInc.html unaryBInc]<br />
| Entailment rules between unary templates using BInc algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over unary templates learned over the Reuters corpus using<br />
the BInc algorithm of Szpektor and Dagan (2008).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[unaryBInc- RTE Users|Users]]<br />
<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''New resource''<br />
| <br />
| <br />
| ''Participants are encouraged to contribute''<br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| [[New Resource2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br><br />
<br />
=== Not available Resources ===<br />
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.<br />
<br><br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="180"|Author<br />
! class="unsortable" width="500"|Brief description<br />
! width="30"|<small>PAST Users</small><br />
! width="30"|<small>RTE4 Users</small><br />
! width="30"|<small>RTE5 Users</small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.parc.com/ PARC] Polarity Lexicon<br />
| Lexical DB<br />
| PARC - Palo Alto Research Center<br />
| Verbs classification with respect to semantic polarity<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Parc Polarity Lexicon - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Gazetteer from [http://trec.nist.gov/ TREC]<br />
| Gazetteer<br />
| NIST - National Institute of Standards and Technology<br />
| Cities and other geographical names<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Gazetteer from TREC - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DFKI Geographic Ontology<br>''(to be released)''<br />
| Ontology<br />
| DFKI - German Research Center for Artificial Intelligence<br />
| Ontology containing geographic terms and two kinds of relations: the directional ''part-of'' relation, and the ''equal'' relation for synonyms and abbreviations of the same geographic area (e.g ''the United Kingdom'', ''the UK'', ''Great Britain'', etc.) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geographic Ontology - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Geo''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Meronymy entailment rules, based on TREC’s TIPSTER gazetteer.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Geo - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Regex''<br />
| Collection of Entailment rules <br />
| Bar-Ilan University; Tel-Aviv University<br />
| Small set of entailment rules based on regular expressions, intended to address lexical variability involving temporal phrases<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Regex - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Syntactic rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Polarity rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[Polarity rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Lexical-Syntactic rule base<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08)<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Lexical-Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| OPENU Collection<br />
| Collection of Entailment Rules and Patterns<br />
| Open University<br />
| Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0 <br />
| [[OPENU Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Abbr''<br />
| Collection of rules for abbreviation<br />
| Bar-Ilan University; Tel-Aviv University<br />
| 2000 Abbreviation rules, extracted from [http://badc.nerc.ac.uk/help/abbrevs.html BADC] and [http://www.acronym-guide.com/ Acronym Guide]<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Abbr - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Negation_list<br />
| Negation rules<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of negative terms and words (verbs, adjectives, nouns) affecting modality or factuality of a infinitive verb preceded by the particle "to" (e.g. "believe","necessary", "attempt")<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[UAIC Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DLSIUAES Negation_list<br />
| List of negative terms<br />
| University of Alicante<br />
| Basic list of negative terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[DLSIUAES Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Quantifier_list<br />
| List of quantifiers<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of quantifiers affecting entailment judgment. The quantifiers are taken from a list which contains expressions like “more than”, “less than”, or words such as “over”, “under”, etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[UAIC Quantifier_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| FBKirst StopWord list<br />
| List of frequent words<br />
| FBK-Irst;<br/>University of Trento - Italy <br />
| A list of the 572 most frequent English words.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[FBKirst StopWord list - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br />
==Footnotes==<br />
<references/></div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE_Knowledge_Resources&diff=8071RTE Knowledge Resources2010-07-07T13:00:51Z<p>DGiampiccolo: </p>
<hr />
<div>Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.<br><br />
<br><br />
This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.<br><br />
<br><br />
RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.<br><br />
<br><br />
__TOC__<br />
<br><br />
=== Call for Resources ===<br />
<br />
In order to help the research, all the participants are invited to contribute, sharing their own resources with the RTE community. Making the resources available to be used by other systems has several advantages. On the one hand, it helps improve the TE technology; on the other hand, it offers an opportunity to further test and evaluate the resource.<br />
<br><br />
<br><br />
[[RTE6 - Call for Resources]]<br />
<br><br />
=== Ablation Tests ===<br />
An ablation test consists of removing one module at a time from a system, and rerunning the system on the test set with the other modules, except the one tested. <br />
<br><br />
Ablation test are meant to help better understand the relevance of the knowledge resources used by RTE systems, and evaluate the contribution of each of them to the systems' performances. In fact, comparing the results achieved in the ablation tests to those obtained by the systems as a whole allows assessing the contribution given by each single resource.<br />
<br><br />
<br><br />
[[RTE5 - Ablation Tests]]<br />
<br><br />
<br />
=== Publicly available Resources ===<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="150"|Author<br />
! class="unsortable" width="300"|Brief description<br />
! width="45"|<small>PAST Users <ref name:"rtethree">RTE-3 data have been provided by participants by means of a questionnaire.</ref></small><br />
! width="45"|<small>RTE4 Users<ref name:"rtefour">RTE-4 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE5 Users<ref name:"rtefive">RTE-5 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[WordNet]]<br />
| Lexical DB<br />
| Princeton University<br />
| Lexical database of English nouns, verbs, adjectives and adverbs<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|21<br />
| style="text-align: center;"|18<br />
| [[WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://xwn.hlt.utdallas.edu/index.html eXtended Wordnet]<br />
| Lexical DB<br />
| Human Language Technology Research Institute, University of Texas at Dallas<br />
| Extension of WordNet based on the exploitation of the information contained in WordNet definitional glosses: the glosses are syntactically parsed, transformed into logic forms and content words are semantically disambiguated. The Extended Wordnet is an ongoing project. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|2<br />
| [[eXtended WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://ai.stanford.edu/~rion/swn/ Augmented Wordnet]<br />
| Lexical DB<br />
| Stanford University<br />
| The resource is the result of the application of a learning algorithm for inducing semantic taxonomies from parsed text. The algorithm automatically acquires items of world knowledge, and uses these to produce significantly enhanced versions of WordNet (up to 40,000 synsets more).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Augmented Wordnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]<br />
| Lexical DB<br />
| University of Colorado Boulder<br />
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| [[Verbnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[VerbOcean]]<br />
| Lexical DB<br />
| Information Sciences Institute, University of Southern California<br />
| Broad-coverage semantic network of verbs<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
| [[VerbOcean - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://framenet.icsi.berkeley.edu/ FrameNet]<br />
| Lexical DB<br />
| ICSI (International Computer Science Institute) - Berkley University<br />
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|2<br />
| [[Framenet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]<br />
| Lexical DB<br />
| New York University<br />
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[NomBank Resource - RTE Users|Users]] <br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]<br />
| Lexical DB<br />
| University of Colorado Boulder <br />
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[PropBank Resource - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus<br />
| Lexical DB<br />
| New York University<br />
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Nomlex Plus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]<br />
| Thesaurus<br />
| University of Alberta<br />
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://icon.shef.ac.uk/Moby/mthes.html Grady Ward's Moby Thesaurus]<br />
| Thesaurus<br />
| University of Sheffield<br />
| Thesaurus containing 30,260 root words, with 2,520,264 synonyms and related terms. Grady Ward placed this thesaurus in the public domain in 1996.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Grady Ward's Moby Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]<br />
| Thesaurus<br />
| Peter Mark Roget (Electronic version distributed by University of Chicago)<br />
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Roget's Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.wikipedia.org/ Wikipedia]<br />
| Encyclopedia<br />
| <br />
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
| [[Wikipedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.umbel.org/ Umbel]<br />
| Ontology<br />
| Structured Dynamics LLC, Coralville, IA<br />
| UMBEL stands for Upper Mapping and Binding Exchange Layer and is a lightweight ontology structure for relating Web content and data to a standard set of subject concepts<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Umbel - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.mpi-inf.mpg.de/yago-naga/yago/ YAGO]<br />
| Ontology<br />
| Max-Planck Institute for Informatics, Saarbrücken, Germany<br />
| Light-weight and extensible ontology. It contains more than 2 million entities and 20 million facts about these entities. The facts have been automatically extracted from Wikipedia and unified with WordNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[YAGO - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://dbpedia.org/About DBpedia]<br />
| Ontology<br />
| Open community project<br />
| DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.9 million things in 91 different languages and consists of 479 million pieces of information.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[DBpedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[DIRT Paraphrase Collection]]<br />
| Collection of paraphrases<br />
| University of Alberta<br />
| DIRT (Discovery of Inference Rules from Text) is both an algorithm and a resulting knowledge collection. The DIRT knowledge collection is the output of the DIRT algorithm over a 1GB set of newspaper text.<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|4<br />
| style="text-align: center;"|3<br />
| [[DIRT Paraphrase Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[TEASE]] Collection<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University<br />
| Output of the TEASE algorithm <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Tease Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]<br />
| Word List<br />
| BADC (British Atmospheric Data Centre)<br />
| Acronym and Abbreviation List<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.acronym-guide.com/ Acronym Guide]<br />
| Word List<br />
| Acronym-Guide.com <br />
| Acronym and Abbreviation Lists for English, branched in thematic directories <br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|3<br />
| [[Acronym Guide - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]<br />
| Word list<br />
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.<br />
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Web1T - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE3_RTE4_NGD.zip Normalized Google Distance (RTE3&RTE4)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Normalized Google Distance (RTE3&RTE4)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE5_NGD.zip Normalized Google Distance (RTE5)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Normalized Google Distance (RTE5)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]<br />
| Gazetteer<br />
| USGS (United States Geological Survey)<br />
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[GNIS - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.geonames.org/ Geonames]<br />
| Gazetteer<br />
| <br />
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geonames - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]<br />
| Collection of paraphrases<br />
| Department of Computer Science, New York University <br />
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Sekine's Paraphrase Database - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]<br />
| Collection of paraphrases<br />
| Microsoft Research<br />
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators] <br />
| Collection of entailing operators<br />
| Department of Computer Science, Cornell University, Ithaca NY<br />
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Downward entailing operators - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/WikiRules.html WikiRules!]<br />
| Lexical Reference rule-base<br />
| Bar-Ilan University<br />
| Extraction of about 8 million lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia. Provides better performance than other automatically constructed resources and comparable performance to WordNet. Offers complementary knowledge to WordNet. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[WikiRules! - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://www.cs.utexas.edu/users/pclark/dart/ DART]<br />
| Collection of "world knowledge" propositions<br />
| Boeing Research and Technology<br />
| 23 million tuples such as "airplanes can fly to airports", "rivers can flood" collected from abstracted parse trees.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[DART - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://cs.biu.ac.il/~nlp/download/FRED/latest/FRED.zip FRED]<br />
| FrameNet-derived entailment rule-base<br />
| Bar-Ilan University<br />
| This package contains the outputs of the FRED algorithm, an algorithm which extracts entailment rules from FrameNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[FRED - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/DIRECT.html DIRECT]<br />
| Directional Distributional Term-Similarity Resource<br />
| Bar-Ilan University<br />
| This is a resource of directional distributional term-similarity rules (mostly lexical entailment rules) automatically extracted using the inclusion relation as described in (Kotlerman et.al., ACL-09).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[DIRECT - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/binaryDirt.html binaryDIRT]<br />
| Entailment rules between binary templates using DIRT algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over binary templates learned over the Reuters corpus using<br />
the DIRT algorithm of Lin and Pantel.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[BinaryDIRT- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/unaryBInc.html unaryBInc]<br />
| Entailment rules between unary templates using BInc algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over unary templates learned over the Reuters corpus using<br />
the BInc algorithm of Szpektor and Dagan (2008).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[unaryBInc- RTE Users|Users]]<br />
<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''New resource''<br />
| <br />
| <br />
| ''Participants are encouraged to contribute''<br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| [[New Resource2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br><br />
<br />
=== Not available Resources ===<br />
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.<br />
<br><br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="180"|Author<br />
! class="unsortable" width="500"|Brief description<br />
! width="30"|<small>PAST Users</small><br />
! width="30"|<small>RTE4 Users</small><br />
! width="30"|<small>RTE5 Users</small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.parc.com/ PARC] Polarity Lexicon<br />
| Lexical DB<br />
| PARC - Palo Alto Research Center<br />
| Verbs classification with respect to semantic polarity<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Parc Polarity Lexicon - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Gazetteer from [http://trec.nist.gov/ TREC]<br />
| Gazetteer<br />
| NIST - National Institute of Standards and Technology<br />
| Cities and other geographical names<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Gazetteer from TREC - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DFKI Geographic Ontology<br>''(to be released)''<br />
| Ontology<br />
| DFKI - German Research Center for Artificial Intelligence<br />
| Ontology containing geographic terms and two kinds of relations: the directional ''part-of'' relation, and the ''equal'' relation for synonyms and abbreviations of the same geographic area (e.g ''the United Kingdom'', ''the UK'', ''Great Britain'', etc.) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geographic Ontology - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Geo''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Meronymy entailment rules, based on TREC’s TIPSTER gazetteer.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Geo - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Regex''<br />
| Collection of Entailment rules <br />
| Bar-Ilan University; Tel-Aviv University<br />
| Small set of entailment rules based on regular expressions, intended to address lexical variability involving temporal phrases<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Regex - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Syntactic rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Polarity rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[Polarity rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Lexical-Syntactic rule base<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08)<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Lexical-Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| OPENU Collection<br />
| Collection of Entailment Rules and Patterns<br />
| Open University<br />
| Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0 <br />
| [[OPENU Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Abbr''<br />
| Collection of rules for abbreviation<br />
| Bar-Ilan University; Tel-Aviv University<br />
| 2000 Abbreviation rules, extracted from [http://badc.nerc.ac.uk/help/abbrevs.html BADC] and [http://www.acronym-guide.com/ Acronym Guide]<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Abbr - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Negation_list<br />
| Negation rules<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of negative terms and words (verbs, adjectives, nouns) affecting modality or factuality of a infinitive verb preceded by the particle "to" (e.g. "believe","necessary", "attempt")<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[UAIC Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DLSIUAES Negation_list<br />
| List of negative terms<br />
| University of Alicante<br />
| Basic list of negative terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[DLSIUAES Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Quantifier_list<br />
| List of quantifiers<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of quantifiers affecting entailment judgment. The quantifiers are taken from a list which contains expressions like “more than”, “less than”, or words such as “over”, “under”, etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[UAIC Quantifier_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| FBKirst StopWord list<br />
| List of frequent words<br />
| FBK-Irst;<br/>University of Trento - Italy <br />
| A list of the 572 most frequent English words.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[FBKirst StopWord list - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br />
==Footnotes==<br />
<references/></div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=RTE_Knowledge_Resources&diff=8070RTE Knowledge Resources2010-07-07T12:27:11Z<p>DGiampiccolo: </p>
<hr />
<div>Knowledge resources have shown their relevance for applied semantic inference, and are extensively used by applied inference systems, such as those developed within the Textual Entailment framework.<br><br />
<br><br />
This page presents a list of the knowledge resources used by systems that have participated in the last RTE challenges. The first table lists the publicly available resources, the second one lists unpublished resources. Both tables are sortable by Resource name, type, author and number of users.<br><br />
<br><br />
RTE Participants are encouraged to add information about all kind of knowledge resources used, from standard existing resources (e.g. WordNet) to knowledge collections created for specific purposes, which can be made available to the community.<br><br />
<br><br />
__TOC__<br />
<br><br />
=== Call for Resources ===<br />
[[RTE6 - Call for Resources]]<br />
<br><br />
In order to help the research, all the participants are invited to contribute, sharing their own resources with the RTE community. Making the resources available to be used by other systems has several advantages. On the one hand, it helps improve the TE technology; on the other hand, it offers an opportunity to further test and evaluate the resource.<br />
<br><br />
=== Ablation Tests ===<br />
[[RTE5 - Ablation Tests]]<br />
<br><br />
<br />
=== Publicly available Resources ===<br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="150"|Author<br />
! class="unsortable" width="300"|Brief description<br />
! width="45"|<small>PAST Users <ref name:"rtethree">RTE-3 data have been provided by participants by means of a questionnaire.</ref></small><br />
! width="45"|<small>RTE4 Users<ref name:"rtefour">RTE-4 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! width="45"|<small>RTE5 Users<ref name:"rtefive">RTE-5 data have been provided by participants and have been integrated with information extracted from the related proceedings.</ref></small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[WordNet]]<br />
| Lexical DB<br />
| Princeton University<br />
| Lexical database of English nouns, verbs, adjectives and adverbs<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|21<br />
| style="text-align: center;"|18<br />
| [[WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://xwn.hlt.utdallas.edu/index.html eXtended Wordnet]<br />
| Lexical DB<br />
| Human Language Technology Research Institute, University of Texas at Dallas<br />
| Extension of WordNet based on the exploitation of the information contained in WordNet definitional glosses: the glosses are syntactically parsed, transformed into logic forms and content words are semantically disambiguated. The Extended Wordnet is an ongoing project. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|2<br />
| [[eXtended WordNet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://ai.stanford.edu/~rion/swn/ Augmented Wordnet]<br />
| Lexical DB<br />
| Stanford University<br />
| The resource is the result of the application of a learning algorithm for inducing semantic taxonomies from parsed text. The algorithm automatically acquires items of world knowledge, and uses these to produce significantly enhanced versions of WordNet (up to 40,000 synsets more).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Augmented Wordnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]<br />
| Lexical DB<br />
| University of Colorado Boulder<br />
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| [[Verbnet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[VerbOcean]]<br />
| Lexical DB<br />
| Information Sciences Institute, University of Southern California<br />
| Broad-coverage semantic network of verbs<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
| [[VerbOcean - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://framenet.icsi.berkeley.edu/ FrameNet]<br />
| Lexical DB<br />
| ICSI (International Computer Science Institute) - Berkley University<br />
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|2<br />
| [[Framenet - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]<br />
| Lexical DB<br />
| New York University<br />
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[NomBank Resource - RTE Users|Users]] <br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]<br />
| Lexical DB<br />
| University of Colorado Boulder <br />
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora <br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[PropBank Resource - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus<br />
| Lexical DB<br />
| New York University<br />
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Nomlex Plus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]<br />
| Thesaurus<br />
| University of Alberta<br />
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://icon.shef.ac.uk/Moby/mthes.html Grady Ward's Moby Thesaurus]<br />
| Thesaurus<br />
| University of Sheffield<br />
| Thesaurus containing 30,260 root words, with 2,520,264 synonyms and related terms. Grady Ward placed this thesaurus in the public domain in 1996.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Grady Ward's Moby Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]<br />
| Thesaurus<br />
| Peter Mark Roget (Electronic version distributed by University of Chicago)<br />
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Roget's Thesaurus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.wikipedia.org/ Wikipedia]<br />
| Encyclopedia<br />
| <br />
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|3<br />
| style="text-align: center;"|6<br />
| [[Wikipedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.umbel.org/ Umbel]<br />
| Ontology<br />
| Structured Dynamics LLC, Coralville, IA<br />
| UMBEL stands for Upper Mapping and Binding Exchange Layer and is a lightweight ontology structure for relating Web content and data to a standard set of subject concepts<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Umbel - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.mpi-inf.mpg.de/yago-naga/yago/ YAGO]<br />
| Ontology<br />
| Max-Planck Institute for Informatics, Saarbrücken, Germany<br />
| Light-weight and extensible ontology. It contains more than 2 million entities and 20 million facts about these entities. The facts have been automatically extracted from Wikipedia and unified with WordNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[YAGO - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://dbpedia.org/About DBpedia]<br />
| Ontology<br />
| Open community project<br />
| DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.9 million things in 91 different languages and consists of 479 million pieces of information.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[DBpedia - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[DIRT Paraphrase Collection]]<br />
| Collection of paraphrases<br />
| University of Alberta<br />
| DIRT (Discovery of Inference Rules from Text) is both an algorithm and a resulting knowledge collection. The DIRT knowledge collection is the output of the DIRT algorithm over a 1GB set of newspaper text.<br />
| style="text-align: center;"|2<br />
| style="text-align: center;"|4<br />
| style="text-align: center;"|3<br />
| [[DIRT Paraphrase Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [[TEASE]] Collection<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University<br />
| Output of the TEASE algorithm <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Tease Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]<br />
| Word List<br />
| BADC (British Atmospheric Data Centre)<br />
| Acronym and Abbreviation List<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.acronym-guide.com/ Acronym Guide]<br />
| Word List<br />
| Acronym-Guide.com <br />
| Acronym and Abbreviation Lists for English, branched in thematic directories <br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|3<br />
| [[Acronym Guide - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]<br />
| Word list<br />
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.<br />
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Web1T - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE3_RTE4_NGD.zip Normalized Google Distance (RTE3&RTE4)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Normalized Google Distance (RTE3&RTE4)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.coli.uni-saarland.de/%7Erwang/resources/RTE5_NGD.zip Normalized Google Distance (RTE5)]<br />
| Word Pair Co-occurrence<br />
| Saarland University<br />
| Co-occurrence of the word pairs in RTE3 and RTE4 using Normalized Google Distance (Cilibrasi and Vitanyi, 2004). The word pairs are all the possible combinations of content words in T and H. In practice, we used [http://m.www.yahoo.com/ Yahoo!] as the search engine.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Normalized Google Distance (RTE5)- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]<br />
| Gazetteer<br />
| USGS (United States Geological Survey)<br />
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[GNIS - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.geonames.org/ Geonames]<br />
| Gazetteer<br />
| <br />
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geonames - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]<br />
| Collection of paraphrases<br />
| Department of Computer Science, New York University <br />
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Sekine's Paraphrase Database - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]<br />
| Collection of paraphrases<br />
| Microsoft Research<br />
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators] <br />
| Collection of entailing operators<br />
| Department of Computer Science, Cornell University, Ithaca NY<br />
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Downward entailing operators - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/WikiRules.html WikiRules!]<br />
| Lexical Reference rule-base<br />
| Bar-Ilan University<br />
| Extraction of about 8 million lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia. Provides better performance than other automatically constructed resources and comparable performance to WordNet. Offers complementary knowledge to WordNet. <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[WikiRules! - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://www.cs.utexas.edu/users/pclark/dart/ DART]<br />
| Collection of "world knowledge" propositions<br />
| Boeing Research and Technology<br />
| 23 million tuples such as "airplanes can fly to airports", "rivers can flood" collected from abstracted parse trees.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[DART - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://cs.biu.ac.il/~nlp/download/FRED/latest/FRED.zip FRED]<br />
| FrameNet-derived entailment rule-base<br />
| Bar-Ilan University<br />
| This package contains the outputs of the FRED algorithm, an algorithm which extracts entailment rules from FrameNet.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[FRED - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/DIRECT.html DIRECT]<br />
| Directional Distributional Term-Similarity Resource<br />
| Bar-Ilan University<br />
| This is a resource of directional distributional term-similarity rules (mostly lexical entailment rules) automatically extracted using the inclusion relation as described in (Kotlerman et.al., ACL-09).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[DIRECT - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/binaryDirt.html binaryDIRT]<br />
| Entailment rules between binary templates using DIRT algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over binary templates learned over the Reuters corpus using<br />
the DIRT algorithm of Lin and Pantel.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[BinaryDIRT- RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
|[http://u.cs.biu.ac.il/~nlp/downloads/unaryBInc.html unaryBInc]<br />
| Entailment rules between unary templates using BInc algorithm<br />
| Bar-Ilan University<br />
| This resource contains entailment rules over unary templates learned over the Reuters corpus using<br />
the BInc algorithm of Szpektor and Dagan (2008).<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[unaryBInc- RTE Users|Users]]<br />
<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''New resource''<br />
| <br />
| <br />
| ''Participants are encouraged to contribute''<br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| style="text-align: center;"| <br />
| [[New Resource2 - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br><br />
<br />
=== Not available Resources ===<br />
The following table lists the unpublished resources used by RTE participants. Some of them have been developed by Users themselves specifically for RTE. Interested people may turn to authors to obtain further information.<br />
<br><br />
{|class="wikitable sortable" cellpadding="3" cellspacing="0" border="1"<br />
<br />
|- bgcolor="#CDCDCD"<br />
! width="80"|Resource<br />
! width="50"|Type<br />
! width="180"|Author<br />
! class="unsortable" width="500"|Brief description<br />
! width="30"|<small>PAST Users</small><br />
! width="30"|<small>RTE4 Users</small><br />
! width="30"|<small>RTE5 Users</small><br />
! class="unsortable" width="30"|<small>Usage info</small><br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| [http://www.parc.com/ PARC] Polarity Lexicon<br />
| Lexical DB<br />
| PARC - Palo Alto Research Center<br />
| Verbs classification with respect to semantic polarity<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Parc Polarity Lexicon - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Gazetteer from [http://trec.nist.gov/ TREC]<br />
| Gazetteer<br />
| NIST - National Institute of Standards and Technology<br />
| Cities and other geographical names<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| [[Gazetteer from TREC - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DFKI Geographic Ontology<br>''(to be released)''<br />
| Ontology<br />
| DFKI - German Research Center for Artificial Intelligence<br />
| Ontology containing geographic terms and two kinds of relations: the directional ''part-of'' relation, and the ''equal'' relation for synonyms and abbreviations of the same geographic area (e.g ''the United Kingdom'', ''the UK'', ''Great Britain'', etc.) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Geographic Ontology - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Geo''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Meronymy entailment rules, based on TREC’s TIPSTER gazetteer.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Geo - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Regex''<br />
| Collection of Entailment rules <br />
| Bar-Ilan University; Tel-Aviv University<br />
| Small set of entailment rules based on regular expressions, intended to address lexical variability involving temporal phrases<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Regex - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Syntactic rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|1<br />
| [[Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Polarity rule base <br>''(to be released)''<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet. Extends a resource described in (Bar-Haim et al., AAAI-07) <br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0 <br />
| [[Polarity rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| Lexical-Syntactic rule base<br />
| Collection of Entailment Rules<br />
| Bar-Ilan University; Tel-Aviv University<br />
| Extract lexical-syntactic entailment rules for predicates (verbal and nominal), including argument mapping. The resource is based on WordNet, Nomlex-Plus and Unary DIRT (Szpektor and Dagan, Coling 08)<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| [[Lexical-Syntactic rule base - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| OPENU Collection<br />
| Collection of Entailment Rules and Patterns<br />
| Open University<br />
| Collections of rules, patterns etc. for RTE purpose, extracted from Reuter corpus parsed using Minipar.<br />
| style="text-align: center;"|1<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0 <br />
| [[OPENU Collection - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| ''Abbr''<br />
| Collection of rules for abbreviation<br />
| Bar-Ilan University; Tel-Aviv University<br />
| 2000 Abbreviation rules, extracted from [http://badc.nerc.ac.uk/help/abbrevs.html BADC] and [http://www.acronym-guide.com/ Acronym Guide]<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1<br />
| [[Abbr - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Negation_list<br />
| Negation rules<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of negative terms and words (verbs, adjectives, nouns) affecting modality or factuality of a infinitive verb preceded by the particle "to" (e.g. "believe","necessary", "attempt")<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[UAIC Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| DLSIUAES Negation_list<br />
| List of negative terms<br />
| University of Alicante<br />
| Basic list of negative terms.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[DLSIUAES Negation_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| UAIC Quantifier_list<br />
| List of quantifiers<br />
| „Al. I. Cuza“ University, Iasi, Romania <br />
| List of quantifiers affecting entailment judgment. The quantifiers are taken from a list which contains expressions like “more than”, “less than”, or words such as “over”, “under”, etc.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[UAIC Quantifier_list - RTE Users|Users]]<br />
<br />
|- bgcolor="#ECECEC" "align="left"<br />
| FBKirst StopWord list<br />
| List of frequent words<br />
| FBK-Irst;<br/>University of Trento - Italy <br />
| A list of the 572 most frequent English words.<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|0<br />
| style="text-align: center;"|1 <br />
| [[FBKirst StopWord list - RTE Users|Users]]<br />
<br />
|}<br />
<br><br />
<br />
==Footnotes==<br />
<references/></div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=Textual_Entailment_Resource_Pool&diff=8047Textual Entailment Resource Pool2010-06-18T14:43:50Z<p>DGiampiccolo: </p>
<hr />
<div>[[Textual Entailment|Textual entailment]] systems rely on many different types of [[Natural Language Processing|NLP]] resources, including term banks, paraphrase lists, parsers, named-entity recognizers, etc. With so many resources being continuously released and improved, it can be difficult to know which particular resource to use when developing a system.<br />
<br />
In response, the [[Recognizing Textual Entailment|Recognizing Textual Entailment (RTE)]] shared task community initiated a new activity for building this ''Textual Entailment Resource Pool''. RTE participants and any other member of the NLP community are encouraged to contribute to the pool.<br />
<br />
In an effort to determine the relative impact of the resources, RTE participants are strongly encouraged to report, whenever possible, the contribution to the overall performance of each utilized resource. Formal qualitative and quantitative results should be included in a separate section of the system report as well as posted on the talk pages of this Textual Entailment Resource Pool.<br />
<br />
'''Adding''' a new resource is very easy. See how to '''use existing templates''' to do this in [[Help:Using Templates]].<br />
<br />
== Complete RTE Systems ==<br />
<br />
* [http://project.cgm.unive.it/html/venses.html VENSES] (from Ca' Foscari University of Venice, Italy)<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc/wiki/nutcracker Nutcracker] (available for download)<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/kindleDemo.php Entailment Demo] (from the University of Illinois at Urbana-Champaign)<br />
* [http://edits.fbk.eu/ EDITS - Edit Distance Textual Entailment Suite] (open source software developed by [http://hlt.fbk.eu/ Human Language Technology (HLT) group at FBK-Irst])<br />
<br />
== RTE data sets ==<br />
* [http://www.coli.uni-saarland.de/projects/salsa/fate FrameNet manually annotated RTE 2006 Test Set.] Provided by [http://www.coli.uni-saarland.de/projects/salsa/ SALSA project, Saarland University.]<br />
* [http://www.cs.biu.ac.il/~nlp/files/RTE_2006_Aligned.zip Manually Word Aligned RTE 2006 Data Sets.] Provided by [http://research.microsoft.com/nlp/ the Natural Language Processing Group, Microsoft Research.]<br />
* [http://www-nlp.stanford.edu/projects/contradiction/ RTE data sets annotated for a 3-way decision: entails, contradicts, unknown.] Provided by Stanford NLP Group.<br />
* [http://www.cs.utexas.edu/~pclark/bpi-test-suite/ BPI RTE data set] - 250 pairs, focusing on world knowledge. Provided jointly by [http://www.boeing.com/phantom/math_ct/index.html Boeing], [http://wordnet.cs.princeton.edu/ Princeton], and [http://www.isi.edu ISI].<br />
* [http://hlt.fbk.eu/en/Technology/TE_Specialized_Data Textual Entailment Specialized Data Sets] - 90 RTE-5 Test Set pairs annotated with linguistic phenomena + 203 monothematic pairs (i.e. pairs where only one linguistic phenomenon is relevant to the entailment relation) created from the 90 annotated pairs. Provided jointly by [http://hlt.fbk.eu/en/home FBK-Irst], and [http://www.celct.it/ CELCT].<br />
* [http://www.nist.gov/tac/data/ RTE-5 Search Pilot Data Set annotated with anaphora and coreference information] - RTE-5 Search Data Set annotated with anaphora/coreference information + Augmented RTE-5 Search Data Set, where all the referring expressions which need to be resolved in the entailing sentences are substituted by explicit expressions on the basis of the anaphora/coreference annotation. Provided by [http://www.celct.it/ CELCT] and distributed by [http://www.nist.gov/index.html NIST] at the [http://www.nist.gov/tac/data/ Past TAC Data] web page (2009 Search Pilot, annotated test/dev data).<br />
<br />
== Knowledge Resources ==<br />
The [[RTE Knowledge Resources]] page presents: <br />
<br />
* a [[RTE Knowledge Resources#Call for Resources|call for resources]], inviting system developers to share the resources used by their own TE engines, to both help improve the TE technology and further test and evaluate such resources;<br />
* [[RTE Knowledge Resources#Ablation tests|the ablation tests]] carried out in the RTE challenges in order to evaluate the impact of knowledge resources and tools on TE system performances;<br />
* [[RTE Knowledge Resources#Publicly available Resources|lists of knowledge resources]], both publically available and unpublished, used by systems participating in the last RTE challenges. <br />
<br />
== Tools ==<br />
<br />
=== Parsers ===<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc C&C parser for Combinatory Categorial Grammar]<br />
* [[Minipar]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SP Shallow Parser] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/shallow_parse_demo.php web demo] of this tool<br />
<br />
=== Role Labelling ===<br />
* [http://cemantix.org/assert ASSERT]<br />
* [http://www.coli.uni-saarland.de/projects/salsa/shal/ Shalmaneser]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SRL Semantic Role Labeler] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/srl-demo.php web demo] of this tool<br />
<br />
=== Entity Recognition Tools ===<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=NE Illinois Named Entity Tagger] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_demo.php web demo] of this tool<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=CORANKER Illinois Multi-lingual Named Entity Discovery Tool] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_matcher_demo.php web demo] of this tool<br />
<br />
=== Similarity / Relatedness Tools ===<br />
* [http://ixa2.si.ehu.es/ukb UKB]: Open source WordNet-based similarity/relatedness tool, includes also pre-computed semantic vectors for all words<br />
<br />
=== Corpus Readers ===<br />
* [http://nltk.org NLTK] provides a corpus reader for the data from RTE Challenges 1, 2, and 3 - see the [http://nltk.org/doc/guides/corpus.html#rte Corpus Readers] Guide for more information.<br />
<br />
=== Related Libraries ===<br />
<br />
* [http://www.semantilog.org/pypes.html PyPES] general purpose library containing evaluation environment for RTE and McPIET text inference engine based on the ERG (English Resource Grammar)<br />
<br />
== Links ==<br />
* [http://homepages.inf.ed.ac.uk/jbos/rte/ Textual Entailment site by Johan Bos]<br />
* [http://ai-nlp.info.uniroma2.it/te/ Textual Entailment at the University of Rome "Tor Vergata"]<br />
[[Category:Textual Entailment Portal]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/entailment-module-demos.php Illinois Textual Entailment System Component demos]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=Textual_Entailment_Resource_Pool&diff=8045Textual Entailment Resource Pool2010-06-18T10:37:39Z<p>DGiampiccolo: </p>
<hr />
<div>[[Textual Entailment|Textual entailment]] systems rely on many different types of [[Natural Language Processing|NLP]] resources, including term banks, paraphrase lists, parsers, named-entity recognizers, etc. With so many resources being continuously released and improved, it can be difficult to know which particular resource to use when developing a system.<br />
<br />
In response, the [[Recognizing Textual Entailment|Recognizing Textual Entailment (RTE)]] shared task community initiated a new activity for building this ''Textual Entailment Resource Pool''. RTE participants and any other member of the NLP community are encouraged to contribute to the pool.<br />
<br />
In an effort to determine the relative impact of the resources, RTE participants are strongly encouraged to report, whenever possible, the contribution to the overall performance of each utilized resource. Formal qualitative and quantitative results should be included in a separate section of the system report as well as posted on the talk pages of this Textual Entailment Resource Pool.<br />
<br />
'''Adding''' a new resource is very easy. See how to '''use existing templates''' to do this in [[Help:Using Templates]].<br />
<br />
== Complete RTE Systems ==<br />
<br />
* [http://project.cgm.unive.it/html/venses.html VENSES] (from Ca' Foscari University of Venice, Italy)<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc/wiki/nutcracker Nutcracker] (available for download)<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/kindleDemo.php Entailment Demo] (from the University of Illinois at Urbana-Champaign)<br />
* [http://edits.fbk.eu/ EDITS - Edit Distance Textual Entailment Suite] (open source software developed by [http://hlt.fbk.eu/ Human Language Technology (HLT) group at FBK-Irst])<br />
<br />
== RTE data sets ==<br />
* [http://www.coli.uni-saarland.de/projects/salsa/fate FrameNet manually annotated RTE 2006 Test Set.] Provided by [http://www.coli.uni-saarland.de/projects/salsa/ SALSA project, Saarland University.]<br />
* [http://www.cs.biu.ac.il/~nlp/files/RTE_2006_Aligned.zip Manually Word Aligned RTE 2006 Data Sets.] Provided by [http://research.microsoft.com/nlp/ the Natural Language Processing Group, Microsoft Research.]<br />
* [http://www-nlp.stanford.edu/projects/contradiction/ RTE data sets annotated for a 3-way decision: entails, contradicts, unknown.] Provided by Stanford NLP Group.<br />
* [http://www.cs.utexas.edu/~pclark/bpi-test-suite/ BPI RTE data set] - 250 pairs, focusing on world knowledge. Provided jointly by [http://www.boeing.com/phantom/math_ct/index.html Boeing], [http://wordnet.cs.princeton.edu/ Princeton], and [http://www.isi.edu ISI].<br />
* [http://hlt.fbk.eu/en/Technology/TE_Specialized_Data Textual Entailment Specialized Data Sets] - 90 RTE-5 Test Set pairs annotated with linguistic phenomena + 203 monothematic pairs (i.e. pairs where only one linguistic phenomenon is relevant to the entailment relation) created from the 90 annotated pairs. Provided jointly by [http://hlt.fbk.eu/en/home FBK-Irst], and [http://www.celct.it/ CELCT].<br />
* [http://www.nist.gov/tac/data/ RTE-5 Search Pilot Data Set annotated with anaphora and coreference information] - RTE-5 Search Data Set annotated with anaphora/coreference information + Augmented RTE-5 Search Data Set, where all the referring expressions which need to be resolved in the entailing sentences are substituted by explicit expressions on the basis of the anaphora/coreference annotation. Provided by [http://www.celct.it/ CELCT] and distributed by [http://www.nist.gov/index.html NIST] at the [http://www.nist.gov/tac/data/ Past TAC Data] web page (2009 Search Pilot, annotated test/dev data).<br />
<br />
== Knowledge Resources ==<br />
The [[RTE Knowledge Resources]] page presents a list of knowledge resources that have been used in the RTE challanges. It also gives access to the [[RTE Knowledge Resources#Ablation tests|'''Ablation tests''']] carried out in RTE-5 and to [[RTE Knowledge Resources#Call for Resources|'''the RTE-6 Call for Resources''']]. <br />
The content of the page is as follows: <br />
<br />
* [[RTE Knowledge Resources#Call for Resources|Call for Resources]]<br />
* [[RTE Knowledge Resources#Ablation tests|Ablation tests]]<br />
* [[RTE Knowledge Resources#Publicly available Resources|Publicly available resources]]<br />
* [[RTE Knowledge Resources#Not available Resources|Not available resources]]<br />
<br />
<br />
== Tools ==<br />
<br />
=== Parsers ===<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc C&C parser for Combinatory Categorial Grammar]<br />
* [[Minipar]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SP Shallow Parser] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/shallow_parse_demo.php web demo] of this tool<br />
<br />
=== Role Labelling ===<br />
* [http://cemantix.org/assert ASSERT]<br />
* [http://www.coli.uni-saarland.de/projects/salsa/shal/ Shalmaneser]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SRL Semantic Role Labeler] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/srl-demo.php web demo] of this tool<br />
<br />
=== Entity Recognition Tools ===<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=NE Illinois Named Entity Tagger] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_demo.php web demo] of this tool<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=CORANKER Illinois Multi-lingual Named Entity Discovery Tool] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_matcher_demo.php web demo] of this tool<br />
<br />
=== Similarity / Relatedness Tools ===<br />
* [http://ixa2.si.ehu.es/ukb UKB]: Open source WordNet-based similarity/relatedness tool, includes also pre-computed semantic vectors for all words<br />
<br />
=== Corpus Readers ===<br />
* [http://nltk.org NLTK] provides a corpus reader for the data from RTE Challenges 1, 2, and 3 - see the [http://nltk.org/doc/guides/corpus.html#rte Corpus Readers] Guide for more information.<br />
<br />
=== Related Libraries ===<br />
<br />
* [http://www.semantilog.org/pypes.html PyPES] general purpose library containing evaluation environment for RTE and McPIET text inference engine based on the ERG (English Resource Grammar)<br />
<br />
== Links ==<br />
* [http://homepages.inf.ed.ac.uk/jbos/rte/ Textual Entailment site by Johan Bos]<br />
* [http://ai-nlp.info.uniroma2.it/te/ Textual Entailment at the University of Rome "Tor Vergata"]<br />
[[Category:Textual Entailment Portal]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/entailment-module-demos.php Illinois Textual Entailment System Component demos]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=Textual_Entailment_Resource_Pool&diff=8044Textual Entailment Resource Pool2010-06-18T10:34:25Z<p>DGiampiccolo: </p>
<hr />
<div>[[Textual Entailment|Textual entailment]] systems rely on many different types of [[Natural Language Processing|NLP]] resources, including term banks, paraphrase lists, parsers, named-entity recognizers, etc. With so many resources being continuously released and improved, it can be difficult to know which particular resource to use when developing a system.<br />
<br />
In response, the [[Recognizing Textual Entailment|Recognizing Textual Entailment (RTE)]] shared task community initiated a new activity for building this ''Textual Entailment Resource Pool''. RTE participants and any other member of the NLP community are encouraged to contribute to the pool.<br />
<br />
In an effort to determine the relative impact of the resources, RTE participants are strongly encouraged to report, whenever possible, the contribution to the overall performance of each utilized resource. Formal qualitative and quantitative results should be included in a separate section of the system report as well as posted on the talk pages of this Textual Entailment Resource Pool.<br />
<br />
'''Adding''' a new resource is very easy. See how to '''use existing templates''' to do this in [[Help:Using Templates]].<br />
<br />
== Complete RTE Systems ==<br />
<br />
* [http://project.cgm.unive.it/html/venses.html VENSES] (from Ca' Foscari University of Venice, Italy)<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc/wiki/nutcracker Nutcracker] (available for download)<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/kindleDemo.php Entailment Demo] (from the University of Illinois at Urbana-Champaign)<br />
* [http://edits.fbk.eu/ EDITS - Edit Distance Textual Entailment Suite] (open source software developed by [http://hlt.fbk.eu/ Human Language Technology (HLT) group at FBK-Irst])<br />
<br />
== RTE data sets ==<br />
* [http://www.coli.uni-saarland.de/projects/salsa/fate FrameNet manually annotated RTE 2006 Test Set.] Provided by [http://www.coli.uni-saarland.de/projects/salsa/ SALSA project, Saarland University.]<br />
* [http://www.cs.biu.ac.il/~nlp/files/RTE_2006_Aligned.zip Manually Word Aligned RTE 2006 Data Sets.] Provided by [http://research.microsoft.com/nlp/ the Natural Language Processing Group, Microsoft Research.]<br />
* [http://www-nlp.stanford.edu/projects/contradiction/ RTE data sets annotated for a 3-way decision: entails, contradicts, unknown.] Provided by Stanford NLP Group.<br />
* [http://www.cs.utexas.edu/~pclark/bpi-test-suite/ BPI RTE data set] - 250 pairs, focusing on world knowledge. Provided jointly by [http://www.boeing.com/phantom/math_ct/index.html Boeing], [http://wordnet.cs.princeton.edu/ Princeton], and [http://www.isi.edu ISI].<br />
* [http://hlt.fbk.eu/en/Technology/TE_Specialized_Data Textual Entailment Specialized Data Sets] - 90 RTE-5 Test Set pairs annotated with linguistic phenomena + 203 monothematic pairs (i.e. pairs where only one linguistic phenomenon is relevant to the entailment relation) created from the 90 annotated pairs. Provided jointly by [http://hlt.fbk.eu/en/home FBK-Irst], and [http://www.celct.it/ CELCT].<br />
* [http://www.nist.gov/tac/data/ RTE-5 Search Pilot Data Set annotated with anaphora and coreference information] - RTE-5 Search Data Set annotated with anaphora/coreference information + Augmented RTE-5 Search Data Set, where all the referring expressions which need to be resolved in the entailing sentences are substituted by explicit expressions on the basis of the anaphora/coreference annotation. Provided by [http://www.celct.it/ CELCT] and distributed by [http://www.nist.gov/index.html NIST] at the [http://www.nist.gov/tac/data/ Past TAC Data] web page (2009 Search Pilot, annotated test/dev data).<br />
<br />
== Knowledge Resources ==<br />
The [[RTE Knowledge Resources]] page presents a list of knowledge resources that have been used in the RTE challanges. It also gives access to the [[RTE Knowledge Resources#Ablation tests|'''Ablation tests''']] held in last challange and to the [[RTE Knowledge Resources#Call for Resources|'''RTE-6 Call for Resources''']]. <br />
The content of the page is as follows: <br />
<br />
* [[RTE Knowledge Resources#Call for Resources|Call for Resources]]<br />
* [[RTE Knowledge Resources#Ablation tests|Ablation tests]]<br />
* [[RTE Knowledge Resources#Publicly available Resources|Publicly available resources]]<br />
* [[RTE Knowledge Resources#Not available Resources|Not available resources]]<br />
<br />
<br />
== Tools ==<br />
<br />
=== Parsers ===<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc C&C parser for Combinatory Categorial Grammar]<br />
* [[Minipar]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SP Shallow Parser] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/shallow_parse_demo.php web demo] of this tool<br />
<br />
=== Role Labelling ===<br />
* [http://cemantix.org/assert ASSERT]<br />
* [http://www.coli.uni-saarland.de/projects/salsa/shal/ Shalmaneser]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SRL Semantic Role Labeler] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/srl-demo.php web demo] of this tool<br />
<br />
=== Entity Recognition Tools ===<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=NE Illinois Named Entity Tagger] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_demo.php web demo] of this tool<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=CORANKER Illinois Multi-lingual Named Entity Discovery Tool] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_matcher_demo.php web demo] of this tool<br />
<br />
=== Similarity / Relatedness Tools ===<br />
* [http://ixa2.si.ehu.es/ukb UKB]: Open source WordNet-based similarity/relatedness tool, includes also pre-computed semantic vectors for all words<br />
<br />
=== Corpus Readers ===<br />
* [http://nltk.org NLTK] provides a corpus reader for the data from RTE Challenges 1, 2, and 3 - see the [http://nltk.org/doc/guides/corpus.html#rte Corpus Readers] Guide for more information.<br />
<br />
=== Related Libraries ===<br />
<br />
* [http://www.semantilog.org/pypes.html PyPES] general purpose library containing evaluation environment for RTE and McPIET text inference engine based on the ERG (English Resource Grammar)<br />
<br />
== Links ==<br />
* [http://homepages.inf.ed.ac.uk/jbos/rte/ Textual Entailment site by Johan Bos]<br />
* [http://ai-nlp.info.uniroma2.it/te/ Textual Entailment at the University of Rome "Tor Vergata"]<br />
[[Category:Textual Entailment Portal]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/entailment-module-demos.php Illinois Textual Entailment System Component demos]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=Textual_Entailment_Resource_Pool&diff=8043Textual Entailment Resource Pool2010-06-18T10:32:34Z<p>DGiampiccolo: </p>
<hr />
<div>[[Textual Entailment|Textual entailment]] systems rely on many different types of [[Natural Language Processing|NLP]] resources, including term banks, paraphrase lists, parsers, named-entity recognizers, etc. With so many resources being continuously released and improved, it can be difficult to know which particular resource to use when developing a system.<br />
<br />
In response, the [[Recognizing Textual Entailment|Recognizing Textual Entailment (RTE)]] shared task community initiated a new activity for building this ''Textual Entailment Resource Pool''. RTE participants and any other member of the NLP community are encouraged to contribute to the pool.<br />
<br />
In an effort to determine the relative impact of the resources, RTE participants are strongly encouraged to report, whenever possible, the contribution to the overall performance of each utilized resource. Formal qualitative and quantitative results should be included in a separate section of the system report as well as posted on the talk pages of this Textual Entailment Resource Pool.<br />
<br />
'''Adding''' a new resource is very easy. See how to '''use existing templates''' to do this in [[Help:Using Templates]].<br />
<br />
== Complete RTE Systems ==<br />
<br />
* [http://project.cgm.unive.it/html/venses.html VENSES] (from Ca' Foscari University of Venice, Italy)<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc/wiki/nutcracker Nutcracker] (available for download)<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/kindleDemo.php Entailment Demo] (from the University of Illinois at Urbana-Champaign)<br />
* [http://edits.fbk.eu/ EDITS - Edit Distance Textual Entailment Suite] (open source software developed by [http://hlt.fbk.eu/ Human Language Technology (HLT) group at FBK-Irst])<br />
<br />
== RTE data sets ==<br />
* [http://www.coli.uni-saarland.de/projects/salsa/fate FrameNet manually annotated RTE 2006 Test Set.] Provided by [http://www.coli.uni-saarland.de/projects/salsa/ SALSA project, Saarland University.]<br />
* [http://www.cs.biu.ac.il/~nlp/files/RTE_2006_Aligned.zip Manually Word Aligned RTE 2006 Data Sets.] Provided by [http://research.microsoft.com/nlp/ the Natural Language Processing Group, Microsoft Research.]<br />
* [http://www-nlp.stanford.edu/projects/contradiction/ RTE data sets annotated for a 3-way decision: entails, contradicts, unknown.] Provided by Stanford NLP Group.<br />
* [http://www.cs.utexas.edu/~pclark/bpi-test-suite/ BPI RTE data set] - 250 pairs, focusing on world knowledge. Provided jointly by [http://www.boeing.com/phantom/math_ct/index.html Boeing], [http://wordnet.cs.princeton.edu/ Princeton], and [http://www.isi.edu ISI].<br />
* [http://hlt.fbk.eu/en/Technology/TE_Specialized_Data Textual Entailment Specialized Data Sets] - 90 RTE-5 Test Set pairs annotated with linguistic phenomena + 203 monothematic pairs (i.e. pairs where only one linguistic phenomenon is relevant to the entailment relation) created from the 90 annotated pairs. Provided jointly by [http://hlt.fbk.eu/en/home FBK-Irst], and [http://www.celct.it/ CELCT].<br />
* [http://www.nist.gov/tac/data/ RTE-5 Search Pilot Data Set annotated with anaphora and coreference information] - RTE-5 Search Data Set annotated with anaphora/coreference information + Augmented RTE-5 Search Data Set, where all the referring expressions which need to be resolved in the entailing sentences are substituted by explicit expressions on the basis of the anaphora/coreference annotation. Provided by [http://www.celct.it/ CELCT] and distributed by [http://www.nist.gov/index.html NIST] at the [http://www.nist.gov/tac/data/ Past TAC Data] web page (2009 Search Pilot, annotated test/dev data).<br />
<br />
== Knowledge Resources ==<br />
The [[RTE Knowledge Resources]] page presents a list of knowledge resources that have been used in the RTE challanges. It also gives access to the [[RTE Knowledge Resources#Ablation tests|'''Ablation tests''']] held in last challange and to the RTE-6 Call for Resources. The content of the page is as follows: <br />
<br />
* [[RTE Knowledge Resources#Call for Resources|Call for Resources]]<br />
* [[RTE Knowledge Resources#Ablation tests|Ablation tests]]<br />
* [[RTE Knowledge Resources#Publicly available Resources|Publicly available resources]]<br />
* [[RTE Knowledge Resources#Not available Resources|Not available resources]]<br />
* Footnotes<br />
<br />
== Tools ==<br />
<br />
=== Parsers ===<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc C&C parser for Combinatory Categorial Grammar]<br />
* [[Minipar]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SP Shallow Parser] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/shallow_parse_demo.php web demo] of this tool<br />
<br />
=== Role Labelling ===<br />
* [http://cemantix.org/assert ASSERT]<br />
* [http://www.coli.uni-saarland.de/projects/salsa/shal/ Shalmaneser]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SRL Semantic Role Labeler] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/srl-demo.php web demo] of this tool<br />
<br />
=== Entity Recognition Tools ===<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=NE Illinois Named Entity Tagger] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_demo.php web demo] of this tool<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=CORANKER Illinois Multi-lingual Named Entity Discovery Tool] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_matcher_demo.php web demo] of this tool<br />
<br />
=== Similarity / Relatedness Tools ===<br />
* [http://ixa2.si.ehu.es/ukb UKB]: Open source WordNet-based similarity/relatedness tool, includes also pre-computed semantic vectors for all words<br />
<br />
=== Corpus Readers ===<br />
* [http://nltk.org NLTK] provides a corpus reader for the data from RTE Challenges 1, 2, and 3 - see the [http://nltk.org/doc/guides/corpus.html#rte Corpus Readers] Guide for more information.<br />
<br />
=== Related Libraries ===<br />
<br />
* [http://www.semantilog.org/pypes.html PyPES] general purpose library containing evaluation environment for RTE and McPIET text inference engine based on the ERG (English Resource Grammar)<br />
<br />
== Links ==<br />
* [http://homepages.inf.ed.ac.uk/jbos/rte/ Textual Entailment site by Johan Bos]<br />
* [http://ai-nlp.info.uniroma2.it/te/ Textual Entailment at the University of Rome "Tor Vergata"]<br />
[[Category:Textual Entailment Portal]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/entailment-module-demos.php Illinois Textual Entailment System Component demos]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=Textual_Entailment_Resource_Pool&diff=8042Textual Entailment Resource Pool2010-06-18T10:14:33Z<p>DGiampiccolo: </p>
<hr />
<div>[[Textual Entailment|Textual entailment]] systems rely on many different types of [[Natural Language Processing|NLP]] resources, including term banks, paraphrase lists, parsers, named-entity recognizers, etc. With so many resources being continuously released and improved, it can be difficult to know which particular resource to use when developing a system.<br />
<br />
In response, the [[Recognizing Textual Entailment|Recognizing Textual Entailment (RTE)]] shared task community initiated a new activity for building this ''Textual Entailment Resource Pool''. RTE participants and any other member of the NLP community are encouraged to contribute to the pool.<br />
<br />
In an effort to determine the relative impact of the resources, RTE participants are strongly encouraged to report, whenever possible, the contribution to the overall performance of each utilized resource. Formal qualitative and quantitative results should be included in a separate section of the system report as well as posted on the talk pages of this Textual Entailment Resource Pool.<br />
<br />
'''Adding''' a new resource is very easy. See how to '''use existing templates''' to do this in [[Help:Using Templates]].<br />
<br />
== Complete RTE Systems ==<br />
<br />
* [http://project.cgm.unive.it/html/venses.html VENSES] (from Ca' Foscari University of Venice, Italy)<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc/wiki/nutcracker Nutcracker] (available for download)<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/kindleDemo.php Entailment Demo] (from the University of Illinois at Urbana-Champaign)<br />
* [http://edits.fbk.eu/ EDITS - Edit Distance Textual Entailment Suite] (open source software developed by [http://hlt.fbk.eu/ Human Language Technology (HLT) group at FBK-Irst])<br />
<br />
== RTE data sets ==<br />
* [http://www.coli.uni-saarland.de/projects/salsa/fate FrameNet manually annotated RTE 2006 Test Set.] Provided by [http://www.coli.uni-saarland.de/projects/salsa/ SALSA project, Saarland University.]<br />
* [http://www.cs.biu.ac.il/~nlp/files/RTE_2006_Aligned.zip Manually Word Aligned RTE 2006 Data Sets.] Provided by [http://research.microsoft.com/nlp/ the Natural Language Processing Group, Microsoft Research.]<br />
* [http://www-nlp.stanford.edu/projects/contradiction/ RTE data sets annotated for a 3-way decision: entails, contradicts, unknown.] Provided by Stanford NLP Group.<br />
* [http://www.cs.utexas.edu/~pclark/bpi-test-suite/ BPI RTE data set] - 250 pairs, focusing on world knowledge. Provided jointly by [http://www.boeing.com/phantom/math_ct/index.html Boeing], [http://wordnet.cs.princeton.edu/ Princeton], and [http://www.isi.edu ISI].<br />
* [http://hlt.fbk.eu/en/Technology/TE_Specialized_Data Textual Entailment Specialized Data Sets] - 90 RTE-5 Test Set pairs annotated with linguistic phenomena + 203 monothematic pairs (i.e. pairs where only one linguistic phenomenon is relevant to the entailment relation) created from the 90 annotated pairs. Provided jointly by [http://hlt.fbk.eu/en/home FBK-Irst], and [http://www.celct.it/ CELCT].<br />
* [http://www.nist.gov/tac/data/ RTE-5 Search Pilot Data Set annotated with anaphora and coreference information] - RTE-5 Search Data Set annotated with anaphora/coreference information + Augmented RTE-5 Search Data Set, where all the referring expressions which need to be resolved in the entailing sentences are substituted by explicit expressions on the basis of the anaphora/coreference annotation. Provided by [http://www.celct.it/ CELCT] and distributed by [http://www.nist.gov/index.html NIST] at the [http://www.nist.gov/tac/data/ Past TAC Data] web page (2009 Search Pilot, annotated test/dev data).<br />
<br />
== Knowledge Resources ==<br />
The [[RTE Knowledge Resources]] page presents a list of knowledge resources that have been used in the RTE challanges. It also gives access to the '''Ablation tests''' held in last challange and to the RTE-6 Call for Resources. The content of the page is as follows: <br />
<br />
* Call for Resources<br />
* Ablation Tests<br />
* Publicly available Resources<br />
* Not available Resources<br />
* Footnotes<br />
<br />
== Tools ==<br />
<br />
=== Parsers ===<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc C&C parser for Combinatory Categorial Grammar]<br />
* [[Minipar]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SP Shallow Parser] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/shallow_parse_demo.php web demo] of this tool<br />
<br />
=== Role Labelling ===<br />
* [http://cemantix.org/assert ASSERT]<br />
* [http://www.coli.uni-saarland.de/projects/salsa/shal/ Shalmaneser]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SRL Semantic Role Labeler] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/srl-demo.php web demo] of this tool<br />
<br />
=== Entity Recognition Tools ===<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=NE Illinois Named Entity Tagger] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_demo.php web demo] of this tool<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=CORANKER Illinois Multi-lingual Named Entity Discovery Tool] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_matcher_demo.php web demo] of this tool<br />
<br />
=== Similarity / Relatedness Tools ===<br />
* [http://ixa2.si.ehu.es/ukb UKB]: Open source WordNet-based similarity/relatedness tool, includes also pre-computed semantic vectors for all words<br />
<br />
=== Corpus Readers ===<br />
* [http://nltk.org NLTK] provides a corpus reader for the data from RTE Challenges 1, 2, and 3 - see the [http://nltk.org/doc/guides/corpus.html#rte Corpus Readers] Guide for more information.<br />
<br />
=== Related Libraries ===<br />
<br />
* [http://www.semantilog.org/pypes.html PyPES] general purpose library containing evaluation environment for RTE and McPIET text inference engine based on the ERG (English Resource Grammar)<br />
<br />
== Links ==<br />
* [http://homepages.inf.ed.ac.uk/jbos/rte/ Textual Entailment site by Johan Bos]<br />
* [http://ai-nlp.info.uniroma2.it/te/ Textual Entailment at the University of Rome "Tor Vergata"]<br />
[[Category:Textual Entailment Portal]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/entailment-module-demos.php Illinois Textual Entailment System Component demos]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=Textual_Entailment_Resource_Pool&diff=8041Textual Entailment Resource Pool2010-06-18T10:13:50Z<p>DGiampiccolo: </p>
<hr />
<div>[[Textual Entailment|Textual entailment]] systems rely on many different types of [[Natural Language Processing|NLP]] resources, including term banks, paraphrase lists, parsers, named-entity recognizers, etc. With so many resources being continuously released and improved, it can be difficult to know which particular resource to use when developing a system.<br />
<br />
In response, the [[Recognizing Textual Entailment|Recognizing Textual Entailment (RTE)]] shared task community initiated a new activity for building this ''Textual Entailment Resource Pool''. RTE participants and any other member of the NLP community are encouraged to contribute to the pool.<br />
<br />
In an effort to determine the relative impact of the resources, RTE participants are strongly encouraged to report, whenever possible, the contribution to the overall performance of each utilized resource. Formal qualitative and quantitative results should be included in a separate section of the system report as well as posted on the talk pages of this Textual Entailment Resource Pool.<br />
<br />
'''Adding''' a new resource is very easy. See how to '''use existing templates''' to do this in [[Help:Using Templates]].<br />
<br />
== Complete RTE Systems ==<br />
<br />
* [http://project.cgm.unive.it/html/venses.html VENSES] (from Ca' Foscari University of Venice, Italy)<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc/wiki/nutcracker Nutcracker] (available for download)<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/kindleDemo.php Entailment Demo] (from the University of Illinois at Urbana-Champaign)<br />
* [http://edits.fbk.eu/ EDITS - Edit Distance Textual Entailment Suite] (open source software developed by [http://hlt.fbk.eu/ Human Language Technology (HLT) group at FBK-Irst])<br />
<br />
== RTE data sets ==<br />
* [http://www.coli.uni-saarland.de/projects/salsa/fate FrameNet manually annotated RTE 2006 Test Set.] Provided by [http://www.coli.uni-saarland.de/projects/salsa/ SALSA project, Saarland University.]<br />
* [http://www.cs.biu.ac.il/~nlp/files/RTE_2006_Aligned.zip Manually Word Aligned RTE 2006 Data Sets.] Provided by [http://research.microsoft.com/nlp/ the Natural Language Processing Group, Microsoft Research.]<br />
* [http://www-nlp.stanford.edu/projects/contradiction/ RTE data sets annotated for a 3-way decision: entails, contradicts, unknown.] Provided by Stanford NLP Group.<br />
* [http://www.cs.utexas.edu/~pclark/bpi-test-suite/ BPI RTE data set] - 250 pairs, focusing on world knowledge. Provided jointly by [http://www.boeing.com/phantom/math_ct/index.html Boeing], [http://wordnet.cs.princeton.edu/ Princeton], and [http://www.isi.edu ISI].<br />
* [http://hlt.fbk.eu/en/Technology/TE_Specialized_Data Textual Entailment Specialized Data Sets] - 90 RTE-5 Test Set pairs annotated with linguistic phenomena + 203 monothematic pairs (i.e. pairs where only one linguistic phenomenon is relevant to the entailment relation) created from the 90 annotated pairs. Provided jointly by [http://hlt.fbk.eu/en/home FBK-Irst], and [http://www.celct.it/ CELCT].<br />
* [http://www.nist.gov/tac/data/ RTE-5 Search Pilot Data Set annotated with anaphora and coreference information] - RTE-5 Search Data Set annotated with anaphora/coreference information + Augmented RTE-5 Search Data Set, where all the referring expressions which need to be resolved in the entailing sentences are substituted by explicit expressions on the basis of the anaphora/coreference annotation. Provided by [http://www.celct.it/ CELCT] and distributed by [http://www.nist.gov/index.html NIST] at the [http://www.nist.gov/tac/data/ Past TAC Data] web page (2009 Search Pilot, annotated test/dev data).<br />
<br />
== Knowledge Resources ==<br />
The [[RTE Knowledge Resources]] page presents a list of knowledge resources that have been used in the RTE challanges. It also gives access to the """Ablation test""" held in last challange and to the RTE-6 Call for Resources. The content of the page is as follows: <br />
<br />
* Call for Resources<br />
* Ablation Tests<br />
* Publicly available Resources<br />
* Not available Resources<br />
* Footnotes<br />
<br />
== Tools ==<br />
<br />
=== Parsers ===<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc C&C parser for Combinatory Categorial Grammar]<br />
* [[Minipar]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SP Shallow Parser] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/shallow_parse_demo.php web demo] of this tool<br />
<br />
=== Role Labelling ===<br />
* [http://cemantix.org/assert ASSERT]<br />
* [http://www.coli.uni-saarland.de/projects/salsa/shal/ Shalmaneser]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SRL Semantic Role Labeler] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/srl-demo.php web demo] of this tool<br />
<br />
=== Entity Recognition Tools ===<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=NE Illinois Named Entity Tagger] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_demo.php web demo] of this tool<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=CORANKER Illinois Multi-lingual Named Entity Discovery Tool] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_matcher_demo.php web demo] of this tool<br />
<br />
=== Similarity / Relatedness Tools ===<br />
* [http://ixa2.si.ehu.es/ukb UKB]: Open source WordNet-based similarity/relatedness tool, includes also pre-computed semantic vectors for all words<br />
<br />
=== Corpus Readers ===<br />
* [http://nltk.org NLTK] provides a corpus reader for the data from RTE Challenges 1, 2, and 3 - see the [http://nltk.org/doc/guides/corpus.html#rte Corpus Readers] Guide for more information.<br />
<br />
=== Related Libraries ===<br />
<br />
* [http://www.semantilog.org/pypes.html PyPES] general purpose library containing evaluation environment for RTE and McPIET text inference engine based on the ERG (English Resource Grammar)<br />
<br />
== Links ==<br />
* [http://homepages.inf.ed.ac.uk/jbos/rte/ Textual Entailment site by Johan Bos]<br />
* [http://ai-nlp.info.uniroma2.it/te/ Textual Entailment at the University of Rome "Tor Vergata"]<br />
[[Category:Textual Entailment Portal]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/entailment-module-demos.php Illinois Textual Entailment System Component demos]</div>DGiampiccolohttps://aclweb.org/aclwiki/index.php?title=Textual_Entailment_Resource_Pool&diff=8040Textual Entailment Resource Pool2010-06-18T10:09:34Z<p>DGiampiccolo: </p>
<hr />
<div>[[Textual Entailment|Textual entailment]] systems rely on many different types of [[Natural Language Processing|NLP]] resources, including term banks, paraphrase lists, parsers, named-entity recognizers, etc. With so many resources being continuously released and improved, it can be difficult to know which particular resource to use when developing a system.<br />
<br />
In response, the [[Recognizing Textual Entailment|Recognizing Textual Entailment (RTE)]] shared task community initiated a new activity for building this ''Textual Entailment Resource Pool''. RTE participants and any other member of the NLP community are encouraged to contribute to the pool.<br />
<br />
In an effort to determine the relative impact of the resources, RTE participants are strongly encouraged to report, whenever possible, the contribution to the overall performance of each utilized resource. Formal qualitative and quantitative results should be included in a separate section of the system report as well as posted on the talk pages of this Textual Entailment Resource Pool.<br />
<br />
'''Adding''' a new resource is very easy. See how to '''use existing templates''' to do this in [[Help:Using Templates]].<br />
<br />
== Complete RTE Systems ==<br />
<br />
* [http://project.cgm.unive.it/html/venses.html VENSES] (from Ca' Foscari University of Venice, Italy)<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc/wiki/nutcracker Nutcracker] (available for download)<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/kindleDemo.php Entailment Demo] (from the University of Illinois at Urbana-Champaign)<br />
* [http://edits.fbk.eu/ EDITS - Edit Distance Textual Entailment Suite] (open source software developed by [http://hlt.fbk.eu/ Human Language Technology (HLT) group at FBK-Irst])<br />
<br />
== RTE data sets ==<br />
* [http://www.coli.uni-saarland.de/projects/salsa/fate FrameNet manually annotated RTE 2006 Test Set.] Provided by [http://www.coli.uni-saarland.de/projects/salsa/ SALSA project, Saarland University.]<br />
* [http://www.cs.biu.ac.il/~nlp/files/RTE_2006_Aligned.zip Manually Word Aligned RTE 2006 Data Sets.] Provided by [http://research.microsoft.com/nlp/ the Natural Language Processing Group, Microsoft Research.]<br />
* [http://www-nlp.stanford.edu/projects/contradiction/ RTE data sets annotated for a 3-way decision: entails, contradicts, unknown.] Provided by Stanford NLP Group.<br />
* [http://www.cs.utexas.edu/~pclark/bpi-test-suite/ BPI RTE data set] - 250 pairs, focusing on world knowledge. Provided jointly by [http://www.boeing.com/phantom/math_ct/index.html Boeing], [http://wordnet.cs.princeton.edu/ Princeton], and [http://www.isi.edu ISI].<br />
* [http://hlt.fbk.eu/en/Technology/TE_Specialized_Data Textual Entailment Specialized Data Sets] - 90 RTE-5 Test Set pairs annotated with linguistic phenomena + 203 monothematic pairs (i.e. pairs where only one linguistic phenomenon is relevant to the entailment relation) created from the 90 annotated pairs. Provided jointly by [http://hlt.fbk.eu/en/home FBK-Irst], and [http://www.celct.it/ CELCT].<br />
* [http://www.nist.gov/tac/data/ RTE-5 Search Pilot Data Set annotated with anaphora and coreference information] - RTE-5 Search Data Set annotated with anaphora/coreference information + Augmented RTE-5 Search Data Set, where all the referring expressions which need to be resolved in the entailing sentences are substituted by explicit expressions on the basis of the anaphora/coreference annotation. Provided by [http://www.celct.it/ CELCT] and distributed by [http://www.nist.gov/index.html NIST] at the [http://www.nist.gov/tac/data/ Past TAC Data] web page (2009 Search Pilot, annotated test/dev data).<br />
<br />
== Knowledge Resources ==<br />
The [[RTE Knowledge Resources]] page presents a list of knowledge resources that have been used in the RTE challanges. It also gives access to the [[ABLATION TESTS]] held in last challange and to the RTE-6 Call for Resources. The content of the page is as follows: <br />
<br />
* Call for Resources<br />
* Ablation Tests<br />
* Publicly available Resources<br />
* Not available Resources<br />
* Footnotes<br />
<br />
== Tools ==<br />
<br />
=== Parsers ===<br />
* [http://svn.ask.it.usyd.edu.au/trac/candc C&C parser for Combinatory Categorial Grammar]<br />
* [[Minipar]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SP Shallow Parser] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/shallow_parse_demo.php web demo] of this tool<br />
<br />
=== Role Labelling ===<br />
* [http://cemantix.org/assert ASSERT]<br />
* [http://www.coli.uni-saarland.de/projects/salsa/shal/ Shalmaneser]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=SRL Semantic Role Labeler] - from the University of Illinois at Urbana-Champaign, see a [http://l2r.cs.uiuc.edu/~cogcomp/srl-demo.php web demo] of this tool<br />
<br />
=== Entity Recognition Tools ===<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=NE Illinois Named Entity Tagger] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_demo.php web demo] of this tool<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=CORANKER Illinois Multi-lingual Named Entity Discovery Tool] - see a [http://l2r.cs.uiuc.edu/~cogcomp/ne_matcher_demo.php web demo] of this tool<br />
<br />
=== Similarity / Relatedness Tools ===<br />
* [http://ixa2.si.ehu.es/ukb UKB]: Open source WordNet-based similarity/relatedness tool, includes also pre-computed semantic vectors for all words<br />
<br />
=== Corpus Readers ===<br />
* [http://nltk.org NLTK] provides a corpus reader for the data from RTE Challenges 1, 2, and 3 - see the [http://nltk.org/doc/guides/corpus.html#rte Corpus Readers] Guide for more information.<br />
<br />
=== Related Libraries ===<br />
<br />
* [http://www.semantilog.org/pypes.html PyPES] general purpose library containing evaluation environment for RTE and McPIET text inference engine based on the ERG (English Resource Grammar)<br />
<br />
== Links ==<br />
* [http://homepages.inf.ed.ac.uk/jbos/rte/ Textual Entailment site by Johan Bos]<br />
* [http://ai-nlp.info.uniroma2.it/te/ Textual Entailment at the University of Rome "Tor Vergata"]<br />
[[Category:Textual Entailment Portal]]<br />
* [http://l2r.cs.uiuc.edu/~cogcomp/entailment-module-demos.php Illinois Textual Entailment System Component demos]</div>DGiampiccolo