Difference between revisions of "Textual Entailment Resource Pool"
m (updated URL for downward entailing operator list) |
|||
Line 16: | Line 16: | ||
* [http://www.coli.uni-saarland.de/projects/salsa/fate FrameNet manually annotated RTE 2006 Test Set.] Provided by [http://www.coli.uni-saarland.de/projects/salsa/ SALSA project, Saarland University.] | * [http://www.coli.uni-saarland.de/projects/salsa/fate FrameNet manually annotated RTE 2006 Test Set.] Provided by [http://www.coli.uni-saarland.de/projects/salsa/ SALSA project, Saarland University.] | ||
* [http://www.cs.biu.ac.il/~nlp/files/RTE_2006_Aligned.zip Manually Word Aligned RTE 2006 Data Sets.] Provided by [http://research.microsoft.com/nlp/ the Natural Language Processing Group, Microsoft Research.] | * [http://www.cs.biu.ac.il/~nlp/files/RTE_2006_Aligned.zip Manually Word Aligned RTE 2006 Data Sets.] Provided by [http://research.microsoft.com/nlp/ the Natural Language Processing Group, Microsoft Research.] | ||
− | |||
* [http://www-nlp.stanford.edu/projects/contradiction/ RTE data sets annotated for a 3-way decision: entails, contradicts, unknown.] Provided by Stanford NLP Group. | * [http://www-nlp.stanford.edu/projects/contradiction/ RTE data sets annotated for a 3-way decision: entails, contradicts, unknown.] Provided by Stanford NLP Group. | ||
* [http://www.cs.utexas.edu/~pclark/bpi-test-suite/ BPI RTE data set] - 250 pairs, focusing on world knowledge. Provided jointly by [http://www.boeing.com/phantom/math_ct/index.html Boeing], [http://wordnet.cs.princeton.edu/ Princeton], and [http://www.isi.edu ISI]. | * [http://www.cs.utexas.edu/~pclark/bpi-test-suite/ BPI RTE data set] - 250 pairs, focusing on world knowledge. Provided jointly by [http://www.boeing.com/phantom/math_ct/index.html Boeing], [http://wordnet.cs.princeton.edu/ Princeton], and [http://www.isi.edu ISI]. |
Revision as of 07:59, 17 June 2009
Textual entailment systems rely on many different types of NLP resources, including term banks, paraphrase lists, parsers, named-entity recognizers, etc. With so many resources being continuously released and improved, it can be difficult to know which particular resource to use when developing a system.
In response, the Recognizing Textual Entailment (RTE) shared task community initiated a new activity for building this Textual Entailment Resource Pool. RTE participants and any other member of the NLP community are encouraged to contribute to the pool.
In an effort to determine the relative impact of the resources, RTE participants are strongly encouraged to report, whenever possible, the contribution to the overall performance of each utilized resource. Formal qualitative and quantitative results should be included in a separate section of the system report as well as posted on the talk pages of this Textual Entailment Resource Pool.
Adding a new resource is very easy. See how to use existing templates to do this in Help:Using Templates.
Complete RTE Systems
- VENSES (from Ca' Foscari University of Venice, Italy)
- Nutcracker (available for download)
- Entailment Demo (from the University of Illinois at Urbana-Champaign)
RTE data sets
- FrameNet manually annotated RTE 2006 Test Set. Provided by SALSA project, Saarland University.
- Manually Word Aligned RTE 2006 Data Sets. Provided by the Natural Language Processing Group, Microsoft Research.
- RTE data sets annotated for a 3-way decision: entails, contradicts, unknown. Provided by Stanford NLP Group.
- BPI RTE data set - 250 pairs, focusing on world knowledge. Provided jointly by Boeing, Princeton, and ISI.
Knowledge Resources
- DIRT Paraphrase Collection
- Downward entailing operators A partial list provided by the Cornell NLP group.
- FrameNet
- Sekine's Paraphrase Database
- TEASE Entailment Rule Collection
- VerbOcean
- WordNet
Tools
Parsers
- C&C parser for Combinatory Categorial Grammar
- Minipar
- Shallow Parser - from the University of Illinois at Urbana-Champaign, see a web demo of this tool
Role Labelling
- ASSERT
- Shalmaneser
- Semantic Role Labeler - from the University of Illinois at Urbana-Champaign, see a web demo of this tool
Entity Recognition Tools
- CCG Named Entity Tagger - see a web demo of this tool
- CCG Multi-lingual Named Entity Discovery Tool - see a web demo of this tool
Corpus Readers
- NLTK provides a corpus reader for the data from RTE Challenges 1, 2, and 3 - see the Corpus Readers Guide for more information.