Difference between revisions of "RTE Knowledge Resources"
Jump to navigation
Jump to search
m |
m |
||
Line 166: | Line 166: | ||
| [[Polarity rule base RTE Users|Users]] | | [[Polarity rule base RTE Users|Users]] | ||
|- bgcolor="#ECECEC" "align="left" | |- bgcolor="#ECECEC" "align="left" | ||
− | | | + | | Resources extracted from Parsed Corpora |
| Collections | | Collections | ||
| | | |
Revision as of 10:52, 8 April 2009
The table below lists the knowledge resources used by participants in the last RTE challenges. Other important RTE resources have been added in order to encourage people to add information about potential usage.
The table is sortable by Resource name, type, author and number of users.
Resource | Type | Author | Brief description | # Users* | Usage info |
---|---|---|---|---|---|
WordNet | Lexical | Princeton University | Lexical database of English nouns, verbs, adjectives and adverbs | 23 | Users |
Verbnet | Lexical | University of Colorado Boulder | On-line lexicon for English verbs organized into classes | 3 | Users |
VerbOcean | Lexical | University of Southern California | Broad-coverage semantic network of verbs | 5 | Users |
FrameNet | Lexical | ICSI (International Computer Science Institute) - Berkley University | On-line lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence | 2 | Users |
NomBank Resources | Lexical | New York University | Lexical Resources containing syntactic frames for nouns, extracted from annotated corpora | 2 | Users |
Nomlex Plus | Lexical | New York University | Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb | 1 | Users |
Parc Polarity Lexicon | Lexical | PARC - Palo Alto Research Center | Verbs classification with respect to semantic polarity | 1 | Users |
DIRT Paraphrase Collection | Collections of paraphrases (DIRT output) | Various | Output of the DIRT algorithm | 4 | Users |
PropBank Resources | Lexical | University of Colorado Boulder | Lexical Resources containing syntactic frames for verbs, extracted from annotated corpora | 2 | Users |
TEASE Collection | Syntactic-semantic | Bar Ilan University | Collection of Entailment Rules | 0 | Users |
BADC Acronym and Abbreviation List | Word List | BADC - British Atmospheric Data Centre | Acronym and Abbreviation List | 1 | Users |
Acronym Guide | Word List | Acronym-Guide.com | Acronym and Abbreviation Lists for English, branched in thematic directories | 1 | Users |
Dekang Lin’s Thesaurus | Thesaurus | University of Alberta | Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores | 1 | Users |
Roget's Thesaurus | Thesaurus | Peter Mark Roget | Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The 1911 US edition (version 1.02) is made available by University of Chicago. | 1 | Users |
Web1T 5-grams | Word list | Google Inc. | Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages | 1 | Users |
Wikipedia | Encyclopedia | Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information ecc. | 3 | Users | |
GNIS - Geographic Names Information System | Word List | USGS - United States Geological Survey | Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica. | 1 | Users |
Geonames | Word List | Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources. | 1 | Users | |
Gazetteer from TREC | Gazetteer | NIST - National Institute of Standards and Technology | Cities and other geographical names | 1 | Users |
Geographic Ontology | Gazetter | University of West Florida | Hierarchical data structure that allows the storage of natural and man-made feature data for use in a multitude of both manual and computerized Mapping, Charting & Geodesy systems. | 1 | Users |
Syntactic rule base | Entailment Rules' Collection | Bar-Ilan University | A manually-composed collection of entailment rules which define parse tree transformations. The rules cover generic syntactic phenomena such as appositions, conjunctions, passive, relative clause, etc. (Bar-Haim et al., AAAI-07) | 1 | Users |
Polarity rule base | Entailment Rules' Collection | Bar-Ilan University | A manually-composed collection of entailment rules which detect predicates whose polarity is negative (e.g. didn't dance) or unknown (e.g. plans to dance). The rules capture diverse phenomena that affect polarity, e.g. verbal negation, modal verbs, conditionals, and certain verbs that induce negative or "unknown" polarity context. The latter were taken mainly from VerbNet, and also from the PARC polarity lexicon. It extends a resource described in (Bar-Haim et al., AAAI-07) | 1 | Users |
Resources extracted from Parsed Corpora | Collections | Collections of rules, patterns etc. extracted from parsed corpora for RTE purpose. | 1 | Users | |
New resource (Participants are encouraged to contribute) |
Users | ||||
New resource (Participants are encouraged to contribute) |
Users |
[*] The numbers refer to the Users in RTE4 (data extracted both from related proceedings and from RTE Knowledge Resources Questionnaire) and in RTE3 (data extracted only from RTE Knowledge Resources Questionnaire) challenges.