RTE Knowledge Resources

From ACL Wiki
Revision as of 03:20, 8 April 2009 by Celct (talk | contribs)
Jump to navigation Jump to search

The table below lists the knowledge resources used by participants in the last RTE challenges. Other important RTE resources have been added in order to encourage people to add information about potential usage.
The table is sortable by Resource name, type, author and number of users.

Resource Type Author Brief description # Users* Usage info
WordNet Lexical Princeton University Lexical database of English nouns, verbs, adjectives and adverbs 23 Users
Verbnet Lexical University of Colorado Boulder On-line lexicon for English verbs organized into classes 3 Users
VerbOcean Lexical University of Southern California Broad-coverage semantic network of verbs 5 Users
FrameNet Lexical ICSI (International Computer Science Institute) - Berkley University On-line lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence 2 Users
NomBank Resources Lexical New York University Lexical Resources containing syntactic frames for nouns, extracted from annotated corpora 2 Users
Nomlex Plus Lexical New York University Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb 1 Users
Parc Polarity Lexicon Lexical PARC - Palo Alto Research Center Verbs classification with respect to semantic polarity 1 Users
DIRT Paraphrase Collection Collections of paraphrases (DIRT output) Various Output of the DIRT algorithm 4 Users
PropBank Resources Lexical University of Colorado Boulder Lexical Resources containing syntactic frames for verbs, extracted from annotated corpora 2 Users
TEASE Collection Syntactic-semantic Bar Ilan University Collection of Entailment Rules 0 Users
BADC Acronym and Abbreviation List Word List BADC - British Atmospheric Data Centre Acronym and Abbreviation List 1 Users
Acronym Guide Word List Acronym-Guide.com Acronym and Abbreviation Lists for English, branched in thematic directories 1 Users
Dekang Lin’s Thesaurus Thesaurus University of Alberta Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores 1 Users
Roget's Thesaurus Thesaurus Peter Mark Roget Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The 1911 US edition (version 1.02) is made available by University of Chicago. Users
Web1T 5-grams Word list Google Inc. Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages 1 Users
Wikipedia Encyclopedia Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information ecc. 3 Users
GNIS - Geographic Names Information System Word List USGS - United States Geological Survey Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica. 1 Users
Geonames Word List Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources. 1 Users
Users
Users
Users
Users


[*] The numbers refer to the Users in RTE4 (data extracted both from related proceedings and from RTE Knowledge Resources Questionnaire) and in RTE3 (data extracted only from RTE Knowledge Resources Questionnaire) challenges.