Difference between revisions of "RTE5 - Ablation Tests"

From ACL Wiki
Jump to navigation Jump to search
(New page: === Publicly available Resources === {|class="wikitable sortable" cellpadding="3" cellspacing="0" style="margin-left: 20px;" border="1" |- bgcolor="#CDCDCD" ! Resource ! Type ! Author ! cl...)
 
Line 147: Line 147:
 
| Extraction of lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia
 
| Extraction of lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>  
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>  
| [[Lexical reference rules - RTE Users|Users]]
+
| [[WikiRules! - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
|- bgcolor="#ECECEC" "align="left"
 
| ''New resource''
 
| ''New resource''

Revision as of 07:30, 23 November 2009

Publicly available Resources

Resource Type Author Brief description RTE Users* Usage info
WordNet Lexical DB Princeton University Lexical database of English nouns, verbs, adjectives and adverbs 21 - RTE4
3 - RTE3
Users
Verbnet Lexical DB University of Colorado Boulder Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class 2 - RTE4
2 - RTE3
Users
VerbOcean Lexical DB Information Sciences Institute, University of Southern California Broad-coverage semantic network of verbs 4 - RTE4
1 - RTE3
Users
FrameNet Lexical DB ICSI (International Computer Science Institute) - Berkley University Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence 1 - RTE4
1 - RTE3
Users
NomBank Lexical DB New York University Lexical resource containing syntactic frames for nouns, extracted from annotated corpora 1 - RTE4
2 - RTE3
Users
PropBank Lexical DB University of Colorado Boulder Lexical resource containing syntactic frames for verbs, extracted from annotated corpora 1 - RTE4
2 - RTE3
Users
Nomlex Plus Lexical DB New York University Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb 1 - RTE4
0 - RTE3
Users
Wikipedia Encyclopedia Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc. 3 - RTE4
0 - RTE3
Users
TEASE Collection Collection of Entailment Rules Bar-Ilan University Output of the TEASE algorithm 0 - RTE4
0 - RTE3
Users
BADC Acronym and Abbreviation List Word List BADC (British Atmospheric Data Centre) Acronym and Abbreviation List 1 - RTE4
0 - RTE3
Users
Acronym Guide Word List Acronym-Guide.com Acronym and Abbreviation Lists for English, branched in thematic directories 1 - RTE4
0 - RTE3
Users
Dekang Lin’s Thesaurus Thesaurus University of Alberta Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores 1 - RTE4
0 - RTE3
Users
Roget's Thesaurus Thesaurus Peter Mark Roget (Electronic version distributed by University of Chicago) Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition (version 1.02) is made available by University of Chicago. 0 - RTE4
1 - RTE3
Users
Web1T 5-grams Word list Linguistic Data Consortium, University of Pennsylvania; Google Inc. Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages 1 - RTE4
0 - RTE3
Users
GNIS - Geographic Names Information System Gazetteer USGS (United States Geological Survey) Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica 1 - RTE4
0 - RTE3
Users
Geonames Gazetteer Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources. 1 - RTE4
0 - RTE3
Users
Sekine's Paraphrase Database Collection of paraphrases Department of Computer Science, New York University Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases. 0 - RTE4
0 - RTE3
Users
Microsoft Research Paraphrase Corpus Collection of paraphrases Microsoft Research Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship. 0 - RTE4
0 - RTE3
Users
Downward entailing operators Collection of entailing operators Department of Computer Science, Cornell University, Ithaca NY System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'. 0 - RTE4
0 - RTE3
Users
WikiRules!
Lexical Reference rule-base Bar-Ilan University Extraction of lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia 1 - RTE4
0 - RTE3
Users
New resource Participants are encouraged to contribute Users
New resource Participants are encouraged to contribute Users