Difference between revisions of "RTE5 - Ablation Tests"

From ACL Wiki
Jump to navigation Jump to search
(Removing all content from page)
Line 1: Line 1:
=== Publicly available Resources ===
+
 
{|class="wikitable sortable" cellpadding="3" cellspacing="0" style="margin-left: 20px;" border="1"
 
|- bgcolor="#CDCDCD"
 
! Resource
 
! Type
 
! Author
 
! class="unsortable"|Brief description
 
! RTE Users*
 
! class="unsortable"|Usage info
 
|- bgcolor="#ECECEC" "align="left"
 
| [[WordNet]]
 
| Lexical DB
 
| Princeton University
 
| Lexical database of English nouns, verbs, adjectives and adverbs
 
| style="text-align: center;"|<small>21 - RTE4</small> <br> <small>3 - RTE3</small>
 
| [[WordNet - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]
 
| Lexical DB
 
| University of Colorado Boulder
 
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class
 
| style="text-align: center;"|<small>2 - RTE4</small> <br> <small>2 - RTE3</small>
 
| [[Verbnet - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [[VerbOcean]]
 
| Lexical DB
 
| Information Sciences Institute, University of Southern California
 
| Broad-coverage semantic network of verbs
 
| style="text-align: center;"|<small>4 - RTE4</small> <br> <small>1 - RTE3</small>
 
| [[VerbOcean - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://framenet.icsi.berkeley.edu/ FrameNet]
 
| Lexical DB
 
| ICSI (International Computer Science Institute) - Berkley University
 
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>1 - RTE3</small>
 
| [[Framenet - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]
 
| Lexical DB
 
| New York University
 
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>2 - RTE3</small>
 
| [[NomBank Resource - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]
 
| Lexical DB
 
| University of Colorado Boulder
 
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>2 - RTE3</small>
 
| [[PropBank Resource - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus
 
| Lexical DB
 
| New York University
 
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Nomlex Plus - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.wikipedia.org/ Wikipedia]
 
| Encyclopedia
 
|
 
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.
 
| style="text-align: center;"|<small>3 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Wikipedia - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [[TEASE]] Collection
 
| Collection of Entailment Rules
 
| Bar-Ilan University
 
| Output of the TEASE algorithm
 
| style="text-align: center;"|<small>0 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Tease Collection - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]
 
| Word List
 
| BADC (British Atmospheric Data Centre)
 
| Acronym and Abbreviation List
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.acronym-guide.com/ Acronym Guide]
 
| Word List
 
| Acronym-Guide.com
 
| Acronym and Abbreviation Lists for English, branched in thematic directories
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Acronym Guide - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]
 
| Thesaurus
 
| University of Alberta
 
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]
 
| Thesaurus
 
| Peter Mark Roget (Electronic version distributed by University of Chicago)
 
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.
 
| style="text-align: center;"|<small>0 - RTE4</small> <br> <small>1 - RTE3</small>
 
| [[Roget's Thesaurus - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]
 
| Word list
 
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.
 
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Web1T - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]
 
| Gazetteer
 
| USGS (United States Geological Survey)
 
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[GNIS - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.geonames.org/ Geonames]
 
| Gazetteer
 
|
 
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Geonames - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]
 
| Collection of paraphrases
 
| Department of Computer Science, New York University
 
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.
 
| style="text-align: center;"| <small>0 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Sekine's Paraphrase Database - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]
 
| Collection of paraphrases
 
| Microsoft Research
 
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.
 
| style="text-align: center;"| <small>0 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators]
 
| Collection of entailing operators
 
| Department of Computer Science, Cornell University, Ithaca NY
 
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.
 
| style="text-align: center;"| <small>0 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[Downward entailing operators - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
|[http://cs.biu.ac.il/~shey/WikiRules.html WikiRules!]  <br>
 
| Lexical Reference rule-base
 
| Bar-Ilan University
 
| Extraction of lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia
 
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
 
| [[WikiRules! - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| ''New resource''
 
|
 
|
 
| ''Participants are encouraged to contribute''
 
| style="text-align: center;"|
 
| [[New Resource1 - RTE Users|Users]]
 
|- bgcolor="#ECECEC" "align="left"
 
| ''New resource''
 
|
 
|
 
| ''Participants are encouraged to contribute''
 
| style="text-align: center;"|
 
| [[New Resource2 - RTE Users|Users]]
 
|}
 
<br>
 
<br>
 

Revision as of 04:42, 24 November 2009