RTE5 - Ablation Tests: Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
Celct (talk | contribs)
Celct (talk | contribs)
Removing all content from page
Line 1: Line 1:
=== Publicly available Resources ===
 
{|class="wikitable sortable" cellpadding="3" cellspacing="0" style="margin-left: 20px;" border="1"
|- bgcolor="#CDCDCD"
! Resource
! Type
! Author
! class="unsortable"|Brief description
! RTE Users*
! class="unsortable"|Usage info
|- bgcolor="#ECECEC" "align="left"
| [[WordNet]]
| Lexical DB
| Princeton University
| Lexical database of English nouns, verbs, adjectives and adverbs
| style="text-align: center;"|<small>21 - RTE4</small> <br> <small>3 - RTE3</small>
| [[WordNet - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]
| Lexical DB
| University of Colorado Boulder
| Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class
| style="text-align: center;"|<small>2 - RTE4</small> <br> <small>2 - RTE3</small>
| [[Verbnet - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [[VerbOcean]]
| Lexical DB
| Information Sciences Institute, University of Southern California
| Broad-coverage semantic network of verbs
| style="text-align: center;"|<small>4 - RTE4</small> <br> <small>1 - RTE3</small>
| [[VerbOcean - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://framenet.icsi.berkeley.edu/ FrameNet]
| Lexical DB
| ICSI (International Computer Science Institute) - Berkley University
| Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>1 - RTE3</small>
| [[Framenet - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]
| Lexical DB
| New York University
| Lexical resource containing syntactic frames for nouns, extracted from annotated corpora
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>2 - RTE3</small>
| [[NomBank Resource - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]
| Lexical DB
| University of Colorado Boulder
| Lexical resource containing syntactic frames for verbs, extracted from annotated corpora
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>2 - RTE3</small>
| [[PropBank Resource - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus
| Lexical DB
| New York University
| Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Nomlex Plus - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://www.wikipedia.org/ Wikipedia]
| Encyclopedia
|
| Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.
| style="text-align: center;"|<small>3 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Wikipedia - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [[TEASE]] Collection
| Collection of Entailment Rules
| Bar-Ilan University
| Output of the TEASE algorithm
| style="text-align: center;"|<small>0 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Tease Collection - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]
| Word List
| BADC (British Atmospheric Data Centre)
| Acronym and Abbreviation List
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| [[BADC Acronym and Abbreviation List - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://www.acronym-guide.com/ Acronym Guide]
| Word List
| Acronym-Guide.com
| Acronym and Abbreviation Lists for English, branched in thematic directories
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Acronym Guide - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]
| Thesaurus
| University of Alberta
| Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Dekang Lin’s Thesaurus - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]
| Thesaurus
| Peter Mark Roget (Electronic version distributed by University of Chicago)
| Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.
| style="text-align: center;"|<small>0 - RTE4</small> <br> <small>1 - RTE3</small>
| [[Roget's Thesaurus - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]
| Word list
| Linguistic Data Consortium, University of Pennsylvania; Google Inc.
| Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Web1T - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]
| Gazetteer
| USGS (United States Geological Survey)
| Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| [[GNIS - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://www.geonames.org/ Geonames]
| Gazetteer
|
| Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Geonames - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]
| Collection of paraphrases
| Department of Computer Science, New York University
| Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.
| style="text-align: center;"| <small>0 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Sekine's Paraphrase Database - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]
| Collection of paraphrases
| Microsoft Research
| Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.
| style="text-align: center;"| <small>0 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators]
| Collection of entailing operators
| Department of Computer Science, Cornell University, Ithaca NY
| System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.
| style="text-align: center;"| <small>0 - RTE4</small> <br> <small>0 - RTE3</small>
| [[Downward entailing operators - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
|[http://cs.biu.ac.il/~shey/WikiRules.html WikiRules!]  <br>
| Lexical Reference rule-base
| Bar-Ilan University
| Extraction of lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia
| style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| [[WikiRules! - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| ''New resource''
|
|
| ''Participants are encouraged to contribute''
| style="text-align: center;"|
| [[New Resource1 - RTE Users|Users]]
|- bgcolor="#ECECEC" "align="left"
| ''New resource''
|
|
| ''Participants are encouraged to contribute''
| style="text-align: center;"|
| [[New Resource2 - RTE Users|Users]]
|}
<br>
<br>

Revision as of 10:42, 24 November 2009