|
|
| Line 1: |
Line 1: |
| === Publicly available Resources ===
| | |
| {|class="wikitable sortable" cellpadding="3" cellspacing="0" style="margin-left: 20px;" border="1"
| |
| |- bgcolor="#CDCDCD"
| |
| ! Resource
| |
| ! Type
| |
| ! Author
| |
| ! class="unsortable"|Brief description
| |
| ! RTE Users*
| |
| ! class="unsortable"|Usage info
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [[WordNet]]
| |
| | Lexical DB
| |
| | Princeton University
| |
| | Lexical database of English nouns, verbs, adjectives and adverbs
| |
| | style="text-align: center;"|<small>21 - RTE4</small> <br> <small>3 - RTE3</small>
| |
| | [[WordNet - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://verbs.colorado.edu/~mpalmer/projects/verbnet.html Verbnet]
| |
| | Lexical DB
| |
| | University of Colorado Boulder
| |
| | Lexicon for English verbs organized into classes extending Levin (1993) classes through refinement and addition of subclasses to achieve syntactic and semantic coherence among members of a class
| |
| | style="text-align: center;"|<small>2 - RTE4</small> <br> <small>2 - RTE3</small>
| |
| | [[Verbnet - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [[VerbOcean]]
| |
| | Lexical DB
| |
| | Information Sciences Institute, University of Southern California
| |
| | Broad-coverage semantic network of verbs
| |
| | style="text-align: center;"|<small>4 - RTE4</small> <br> <small>1 - RTE3</small>
| |
| | [[VerbOcean - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://framenet.icsi.berkeley.edu/ FrameNet]
| |
| | Lexical DB
| |
| | ICSI (International Computer Science Institute) - Berkley University
| |
| | Lexical resource for English words, based on frame semantics (valences) and supported by corpus evidence
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>1 - RTE3</small>
| |
| | [[Framenet - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://nlp.cs.nyu.edu/meyers/NomBank.html NomBank]
| |
| | Lexical DB
| |
| | New York University
| |
| | Lexical resource containing syntactic frames for nouns, extracted from annotated corpora
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>2 - RTE3</small>
| |
| | [[NomBank Resource - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://verbs.colorado.edu/~mpalmer/projects/ace.html PropBank]
| |
| | Lexical DB
| |
| | University of Colorado Boulder
| |
| | Lexical resource containing syntactic frames for verbs, extracted from annotated corpora
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>2 - RTE3</small>
| |
| | [[PropBank Resource - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://nlp.cs.nyu.edu/nomlex/index.html Nomlex] Plus
| |
| | Lexical DB
| |
| | New York University
| |
| | Dictionary of English nominalizations: it describes the allowed complements for a nominalization and relates the nominal complements to the arguments of the corresponding verb
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Nomlex Plus - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://www.wikipedia.org/ Wikipedia]
| |
| | Encyclopedia
| |
| |
| |
| | Free encyclopedia. Used for extraction of lexical-semantic rules (from its more structured parts), named entity recognition, geographical information etc.
| |
| | style="text-align: center;"|<small>3 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Wikipedia - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [[TEASE]] Collection
| |
| | Collection of Entailment Rules
| |
| | Bar-Ilan University
| |
| | Output of the TEASE algorithm
| |
| | style="text-align: center;"|<small>0 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Tease Collection - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://badc.nerc.ac.uk/help/abbrevs.html BADC Acronym and Abbreviation List]
| |
| | Word List
| |
| | BADC (British Atmospheric Data Centre)
| |
| | Acronym and Abbreviation List
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[BADC Acronym and Abbreviation List - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://www.acronym-guide.com/ Acronym Guide]
| |
| | Word List
| |
| | Acronym-Guide.com
| |
| | Acronym and Abbreviation Lists for English, branched in thematic directories
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Acronym Guide - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://www.cs.ualberta.ca/~lindek/downloads.htm Dekang Lin’s Thesaurus]
| |
| | Thesaurus
| |
| | University of Alberta
| |
| | Thesaurus automatically constructed using a parsed corpus, based on distributional similarity scores
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Dekang Lin’s Thesaurus - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://en.wikipedia.org/wiki/Roget%27s_Thesaurus Roget's Thesaurus]
| |
| | Thesaurus
| |
| | Peter Mark Roget (Electronic version distributed by University of Chicago)
| |
| | Roget's Thesaurus is a widely-used English thesaurus, created by Dr. Peter Mark Roget in 1805. The original edition had 15,000 words, and each new edition has been larger. The electronic edition ([http://machaut.uchicago.edu/rogets version 1.02]) is made available by University of Chicago.
| |
| | style="text-align: center;"|<small>0 - RTE4</small> <br> <small>1 - RTE3</small>
| |
| | [[Roget's Thesaurus - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13 Web1T 5-grams]
| |
| | Word list
| |
| | Linguistic Data Consortium, University of Pennsylvania; Google Inc.
| |
| | Data set containing English word n-grams and their observed frequency counts. The n-gram counts were generated from approximately 1 trillion word tokens of text from publicly accessible Web pages
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Web1T - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://geonames.usgs.gov/index.html GNIS - Geographic Names Information System]
| |
| | Gazetteer
| |
| | USGS (United States Geological Survey)
| |
| | Database containing the Federal and national standard toponyms for USA, associated areas and Antarctica
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[GNIS - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://www.geonames.org/ Geonames]
| |
| | Gazetteer
| |
| |
| |
| | Database containing eight million geographical names. It is integrating geographical data such as names of places in various languages, elevation, population and others from various sources.
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Geonames - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://nlp.cs.nyu.edu/paraphrase/ Sekine's Paraphrase Database]
| |
| | Collection of paraphrases
| |
| | Department of Computer Science, New York University
| |
| | Data-base created using Sekine's method, NOT cleaned up by human. It includes 19,975 sets of paraphrases with 191,572 phrases.
| |
| | style="text-align: center;"| <small>0 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Sekine's Paraphrase Database - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://research.microsoft.com/research/downloads/Details/607D14D9-20CD-47E3-85BC-A2F65CD28042/Details.aspx Microsoft Research Paraphrase Corpus]
| |
| | Collection of paraphrases
| |
| | Microsoft Research
| |
| | Text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship.
| |
| | style="text-align: center;"| <small>0 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Microsoft Research Paraphrase Corpus - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | [http://www.cs.cornell.edu/~cristian/Without_a_doubt_-_Data.html Downward entailing operators]
| |
| | Collection of entailing operators
| |
| | Department of Computer Science, Cornell University, Ithaca NY
| |
| | System output of an unsupervised algorithm recovering many Downward Entailing operators, like 'doubt'.
| |
| | style="text-align: center;"| <small>0 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[Downward entailing operators - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| |[http://cs.biu.ac.il/~shey/WikiRules.html WikiRules!] <br>
| |
| | Lexical Reference rule-base
| |
| | Bar-Ilan University
| |
| | Extraction of lexical reference rules from the text body (first sentence) and from metadata (links, redirects, parentheses) of Wikipedia
| |
| | style="text-align: center;"|<small>1 - RTE4</small> <br> <small>0 - RTE3</small>
| |
| | [[WikiRules! - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | ''New resource''
| |
| |
| |
| |
| |
| | ''Participants are encouraged to contribute''
| |
| | style="text-align: center;"|
| |
| | [[New Resource1 - RTE Users|Users]]
| |
| |- bgcolor="#ECECEC" "align="left"
| |
| | ''New resource''
| |
| |
| |
| |
| |
| | ''Participants are encouraged to contribute''
| |
| | style="text-align: center;"|
| |
| | [[New Resource2 - RTE Users|Users]]
| |
| |}
| |
| <br>
| |
| <br>
| |