Difference between revisions of "Corpora, datasets, lexicons"

From ACL Wiki
Jump to navigation Jump to search
Line 46: Line 46:
  
 
== Lexicons ==
 
== Lexicons ==
 
+
(alphabetical order)
 
* [http://clipdemos.umiacs.umd.edu/catvar/ Catvar 2.0: The Categorial Variation Database]
 
* [http://clipdemos.umiacs.umd.edu/catvar/ Catvar 2.0: The Categorial Variation Database]
 
* [http://www.wjh.harvard.edu/%7Einquirer/spreadsheet_guide.htm General Inquirer]
 
* [http://www.wjh.harvard.edu/%7Einquirer/spreadsheet_guide.htm General Inquirer]
Line 54: Line 54:
 
* [http://www.signiform.com/tt/htm/tt.htm ThoughtTreasure]
 
* [http://www.signiform.com/tt/htm/tt.htm ThoughtTreasure]
  
 +
=== WordNet and enhancements ===
 +
(alphabetical order)
 +
* [http://xwn.hlt.utdallas.edu/ eXtended WordNet] - glosses are syntactically parsed, transformed into logic forms, and content words are semantically disambiguated
 +
* [http://patty.isti.cnr.it/~esuli/software/SentiWordNet/ SentiWordNet] - assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity
 
* [http://wordnet.princeton.edu/ WordNet] - the original
 
* [http://wordnet.princeton.edu/ WordNet] - the original
** [http://xwn.hlt.utdallas.edu/ eXtended WordNet] - glosses are syntactically parsed, transformed into logic forms, and content words are semantically disambiguated
+
* [http://tcc.itc.it/research/textec/topics/disambiguation/wordnetdomains.html WordNet Domains] - augmented with Domain Labels, such as POLITICS, ECONOMY, SPORT
** [http://tcc.itc.it/research/textec/topics/disambiguation/wordnetdomains.html WordNet Domains] - augmented with Domain Labels, such as POLITICS, ECONOMY, SPORT
 
** [http://patty.isti.cnr.it/~esuli/software/SentiWordNet/ SentiWordNet] - assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity
 

Revision as of 07:47, 2 November 2006

Miscellaneous

Corpora

English

(alphabetical order)

Multilingual

(alphabetical order)

Other lists of corpora

(alphabetical order)

Datasets

Lexicons

(alphabetical order)

WordNet and enhancements

(alphabetical order)

  • eXtended WordNet - glosses are syntactically parsed, transformed into logic forms, and content words are semantically disambiguated
  • SentiWordNet - assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity
  • WordNet - the original
  • WordNet Domains - augmented with Domain Labels, such as POLITICS, ECONOMY, SPORT