Knowledge collections and datasets (English): Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
Ionandr (talk | contribs)
Added spam filtering datasets.
mNo edit summary
Line 2: Line 2:


* [[Clustering by Committee]] - terms clustered and organized using the [[Distributional Hypothesis]]
* [[Clustering by Committee]] - terms clustered and organized using the [[Distributional Hypothesis]]
* [[DIRT Paraphrase Collection]]
* [[DIRT Paraphrase Collection]] - Discovery of Inference Rules from Text
* [http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)]
* [http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)]
* [http://framenet.icsi.berkeley.edu/ FrameNet]
* [http://framenet.icsi.berkeley.edu/ FrameNet]
Line 10: Line 10:
* [[Spam filtering datasets]]
* [[Spam filtering datasets]]
* [http://w3.usf.edu/FreeAssociation/ University of South Florida Free Association Norms]
* [http://w3.usf.edu/FreeAssociation/ University of South Florida Free Association Norms]
* [[VerbOcean|VerbOcean - verbs organized by semantic relation, including temporal precedence, strength, etc.]]
* [[VerbOcean]] - verbs organized by semantic relation, including temporal precedence and strength
* [http://wordnet.princeton.edu/ WordNet]
* [http://wordnet.princeton.edu/ WordNet]
* [http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html WordSimilarity-353 Test Collection]
* [http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html WordSimilarity-353 Test Collection]

Revision as of 16:42, 19 November 2006

Datasets for Computational Linguistics and Natural Language Processing.

Additional Dataset Collections