Knowledge collections and datasets (English): Difference between revisions
Jump to navigation
Jump to search
No edit summary |
mNo edit summary |
||
| Line 12: | Line 12: | ||
* [http://w3.usf.edu/FreeAssociation/ University of South Florida Free Association Norms] | * [http://w3.usf.edu/FreeAssociation/ University of South Florida Free Association Norms] | ||
* [[VerbOcean]] - verbs organized by semantic relation, including temporal precedence and strength | * [[VerbOcean]] - verbs organized by semantic relation, including temporal precedence and strength | ||
* [ | * [[WordNet]] | ||
* [http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html WordSimilarity-353 Test Collection] | * [http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/wordsim353.html WordSimilarity-353 Test Collection] | ||
Revision as of 01:13, 7 February 2007
Datasets for Computational Linguistics and Natural Language Processing.
- Clustering by Committee - terms clustered and organized using the Distributional Hypothesis
- DIRT Paraphrase Collection - Discovery of Inference Rules from Text
- Edinburgh Associative Thesaurus (EAT)
- FrameNet
- MRC Psycholinguistic Database
- Noun Compound Repository
- Reuters-21578 Text Categorization Collection
- Spam filtering datasets
- TEASE - Acquisition of Entailment Relations from the Web
- University of South Florida Free Association Norms
- VerbOcean - verbs organized by semantic relation, including temporal precedence and strength
- WordNet
- WordSimilarity-353 Test Collection