<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.aclweb.org/aclwiki/index.php?action=history&amp;feed=atom&amp;title=Word_sense_disambiguation_resources</id>
	<title>Word sense disambiguation resources - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://www.aclweb.org/aclwiki/index.php?action=history&amp;feed=atom&amp;title=Word_sense_disambiguation_resources"/>
	<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Word_sense_disambiguation_resources&amp;action=history"/>
	<updated>2026-05-23T05:49:52Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.43.6</generator>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Word_sense_disambiguation_resources&amp;diff=10920&amp;oldid=prev</id>
		<title>Tristan Miller: migrated from https://www.ukp.tu-darmstadt.de/research/scientific-community/ukpedia/word-sense-disambiguation/</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Word_sense_disambiguation_resources&amp;diff=10920&amp;oldid=prev"/>
		<updated>2014-12-12T11:42:57Z</updated>

		<summary type="html">&lt;p&gt;migrated from https://www.ukp.tu-darmstadt.de/research/scientific-community/ukpedia/word-sense-disambiguation/&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;[[Word sense disambiguation]] (WSD) is an open problem in natural language processing concerned with determining which sense (i.e., meaning) of a word is used in a particular context.  This article provides provides links to important WSD-related publications, software, corpora, and other resources.&lt;br /&gt;
&lt;br /&gt;
==Introductory material, overviews, and surveys==&lt;br /&gt;
* [http://en.wikipedia.org/wiki/Word_sense_disambiguation Word sense disambiguation] (Wikipedia)&lt;br /&gt;
* [http://www.scholarpedia.org/article/Word_sense_disambiguation Word sense disambiguation] (Scholarpedia)&lt;br /&gt;
* [http://aclweb.org/aclwiki/index.php?title=Word_sense_disambiguation Word sense disambiguation] (ACLWiki)&lt;br /&gt;
* Eneko Agirre and Philip Edmonds, editors. [http://www.wsdbook.org/ &amp;#039;&amp;#039;Word Sense Disambiguation: Algorithms and Applications&amp;#039;&amp;#039;], volume 33 of Text, Speech, and Language Technology. Springer, 2006. ISBN 978-1-4020-6870-6.&lt;br /&gt;
* [http://www.d.umn.edu/%7Etpederse/WSDTutorial.html Advances in Word Sense Disambiguation tutorial] by Rada Mihalcea and Ted Pedersen (2005)&lt;br /&gt;
* Roberto Navigli. [http://dl.acm.org/citation.cfm?doid=1459352.1459355 Word sense disambiguation: A survey]. &amp;#039;&amp;#039;ACM Computing Surveys&amp;#039;&amp;#039;, 41:10:1–10:69, February 2009. ISSN 0360-0300.&lt;br /&gt;
* Nancy Ide and Jean Véronis. [http://www.up.univ-mrs.fr/%7Everonis/pdf/1998wsd.pdf  Introduction to the special issue on word sense disambiguation: The state of the art]. &amp;#039;&amp;#039;Computational Linguistics&amp;#039;&amp;#039;, 24(1):1–40, 1998. ISSN 0891-2017.&lt;br /&gt;
* K. C. Litkowski. Computational lexicons and dictionaries. In Keith Brown, editor, &amp;#039;&amp;#039;Encyclopedia of Language and Linguistics&amp;#039;&amp;#039;, pages 753–761. Elsevier Science, Oxford, second edition, 2005. ISBN 978-0-08-044299-0.&lt;br /&gt;
* Philip Edmonds. Lexical disambiguation. In Keith Brown, editor, &amp;#039;&amp;#039;Encyclopedia of Language and Linguistics&amp;#039;&amp;#039;, pages 607–623. Elsevier Science, Oxford, second edition, 2005. ISBN 978-0-08-044299-0.&lt;br /&gt;
* David Jurafsky and James H. Martin. &amp;#039;&amp;#039;Speech and Language Processing&amp;#039;&amp;#039;, chapter Computational Lexical Semantics. Prentice Hall, second edition, 2008. ISBN 978-0131873216.&lt;br /&gt;
* Christopher D. Manning and Hinrich Schütze. &amp;#039;&amp;#039;Foundations of Statistical Natural Language Processing&amp;#039;&amp;#039;, chapter Word Sense Disambiguation, pages 229–264. The MIT Press, 1999. ISBN 978-0262133609.&lt;br /&gt;
* David Yarowsky. Word sense disambiguation. In Nitin Indurkhya and Fred J. Damerau, editors, &amp;#039;&amp;#039;Handbook of Natural Language Processing&amp;#039;&amp;#039;, pages 315–338. Chapman and Hall/CRC, second edition, 2010. ISBN 978-1420085921.&lt;br /&gt;
&lt;br /&gt;
==Conferences, workshops, and journals==&lt;br /&gt;
* [http://www.dcs.shef.ac.uk/research/ilash/iccl/ The International Committee on Computational Linguistics (ICCL)] and its conferences:&lt;br /&gt;
** [http://www.dcs.shef.ac.uk/research/ilash/iccl/ International Conference on Computational Linguistics (COLING)]&lt;br /&gt;
* [http://www.aclweb.org/ The Association for Computational Linguistics (ACL)] and its associated organizations, conferences, workshops, and special interest groups:&lt;br /&gt;
** [http://www.clres.com/siglex.html ACL SIGLEX], the umbrella organization for the [http://www.senseval.org/ Semeval and Senseval] evaluation exercises:&lt;br /&gt;
*** [http://www.itri.brighton.ac.uk/events/senseval/ARCHIVE/index.html Senseval-1] (1998)&lt;br /&gt;
*** [http://www.sle.sharp.co.uk/senseval2 Senseval-2] (2001)&lt;br /&gt;
*** [http://www.senseval.org/senseval3 Senseval-3] (2004)&lt;br /&gt;
*** [http://nlp.cs.swarthmore.edu/semeval Semeval-1] (2007)&lt;br /&gt;
*** [http://semeval2.fbk.eu/ Semeval-2] (2010)&lt;br /&gt;
*** [http://www.cs.york.ac.uk/semeval/ Semeval-3] (2013)&lt;br /&gt;
* [http://ixa2.si.ehu.es/clirwsd/ Robust WSD task] at the [http://clef-campaign.org/ Cross Language Evaluation Form (CLEF)]&lt;br /&gt;
* [http://www.mitpressjournals.org/loi/coli &amp;#039;&amp;#039;Computational Linguistics&amp;#039;&amp;#039;]. MIT Press. ISSN 0891-2017.&lt;br /&gt;
** [http://www.aclweb.org/anthology-new/J/J98/ &amp;#039;&amp;#039;Computational Linguistics&amp;#039;&amp;#039;, 24(1), 1998.  Special issue on word sense disambiguation].&lt;br /&gt;
* [http://journals.cambridge.org/action/displayJournal?jid=NLE &amp;#039;&amp;#039;Natural Language Engineering&amp;#039;&amp;#039;]. Cambridge University Press. ISSN 1351-3249.&lt;br /&gt;
**[http://journals.cambridge.org/action/displayIssue?decade=2000&amp;amp;jid=NLE&amp;amp;volumeId=8&amp;amp;issueId=04&amp;amp;iid=138358 &amp;#039;&amp;#039;Natural Language Engineering&amp;#039;&amp;#039;, 8(4), 2002.  Special issue on evaluating word sense disambiguation systems].&lt;br /&gt;
&lt;br /&gt;
==Sense inventories and other lexical resources==&lt;br /&gt;
; [http://www.webdante.com/ DANTE]&lt;br /&gt;
: A lexical database for English&lt;br /&gt;
; [http://www.ibiblio.org/webster/ GCIDE_XML]&lt;br /&gt;
: The GNU version of the &amp;#039;&amp;#039;Collaborative International Dictionary of English&amp;#039;&amp;#039; (CIDE), presented in XML&lt;br /&gt;
; [http://www.itri.brighton.ac.uk/events/senseval/ARCHIVE/resources.html#lex HECTOR]&lt;br /&gt;
: A 35-word English dictionary used for Senseval-1&lt;br /&gt;
; &amp;#039;&amp;#039;Longman Dictionary of Contemporary English&amp;#039;&amp;#039; (LDOCE).  Burnt Mill, Essex: Longman, 1978&lt;br /&gt;
: This proprietary dictionary saw considerable use by the WSD research community before less restrictively licensed resources became available.&lt;br /&gt;
; &amp;#039;&amp;#039;Roget&amp;#039;s International Thesaurus&amp;#039;&amp;#039;.  New York: Harper Collins, 1992&lt;br /&gt;
: This proprietary thesaurus saw considerable use by the WSD research community before less restrictively licensed resources became available.&lt;br /&gt;
; [http://rogets.site.uottawa.ca/ The Open Roget&amp;#039;s Project]&lt;br /&gt;
: A free implementation of the 1911 &amp;#039;&amp;#039;Roget&amp;#039;s Thesaurus&amp;#039;&amp;#039;.&lt;br /&gt;
===Wordnets and associated resources===&lt;br /&gt;
;[http://wordnet.princeton.edu/ WordNet]&lt;br /&gt;
: A lexical database for English&lt;br /&gt;
; [http://www.globalwordnet.org/gwa/wordnet_table.htm Wordnets in the world]&lt;br /&gt;
: A list of wordnets for various languages&lt;br /&gt;
; [http://xwn.hlt.utdallas.edu/ eXtended WordNet]&lt;br /&gt;
: A version of WordNet where the glosses are syntactically parsed, transformed into logic forms, and content words are semantically disambiguated 		      &lt;br /&gt;
; [http://www.cse.unt.edu/%7Erada/downloads.html#wordnet Inter-version WordNet mappings]&lt;br /&gt;
: Mapping between synsets offsets in various WordNet versions&lt;br /&gt;
; [http://www.lsi.upc.edu/~nlp/meaning/downloads.html MCR]&lt;br /&gt;
: An integration of five local wordnets, the EuroWordNet Top Concept ontology, MultiWordNet Domains, and hundreds of thousands of new semantic relations and properties automatically acquired from corpora.&lt;br /&gt;
&lt;br /&gt;
==Annotated corpora==&lt;br /&gt;
; [http://www.computing.dcu.ie/%7Easmeaton/SIGIR96-captions/ Alan Smeaton and Ian Quigley&amp;#039;s image captions]&lt;br /&gt;
: 8816 WordNet 1.5-annotated instances of 2304 lemmas in 2714 image captions&lt;br /&gt;
; [http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC97T12 DSO Corpus of Sense-Tagged English]&lt;br /&gt;
: Sense-tagged word occurrences for 121 nouns and 70 verbs occurring in the Brown Corpus and &amp;#039;&amp;#039;Wall Street Journal&amp;#039;&amp;#039; corpus&lt;br /&gt;
; [http://www.itri.brighton.ac.uk/events/senseval/ARCHIVE/resources.html HECTOR (Senseval-1)]&lt;br /&gt;
: Separate training and test corpora with 35 word types annotated with their HECTOR senses.  See also Ted Pedersen&amp;#039;s conversions.&lt;br /&gt;
; interest&lt;br /&gt;
: &amp;#039;&amp;#039;Wall Street Journal&amp;#039;&amp;#039; articles with 2369 instances of &amp;quot;interest&amp;quot; annotated with their LDOCE senses.  See Ted Pedersen&amp;#039;s conversions.&lt;br /&gt;
; line, hard, serve&lt;br /&gt;
: &amp;#039;&amp;#039;Wall Street Journal&amp;#039;&amp;#039; articles with over 12,000 instances of &amp;quot;line&amp;quot;, &amp;quot;hard&amp;quot;, and &amp;quot;serve&amp;quot; tagged with a subset of their WordNet 1.5 senses.  See Ted Pedersen&amp;#039;s conversions.&lt;br /&gt;
; [http://www.cse.unt.edu/%7Erada/downloads.html#omwe Open 	  Mind Word Expert sense-tagged data]&lt;br /&gt;
: Various data 	sets for English, Romanian, and Hindi&lt;br /&gt;
; [http://www.cse.unt.edu/%7Erada/downloads.html#sensevalsemcor Rada Mihalcea&amp;#039;s Senseval-2 and Senseval-3 conversions into SemCor format]&lt;br /&gt;
: Senseval-2 and Senseval-3 English all-words data converted into SemCor format&lt;br /&gt;
; [http://www.cse.unt.edu/%7Erada/downloads.html#semcor SemCor]&lt;br /&gt;
: Brown Corpus texts annotated with WordNet 1.6 senses, and       automatically mapped to WordNet 1.7, WordNet 1.7.1, WordNet 2.0,       WordNet 2.1, WordNet 3.0&lt;br /&gt;
; [http://www.grsampson.net/Resources.html SEMiSUSANNE]&lt;br /&gt;
: 33 sense-tagged and structurally annotated documents from the Brown Corpus&lt;br /&gt;
; [http://ixa.si.ehu.es/Ixa/resources/sensecorpus Sensecorpus]&lt;br /&gt;
: Automatically extracted examples for all WordNet 1.6 noun senses and topic signatures built based on those examples&lt;br /&gt;
; [http://86.188.143.199/senseval2/Results/guidelines.htm#rawdata Senseval-2]&lt;br /&gt;
: Three all-words sense-annotated Penn Treebank II articles comprising in total some 5000 words of running text, plus some Penn Treebank II &amp;#039;&amp;#039;Wall Street Journal&amp;#039;&amp;#039; and British National Corpus text where 75 to 300 instances of a total of 73 nouns, adjectives, and verbs have been annotated with their WordNet 1.7 senses.  See also Ted Pedersen&amp;#039;s and Rada Mihalcea&amp;#039;s conversions.&lt;br /&gt;
; [http://www.d.umn.edu/%7Etpederse/data.html Ted Pederson&amp;#039;s Sense-tagged Text]&lt;br /&gt;
: Versions of the Senseval-1, Senseval-2, line, hard,  serve, and interest data which have been converted to a common format  (Senseval-2), POS tagged, and parsed.&lt;br /&gt;
; [http://www.cse.unt.edu/%7Erada/downloads.html#twa TWA       sense-tagged data]&lt;br /&gt;
: Sense tagged data for six words with       two-way ambiguities (bass, crane, motion, palm, plant,       tank)&lt;br /&gt;
; [http://wordnet.princeton.edu/glosstag.shtml WordNet Gloss Disambiguation Project]&lt;br /&gt;
: A corpus of WordNet 3.0 glosses with word forms disambiguated to their WordNet 3.0 senses&lt;br /&gt;
&lt;br /&gt;
==Software==&lt;br /&gt;
; [http://sourceforge.net/projects/cuitools/ CuiTools]&lt;br /&gt;
: A complete word sense disambiguation system that assigns senses to biomedical text based on the UMLS&lt;br /&gt;
; [https://code.google.com/p/dkpro-wsd/ DKPro WSD]&lt;br /&gt;
: A collection of software components for word sense disambiguation based on the Apache UIMA framework.&lt;br /&gt;
; [http://www.cse.unt.edu/%7Erada/downloads.html#gwsd GWSD: Unsupervised Graph-based Word Sense Disambiguation]&lt;br /&gt;
:  A system for unsupervised all-words graph-based word sense disambiguation&lt;br /&gt;
; [http://alias-i.com/lingpipe/ LingPipe]&lt;br /&gt;
: A Java natural language processing toolkit.  A [http://alias-i.com/lingpipe/demos/tutorial/wordSense/read-me.html tutorial on using LingPipe for word sense disambiguation] is available.&lt;br /&gt;
; [http://www.nltk.org/ Natural Language Toolkit (NLTK)]&lt;br /&gt;
: Python modules for NLP, including a module for reading Senseval-2 data&lt;br /&gt;
; [http://www.d.umn.edu/%7Etpederse/senseclusters.html SenseClusters]&lt;br /&gt;
: A package of (mostly) Perl programs that allows a user to cluster similar contexts together using unsupervised knowledge-lean methods.&lt;br /&gt;
; [http://www.cse.unt.edu/%7Erada/downloads.html#senselearner SenseLearner]&lt;br /&gt;
:  An all-words word sense disambiguation tool&lt;br /&gt;
; [http://www.d.umn.edu/%7Etpederse/sensetools.html SenseTools]&lt;br /&gt;
: A suite of tools that allow for easy creation of supervised word sense disambiguation&lt;br /&gt;
; [http://www.d.umn.edu/%7Etpederse/tools.html Senseval-2 data format converters]&lt;br /&gt;
: Tools to convert between the following formats: Senseval-1, Senseval-2, Senseval-2 with conflated words, Headless Senseval-2, WePS, English Giga Word, plain text, National Library of Medicine Test Collection, Open Mind Data&lt;br /&gt;
; [http://senserelate.sourceforge.net/ WordNet::SenseRelate]&lt;br /&gt;
: Perl tools which use measures of semantic similarity and relatedness to perform word sense disambiguation&lt;br /&gt;
; [http://sourceforge.net/projects/wsdgate/ WSD Gate]&lt;br /&gt;
: A word sense disambiguation toolkit using GATE and WEKA&lt;br /&gt;
; [http://www.d.umn.edu/%7Etpederse/wsdshell.html WSD Shell]&lt;br /&gt;
: An improved version of the Duluth-Shell which was used as a driver for the Duluth Senseval-2 and Senseval-3 systems&lt;/div&gt;</summary>
		<author><name>Tristan Miller</name></author>
	</entry>
</feed>