<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Wartena</id>
	<title>ACL Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Wartena"/>
	<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/Special:Contributions/Wartena"/>
	<updated>2026-04-10T12:47:17Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.6</generator>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=TOEFL_Synonym_Questions_(State_of_the_art)&amp;diff=10277</id>
		<title>TOEFL Synonym Questions (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=TOEFL_Synonym_Questions_(State_of_the_art)&amp;diff=10277"/>
		<updated>2013-10-02T22:06:30Z</updated>

		<summary type="html">&lt;p&gt;Wartena: /* References */  Added Karlgren and Sahlgren 2001&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* TOEFL = Test of English as a Foreign Language&lt;br /&gt;
* 80 multiple-choice synonym questions; 4 choices per question&lt;br /&gt;
* the TOEFL questions are available on request by contacting [http://lsa.colorado.edu/mail_sub.html LSA Support at CU Boulder], the people who manage the [http://lsa.colorado.edu/ LSA web site at Colorado]&lt;br /&gt;
* introduced in Landauer and Dumais (1997) as a way of evaluating algorithms for measuring degree of similarity between words&lt;br /&gt;
* subsequently used by many other researchers&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Sample question ==&lt;br /&gt;
&lt;br /&gt;
::{| border=&amp;quot;0&amp;quot; cellpadding=&amp;quot;1&amp;quot; cellspacing=&amp;quot;1&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Stem:&lt;br /&gt;
|&lt;br /&gt;
| levied&lt;br /&gt;
|-&lt;br /&gt;
! Choices:&lt;br /&gt;
| (a)&lt;br /&gt;
| imposed&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
| (b)&lt;br /&gt;
| believed&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
| (c)&lt;br /&gt;
| requested&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
| (d)&lt;br /&gt;
| correlated&lt;br /&gt;
|-&lt;br /&gt;
! Solution:&lt;br /&gt;
| (a)&lt;br /&gt;
| imposed&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Table of results ==&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Algorithm&lt;br /&gt;
! Reference for algorithm&lt;br /&gt;
! Reference for experiment&lt;br /&gt;
! Type&lt;br /&gt;
! Correct&lt;br /&gt;
! 95% confidence&lt;br /&gt;
|-&lt;br /&gt;
| RES&lt;br /&gt;
| Resnik (1995)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 20.31%&lt;br /&gt;
| 12.89–31.83%&lt;br /&gt;
|-&lt;br /&gt;
| LC&lt;br /&gt;
| Leacock and Chodrow (1998)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Lexicon-based&lt;br /&gt;
| 21.88%&lt;br /&gt;
| 13.91–33.21%&lt;br /&gt;
|-&lt;br /&gt;
| LIN&lt;br /&gt;
| Lin (1998)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 24.06%&lt;br /&gt;
| 15.99–35.94%&lt;br /&gt;
|-&lt;br /&gt;
| Random&lt;br /&gt;
| Random guessing&lt;br /&gt;
| 1 / 4 = 25.00%&lt;br /&gt;
| Random&lt;br /&gt;
| 25.00%&lt;br /&gt;
| 15.99–35.94%&lt;br /&gt;
|-&lt;br /&gt;
| JC&lt;br /&gt;
| Jiang and Conrath (1997)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 25.00%&lt;br /&gt;
| 15.99–35.94%&lt;br /&gt;
|-&lt;br /&gt;
| LSA&lt;br /&gt;
| Landauer and Dumais (1997)&lt;br /&gt;
| Landauer and Dumais (1997)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 64.38%&lt;br /&gt;
| 52.90–74.80%&lt;br /&gt;
|-&lt;br /&gt;
| Human&lt;br /&gt;
| Average non-English US college applicant&lt;br /&gt;
| Landauer and Dumais (1997)&lt;br /&gt;
| Human&lt;br /&gt;
| 64.50%&lt;br /&gt;
| 53.01–74.88%&lt;br /&gt;
|-&lt;br /&gt;
| RI&lt;br /&gt;
| Karlgren and Sahlgren (2001)&lt;br /&gt;
| Karlgren and Sahlgren (2001)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 72%&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| DS&lt;br /&gt;
| Pado and Lapata (2007)&lt;br /&gt;
| Pado and Lapata (2007)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 73.00%&lt;br /&gt;
| 62.72-82.96%&lt;br /&gt;
|-&lt;br /&gt;
| PMI-IR&lt;br /&gt;
| Turney (2001)&lt;br /&gt;
| Turney (2001)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 73.75%&lt;br /&gt;
| 62.72–82.96%&lt;br /&gt;
|-&lt;br /&gt;
| PairClass&lt;br /&gt;
| Turney (2008)&lt;br /&gt;
| Turney (2008)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 76.25%&lt;br /&gt;
| 65.42-85.06%&lt;br /&gt;
|-&lt;br /&gt;
| HSO&lt;br /&gt;
| Hirst and St.-Onge (1998)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Lexicon-based&lt;br /&gt;
| 77.91%&lt;br /&gt;
| 68.17–87.11%&lt;br /&gt;
|-&lt;br /&gt;
| JS&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Lexicon-based&lt;br /&gt;
| 78.75%&lt;br /&gt;
| 68.17–87.11%&lt;br /&gt;
|-&lt;br /&gt;
| PMI-IR&lt;br /&gt;
| Terra and Clarke (2003)&lt;br /&gt;
| Terra and Clarke (2003)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 81.25%&lt;br /&gt;
| 70.97–89.11%&lt;br /&gt;
|-&lt;br /&gt;
| CWO&lt;br /&gt;
| Ruiz-Casado et al. (2005)&lt;br /&gt;
| Ruiz-Casado et al. (2005)&lt;br /&gt;
| Web-based&lt;br /&gt;
| 82.55%&lt;br /&gt;
| 72.38–90.09%&lt;br /&gt;
|-&lt;br /&gt;
| PPMIC&lt;br /&gt;
| Bullinaria and Levy (2007)&lt;br /&gt;
| Bullinaria and Levy (2007)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 85.00%&lt;br /&gt;
| 75.26-92.00%&lt;br /&gt;
|-&lt;br /&gt;
| GLSA&lt;br /&gt;
| Matveeva et al. (2005)&lt;br /&gt;
| Matveeva et al. (2005)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 86.25%&lt;br /&gt;
| 76.73-92.93%&lt;br /&gt;
|-&lt;br /&gt;
| LSA&lt;br /&gt;
| Rapp (2003)&lt;br /&gt;
| Rapp (2003)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 92.50%&lt;br /&gt;
| 84.39-97.20%&lt;br /&gt;
|-&lt;br /&gt;
| ADW&lt;br /&gt;
| Pilehvar et al. (2013)&lt;br /&gt;
| Pilehvar et al. (2013)&lt;br /&gt;
| WordNet graph-based (unsupervised)&lt;br /&gt;
| 96.25%&lt;br /&gt;
| 89.43-99.22%&lt;br /&gt;
|-&lt;br /&gt;
| PR&lt;br /&gt;
| Turney et al. (2003)&lt;br /&gt;
| Turney et al. (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 97.50%&lt;br /&gt;
| 91.26–99.70%&lt;br /&gt;
|-&lt;br /&gt;
| PCCP&lt;br /&gt;
| Bullinaria and Levy (2012)&lt;br /&gt;
| Bullinaria and Levy (2012)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 100.00%&lt;br /&gt;
| 96.32-100.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Explanation of table ==&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Algorithm&#039;&#039;&#039; = name of algorithm&lt;br /&gt;
* &#039;&#039;&#039;Reference for algorithm&#039;&#039;&#039; = where to find out more about given algorithm&lt;br /&gt;
* &#039;&#039;&#039;Reference for experiment&#039;&#039;&#039; = where to find out more about evaluation of given algorithm with TOEFL questions&lt;br /&gt;
* &#039;&#039;&#039;Type&#039;&#039;&#039; = general type of algorithm: corpus-based, lexicon-based, hybrid&lt;br /&gt;
* &#039;&#039;&#039;Correct&#039;&#039;&#039; = percent of 80 questions that given algorithm answered correctly&lt;br /&gt;
* &#039;&#039;&#039;95% confidence&#039;&#039;&#039; = confidence interval calculated using the [[Statistical calculators|Binomial Exact Test]]&lt;br /&gt;
* table rows sorted in order of increasing percent correct&lt;br /&gt;
* several WordNet-based similarity measures are implemented in [http://www.d.umn.edu/~tpederse/ Ted Pedersen]&#039;s [http://www.d.umn.edu/~tpederse/similarity.html WordNet::Similarity] package&lt;br /&gt;
* LSA = Latent Semantic Analysis&lt;br /&gt;
* PCCP = Principal Component vectors with Caron P&lt;br /&gt;
* PMI-IR = Pointwise Mutual Information - Information Retrieval&lt;br /&gt;
* PR = Product Rule&lt;br /&gt;
* PPMIC = Positive Pointwise Mutual Information with Cosine&lt;br /&gt;
* GLSA = Generalized Latent Semantic Analysis&lt;br /&gt;
* CWO = Context Window Overlapping&lt;br /&gt;
* DS = Dependency Space&lt;br /&gt;
* RI = Random Indexing&lt;br /&gt;
&lt;br /&gt;
== Notes ==&lt;br /&gt;
&lt;br /&gt;
* the performance of a corpus-based algorithm depends on the corpus, so the difference in performance between two corpus-based systems may be due to the different corpora, rather than the different algorithms&lt;br /&gt;
* the TOEFL questions include nouns, verbs, and adjectives, but some of the WordNet-based algorithms were only designed to work with nouns; this explains some of the lower scores&lt;br /&gt;
* some of the algorithms may have been tuned on the TOEFL questions; read the references for details&lt;br /&gt;
* Landauer and Dumais (1997) report scores that were corrected for guessing by subtracting a penalty of 1/3 for each incorrect answer; they report a score of 52.5% when this penalty is applied; when the penalty is removed, their performance is 64.4% correct&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
Bullinaria, J.A., and Levy, J.P. (2007). [http://www.cs.bham.ac.uk/~jxb/PUBS/BRM.pdf Extracting semantic representations from word co-occurrence statistics: A computational study]. &#039;&#039;Behavior Research Methods&#039;&#039;, 39(3), 510-526.&lt;br /&gt;
&lt;br /&gt;
Bullinaria, J.A., and Levy, J.P. (2012). [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.228.9582&amp;amp;rep=rep1&amp;amp;type=pdf Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD]. &#039;&#039;Behavior Research Methods&#039;&#039;,  44(3):890-907.&lt;br /&gt;
&lt;br /&gt;
Hirst, G., and St-Onge, D. (1998). [http://mirror.eacoss.org/documentation/ITLibrary/IRIS/Data/1997/Hirst/Lexical/1997-Hirst-Lexical.pdf Lexical chains as representation of context for the detection and correction of malapropisms]. In C. Fellbaum (ed.), &#039;&#039;WordNet: An Electronic Lexical Database&#039;&#039;. Cambridge: MIT Press, 305-332.&lt;br /&gt;
&lt;br /&gt;
Jarmasz, M., and Szpakowicz, S. (2003). [http://www.csi.uottawa.ca/~szpak/recent_papers/TR-2003-01.pdf Roget’s thesaurus and semantic similarity], &#039;&#039;Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03)&#039;&#039;, Borovets, Bulgaria, September, pp. 212-219.&lt;br /&gt;
&lt;br /&gt;
Jiang, J.J., and Conrath, D.W. (1997). [http://wortschatz.uni-leipzig.de/~sbordag/aalw05/Referate/03_Assoziationen_BudanitskyResnik/Jiang_Conrath_97.pdf Semantic similarity based on corpus statistics and lexical taxonomy]. &#039;&#039;Proceedings of the International Conference on Research in Computational Linguistics&#039;&#039;, Taiwan.&lt;br /&gt;
&lt;br /&gt;
Karlgren, J. and Sahlgren, M. (2001). [http://www.sics.se/~jussi/Artiklar/2001_RWIbook/KarlgrenSahlgren2001.pdf From Words to Understanding]. In Uesaka, Y., Kanerva, P., &amp;amp; Asoh, H. (Eds.), &#039;&#039;Foundations of Real-World Intelligence&#039;&#039;, Stanford: CSLI Publications, pp. 294–308. &lt;br /&gt;
&lt;br /&gt;
Landauer, T.K., and Dumais, S.T. (1997). [http://lsa.colorado.edu/papers/plato/plato.annote.html A solution to Plato&#039;s problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge]. &#039;&#039;Psychological Review&#039;&#039;, 104(2):211–240.&lt;br /&gt;
&lt;br /&gt;
Leacock, C., and Chodorow, M. (1998). [http://books.google.ca/books?id=Rehu8OOzMIMC&amp;amp;lpg=PA265&amp;amp;ots=IpnaLkZUec&amp;amp;lr&amp;amp;pg=PA265#v=onepage&amp;amp;q&amp;amp;f=false Combining local context and WordNet similarity for word sense identification]. In C. Fellbaum (ed.), &#039;&#039;WordNet: An Electronic Lexical Database&#039;&#039;. Cambridge: MIT Press, pp. 265-283.&lt;br /&gt;
&lt;br /&gt;
Lin, D. (1998). [http://www.cs.ualberta.ca/~lindek/papers/sim.pdf An information-theoretic definition of similarity]. &#039;&#039;Proceedings of the 15th International Conference on Machine Learning (ICML-98)&#039;&#039;, Madison, WI, pp. 296-304.&lt;br /&gt;
&lt;br /&gt;
Matveeva, I., Levow, G., Farahat, A., and Royer, C. (2005). [http://people.cs.uchicago.edu/~matveeva/SynGLSA_ranlp_final.pdf Generalized latent semantic analysis for term representation]. &#039;&#039;Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-05)&#039;&#039;, Borovets, Bulgaria.&lt;br /&gt;
&lt;br /&gt;
Pado, S., and Lapata, M. (2007). [http://www.coli.uni-saarland.de/~pado/pub/papers/cl07_pado.pdf Dependency-based construction of semantic space models]. &#039;&#039;Computational Linguistics&#039;&#039;, 33(2), 161-199.&lt;br /&gt;
&lt;br /&gt;
Pilehvar, M.T., Jurgens D., and Navigli R. (2013). [http://wwwusers.di.uniroma1.it/~navigli/pubs/ACL_2013_Pilehvar_Jurgens_Navigli.pdf Align, disambiguate and walk: A unified approach for measuring semantic similarity]. &#039;&#039;Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013),&#039;&#039; Sofia, Bulgaria.&lt;br /&gt;
&lt;br /&gt;
Rapp, R. (2003). [http://www.amtaweb.org/summit/MTSummit/FinalPapers/19-Rapp-final.pdf Word sense discovery based on sense descriptor dissimilarity]. &#039;&#039;Proceedings of the Ninth Machine Translation Summit&#039;&#039;, pp. 315-322.&lt;br /&gt;
&lt;br /&gt;
Resnik, P. (1995). [http://citeseer.ist.psu.edu/resnik95using.html Using information content to evaluate semantic similarity]. &#039;&#039;Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95)&#039;&#039;, Montreal, pp. 448-453.&lt;br /&gt;
&lt;br /&gt;
Ruiz-Casado, M., Alfonseca, E. and Castells, P. (2005) [http://alfonseca.org/pubs/2005-ranlp1.pdf Using context-window overlapping in Synonym Discovery and Ontology Extension]. &#039;&#039;Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP-2005)&#039;&#039;, Borovets, Bulgaria.&lt;br /&gt;
&lt;br /&gt;
Terra, E., and Clarke, C.L.A. (2003). [http://acl.ldc.upenn.edu/N/N03/N03-1032.pdf Frequency estimates for statistical word similarity measures]. &#039;&#039;Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003)&#039;&#039;, pp. 244–251.&lt;br /&gt;
&lt;br /&gt;
Turney, P.D. (2001). [http://arxiv.org/abs/cs.LG/0212033 Mining the Web for synonyms: PMI-IR versus LSA on TOEFL]. &#039;&#039;Proceedings of the Twelfth European Conference on Machine Learning (ECML-2001)&#039;&#039;, Freiburg, Germany, pp. 491-502.&lt;br /&gt;
&lt;br /&gt;
Turney, P.D., Littman, M.L., Bigham, J., and Shnayder, V. (2003). [http://arxiv.org/abs/cs.CL/0309035 Combining independent modules to solve multiple-choice synonym and analogy problems]. &#039;&#039;Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03)&#039;&#039;, Borovets, Bulgaria, pp. 482-489.&lt;br /&gt;
&lt;br /&gt;
Turney, P.D. (2008). [http://arxiv.org/abs/0809.0124 A uniform approach to analogies, synonyms, antonyms, and associations]. &#039;&#039;Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)&#039;&#039;, Manchester, UK, pp. 905-912.&lt;br /&gt;
&lt;br /&gt;
== See also ==&lt;br /&gt;
&lt;br /&gt;
* [[Attributional and Relational Similarity (State of the art)]]&lt;br /&gt;
* [[ESL Synonym Questions (State of the art)|ESL Synonym Questions]]&lt;br /&gt;
* [[SAT Analogy Questions]]&lt;br /&gt;
* [[State of the art]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Wartena</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=TOEFL_Synonym_Questions_(State_of_the_art)&amp;diff=10276</id>
		<title>TOEFL Synonym Questions (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=TOEFL_Synonym_Questions_(State_of_the_art)&amp;diff=10276"/>
		<updated>2013-10-02T21:57:00Z</updated>

		<summary type="html">&lt;p&gt;Wartena: /* Explanation of table */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* TOEFL = Test of English as a Foreign Language&lt;br /&gt;
* 80 multiple-choice synonym questions; 4 choices per question&lt;br /&gt;
* the TOEFL questions are available on request by contacting [http://lsa.colorado.edu/mail_sub.html LSA Support at CU Boulder], the people who manage the [http://lsa.colorado.edu/ LSA web site at Colorado]&lt;br /&gt;
* introduced in Landauer and Dumais (1997) as a way of evaluating algorithms for measuring degree of similarity between words&lt;br /&gt;
* subsequently used by many other researchers&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Sample question ==&lt;br /&gt;
&lt;br /&gt;
::{| border=&amp;quot;0&amp;quot; cellpadding=&amp;quot;1&amp;quot; cellspacing=&amp;quot;1&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Stem:&lt;br /&gt;
|&lt;br /&gt;
| levied&lt;br /&gt;
|-&lt;br /&gt;
! Choices:&lt;br /&gt;
| (a)&lt;br /&gt;
| imposed&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
| (b)&lt;br /&gt;
| believed&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
| (c)&lt;br /&gt;
| requested&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
| (d)&lt;br /&gt;
| correlated&lt;br /&gt;
|-&lt;br /&gt;
! Solution:&lt;br /&gt;
| (a)&lt;br /&gt;
| imposed&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Table of results ==&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Algorithm&lt;br /&gt;
! Reference for algorithm&lt;br /&gt;
! Reference for experiment&lt;br /&gt;
! Type&lt;br /&gt;
! Correct&lt;br /&gt;
! 95% confidence&lt;br /&gt;
|-&lt;br /&gt;
| RES&lt;br /&gt;
| Resnik (1995)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 20.31%&lt;br /&gt;
| 12.89–31.83%&lt;br /&gt;
|-&lt;br /&gt;
| LC&lt;br /&gt;
| Leacock and Chodrow (1998)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Lexicon-based&lt;br /&gt;
| 21.88%&lt;br /&gt;
| 13.91–33.21%&lt;br /&gt;
|-&lt;br /&gt;
| LIN&lt;br /&gt;
| Lin (1998)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 24.06%&lt;br /&gt;
| 15.99–35.94%&lt;br /&gt;
|-&lt;br /&gt;
| Random&lt;br /&gt;
| Random guessing&lt;br /&gt;
| 1 / 4 = 25.00%&lt;br /&gt;
| Random&lt;br /&gt;
| 25.00%&lt;br /&gt;
| 15.99–35.94%&lt;br /&gt;
|-&lt;br /&gt;
| JC&lt;br /&gt;
| Jiang and Conrath (1997)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 25.00%&lt;br /&gt;
| 15.99–35.94%&lt;br /&gt;
|-&lt;br /&gt;
| LSA&lt;br /&gt;
| Landauer and Dumais (1997)&lt;br /&gt;
| Landauer and Dumais (1997)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 64.38%&lt;br /&gt;
| 52.90–74.80%&lt;br /&gt;
|-&lt;br /&gt;
| Human&lt;br /&gt;
| Average non-English US college applicant&lt;br /&gt;
| Landauer and Dumais (1997)&lt;br /&gt;
| Human&lt;br /&gt;
| 64.50%&lt;br /&gt;
| 53.01–74.88%&lt;br /&gt;
|-&lt;br /&gt;
| RI&lt;br /&gt;
| Karlgren and Sahlgren (2001)&lt;br /&gt;
| Karlgren and Sahlgren (2001)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 72%&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| DS&lt;br /&gt;
| Pado and Lapata (2007)&lt;br /&gt;
| Pado and Lapata (2007)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 73.00%&lt;br /&gt;
| 62.72-82.96%&lt;br /&gt;
|-&lt;br /&gt;
| PMI-IR&lt;br /&gt;
| Turney (2001)&lt;br /&gt;
| Turney (2001)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 73.75%&lt;br /&gt;
| 62.72–82.96%&lt;br /&gt;
|-&lt;br /&gt;
| PairClass&lt;br /&gt;
| Turney (2008)&lt;br /&gt;
| Turney (2008)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 76.25%&lt;br /&gt;
| 65.42-85.06%&lt;br /&gt;
|-&lt;br /&gt;
| HSO&lt;br /&gt;
| Hirst and St.-Onge (1998)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Lexicon-based&lt;br /&gt;
| 77.91%&lt;br /&gt;
| 68.17–87.11%&lt;br /&gt;
|-&lt;br /&gt;
| JS&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Lexicon-based&lt;br /&gt;
| 78.75%&lt;br /&gt;
| 68.17–87.11%&lt;br /&gt;
|-&lt;br /&gt;
| PMI-IR&lt;br /&gt;
| Terra and Clarke (2003)&lt;br /&gt;
| Terra and Clarke (2003)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 81.25%&lt;br /&gt;
| 70.97–89.11%&lt;br /&gt;
|-&lt;br /&gt;
| CWO&lt;br /&gt;
| Ruiz-Casado et al. (2005)&lt;br /&gt;
| Ruiz-Casado et al. (2005)&lt;br /&gt;
| Web-based&lt;br /&gt;
| 82.55%&lt;br /&gt;
| 72.38–90.09%&lt;br /&gt;
|-&lt;br /&gt;
| PPMIC&lt;br /&gt;
| Bullinaria and Levy (2007)&lt;br /&gt;
| Bullinaria and Levy (2007)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 85.00%&lt;br /&gt;
| 75.26-92.00%&lt;br /&gt;
|-&lt;br /&gt;
| GLSA&lt;br /&gt;
| Matveeva et al. (2005)&lt;br /&gt;
| Matveeva et al. (2005)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 86.25%&lt;br /&gt;
| 76.73-92.93%&lt;br /&gt;
|-&lt;br /&gt;
| LSA&lt;br /&gt;
| Rapp (2003)&lt;br /&gt;
| Rapp (2003)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 92.50%&lt;br /&gt;
| 84.39-97.20%&lt;br /&gt;
|-&lt;br /&gt;
| ADW&lt;br /&gt;
| Pilehvar et al. (2013)&lt;br /&gt;
| Pilehvar et al. (2013)&lt;br /&gt;
| WordNet graph-based (unsupervised)&lt;br /&gt;
| 96.25%&lt;br /&gt;
| 89.43-99.22%&lt;br /&gt;
|-&lt;br /&gt;
| PR&lt;br /&gt;
| Turney et al. (2003)&lt;br /&gt;
| Turney et al. (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 97.50%&lt;br /&gt;
| 91.26–99.70%&lt;br /&gt;
|-&lt;br /&gt;
| PCCP&lt;br /&gt;
| Bullinaria and Levy (2012)&lt;br /&gt;
| Bullinaria and Levy (2012)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 100.00%&lt;br /&gt;
| 96.32-100.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Explanation of table ==&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Algorithm&#039;&#039;&#039; = name of algorithm&lt;br /&gt;
* &#039;&#039;&#039;Reference for algorithm&#039;&#039;&#039; = where to find out more about given algorithm&lt;br /&gt;
* &#039;&#039;&#039;Reference for experiment&#039;&#039;&#039; = where to find out more about evaluation of given algorithm with TOEFL questions&lt;br /&gt;
* &#039;&#039;&#039;Type&#039;&#039;&#039; = general type of algorithm: corpus-based, lexicon-based, hybrid&lt;br /&gt;
* &#039;&#039;&#039;Correct&#039;&#039;&#039; = percent of 80 questions that given algorithm answered correctly&lt;br /&gt;
* &#039;&#039;&#039;95% confidence&#039;&#039;&#039; = confidence interval calculated using the [[Statistical calculators|Binomial Exact Test]]&lt;br /&gt;
* table rows sorted in order of increasing percent correct&lt;br /&gt;
* several WordNet-based similarity measures are implemented in [http://www.d.umn.edu/~tpederse/ Ted Pedersen]&#039;s [http://www.d.umn.edu/~tpederse/similarity.html WordNet::Similarity] package&lt;br /&gt;
* LSA = Latent Semantic Analysis&lt;br /&gt;
* PCCP = Principal Component vectors with Caron P&lt;br /&gt;
* PMI-IR = Pointwise Mutual Information - Information Retrieval&lt;br /&gt;
* PR = Product Rule&lt;br /&gt;
* PPMIC = Positive Pointwise Mutual Information with Cosine&lt;br /&gt;
* GLSA = Generalized Latent Semantic Analysis&lt;br /&gt;
* CWO = Context Window Overlapping&lt;br /&gt;
* DS = Dependency Space&lt;br /&gt;
* RI = Random Indexing&lt;br /&gt;
&lt;br /&gt;
== Notes ==&lt;br /&gt;
&lt;br /&gt;
* the performance of a corpus-based algorithm depends on the corpus, so the difference in performance between two corpus-based systems may be due to the different corpora, rather than the different algorithms&lt;br /&gt;
* the TOEFL questions include nouns, verbs, and adjectives, but some of the WordNet-based algorithms were only designed to work with nouns; this explains some of the lower scores&lt;br /&gt;
* some of the algorithms may have been tuned on the TOEFL questions; read the references for details&lt;br /&gt;
* Landauer and Dumais (1997) report scores that were corrected for guessing by subtracting a penalty of 1/3 for each incorrect answer; they report a score of 52.5% when this penalty is applied; when the penalty is removed, their performance is 64.4% correct&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
Bullinaria, J.A., and Levy, J.P. (2007). [http://www.cs.bham.ac.uk/~jxb/PUBS/BRM.pdf Extracting semantic representations from word co-occurrence statistics: A computational study]. &#039;&#039;Behavior Research Methods&#039;&#039;, 39(3), 510-526.&lt;br /&gt;
&lt;br /&gt;
Bullinaria, J.A., and Levy, J.P. (2012). [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.228.9582&amp;amp;rep=rep1&amp;amp;type=pdf Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD]. &#039;&#039;Behavior Research Methods&#039;&#039;,  44(3):890-907.&lt;br /&gt;
&lt;br /&gt;
Hirst, G., and St-Onge, D. (1998). [http://mirror.eacoss.org/documentation/ITLibrary/IRIS/Data/1997/Hirst/Lexical/1997-Hirst-Lexical.pdf Lexical chains as representation of context for the detection and correction of malapropisms]. In C. Fellbaum (ed.), &#039;&#039;WordNet: An Electronic Lexical Database&#039;&#039;. Cambridge: MIT Press, 305-332.&lt;br /&gt;
&lt;br /&gt;
Jarmasz, M., and Szpakowicz, S. (2003). [http://www.csi.uottawa.ca/~szpak/recent_papers/TR-2003-01.pdf Roget’s thesaurus and semantic similarity], &#039;&#039;Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03)&#039;&#039;, Borovets, Bulgaria, September, pp. 212-219.&lt;br /&gt;
&lt;br /&gt;
Jiang, J.J., and Conrath, D.W. (1997). [http://wortschatz.uni-leipzig.de/~sbordag/aalw05/Referate/03_Assoziationen_BudanitskyResnik/Jiang_Conrath_97.pdf Semantic similarity based on corpus statistics and lexical taxonomy]. &#039;&#039;Proceedings of the International Conference on Research in Computational Linguistics&#039;&#039;, Taiwan.&lt;br /&gt;
&lt;br /&gt;
Landauer, T.K., and Dumais, S.T. (1997). [http://lsa.colorado.edu/papers/plato/plato.annote.html A solution to Plato&#039;s problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge]. &#039;&#039;Psychological Review&#039;&#039;, 104(2):211–240.&lt;br /&gt;
&lt;br /&gt;
Leacock, C., and Chodorow, M. (1998). [http://books.google.ca/books?id=Rehu8OOzMIMC&amp;amp;lpg=PA265&amp;amp;ots=IpnaLkZUec&amp;amp;lr&amp;amp;pg=PA265#v=onepage&amp;amp;q&amp;amp;f=false Combining local context and WordNet similarity for word sense identification]. In C. Fellbaum (ed.), &#039;&#039;WordNet: An Electronic Lexical Database&#039;&#039;. Cambridge: MIT Press, pp. 265-283.&lt;br /&gt;
&lt;br /&gt;
Lin, D. (1998). [http://www.cs.ualberta.ca/~lindek/papers/sim.pdf An information-theoretic definition of similarity]. &#039;&#039;Proceedings of the 15th International Conference on Machine Learning (ICML-98)&#039;&#039;, Madison, WI, pp. 296-304.&lt;br /&gt;
&lt;br /&gt;
Matveeva, I., Levow, G., Farahat, A., and Royer, C. (2005). [http://people.cs.uchicago.edu/~matveeva/SynGLSA_ranlp_final.pdf Generalized latent semantic analysis for term representation]. &#039;&#039;Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-05)&#039;&#039;, Borovets, Bulgaria.&lt;br /&gt;
&lt;br /&gt;
Pado, S., and Lapata, M. (2007). [http://www.coli.uni-saarland.de/~pado/pub/papers/cl07_pado.pdf Dependency-based construction of semantic space models]. &#039;&#039;Computational Linguistics&#039;&#039;, 33(2), 161-199.&lt;br /&gt;
&lt;br /&gt;
Pilehvar, M.T., Jurgens D., and Navigli R. (2013). [http://wwwusers.di.uniroma1.it/~navigli/pubs/ACL_2013_Pilehvar_Jurgens_Navigli.pdf Align, disambiguate and walk: A unified approach for measuring semantic similarity]. &#039;&#039;Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013),&#039;&#039; Sofia, Bulgaria.&lt;br /&gt;
&lt;br /&gt;
Rapp, R. (2003). [http://www.amtaweb.org/summit/MTSummit/FinalPapers/19-Rapp-final.pdf Word sense discovery based on sense descriptor dissimilarity]. &#039;&#039;Proceedings of the Ninth Machine Translation Summit&#039;&#039;, pp. 315-322.&lt;br /&gt;
&lt;br /&gt;
Resnik, P. (1995). [http://citeseer.ist.psu.edu/resnik95using.html Using information content to evaluate semantic similarity]. &#039;&#039;Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95)&#039;&#039;, Montreal, pp. 448-453.&lt;br /&gt;
&lt;br /&gt;
Ruiz-Casado, M., Alfonseca, E. and Castells, P. (2005) [http://alfonseca.org/pubs/2005-ranlp1.pdf Using context-window overlapping in Synonym Discovery and Ontology Extension]. &#039;&#039;Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP-2005)&#039;&#039;, Borovets, Bulgaria.&lt;br /&gt;
&lt;br /&gt;
Terra, E., and Clarke, C.L.A. (2003). [http://acl.ldc.upenn.edu/N/N03/N03-1032.pdf Frequency estimates for statistical word similarity measures]. &#039;&#039;Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003)&#039;&#039;, pp. 244–251.&lt;br /&gt;
&lt;br /&gt;
Turney, P.D. (2001). [http://arxiv.org/abs/cs.LG/0212033 Mining the Web for synonyms: PMI-IR versus LSA on TOEFL]. &#039;&#039;Proceedings of the Twelfth European Conference on Machine Learning (ECML-2001)&#039;&#039;, Freiburg, Germany, pp. 491-502.&lt;br /&gt;
&lt;br /&gt;
Turney, P.D., Littman, M.L., Bigham, J., and Shnayder, V. (2003). [http://arxiv.org/abs/cs.CL/0309035 Combining independent modules to solve multiple-choice synonym and analogy problems]. &#039;&#039;Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03)&#039;&#039;, Borovets, Bulgaria, pp. 482-489.&lt;br /&gt;
&lt;br /&gt;
Turney, P.D. (2008). [http://arxiv.org/abs/0809.0124 A uniform approach to analogies, synonyms, antonyms, and associations]. &#039;&#039;Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)&#039;&#039;, Manchester, UK, pp. 905-912.&lt;br /&gt;
&lt;br /&gt;
== See also ==&lt;br /&gt;
&lt;br /&gt;
* [[Attributional and Relational Similarity (State of the art)]]&lt;br /&gt;
* [[ESL Synonym Questions (State of the art)|ESL Synonym Questions]]&lt;br /&gt;
* [[SAT Analogy Questions]]&lt;br /&gt;
* [[State of the art]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Wartena</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=TOEFL_Synonym_Questions_(State_of_the_art)&amp;diff=10275</id>
		<title>TOEFL Synonym Questions (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=TOEFL_Synonym_Questions_(State_of_the_art)&amp;diff=10275"/>
		<updated>2013-10-02T21:56:03Z</updated>

		<summary type="html">&lt;p&gt;Wartena: /* Table of results */  Added Randon Indexing Result from Karlgren and Sahlgren&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* TOEFL = Test of English as a Foreign Language&lt;br /&gt;
* 80 multiple-choice synonym questions; 4 choices per question&lt;br /&gt;
* the TOEFL questions are available on request by contacting [http://lsa.colorado.edu/mail_sub.html LSA Support at CU Boulder], the people who manage the [http://lsa.colorado.edu/ LSA web site at Colorado]&lt;br /&gt;
* introduced in Landauer and Dumais (1997) as a way of evaluating algorithms for measuring degree of similarity between words&lt;br /&gt;
* subsequently used by many other researchers&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Sample question ==&lt;br /&gt;
&lt;br /&gt;
::{| border=&amp;quot;0&amp;quot; cellpadding=&amp;quot;1&amp;quot; cellspacing=&amp;quot;1&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Stem:&lt;br /&gt;
|&lt;br /&gt;
| levied&lt;br /&gt;
|-&lt;br /&gt;
! Choices:&lt;br /&gt;
| (a)&lt;br /&gt;
| imposed&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
| (b)&lt;br /&gt;
| believed&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
| (c)&lt;br /&gt;
| requested&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
| (d)&lt;br /&gt;
| correlated&lt;br /&gt;
|-&lt;br /&gt;
! Solution:&lt;br /&gt;
| (a)&lt;br /&gt;
| imposed&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Table of results ==&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Algorithm&lt;br /&gt;
! Reference for algorithm&lt;br /&gt;
! Reference for experiment&lt;br /&gt;
! Type&lt;br /&gt;
! Correct&lt;br /&gt;
! 95% confidence&lt;br /&gt;
|-&lt;br /&gt;
| RES&lt;br /&gt;
| Resnik (1995)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 20.31%&lt;br /&gt;
| 12.89–31.83%&lt;br /&gt;
|-&lt;br /&gt;
| LC&lt;br /&gt;
| Leacock and Chodrow (1998)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Lexicon-based&lt;br /&gt;
| 21.88%&lt;br /&gt;
| 13.91–33.21%&lt;br /&gt;
|-&lt;br /&gt;
| LIN&lt;br /&gt;
| Lin (1998)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 24.06%&lt;br /&gt;
| 15.99–35.94%&lt;br /&gt;
|-&lt;br /&gt;
| Random&lt;br /&gt;
| Random guessing&lt;br /&gt;
| 1 / 4 = 25.00%&lt;br /&gt;
| Random&lt;br /&gt;
| 25.00%&lt;br /&gt;
| 15.99–35.94%&lt;br /&gt;
|-&lt;br /&gt;
| JC&lt;br /&gt;
| Jiang and Conrath (1997)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 25.00%&lt;br /&gt;
| 15.99–35.94%&lt;br /&gt;
|-&lt;br /&gt;
| LSA&lt;br /&gt;
| Landauer and Dumais (1997)&lt;br /&gt;
| Landauer and Dumais (1997)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 64.38%&lt;br /&gt;
| 52.90–74.80%&lt;br /&gt;
|-&lt;br /&gt;
| Human&lt;br /&gt;
| Average non-English US college applicant&lt;br /&gt;
| Landauer and Dumais (1997)&lt;br /&gt;
| Human&lt;br /&gt;
| 64.50%&lt;br /&gt;
| 53.01–74.88%&lt;br /&gt;
|-&lt;br /&gt;
| RI&lt;br /&gt;
| Karlgren and Sahlgren (2001)&lt;br /&gt;
| Karlgren and Sahlgren (2001)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 72%&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| DS&lt;br /&gt;
| Pado and Lapata (2007)&lt;br /&gt;
| Pado and Lapata (2007)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 73.00%&lt;br /&gt;
| 62.72-82.96%&lt;br /&gt;
|-&lt;br /&gt;
| PMI-IR&lt;br /&gt;
| Turney (2001)&lt;br /&gt;
| Turney (2001)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 73.75%&lt;br /&gt;
| 62.72–82.96%&lt;br /&gt;
|-&lt;br /&gt;
| PairClass&lt;br /&gt;
| Turney (2008)&lt;br /&gt;
| Turney (2008)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 76.25%&lt;br /&gt;
| 65.42-85.06%&lt;br /&gt;
|-&lt;br /&gt;
| HSO&lt;br /&gt;
| Hirst and St.-Onge (1998)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Lexicon-based&lt;br /&gt;
| 77.91%&lt;br /&gt;
| 68.17–87.11%&lt;br /&gt;
|-&lt;br /&gt;
| JS&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Jarmasz and Szpakowicz (2003)&lt;br /&gt;
| Lexicon-based&lt;br /&gt;
| 78.75%&lt;br /&gt;
| 68.17–87.11%&lt;br /&gt;
|-&lt;br /&gt;
| PMI-IR&lt;br /&gt;
| Terra and Clarke (2003)&lt;br /&gt;
| Terra and Clarke (2003)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 81.25%&lt;br /&gt;
| 70.97–89.11%&lt;br /&gt;
|-&lt;br /&gt;
| CWO&lt;br /&gt;
| Ruiz-Casado et al. (2005)&lt;br /&gt;
| Ruiz-Casado et al. (2005)&lt;br /&gt;
| Web-based&lt;br /&gt;
| 82.55%&lt;br /&gt;
| 72.38–90.09%&lt;br /&gt;
|-&lt;br /&gt;
| PPMIC&lt;br /&gt;
| Bullinaria and Levy (2007)&lt;br /&gt;
| Bullinaria and Levy (2007)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 85.00%&lt;br /&gt;
| 75.26-92.00%&lt;br /&gt;
|-&lt;br /&gt;
| GLSA&lt;br /&gt;
| Matveeva et al. (2005)&lt;br /&gt;
| Matveeva et al. (2005)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 86.25%&lt;br /&gt;
| 76.73-92.93%&lt;br /&gt;
|-&lt;br /&gt;
| LSA&lt;br /&gt;
| Rapp (2003)&lt;br /&gt;
| Rapp (2003)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 92.50%&lt;br /&gt;
| 84.39-97.20%&lt;br /&gt;
|-&lt;br /&gt;
| ADW&lt;br /&gt;
| Pilehvar et al. (2013)&lt;br /&gt;
| Pilehvar et al. (2013)&lt;br /&gt;
| WordNet graph-based (unsupervised)&lt;br /&gt;
| 96.25%&lt;br /&gt;
| 89.43-99.22%&lt;br /&gt;
|-&lt;br /&gt;
| PR&lt;br /&gt;
| Turney et al. (2003)&lt;br /&gt;
| Turney et al. (2003)&lt;br /&gt;
| Hybrid&lt;br /&gt;
| 97.50%&lt;br /&gt;
| 91.26–99.70%&lt;br /&gt;
|-&lt;br /&gt;
| PCCP&lt;br /&gt;
| Bullinaria and Levy (2012)&lt;br /&gt;
| Bullinaria and Levy (2012)&lt;br /&gt;
| Corpus-based&lt;br /&gt;
| 100.00%&lt;br /&gt;
| 96.32-100.00%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Explanation of table ==&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Algorithm&#039;&#039;&#039; = name of algorithm&lt;br /&gt;
* &#039;&#039;&#039;Reference for algorithm&#039;&#039;&#039; = where to find out more about given algorithm&lt;br /&gt;
* &#039;&#039;&#039;Reference for experiment&#039;&#039;&#039; = where to find out more about evaluation of given algorithm with TOEFL questions&lt;br /&gt;
* &#039;&#039;&#039;Type&#039;&#039;&#039; = general type of algorithm: corpus-based, lexicon-based, hybrid&lt;br /&gt;
* &#039;&#039;&#039;Correct&#039;&#039;&#039; = percent of 80 questions that given algorithm answered correctly&lt;br /&gt;
* &#039;&#039;&#039;95% confidence&#039;&#039;&#039; = confidence interval calculated using the [[Statistical calculators|Binomial Exact Test]]&lt;br /&gt;
* table rows sorted in order of increasing percent correct&lt;br /&gt;
* several WordNet-based similarity measures are implemented in [http://www.d.umn.edu/~tpederse/ Ted Pedersen]&#039;s [http://www.d.umn.edu/~tpederse/similarity.html WordNet::Similarity] package&lt;br /&gt;
* LSA = Latent Semantic Analysis&lt;br /&gt;
* PCCP = Principal Component vectors with Caron P&lt;br /&gt;
* PMI-IR = Pointwise Mutual Information - Information Retrieval&lt;br /&gt;
* PR = Product Rule&lt;br /&gt;
* PPMIC = Positive Pointwise Mutual Information with Cosine&lt;br /&gt;
* GLSA = Generalized Latent Semantic Analysis&lt;br /&gt;
* CWO = Context Window Overlapping&lt;br /&gt;
* DS = Dependency Space&lt;br /&gt;
&lt;br /&gt;
== Notes ==&lt;br /&gt;
&lt;br /&gt;
* the performance of a corpus-based algorithm depends on the corpus, so the difference in performance between two corpus-based systems may be due to the different corpora, rather than the different algorithms&lt;br /&gt;
* the TOEFL questions include nouns, verbs, and adjectives, but some of the WordNet-based algorithms were only designed to work with nouns; this explains some of the lower scores&lt;br /&gt;
* some of the algorithms may have been tuned on the TOEFL questions; read the references for details&lt;br /&gt;
* Landauer and Dumais (1997) report scores that were corrected for guessing by subtracting a penalty of 1/3 for each incorrect answer; they report a score of 52.5% when this penalty is applied; when the penalty is removed, their performance is 64.4% correct&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
Bullinaria, J.A., and Levy, J.P. (2007). [http://www.cs.bham.ac.uk/~jxb/PUBS/BRM.pdf Extracting semantic representations from word co-occurrence statistics: A computational study]. &#039;&#039;Behavior Research Methods&#039;&#039;, 39(3), 510-526.&lt;br /&gt;
&lt;br /&gt;
Bullinaria, J.A., and Levy, J.P. (2012). [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.228.9582&amp;amp;rep=rep1&amp;amp;type=pdf Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD]. &#039;&#039;Behavior Research Methods&#039;&#039;,  44(3):890-907.&lt;br /&gt;
&lt;br /&gt;
Hirst, G., and St-Onge, D. (1998). [http://mirror.eacoss.org/documentation/ITLibrary/IRIS/Data/1997/Hirst/Lexical/1997-Hirst-Lexical.pdf Lexical chains as representation of context for the detection and correction of malapropisms]. In C. Fellbaum (ed.), &#039;&#039;WordNet: An Electronic Lexical Database&#039;&#039;. Cambridge: MIT Press, 305-332.&lt;br /&gt;
&lt;br /&gt;
Jarmasz, M., and Szpakowicz, S. (2003). [http://www.csi.uottawa.ca/~szpak/recent_papers/TR-2003-01.pdf Roget’s thesaurus and semantic similarity], &#039;&#039;Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03)&#039;&#039;, Borovets, Bulgaria, September, pp. 212-219.&lt;br /&gt;
&lt;br /&gt;
Jiang, J.J., and Conrath, D.W. (1997). [http://wortschatz.uni-leipzig.de/~sbordag/aalw05/Referate/03_Assoziationen_BudanitskyResnik/Jiang_Conrath_97.pdf Semantic similarity based on corpus statistics and lexical taxonomy]. &#039;&#039;Proceedings of the International Conference on Research in Computational Linguistics&#039;&#039;, Taiwan.&lt;br /&gt;
&lt;br /&gt;
Landauer, T.K., and Dumais, S.T. (1997). [http://lsa.colorado.edu/papers/plato/plato.annote.html A solution to Plato&#039;s problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge]. &#039;&#039;Psychological Review&#039;&#039;, 104(2):211–240.&lt;br /&gt;
&lt;br /&gt;
Leacock, C., and Chodorow, M. (1998). [http://books.google.ca/books?id=Rehu8OOzMIMC&amp;amp;lpg=PA265&amp;amp;ots=IpnaLkZUec&amp;amp;lr&amp;amp;pg=PA265#v=onepage&amp;amp;q&amp;amp;f=false Combining local context and WordNet similarity for word sense identification]. In C. Fellbaum (ed.), &#039;&#039;WordNet: An Electronic Lexical Database&#039;&#039;. Cambridge: MIT Press, pp. 265-283.&lt;br /&gt;
&lt;br /&gt;
Lin, D. (1998). [http://www.cs.ualberta.ca/~lindek/papers/sim.pdf An information-theoretic definition of similarity]. &#039;&#039;Proceedings of the 15th International Conference on Machine Learning (ICML-98)&#039;&#039;, Madison, WI, pp. 296-304.&lt;br /&gt;
&lt;br /&gt;
Matveeva, I., Levow, G., Farahat, A., and Royer, C. (2005). [http://people.cs.uchicago.edu/~matveeva/SynGLSA_ranlp_final.pdf Generalized latent semantic analysis for term representation]. &#039;&#039;Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-05)&#039;&#039;, Borovets, Bulgaria.&lt;br /&gt;
&lt;br /&gt;
Pado, S., and Lapata, M. (2007). [http://www.coli.uni-saarland.de/~pado/pub/papers/cl07_pado.pdf Dependency-based construction of semantic space models]. &#039;&#039;Computational Linguistics&#039;&#039;, 33(2), 161-199.&lt;br /&gt;
&lt;br /&gt;
Pilehvar, M.T., Jurgens D., and Navigli R. (2013). [http://wwwusers.di.uniroma1.it/~navigli/pubs/ACL_2013_Pilehvar_Jurgens_Navigli.pdf Align, disambiguate and walk: A unified approach for measuring semantic similarity]. &#039;&#039;Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013),&#039;&#039; Sofia, Bulgaria.&lt;br /&gt;
&lt;br /&gt;
Rapp, R. (2003). [http://www.amtaweb.org/summit/MTSummit/FinalPapers/19-Rapp-final.pdf Word sense discovery based on sense descriptor dissimilarity]. &#039;&#039;Proceedings of the Ninth Machine Translation Summit&#039;&#039;, pp. 315-322.&lt;br /&gt;
&lt;br /&gt;
Resnik, P. (1995). [http://citeseer.ist.psu.edu/resnik95using.html Using information content to evaluate semantic similarity]. &#039;&#039;Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95)&#039;&#039;, Montreal, pp. 448-453.&lt;br /&gt;
&lt;br /&gt;
Ruiz-Casado, M., Alfonseca, E. and Castells, P. (2005) [http://alfonseca.org/pubs/2005-ranlp1.pdf Using context-window overlapping in Synonym Discovery and Ontology Extension]. &#039;&#039;Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP-2005)&#039;&#039;, Borovets, Bulgaria.&lt;br /&gt;
&lt;br /&gt;
Terra, E., and Clarke, C.L.A. (2003). [http://acl.ldc.upenn.edu/N/N03/N03-1032.pdf Frequency estimates for statistical word similarity measures]. &#039;&#039;Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003)&#039;&#039;, pp. 244–251.&lt;br /&gt;
&lt;br /&gt;
Turney, P.D. (2001). [http://arxiv.org/abs/cs.LG/0212033 Mining the Web for synonyms: PMI-IR versus LSA on TOEFL]. &#039;&#039;Proceedings of the Twelfth European Conference on Machine Learning (ECML-2001)&#039;&#039;, Freiburg, Germany, pp. 491-502.&lt;br /&gt;
&lt;br /&gt;
Turney, P.D., Littman, M.L., Bigham, J., and Shnayder, V. (2003). [http://arxiv.org/abs/cs.CL/0309035 Combining independent modules to solve multiple-choice synonym and analogy problems]. &#039;&#039;Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03)&#039;&#039;, Borovets, Bulgaria, pp. 482-489.&lt;br /&gt;
&lt;br /&gt;
Turney, P.D. (2008). [http://arxiv.org/abs/0809.0124 A uniform approach to analogies, synonyms, antonyms, and associations]. &#039;&#039;Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)&#039;&#039;, Manchester, UK, pp. 905-912.&lt;br /&gt;
&lt;br /&gt;
== See also ==&lt;br /&gt;
&lt;br /&gt;
* [[Attributional and Relational Similarity (State of the art)]]&lt;br /&gt;
* [[ESL Synonym Questions (State of the art)|ESL Synonym Questions]]&lt;br /&gt;
* [[SAT Analogy Questions]]&lt;br /&gt;
* [[State of the art]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Wartena</name></author>
	</entry>
</feed>