<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Anna+gladkova</id>
	<title>ACL Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Anna+gladkova"/>
	<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/Special:Contributions/Anna_gladkova"/>
	<updated>2026-04-08T22:57:55Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.6</generator>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Google_analogy_test_set_(State_of_the_art)&amp;diff=11735</id>
		<title>Google analogy test set (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Google_analogy_test_set_(State_of_the_art)&amp;diff=11735"/>
		<updated>2017-01-06T10:07:44Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* Test set developed by Mikolov et al. (2013b)&amp;lt;ref&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt; &lt;br /&gt;
* Contains 19544 question pairs (8,869 semantic and 10,675 syntactic (i.e. morphological) questions)&lt;br /&gt;
* 14 types of relations (9 morphological and 5 semantic)&lt;br /&gt;
* [http://download.tensorflow.org/data/questions-words.txt Original link deprecated, copy hosted @TensorFlow]&lt;br /&gt;
&lt;br /&gt;
This page reports results obtained with the &amp;quot;vanilla&amp;quot; 3CosAdd method, or vector offset&amp;lt;ref name=&amp;quot;Mikolov2013&amp;quot;/&amp;gt;. For other methods, see [[Analogy (State of the art)]] &lt;br /&gt;
&lt;br /&gt;
== Table of results == &lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Listed in chronological order.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Model&lt;br /&gt;
! Reference&lt;br /&gt;
! Sem&lt;br /&gt;
! Syn&lt;br /&gt;
! Corpus and window size&lt;br /&gt;
|-&lt;br /&gt;
| CBOW (640 dim)&lt;br /&gt;
| Mikolov et al (2013) &amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 24.0&lt;br /&gt;
| 64.0&lt;br /&gt;
| 6B Google News corpus, window 10&lt;br /&gt;
|-&lt;br /&gt;
| Skip-Gram (640 dim)&lt;br /&gt;
| Mikolov et al (2013) &amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;/&amp;gt;&lt;br /&gt;
| 55.0&lt;br /&gt;
| 59.0&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| RNNLM (640 dim)&lt;br /&gt;
| Mikolov et al (2013) &amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;/&amp;gt;&lt;br /&gt;
| 9.0&lt;br /&gt;
| 36.0&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| NNLM (640 dim)&lt;br /&gt;
| Mikolov et al (2013) &amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;/&amp;gt;&lt;br /&gt;
| 23.0&lt;br /&gt;
| 53.0&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| GloVe (300 dim)&lt;br /&gt;
| Pennington et al (2014) &amp;lt;ref name = &amp;quot;GloVe&amp;quot;&amp;gt;Pennington, J., Socher, R., &amp;amp; Manning, C. D. (2014). Glove: Global vectors for word representation. In Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014) (Vol. 12, pp. 1532–1543). Retrieved from http://llcao.net/cu-deeplearning15/presentation/nn-pres.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 81.9&lt;br /&gt;
| 69.3&lt;br /&gt;
| 42 B corpus, window 5&lt;br /&gt;
|-&lt;br /&gt;
| SVD&lt;br /&gt;
| Levy et al (2015) &amp;lt;ref name = &amp;quot;Levy2015&amp;quot;&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Dagan, I. (2015). Improving distributional similarity with lessons learned from word embeddings. Transactions of the Association for Computational Linguistics, 3, 211–225.&amp;lt;/ref&amp;gt;&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot;|55.4&lt;br /&gt;
| Wikipedia 1.5B, window 2&lt;br /&gt;
|-&lt;br /&gt;
| PPMI&lt;br /&gt;
| Levy et al (2015)  &amp;lt;ref name = &amp;quot;Levy2015&amp;quot;/&amp;gt;&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot;|55.3&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| Skip-Gram&lt;br /&gt;
| Levy et al (2015)  &amp;lt;ref name = &amp;quot;Levy2015&amp;quot;/&amp;gt;&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot;|67.6&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| GloVe&lt;br /&gt;
| Levy et al (2015)  &amp;lt;ref name = &amp;quot;Levy2015&amp;quot;/&amp;gt;&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot;|56.9&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| Skip-Gram (50 dim)&lt;br /&gt;
| Lai et al (2015) &amp;lt;ref name = &amp;quot;Lai2015&amp;quot;&amp;gt;Lai, S., Liu, K., Xu, L., &amp;amp; Zhao, J. (2015). How to Generate a Good Word Embedding? arXiv Preprint arXiv:1507.05523. Retrieved from http://arxiv.org/abs/1507.05523&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 44.8&lt;br /&gt;
| 44.43&lt;br /&gt;
| W&amp;amp;N 2.8 B corpus, window 5&lt;br /&gt;
|-&lt;br /&gt;
| CBOW (50 dim)&lt;br /&gt;
| Lai et al (2015) &amp;lt;ref name = &amp;quot;Lai2015&amp;quot;/&amp;gt;&lt;br /&gt;
| 44.43&lt;br /&gt;
| 55.83&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| DVRS+SG (300 dim)&lt;br /&gt;
| Garten et al (2015) &amp;lt;ref&amp;gt;Garten, J., Sagae, K., Ustun, V., &amp;amp; Dehghani, M. (2015). Combining Distributed Vector Representations for Words. In Proceedings of NAACL-HLT (pp. 95–101). Retrieved from http://www.researchgate.net/profile/Volkan_Ustun/publication/277332298_Combining_Distributed_Vector_Representations_for_Words/links/55705a6308aee1eea7586e93.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 74.0&lt;br /&gt;
| 60.0&lt;br /&gt;
| enwiki9, window 10&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Methodological Issues ==&lt;br /&gt;
&lt;br /&gt;
* This test set is not balanced: 20-70 pairs per category, different number of semantic and morphological relations. See other sets at [[Analogy (State of the art)]].&lt;br /&gt;
* In the semantic part, &#039;&#039;country:capital&#039;&#039; relation accounts for over 50% of all semantic questions. &lt;br /&gt;
* Researchers usually report only the average accuracy for all semantic/syntactic questions, but there is a lot of variation for individual relations - between 10.53% and 99.41% &amp;lt;ref&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Ramat-Gan, I. (2014). Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL (pp. 171–180). Retrieved from http://anthology.aclweb.org/W/W14/W14-1618.pdf&amp;lt;/ref&amp;gt;, also depending on parameters of the model &amp;lt;ref&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&amp;lt;/ref&amp;gt;. Since the test is not balanced, the above results could be flattering to the embeddings, and averaging the mean scores for each subcategory would yield lower results.&lt;br /&gt;
* Accuracy also depends on the method with which analogies are solved &amp;lt;ref&amp;gt;Linzen, T. (2016). Issues in evaluating semantic spaces using word analogies. In Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP. Association for Computational Linguistics. Retrieved from http://anthology.aclweb.org/W16-2503&amp;lt;/ref&amp;gt; &amp;lt;ref&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Ramat-Gan, I. (2014). Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL (pp. 171–180). Retrieved from http://anthology.aclweb.org/W/W14/W14-1618.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;. Set-based methods&amp;lt;ref&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&amp;lt;/ref&amp;gt; considerably outperform pair-based methods, showing that models do in fact encode much &amp;quot;missed&amp;quot; information. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Bigger_analogy_test_set_(State_of_the_art)&amp;diff=11734</id>
		<title>Bigger analogy test set (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Bigger_analogy_test_set_(State_of_the_art)&amp;diff=11734"/>
		<updated>2017-01-06T10:06:22Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Dataset description ==&lt;br /&gt;
* New dataset proposed by Gladkova et al. (2016) &amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* available [http://vsm.blackbird.pw/bats here]&lt;br /&gt;
* dataset balanced across 4 types of relations (inflectional morphology, derivational morphology, lexicographic semantics, encyclopedic semantics)&lt;br /&gt;
* 10 relations of each type, 50 unique pairs per category&lt;br /&gt;
* 99,200 questions in total&lt;br /&gt;
* more challenging than the Google set because of more diverse relations &lt;br /&gt;
* where applicable, more than one correct answer is supplied (e.g. both &#039;&#039;canine&#039;&#039; and &#039;&#039;animal&#039;&#039; are hypernyms of &#039;&#039;dog&#039;&#039;). &lt;br /&gt;
* comes with a testing script [https://github.com/undertherain/vsmlib/blob/master/scripts/test_analogy.py a testing script]  that implements 5 methods of solving analogies (See [[Analogy (State of the art)]])&lt;br /&gt;
&lt;br /&gt;
This page reports results obtained with the &amp;quot;vanilla&amp;quot; 3CosAdd method, or vector offset&amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt;. &lt;br /&gt;
&lt;br /&gt;
== Table of results == &lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Listed in chronological order.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Model&lt;br /&gt;
! Reference&lt;br /&gt;
! Inflectional &amp;lt;br/&amp;gt; morphology&lt;br /&gt;
! Derivational &amp;lt;br/&amp;gt; morphology&lt;br /&gt;
! Lexicographic &amp;lt;br/&amp;gt; semantics&lt;br /&gt;
! Encyclopedic &amp;lt;br/&amp;gt; semantics&lt;br /&gt;
! Corpus, window size, vector size&lt;br /&gt;
|-&lt;br /&gt;
| SVD&lt;br /&gt;
| Drozd et al. (2016) &amp;lt;ref name = &amp;quot;Drozd2016&amp;quot;&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
|44.0&lt;br /&gt;
|9.8&lt;br /&gt;
|10.1&lt;br /&gt;
|18.5&lt;br /&gt;
| 5B corpus (Araneum + Wikipedia + UkWac), window 3, 1000 dimensions&lt;br /&gt;
|-&lt;br /&gt;
| GloVe&lt;br /&gt;
| Drozd et al. (2016) &amp;lt;ref name = &amp;quot;Drozd2016&amp;quot;/&amp;gt;&lt;br /&gt;
| 59.9&lt;br /&gt;
| 10.2&lt;br /&gt;
| 10.9&lt;br /&gt;
|31.5&lt;br /&gt;
| 5B corpus (Araneum + Wikipedia + UkWac), window 8, 300 dimensions&lt;br /&gt;
|-&lt;br /&gt;
| Skip-Gram&lt;br /&gt;
| Drozd et al. (2016) &amp;lt;ref name = &amp;quot;Drozd2016&amp;quot;/&amp;gt;&lt;br /&gt;
| 61.0&lt;br /&gt;
| 11.2&lt;br /&gt;
| 9.1&lt;br /&gt;
| 26.5&lt;br /&gt;
| 5B corpus (Araneum + Wikipedia + UkWac), window 8, 300 dimensions&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Methodological issues ==&lt;br /&gt;
&lt;br /&gt;
* As with other analogy test sets, accuracy depends not only on the embedding and its parameters, but also on the method with which analogies are solved &amp;lt;ref&amp;gt;Linzen, T. (2016). Issues in evaluating semantic spaces using word analogies. In Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP. Association for Computational Linguistics. Retrieved from http://anthology.aclweb.org/W16-2503&amp;lt;/ref&amp;gt; &amp;lt;ref&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Ramat-Gan, I. (2014). Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL (pp. 171–180). Retrieved from http://anthology.aclweb.org/W/W14/W14-1618.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;. Set-based methods&amp;lt;ref&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&amp;lt;/ref&amp;gt; considerably outperform pair-based methods, showing that models do in fact encode much &amp;quot;missed&amp;quot; information.&lt;br /&gt;
* Therefore it is more accurate to think of analogy task as a way to describe and characterize an embedding, rather than evaluate it.&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Bigger_analogy_test_set_(State_of_the_art)&amp;diff=11733</id>
		<title>Bigger analogy test set (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Bigger_analogy_test_set_(State_of_the_art)&amp;diff=11733"/>
		<updated>2017-01-06T09:57:17Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Dataset description ==&lt;br /&gt;
* New dataset proposed by Gladkova et al. (2016) &amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* available [http://vsm.blackbird.pw/bats here]&lt;br /&gt;
* dataset balanced across 4 types of relations (inflectional morphology, derivational morphology, lexicographic semantics, encyclopedic semantics)&lt;br /&gt;
* 10 relations of each type, 50 unique pairs per category&lt;br /&gt;
* 99,200 questions in total&lt;br /&gt;
* more challenging than the Google set because of more diverse relations &lt;br /&gt;
* where applicable, more than one correct answer is supplied (e.g. both &#039;&#039;canine&#039;&#039; and &#039;&#039;animal&#039;&#039; are hypernyms of &#039;&#039;dog&#039;&#039;). &lt;br /&gt;
* comes with a testing script [https://github.com/undertherain/vsmlib/blob/master/scripts/test_analogy.py a testing script]  that implements 5 methods of solving analogies (See [[Analogy (State of the art)]])&lt;br /&gt;
&lt;br /&gt;
This page reports results obtained with the &amp;quot;vanilla&amp;quot; 3CosAdd method, or vector offset&amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt;. &lt;br /&gt;
&lt;br /&gt;
== Table of results == &lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Listed in chronological order.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Model&lt;br /&gt;
! Reference&lt;br /&gt;
! Inflectional &amp;lt;br/&amp;gt; morphology&lt;br /&gt;
! Derivational &amp;lt;br/&amp;gt; morphology&lt;br /&gt;
! Lexicographic &amp;lt;br/&amp;gt; semantics&lt;br /&gt;
! Encyclopedic &amp;lt;br/&amp;gt; semantics&lt;br /&gt;
! Corpus, window size, vector size&lt;br /&gt;
|-&lt;br /&gt;
| SVD&lt;br /&gt;
| Drozd et al. (2016) &amp;lt;ref name = &amp;quot;Drozd2016&amp;quot;&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
|44.0&lt;br /&gt;
|9.8&lt;br /&gt;
|10.1&lt;br /&gt;
|18.5&lt;br /&gt;
| 5B corpus (Araneum + Wikipedia + UkWac), window 3, 1000 dimensions&lt;br /&gt;
|-&lt;br /&gt;
| GloVe&lt;br /&gt;
| Drozd et al. (2016) &amp;lt;ref name = &amp;quot;Drozd2016&amp;quot;/&amp;gt;&lt;br /&gt;
| 59.9&lt;br /&gt;
| 10.2&lt;br /&gt;
| 10.9&lt;br /&gt;
|31.5&lt;br /&gt;
| 5B corpus (Araneum + Wikipedia + UkWac), window 8, 300 dimensions&lt;br /&gt;
|-&lt;br /&gt;
| Skip-Gram&lt;br /&gt;
| Drozd et al. (2016) &amp;lt;ref name = &amp;quot;Drozd2016&amp;quot;/&amp;gt;&lt;br /&gt;
| 61.0&lt;br /&gt;
| 11.2&lt;br /&gt;
| 9.1&lt;br /&gt;
| 26.5&lt;br /&gt;
| 5B corpus (Araneum + Wikipedia + UkWac), window 8, 300 dimensions&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11732</id>
		<title>Analogy (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11732"/>
		<updated>2017-01-06T09:56:35Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Analogy task ==&lt;br /&gt;
A proportional analogy holds between two word pairs: &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039; (&#039;&#039;a&#039;&#039; is to &#039;&#039;a*&#039;&#039; as &#039;&#039;b&#039;&#039; is to &#039;&#039;b*&#039;&#039;)&lt;br /&gt;
For example, &#039;&#039;Tokyo&#039;&#039; is to &#039;&#039;Japan&#039;&#039; as &#039;&#039;Paris&#039;&#039; is to &#039;&#039;France&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With the &#039;&#039;&#039;pair-based&#039;&#039;&#039; methods, given &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;?&#039;&#039;, the task is to find &#039;&#039;b*&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With &#039;&#039;&#039;set-based&#039;&#039;&#039; methods, the task is to find &#039;&#039;b*&#039;&#039; given a set of other pairs (excluding &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;) that hold the same relation as &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;. &lt;br /&gt;
&lt;br /&gt;
In NLP analogies (Mikolov&#039;s &amp;quot;linguistic regularities&amp;quot;&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;/&amp;gt;) are interpreted broadly as basically any &amp;quot;similarities between pairs of words&amp;quot; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;/&amp;gt;, not just semantic.&lt;br /&gt;
&lt;br /&gt;
== Available analogy datasets (ordered by date) ==&lt;br /&gt;
* Listed by date&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Dataset&lt;br /&gt;
! Reference&lt;br /&gt;
! Number of questions&lt;br /&gt;
! Number of relations&lt;br /&gt;
! Dataset Link&lt;br /&gt;
! List of state-of-the-art results&lt;br /&gt;
!Comments&lt;br /&gt;
|-&lt;br /&gt;
| SAT&lt;br /&gt;
| Turney et al (2003)&amp;lt;ref&amp;gt;Turney, P., Littman, M. L., Bigham, J., &amp;amp; Shnayder, V. (2003). Combining independent modules to solve multiple-choice synonym and analogy problems. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (pp. 482--489). Retrieved from http://nparc.cisti-icist.nrc-cnrc.gc.ca/npsi/ctrl?action=rtdoc&amp;amp;an=8913366 &amp;lt;/ref&amp;gt;&lt;br /&gt;
|374&lt;br /&gt;
| misc&lt;br /&gt;
| available on request from Peter Turney&lt;br /&gt;
| [[SAT Analogy Questions (State of the art)]]&lt;br /&gt;
| different task formulation: select the correct answer out of 5 proposed alternatives&lt;br /&gt;
|-&lt;br /&gt;
| SemEval 2012 Task 2&lt;br /&gt;
| Jurgens et al (2012)&amp;lt;ref&amp;gt;Jurgens, D. A., Turney, P. D., Mohammad, S. M., &amp;amp; Holyoak, K. J. (2012). Semeval-2012 task 2: Measuring degrees of relational similarity. In Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM) (pp. 356–364). Montréal, Canada, June 7-8, 2012: Association for Computational Linguistics. Retrieved from http://dl.acm.org/citation.cfm?id=2387693&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 3218&lt;br /&gt;
|79&lt;br /&gt;
|[https://sites.google.com/site/semeval2012task2/download SemEval2012-Task2]&lt;br /&gt;
| [[SemEval-2012 Task 2 (State of the art)]]&lt;br /&gt;
| different task formulation: ranking the degree to which a relation applies.&lt;br /&gt;
|-&lt;br /&gt;
| MSR&lt;br /&gt;
| Mikolov et al. (2013a)&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 8,000&lt;br /&gt;
| 8&lt;br /&gt;
|[http://research.microsoft.com/en-us/um/people/gzweig/Pubs/myz_naacl13_test_set.tgz MSR]&lt;br /&gt;
| [[Syntactic Analogies (State of the art)]]&lt;br /&gt;
|Syntactic (i.e. morphological) questions only&lt;br /&gt;
|-&lt;br /&gt;
| Google&lt;br /&gt;
| Mikolov et al. (2013b)&amp;lt;ref&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 19544&lt;br /&gt;
| 15&lt;br /&gt;
| [http://download.tensorflow.org/data/questions-words.txt Original link deprecated, copy hosted @TensorFlow]&lt;br /&gt;
| [[Google analogy test set (State of the art)]]&lt;br /&gt;
| unbalanced: 8,869 semantic and 10,675 syntactic questions, with 20-70 pairs per category; &#039;&#039;country:capital&#039;&#039; relation is over 50% of all semantic questions. Relations in the syntactic part largely the same as MSR.&lt;br /&gt;
|-&lt;br /&gt;
| BATS&lt;br /&gt;
| Gladkova et al. (2016)&amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 99,200&lt;br /&gt;
| 40&lt;br /&gt;
|[http://vsm.blackbird.pw/bats BATS]&lt;br /&gt;
| [[Bigger analogy test set (State of the art)]]&lt;br /&gt;
|balanced across 4 types of relations: inflectional and derivational morphology, encyclopedic and lexicographic semantics. 10 relations of each type with 50 unique source pairs per relation. Multiple correct answers allowed where applicable.&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Methods to solve analogies==&lt;br /&gt;
&lt;br /&gt;
=== Pair-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;vector offset&#039;&#039;&#039; a.k.a. &#039;&#039;&#039;3CosAdd&#039;&#039;&#039; &amp;lt;ref&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;3CosMul&#039;&#039;&#039; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Ramat-Gan, I. (2014). Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL (pp. 171–180). Retrieved from http://anthology.aclweb.org/W/W14/W14-1618.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* others discussed by Linzen (2016)&amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;&amp;gt;Linzen, T. (2016). Issues in evaluating semantic spaces using word analogies. In Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP. Association for Computational Linguistics. Retrieved from http://anthology.aclweb.org/W16-2503&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
=== Set-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;3CosAvg&#039;&#039;&#039; (vector offset averaged over multiple pairs) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;LRCos&#039;&#039;&#039; (supervised learning of the target class + cosine similarity to the &#039;&#039;b&#039;&#039; word) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
== Issues with evaluating word embeddings on analogy task==&lt;br /&gt;
&#039;&#039;&#039;There is interplay between the chosen embedding, its parameters, particular relations &amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;/&amp;gt;, and method of solving analogies  &amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;/&amp;gt; &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;. It is possible that analogies not solved by one method can be solved by another method on the same embedding. Therefore results for solving analogies with different methods should be taken as a way to &#039;&#039;explore&#039;&#039; or describe an embedding rather than &#039;&#039;evaluate&#039;&#039; it.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Notes==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11731</id>
		<title>Analogy (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11731"/>
		<updated>2017-01-06T09:54:12Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Analogy task ==&lt;br /&gt;
A proportional analogy holds between two word pairs: &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039; (&#039;&#039;a&#039;&#039; is to &#039;&#039;a*&#039;&#039; as &#039;&#039;b&#039;&#039; is to &#039;&#039;b*&#039;&#039;)&lt;br /&gt;
For example, &#039;&#039;Tokyo&#039;&#039; is to &#039;&#039;Japan&#039;&#039; as &#039;&#039;Paris&#039;&#039; is to &#039;&#039;France&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With the &#039;&#039;&#039;pair-based&#039;&#039;&#039; methods, given &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;?&#039;&#039;, the task is to find &#039;&#039;b*&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With &#039;&#039;&#039;set-based&#039;&#039;&#039; methods, the task is to find &#039;&#039;b*&#039;&#039; given a set of other pairs (excluding &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;) that hold the same relation as &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;. &lt;br /&gt;
&lt;br /&gt;
In NLP analogies (Mikolov&#039;s &amp;quot;linguistic regularities&amp;quot;&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;/&amp;gt;) are interpreted broadly as basically any &amp;quot;similarities between pairs of words&amp;quot; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;/&amp;gt;, not just semantic.&lt;br /&gt;
&lt;br /&gt;
== Available analogy datasets (ordered by date) ==&lt;br /&gt;
* Listed by date&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Dataset&lt;br /&gt;
! Reference&lt;br /&gt;
! Number of questions&lt;br /&gt;
! Number of relations&lt;br /&gt;
! Dataset Link&lt;br /&gt;
! List of state-of-the-art results&lt;br /&gt;
!Comments&lt;br /&gt;
|-&lt;br /&gt;
| SAT&lt;br /&gt;
| Turney et al (2003)&amp;lt;ref&amp;gt;Turney, P., Littman, M. L., Bigham, J., &amp;amp; Shnayder, V. (2003). Combining independent modules to solve multiple-choice synonym and analogy problems. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (pp. 482--489). Retrieved from http://nparc.cisti-icist.nrc-cnrc.gc.ca/npsi/ctrl?action=rtdoc&amp;amp;an=8913366 &amp;lt;/ref&amp;gt;&lt;br /&gt;
|374&lt;br /&gt;
| misc&lt;br /&gt;
| available on request from Peter Turney&lt;br /&gt;
| [[SAT Analogy Questions (State of the art)]]&lt;br /&gt;
| different task formulation: select the correct answer out of 5 proposed alternatives&lt;br /&gt;
|-&lt;br /&gt;
| SemEval 2012 Task 2&lt;br /&gt;
| Jurgens et al (2012)&amp;lt;ref&amp;gt;Jurgens, D. A., Turney, P. D., Mohammad, S. M., &amp;amp; Holyoak, K. J. (2012). Semeval-2012 task 2: Measuring degrees of relational similarity. In Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM) (pp. 356–364). Montréal, Canada, June 7-8, 2012: Association for Computational Linguistics. Retrieved from http://dl.acm.org/citation.cfm?id=2387693&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 3218&lt;br /&gt;
|79&lt;br /&gt;
|[https://sites.google.com/site/semeval2012task2/download SemEval2012-Task2]&lt;br /&gt;
| [[SemEval-2012 Task 2 (State of the art)]]&lt;br /&gt;
| different task formulation: ranking the degree to which a relation applies.&lt;br /&gt;
|-&lt;br /&gt;
| MSR&lt;br /&gt;
| Mikolov et al. (2013a)&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 8,000&lt;br /&gt;
| 8&lt;br /&gt;
|[http://research.microsoft.com/en-us/um/people/gzweig/Pubs/myz_naacl13_test_set.tgz MSR]&lt;br /&gt;
| [[Syntactic Analogies (State of the art)]]&lt;br /&gt;
|Syntactic (i.e. morphological) questions only&lt;br /&gt;
|-&lt;br /&gt;
| Google&lt;br /&gt;
| Mikolov et al. (2013b)&amp;lt;ref&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 19544&lt;br /&gt;
| 15&lt;br /&gt;
| [http://download.tensorflow.org/data/questions-words.txt Original link deprecated, copy hosted @TensorFlow]&lt;br /&gt;
| [[Google analogy test set (State of the art)]]&lt;br /&gt;
| unbalanced: 8,869 semantic and 10,675 syntactic questions, with 20-70 pairs per category; &#039;&#039;country:capital&#039;&#039; relation is over 50% of all semantic questions. Relations in the syntactic part largely the same as MSR.&lt;br /&gt;
|-&lt;br /&gt;
| BATS&lt;br /&gt;
| Gladkova et al. (2016)&amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 99,200&lt;br /&gt;
| 40&lt;br /&gt;
|[https://s3.amazonaws.com/blackbirdprojects/tut_vsm/BATS_3.0.zip BATS]&lt;br /&gt;
| [[Bigger analogy test set (State of the art)]]&lt;br /&gt;
|balanced across 4 types of relations: inflectional and derivational morphology, encyclopedic and lexicographic semantics. 10 relations of each type with 50 unique source pairs per relation. Multiple correct answers allowed where applicable.&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Methods to solve analogies==&lt;br /&gt;
&lt;br /&gt;
=== Pair-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;vector offset&#039;&#039;&#039; a.k.a. &#039;&#039;&#039;3CosAdd&#039;&#039;&#039; &amp;lt;ref&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;3CosMul&#039;&#039;&#039; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Ramat-Gan, I. (2014). Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL (pp. 171–180). Retrieved from http://anthology.aclweb.org/W/W14/W14-1618.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* others discussed by Linzen (2016)&amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;&amp;gt;Linzen, T. (2016). Issues in evaluating semantic spaces using word analogies. In Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP. Association for Computational Linguistics. Retrieved from http://anthology.aclweb.org/W16-2503&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
=== Set-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;3CosAvg&#039;&#039;&#039; (vector offset averaged over multiple pairs) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;LRCos&#039;&#039;&#039; (supervised learning of the target class + cosine similarity to the &#039;&#039;b&#039;&#039; word) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
== Issues with evaluating word embeddings on analogy task==&lt;br /&gt;
&#039;&#039;&#039;There is interplay between the chosen embedding, its parameters, particular relations &amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;/&amp;gt;, and method of solving analogies  &amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;/&amp;gt; &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;. It is possible that analogies not solved by one method can be solved by another method on the same embedding. Therefore results for solving analogies with different methods should be taken as a way to &#039;&#039;explore&#039;&#039; or describe an embedding rather than &#039;&#039;evaluate&#039;&#039; it.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Notes==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Bigger_analogy_test_set_(State_of_the_art)&amp;diff=11730</id>
		<title>Bigger analogy test set (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Bigger_analogy_test_set_(State_of_the_art)&amp;diff=11730"/>
		<updated>2017-01-06T09:52:07Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: This page lists published results on Bigger Analogy Test Set (BATS)&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Dataset description ==&lt;br /&gt;
* New dataset proposed by Gladkova et al. (2016) &amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* dataset balanced across 4 types of relations (inflectional morphology, derivational morphology, lexicographic semantics, encyclopedic semantics)&lt;br /&gt;
* 10 relations of each type, 50 unique pairs per category&lt;br /&gt;
* 99,200 questions in total&lt;br /&gt;
* more challenging than the Google set because of more diverse relations &lt;br /&gt;
* where applicable, more than one correct answer is supplied (e.g. both &#039;&#039;canine&#039;&#039; and &#039;&#039;animal&#039;&#039; are hypernyms of &#039;&#039;dog&#039;&#039;). &lt;br /&gt;
* comes with a testing script [https://github.com/undertherain/vsmlib/blob/master/scripts/test_analogy.py a testing script]  that implements 5 methods of solving analogies (See [[Analogy (State of the art)]])&lt;br /&gt;
&lt;br /&gt;
This page reports results obtained with the &amp;quot;vanilla&amp;quot; 3CosAdd method, or vector offset&amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt;. &lt;br /&gt;
&lt;br /&gt;
== Table of results == &lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Listed in chronological order.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Model&lt;br /&gt;
! Reference&lt;br /&gt;
! Inflectional &amp;lt;br/&amp;gt; morphology&lt;br /&gt;
! Derivational &amp;lt;br/&amp;gt; morphology&lt;br /&gt;
! Lexicographic &amp;lt;br/&amp;gt; semantics&lt;br /&gt;
! Encyclopedic &amp;lt;br/&amp;gt; semantics&lt;br /&gt;
! Corpus, window size, vector size&lt;br /&gt;
|-&lt;br /&gt;
| SVD&lt;br /&gt;
| Drozd et al. (2016) &amp;lt;ref name = &amp;quot;Drozd2016&amp;quot;&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
|44.0&lt;br /&gt;
|9.8&lt;br /&gt;
|10.1&lt;br /&gt;
|18.5&lt;br /&gt;
| 5B corpus (Araneum + Wikipedia + UkWac), window 3, 1000 dimensions&lt;br /&gt;
|-&lt;br /&gt;
| GloVe&lt;br /&gt;
| Drozd et al. (2016) &amp;lt;ref name = &amp;quot;Drozd2016&amp;quot;/&amp;gt;&lt;br /&gt;
| 59.9&lt;br /&gt;
| 10.2&lt;br /&gt;
| 10.9&lt;br /&gt;
|31.5&lt;br /&gt;
| 5B corpus (Araneum + Wikipedia + UkWac), window 8, 300 dimensions&lt;br /&gt;
|-&lt;br /&gt;
| Skip-Gram&lt;br /&gt;
| Drozd et al. (2016) &amp;lt;ref name = &amp;quot;Drozd2016&amp;quot;/&amp;gt;&lt;br /&gt;
| 61.0&lt;br /&gt;
| 11.2&lt;br /&gt;
| 9.1&lt;br /&gt;
| 26.5&lt;br /&gt;
| 5B corpus (Araneum + Wikipedia + UkWac), window 8, 300 dimensions&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11729</id>
		<title>Analogy (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11729"/>
		<updated>2017-01-06T07:02:49Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Analogy task ==&lt;br /&gt;
A proportional analogy holds between two word pairs: &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039; (&#039;&#039;a&#039;&#039; is to &#039;&#039;a*&#039;&#039; as &#039;&#039;b&#039;&#039; is to &#039;&#039;b*&#039;&#039;)&lt;br /&gt;
For example, &#039;&#039;Tokyo&#039;&#039; is to &#039;&#039;Japan&#039;&#039; as &#039;&#039;Paris&#039;&#039; is to &#039;&#039;France&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With the &#039;&#039;&#039;pair-based&#039;&#039;&#039; methods, given &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;?&#039;&#039;, the task is to find &#039;&#039;b*&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With &#039;&#039;&#039;set-based&#039;&#039;&#039; methods, the task is to find &#039;&#039;b*&#039;&#039; given a set of other pairs (excluding &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;) that hold the same relation as &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;. &lt;br /&gt;
&lt;br /&gt;
In NLP analogies (Mikolov&#039;s &amp;quot;linguistic regularities&amp;quot;&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;/&amp;gt;) are interpreted broadly as basically any &amp;quot;similarities between pairs of words&amp;quot; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;/&amp;gt;, not just semantic.&lt;br /&gt;
&lt;br /&gt;
== Available analogy datasets (ordered by date) ==&lt;br /&gt;
* Listed by date&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Dataset&lt;br /&gt;
! Reference&lt;br /&gt;
! Number of questions&lt;br /&gt;
! Number of relations&lt;br /&gt;
! Dataset Link&lt;br /&gt;
! List of state-of-the-art results&lt;br /&gt;
!Comments&lt;br /&gt;
|-&lt;br /&gt;
| SAT&lt;br /&gt;
| Turney et al (2003)&amp;lt;ref&amp;gt;Turney, P., Littman, M. L., Bigham, J., &amp;amp; Shnayder, V. (2003). Combining independent modules to solve multiple-choice synonym and analogy problems. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (pp. 482--489). Retrieved from http://nparc.cisti-icist.nrc-cnrc.gc.ca/npsi/ctrl?action=rtdoc&amp;amp;an=8913366 &amp;lt;/ref&amp;gt;&lt;br /&gt;
|374&lt;br /&gt;
| misc&lt;br /&gt;
| available on request from Peter Turney&lt;br /&gt;
| [[SAT Analogy Questions (State of the art)]]&lt;br /&gt;
| different task formulation: select the correct answer out of 5 proposed alternatives&lt;br /&gt;
|-&lt;br /&gt;
| SemEval 2012 Task 2&lt;br /&gt;
| Jurgens et al (2012)&amp;lt;ref&amp;gt;Jurgens, D. A., Turney, P. D., Mohammad, S. M., &amp;amp; Holyoak, K. J. (2012). Semeval-2012 task 2: Measuring degrees of relational similarity. In Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM) (pp. 356–364). Montréal, Canada, June 7-8, 2012: Association for Computational Linguistics. Retrieved from http://dl.acm.org/citation.cfm?id=2387693&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 3218&lt;br /&gt;
|79&lt;br /&gt;
|[https://sites.google.com/site/semeval2012task2/download SemEval2012-Task2]&lt;br /&gt;
| [[SemEval-2012 Task 2 (State of the art)]]&lt;br /&gt;
| different task formulation: ranking the degree to which a relation applies.&lt;br /&gt;
|-&lt;br /&gt;
| MSR&lt;br /&gt;
| Mikolov et al. (2013a)&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 8,000&lt;br /&gt;
| 8&lt;br /&gt;
|[http://research.microsoft.com/en-us/um/people/gzweig/Pubs/myz_naacl13_test_set.tgz MSR]&lt;br /&gt;
| [[Syntactic Analogies (State of the art)]]&lt;br /&gt;
|Syntactic (i.e. morphological) questions only&lt;br /&gt;
|-&lt;br /&gt;
| Google&lt;br /&gt;
| Mikolov et al. (2013b)&amp;lt;ref&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 19544&lt;br /&gt;
| 15&lt;br /&gt;
| [http://download.tensorflow.org/data/questions-words.txt Original link deprecated, copy hosted @TensorFlow]&lt;br /&gt;
| [[Google analogy test set (State of the art)]]&lt;br /&gt;
| unbalanced: 8,869 semantic and 10,675 syntactic questions, with 20-70 pairs per category; &#039;&#039;country:capital&#039;&#039; relation is over 50% of all semantic questions. Relations in the syntactic part largely the same as MSR.&lt;br /&gt;
|-&lt;br /&gt;
| BATS&lt;br /&gt;
| Gladkova et al. (2016)&amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 99,200&lt;br /&gt;
| 40&lt;br /&gt;
|[https://s3.amazonaws.com/blackbirdprojects/tut_vsm/BATS_3.0.zip BATS]&lt;br /&gt;
|&lt;br /&gt;
|balanced across 4 types of relations: inflectional and derivational morphology, encyclopedic and lexicographic semantics. 10 relations of each type with 50 unique source pairs per relation. Multiple correct answers allowed where applicable.&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Methods to solve analogies==&lt;br /&gt;
&lt;br /&gt;
=== Pair-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;vector offset&#039;&#039;&#039; a.k.a. &#039;&#039;&#039;3CosAdd&#039;&#039;&#039; &amp;lt;ref&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;3CosMul&#039;&#039;&#039; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Ramat-Gan, I. (2014). Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL (pp. 171–180). Retrieved from http://anthology.aclweb.org/W/W14/W14-1618.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* others discussed by Linzen (2016)&amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;&amp;gt;Linzen, T. (2016). Issues in evaluating semantic spaces using word analogies. In Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP. Association for Computational Linguistics. Retrieved from http://anthology.aclweb.org/W16-2503&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
=== Set-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;3CosAvg&#039;&#039;&#039; (vector offset averaged over multiple pairs) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;LRCos&#039;&#039;&#039; (supervised learning of the target class + cosine similarity to the &#039;&#039;b&#039;&#039; word) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
== Issues with evaluating word embeddings on analogy task==&lt;br /&gt;
&#039;&#039;&#039;There is interplay between the chosen embedding, its parameters, particular relations &amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;/&amp;gt;, and method of solving analogies  &amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;/&amp;gt; &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;. It is possible that analogies not solved by one method can be solved by another method on the same embedding. Therefore results for solving analogies with different methods should be taken as a way to &#039;&#039;explore&#039;&#039; or describe an embedding rather than &#039;&#039;evaluate&#039;&#039; it.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Notes==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Google_analogy_test_set_(State_of_the_art)&amp;diff=11728</id>
		<title>Google analogy test set (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Google_analogy_test_set_(State_of_the_art)&amp;diff=11728"/>
		<updated>2017-01-06T07:01:57Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: Created page with &amp;quot;* Test set developed by Mikolov et al. (2013b)&amp;lt;ref&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proc...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* Test set developed by Mikolov et al. (2013b)&amp;lt;ref&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt; &lt;br /&gt;
* Contains 19544 question pairs (8,869 semantic and 10,675 syntactic (i.e. morphological) questions)&lt;br /&gt;
* 14 types of relations (9 morphological and 5 semantic)&lt;br /&gt;
* [http://download.tensorflow.org/data/questions-words.txt Original link deprecated, copy hosted @TensorFlow]&lt;br /&gt;
&lt;br /&gt;
This page reports results obtained with the &amp;quot;vanilla&amp;quot; 3CosAdd method, or vector offset&amp;lt;ref name=&amp;quot;Mikolov2013&amp;quot;/&amp;gt;. For other methods, see [[Analogy (State of the art)]] &lt;br /&gt;
&lt;br /&gt;
== Table of results == &lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Listed in chronological order.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Model&lt;br /&gt;
! Reference&lt;br /&gt;
! Sem&lt;br /&gt;
! Syn&lt;br /&gt;
! Corpus and window size&lt;br /&gt;
|-&lt;br /&gt;
| CBOW (640 dim)&lt;br /&gt;
| Mikolov et al (2013) &amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 24.0&lt;br /&gt;
| 64.0&lt;br /&gt;
| 6B Google News corpus, window 10&lt;br /&gt;
|-&lt;br /&gt;
| Skip-Gram (640 dim)&lt;br /&gt;
| Mikolov et al (2013) &amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;/&amp;gt;&lt;br /&gt;
| 55.0&lt;br /&gt;
| 59.0&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| RNNLM (640 dim)&lt;br /&gt;
| Mikolov et al (2013) &amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;/&amp;gt;&lt;br /&gt;
| 9.0&lt;br /&gt;
| 36.0&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| NNLM (640 dim)&lt;br /&gt;
| Mikolov et al (2013) &amp;lt;ref name = &amp;quot;Mikolov2013&amp;quot;/&amp;gt;&lt;br /&gt;
| 23.0&lt;br /&gt;
| 53.0&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| GloVe (300 dim)&lt;br /&gt;
| Pennington et al (2014) &amp;lt;ref name = &amp;quot;GloVe&amp;quot;&amp;gt;Pennington, J., Socher, R., &amp;amp; Manning, C. D. (2014). Glove: Global vectors for word representation. In Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014) (Vol. 12, pp. 1532–1543). Retrieved from http://llcao.net/cu-deeplearning15/presentation/nn-pres.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 81.9&lt;br /&gt;
| 69.3&lt;br /&gt;
| 42 B corpus, window 5&lt;br /&gt;
|-&lt;br /&gt;
| SVD&lt;br /&gt;
| Levy et al (2015) &amp;lt;ref name = &amp;quot;Levy2015&amp;quot;&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Dagan, I. (2015). Improving distributional similarity with lessons learned from word embeddings. Transactions of the Association for Computational Linguistics, 3, 211–225.&amp;lt;/ref&amp;gt;&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot;|55.4&lt;br /&gt;
| Wikipedia 1.5B, window 2&lt;br /&gt;
|-&lt;br /&gt;
| PPMI&lt;br /&gt;
| Levy et al (2015)  &amp;lt;ref name = &amp;quot;Levy2015&amp;quot;/&amp;gt;&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot;|55.3&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| Skip-Gram&lt;br /&gt;
| Levy et al (2015)  &amp;lt;ref name = &amp;quot;Levy2015&amp;quot;/&amp;gt;&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot;|67.6&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| GloVe&lt;br /&gt;
| Levy et al (2015)  &amp;lt;ref name = &amp;quot;Levy2015&amp;quot;/&amp;gt;&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot;|56.9&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| Skip-Gram (50 dim)&lt;br /&gt;
| Lai et al (2015) &amp;lt;ref name = &amp;quot;Lai2015&amp;quot;&amp;gt;Lai, S., Liu, K., Xu, L., &amp;amp; Zhao, J. (2015). How to Generate a Good Word Embedding? arXiv Preprint arXiv:1507.05523. Retrieved from http://arxiv.org/abs/1507.05523&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 44.8&lt;br /&gt;
| 44.43&lt;br /&gt;
| W&amp;amp;N 2.8 B corpus, window 5&lt;br /&gt;
|-&lt;br /&gt;
| CBOW (50 dim)&lt;br /&gt;
| Lai et al (2015) &amp;lt;ref name = &amp;quot;Lai2015&amp;quot;/&amp;gt;&lt;br /&gt;
| 44.43&lt;br /&gt;
| 55.83&lt;br /&gt;
| ibid&lt;br /&gt;
|-&lt;br /&gt;
| DVRS+SG (300 dim)&lt;br /&gt;
| Garten et al (2015) &amp;lt;ref&amp;gt;Garten, J., Sagae, K., Ustun, V., &amp;amp; Dehghani, M. (2015). Combining Distributed Vector Representations for Words. In Proceedings of NAACL-HLT (pp. 95–101). Retrieved from http://www.researchgate.net/profile/Volkan_Ustun/publication/277332298_Combining_Distributed_Vector_Representations_for_Words/links/55705a6308aee1eea7586e93.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 74.0&lt;br /&gt;
| 60.0&lt;br /&gt;
| enwiki9, window 10&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Methodological Issues ==&lt;br /&gt;
&lt;br /&gt;
* This test set is not balanced: 20-70 pairs per category, different number of semantic and morphological relations. See other sets at [[Analogy (State of the art)]].&lt;br /&gt;
* In the semantic part, &#039;&#039;country:capital&#039;&#039; relation accounts for over 50% of all semantic questions. &lt;br /&gt;
* Researchers usually report only the average accuracy for all semantic/syntactic questions, but there is a lot of variation for individual relations - between 10.53% and 99.41% &amp;lt;ref&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Ramat-Gan, I. (2014). Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL (pp. 171–180). Retrieved from http://anthology.aclweb.org/W/W14/W14-1618.pdf&amp;lt;/ref&amp;gt;, also depending on parameters of the model &amp;lt;ref&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&amp;lt;/ref&amp;gt;. Since the test is not balanced, the above results could be flattering to the embeddings, and averaging the mean scores for each subcategory would yield lower results.&lt;br /&gt;
* Accuracy also depends on the method with which analogies are solved &amp;lt;ref&amp;gt;Linzen, T. (2016). Issues in evaluating semantic spaces using word analogies. In Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP. Association for Computational Linguistics. Retrieved from http://anthology.aclweb.org/W16-2503&amp;lt;/ref&amp;gt;. Set-based methods&amp;lt;ref&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&amp;lt;/ref&amp;gt; considerably outperform pair-based methods, showing that models do in fact encode much &amp;quot;missed&amp;quot; information.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11727</id>
		<title>Analogy (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11727"/>
		<updated>2017-01-06T05:31:20Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Analogy task ==&lt;br /&gt;
A proportional analogy holds between two word pairs: &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039; (&#039;&#039;a&#039;&#039; is to &#039;&#039;a*&#039;&#039; as &#039;&#039;b&#039;&#039; is to &#039;&#039;b*&#039;&#039;)&lt;br /&gt;
For example, &#039;&#039;Tokyo&#039;&#039; is to &#039;&#039;Japan&#039;&#039; as &#039;&#039;Paris&#039;&#039; is to &#039;&#039;France&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With the &#039;&#039;&#039;pair-based&#039;&#039;&#039; methods, given &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;?&#039;&#039;, the task is to find &#039;&#039;b*&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With &#039;&#039;&#039;set-based&#039;&#039;&#039; methods, the task is to find &#039;&#039;b*&#039;&#039; given a set of other pairs (excluding &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;) that hold the same relation as &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;. &lt;br /&gt;
&lt;br /&gt;
In NLP analogies (Mikolov&#039;s &amp;quot;linguistic regularities&amp;quot;&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;/&amp;gt;) are interpreted broadly as basically any &amp;quot;similarities between pairs of words&amp;quot; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;/&amp;gt;, not just semantic.&lt;br /&gt;
&lt;br /&gt;
== Available analogy datasets (ordered by date) ==&lt;br /&gt;
* Listed by date&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Dataset&lt;br /&gt;
! Reference&lt;br /&gt;
! Number of questions&lt;br /&gt;
! Number of relations&lt;br /&gt;
! Dataset Link&lt;br /&gt;
! List of state-of-the-art results&lt;br /&gt;
!Comments&lt;br /&gt;
|-&lt;br /&gt;
| SAT&lt;br /&gt;
| Turney et al (2003)&amp;lt;ref&amp;gt;Turney, P., Littman, M. L., Bigham, J., &amp;amp; Shnayder, V. (2003). Combining independent modules to solve multiple-choice synonym and analogy problems. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (pp. 482--489). Retrieved from http://nparc.cisti-icist.nrc-cnrc.gc.ca/npsi/ctrl?action=rtdoc&amp;amp;an=8913366 &amp;lt;/ref&amp;gt;&lt;br /&gt;
|374&lt;br /&gt;
| misc&lt;br /&gt;
| available on request from Peter Turney&lt;br /&gt;
| [[SAT Analogy Questions (State of the art)]]&lt;br /&gt;
| different task formulation: select the correct answer out of 5 proposed alternatives&lt;br /&gt;
|-&lt;br /&gt;
| SemEval 2012 Task 2&lt;br /&gt;
| Jurgens et al (2012)&amp;lt;ref&amp;gt;Jurgens, D. A., Turney, P. D., Mohammad, S. M., &amp;amp; Holyoak, K. J. (2012). Semeval-2012 task 2: Measuring degrees of relational similarity. In Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM) (pp. 356–364). Montréal, Canada, June 7-8, 2012: Association for Computational Linguistics. Retrieved from http://dl.acm.org/citation.cfm?id=2387693&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 3218&lt;br /&gt;
|79&lt;br /&gt;
|[https://sites.google.com/site/semeval2012task2/download SemEval2012-Task2]&lt;br /&gt;
| [[SemEval-2012 Task 2 (State of the art)]]&lt;br /&gt;
| different task formulation: ranking the degree to which a relation applies.&lt;br /&gt;
|-&lt;br /&gt;
| MSR&lt;br /&gt;
| Mikolov et al. (2013a)&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 8,000&lt;br /&gt;
| 8&lt;br /&gt;
|[http://research.microsoft.com/en-us/um/people/gzweig/Pubs/myz_naacl13_test_set.tgz MSR]&lt;br /&gt;
| [[Syntactic Analogies (State of the art)]]&lt;br /&gt;
|Syntactic (i.e. morphological) questions only&lt;br /&gt;
|-&lt;br /&gt;
| Google&lt;br /&gt;
| Mikolov et al. (2013b)&amp;lt;ref&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 19544&lt;br /&gt;
| 15&lt;br /&gt;
| [http://download.tensorflow.org/data/questions-words.txt Original link deprecated, copy hosted @TensorFlow]&lt;br /&gt;
|&lt;br /&gt;
| unbalanced: 8,869 semantic and 10,675 syntactic questions, with 20-70 pairs per category; &#039;&#039;country:capital&#039;&#039; relation is over 50% of all semantic questions. Relations in the syntactic part largely the same as MSR.&lt;br /&gt;
|-&lt;br /&gt;
| BATS&lt;br /&gt;
| Gladkova et al. (2016)&amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 99,200&lt;br /&gt;
| 40&lt;br /&gt;
|[https://s3.amazonaws.com/blackbirdprojects/tut_vsm/BATS_3.0.zip BATS]&lt;br /&gt;
|&lt;br /&gt;
|balanced across 4 types of relations: inflectional and derivational morphology, encyclopedic and lexicographic semantics. 10 relations of each type with 50 unique source pairs per relation. Multiple correct answers allowed where applicable.&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Methods to solve analogies==&lt;br /&gt;
&lt;br /&gt;
=== Pair-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;vector offset&#039;&#039;&#039; a.k.a. &#039;&#039;&#039;3CosAdd&#039;&#039;&#039; &amp;lt;ref&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;3CosMul&#039;&#039;&#039; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Ramat-Gan, I. (2014). Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL (pp. 171–180). Retrieved from http://anthology.aclweb.org/W/W14/W14-1618.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* others discussed by Linzen (2016)&amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;&amp;gt;Linzen, T. (2016). Issues in evaluating semantic spaces using word analogies. In Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP. Association for Computational Linguistics. Retrieved from http://anthology.aclweb.org/W16-2503&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
=== Set-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;3CosAvg&#039;&#039;&#039; (vector offset averaged over multiple pairs) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;LRCos&#039;&#039;&#039; (supervised learning of the target class + cosine similarity to the &#039;&#039;b&#039;&#039; word) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
== Issues with evaluating word embeddings on analogy task==&lt;br /&gt;
&#039;&#039;&#039;There is interplay between the chosen embedding, its parameters, particular relations &amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;/&amp;gt;, and method of solving analogies  &amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;/&amp;gt; &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;. It is possible that analogies not solved by one method can be solved by another method on the same embedding. Therefore results for solving analogies with different methods should be taken as a way to &#039;&#039;explore&#039;&#039; or describe an embedding rather than &#039;&#039;evaluate&#039;&#039; it.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Notes==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11726</id>
		<title>Analogy (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Analogy_(State_of_the_art)&amp;diff=11726"/>
		<updated>2017-01-06T05:08:03Z</updated>

		<summary type="html">&lt;p&gt;Anna gladkova: This page lists available datasets and methods for solving analogies with distributional models&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Analogy task ==&lt;br /&gt;
A proportional analogy holds between two word pairs: &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039; (&#039;&#039;a&#039;&#039; is to &#039;&#039;a*&#039;&#039; as &#039;&#039;b&#039;&#039; is to &#039;&#039;b*&#039;&#039;)&lt;br /&gt;
For example, &#039;&#039;Tokyo&#039;&#039; is to &#039;&#039;Japan&#039;&#039; as &#039;&#039;Paris&#039;&#039; is to &#039;&#039;France&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With the &#039;&#039;&#039;pair-based&#039;&#039;&#039; methods, given &#039;&#039;a&#039;&#039;:&#039;&#039;a*&#039;&#039; :: &#039;&#039;b&#039;&#039;:&#039;&#039;?&#039;&#039;, the task is to find &#039;&#039;b*&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
With &#039;&#039;&#039;set-based&#039;&#039;&#039; methods, the task is to find &#039;&#039;b*&#039;&#039; given a set of other pairs (excluding &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;) that hold the same relation as &#039;&#039;b&#039;&#039;:&#039;&#039;b*&#039;&#039;. &lt;br /&gt;
&lt;br /&gt;
In NLP analogies (Mikolov&#039;s &amp;quot;linguistic regularities&amp;quot;&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;/&amp;gt;) are interpreted broadly as basically any &amp;quot;similarities between pairs of words&amp;quot; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;/&amp;gt;, not just semantic.&lt;br /&gt;
&lt;br /&gt;
== Available analogy datasets (ordered by date) ==&lt;br /&gt;
* Listed by date&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Dataset&lt;br /&gt;
! Reference&lt;br /&gt;
! Number of questions&lt;br /&gt;
! Number of relations&lt;br /&gt;
! Dataset Link&lt;br /&gt;
! List of state-of-the-art results&lt;br /&gt;
!Comments&lt;br /&gt;
|-&lt;br /&gt;
| SAT&lt;br /&gt;
| Turney et al (2003)&amp;lt;ref&amp;gt;Turney, P., Littman, M. L., Bigham, J., &amp;amp; Shnayder, V. (2003). Combining independent modules to solve multiple-choice synonym and analogy problems. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (pp. 482--489). Retrieved from http://nparc.cisti-icist.nrc-cnrc.gc.ca/npsi/ctrl?action=rtdoc&amp;amp;an=8913366 &amp;lt;/ref&amp;gt;&lt;br /&gt;
|374&lt;br /&gt;
| misc&lt;br /&gt;
| available on request from Peter Turney&lt;br /&gt;
| [[SAT Analogy Questions (State of the art)]]&lt;br /&gt;
| different task formulation: select the correct answer out of 5 proposed alternatives&lt;br /&gt;
|-&lt;br /&gt;
| SemEval 2012 Task 2&lt;br /&gt;
| Jurgens et al (2012)&amp;lt;ref&amp;gt;Jurgens, D. A., Turney, P. D., Mohammad, S. M., &amp;amp; Holyoak, K. J. (2012). Semeval-2012 task 2: Measuring degrees of relational similarity. In Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM) (pp. 356–364). Montréal, Canada, June 7-8, 2012: Association for Computational Linguistics. Retrieved from http://dl.acm.org/citation.cfm?id=2387693&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 3218&lt;br /&gt;
|79&lt;br /&gt;
|[https://sites.google.com/site/semeval2012task2/download SemEval2012-Task2]&lt;br /&gt;
| [[SemEval-2012 Task 2 (State of the art)]]&lt;br /&gt;
| different task formulation: ranking the degree to which a relation applies.&lt;br /&gt;
|-&lt;br /&gt;
| MSR&lt;br /&gt;
| Mikolov et al. (2013a)&amp;lt;ref name = &amp;quot;Mikolov2013a&amp;quot;&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 8,000&lt;br /&gt;
| 8&lt;br /&gt;
|[http://research.microsoft.com/en-us/um/people/gzweig/Pubs/myz_naacl13_test_set.tgz MSR]&lt;br /&gt;
| [[Syntactic Analogies (State of the art)]]&lt;br /&gt;
|Syntactic (i.e. morphological) questions only&lt;br /&gt;
|-&lt;br /&gt;
| Google&lt;br /&gt;
| Mikolov et al. (2013b)&amp;lt;ref&amp;gt;Mikolov, T., Chen, K., Corrado, G., &amp;amp; Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR).&amp;lt;/ref&amp;gt; &lt;br /&gt;
| 19544&lt;br /&gt;
| 15&lt;br /&gt;
| [http://download.tensorflow.org/data/questions-words.txt Original link deprecated, copy hosted @TensorFlow]&lt;br /&gt;
|&lt;br /&gt;
| unbalanced: 8,869 semantic and 10,675 syntactic questions, with 20-70 pairs per category; &#039;&#039;country:capital&#039;&#039; relation is over 50% of all semantic questions. Relations in the syntactic part largely the same as MSR.&lt;br /&gt;
|-&lt;br /&gt;
| BATS&lt;br /&gt;
| Gladkova et al. (2016)&amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;&amp;gt;Gladkova, A., Drozd, A., &amp;amp; Matsuoka, S. (2016). Analogy-based detection of morphological and semantic relations with word embeddings: what works and what doesn’t. In Proceedings of the NAACL-HLT SRW (pp. 47–54). San Diego, California, June 12-17, 2016: ACL. Retrieved from https://www.aclweb.org/anthology/N/N16/N16-2002.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
| 99,200&lt;br /&gt;
| 40&lt;br /&gt;
|[https://s3.amazonaws.com/blackbirdprojects/tut_vsm/BATS_3.0.zip BATS]&lt;br /&gt;
|&lt;br /&gt;
|balanced across 4 types of relations: inflectional and derivational morphology, encyclopedic and lexicographic semantics. 10 relations of each type with 50 unique source pairs per relation. Multiple correct answers allowed where applicable.&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Methods to solve analogies==&lt;br /&gt;
&lt;br /&gt;
=== Pair-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;vector offset&#039;&#039;&#039; a.k.a. &#039;&#039;&#039;3CosAdd&#039;&#039;&#039; &amp;lt;ref&amp;gt;Mikolov, T., Yih, W., &amp;amp; Zweig, G. (2013). Linguistic Regularities in Continuous Space Word Representations. In HLT-NAACL (pp. 746–751). Retrieved from http://www.aclweb.org/anthology/N13-1#page=784&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;3CosMul&#039;&#039;&#039; &amp;lt;ref name = &amp;quot;Levy2014&amp;quot;&amp;gt;Levy, O., Goldberg, Y., &amp;amp; Ramat-Gan, I. (2014). Linguistic Regularities in Sparse and Explicit Word Representations. In CoNLL (pp. 171–180). Retrieved from http://anthology.aclweb.org/W/W14/W14-1618.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* others discussed by Linzen (2016)&amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;&amp;gt;Linzen, T. (2016). Issues in evaluating semantic spaces using word analogies. In Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP. Association for Computational Linguistics. Retrieved from http://anthology.aclweb.org/W16-2503&amp;lt;/ref&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
=== Set-based methods for solving analogies ===&lt;br /&gt;
* &#039;&#039;&#039;3CosAvg&#039;&#039;&#039; (vector offset averaged over multiple pairs) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;&amp;gt;Drozd, A., Gladkova, A., &amp;amp; Matsuoka, S. (2016). Word embeddings, analogies, and machine learning: beyond king - man + woman = queen. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 3519–3530). Osaka, Japan, December 11-17: ACL. Retrieved from https://www.aclweb.org/anthology/C/C16/C16-1332.pdf&lt;br /&gt;
&amp;lt;/ref&amp;gt;&lt;br /&gt;
* &#039;&#039;&#039;LRCos&#039;&#039;&#039; (supervised learning of the target class + cosine similarity to the &#039;&#039;b&#039;&#039; word) &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
== Issues with evaluating word embeddings on analogy task==&lt;br /&gt;
&#039;&#039;&#039;There is interplay between the chosen embedding, its parameters, particular relations &amp;lt;ref name = &amp;quot;Gladkova2016&amp;quot;/&amp;gt;, and method of solving analogies  &amp;lt;ref name=&amp;quot;Linzen2016&amp;quot;/&amp;gt; &amp;lt;ref name=&amp;quot;LRCos&amp;quot;/&amp;gt;. It is possible that analogies not solved by one method can be solved by another method on the same embedding. Therefore results for solving analogies with different methods should be taken as a way to &#039;&#039;explore&#039;&#039; or describe an embedding rather than &#039;&#039;evaluate&#039;&#039; it.&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Notes==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>Anna gladkova</name></author>
	</entry>
</feed>