<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Simzalabim</id>
	<title>ACL Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Simzalabim"/>
	<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/Special:Contributions/Simzalabim"/>
	<updated>2026-04-09T15:40:26Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.6</generator>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=CONLL-2003_(State_of_the_art)&amp;diff=12574</id>
		<title>CONLL-2003 (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=CONLL-2003_(State_of_the_art)&amp;diff=12574"/>
		<updated>2019-07-12T13:29:02Z</updated>

		<summary type="html">&lt;p&gt;Simzalabim: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* &#039;&#039;&#039;Performance measure:&#039;&#039;&#039; F = 2 * Precision * Recall / (Recall + Precision)&lt;br /&gt;
* &#039;&#039;&#039;Precision:&#039;&#039;&#039; percentage of named entities found by the algorithm that are correct&lt;br /&gt;
* &#039;&#039;&#039;Recall:&#039;&#039;&#039; percentage of named entities defined in the corpus that were found by the program&lt;br /&gt;
* Exact match (for all words of a chunk) is used in the calculation of precision and recall (see [http://www.cnts.ua.ac.be/conll2000/chunking/output.html CONLL scoring software])&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Training data:&#039;&#039;&#039; Train split of CONLL-2003 corpus&lt;br /&gt;
* &#039;&#039;&#039;Dryrun data:&#039;&#039;&#039; Testa split of CONLL-2003 corpus&lt;br /&gt;
* &#039;&#039;&#039;Testing data:&#039;&#039;&#039; Testb split of CONLL-2003 corpus&lt;br /&gt;
* The corpus contains a very high ratio of metonymic references (city names standing for sport teams)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Table of results ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! System name&lt;br /&gt;
! Short description&lt;br /&gt;
! System type (1)&lt;br /&gt;
! Main publications&lt;br /&gt;
! Software&lt;br /&gt;
! Results&lt;br /&gt;
|-&lt;br /&gt;
| FIJZ&lt;br /&gt;
| Best CONLL-2003 participant&lt;br /&gt;
| S&lt;br /&gt;
| Florian, Ittycheriah, Jing and Zhang (2003)&lt;br /&gt;
| -&lt;br /&gt;
| 88.76%&lt;br /&gt;
|-&lt;br /&gt;
| Baseline&lt;br /&gt;
| Vocabulary transfer from training to testing&lt;br /&gt;
| S&lt;br /&gt;
| Tjong Kim Sang and De Meulder(2003)&lt;br /&gt;
| -&lt;br /&gt;
| 59.61% &lt;br /&gt;
|-&lt;br /&gt;
| Balie&lt;br /&gt;
| Unsupervised approach: no prior training&lt;br /&gt;
| U&lt;br /&gt;
| Nadeau, Turney and Matwin (2006)&lt;br /&gt;
| [http://balie.sourceforge.net sourceforge.net]&lt;br /&gt;
| 55.98%&lt;br /&gt;
|-&lt;br /&gt;
| BI-LSTM-CRF&lt;br /&gt;
| Bidirectional LSTM-CRF Model&lt;br /&gt;
| S&lt;br /&gt;
| Huang et al. (2015)&lt;br /&gt;
| -&lt;br /&gt;
| 90.10%&lt;br /&gt;
|-&lt;br /&gt;
| BI-LSTM-CRF&lt;br /&gt;
| Bidirectional LSTM-CRF Model&lt;br /&gt;
| S&lt;br /&gt;
| Akbik, Blythe, &amp;amp; Vollgraf (2018)&lt;br /&gt;
| https://github.com/zalandoresearch/flair&lt;br /&gt;
| 93.09%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* (1) &#039;&#039;&#039;System type&#039;&#039;&#039;: R = hand-crafted rules, S = supervised learning, U = unsupervised learning, H = hybrid&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
Florian, R., Ittycheriah, A., Jing, H. and Zhang, T. (2003) [http://www.cnts.ua.ac.be/conll2003/pdf/16871flo.pdf Named Entity Recognition through Classifier Combination]. &#039;&#039;Proceedings of CoNLL-2003&#039;&#039;. Edmonton, Canada. &lt;br /&gt;
&lt;br /&gt;
Nadeau, D., Turney, P. D. and Matwin, S. (2006) [http://iit-iti.nrc-cnrc.gc.ca/publications/nrc-48727_e.html Unsupervised Named-Entity Recognition: Generating Gazetteers and Resolving Ambiguity]. &#039;&#039;Proceedings 19th Canadian Conference on Artificial Intelligence&#039;&#039;. Québec, Canada.&lt;br /&gt;
&lt;br /&gt;
Tjong Kim Sang, E. F. and De Meulder, F. (2003) [http://www.cnts.ua.ac.be/conll2003/pdf/14247tjo.pdf Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition]. &#039;&#039;Proceedings of CoNLL-2003&#039;&#039;. Edmonton, Canada.&lt;br /&gt;
&lt;br /&gt;
Z. H. Huang, W. Xu, and K. Yu. (2015) [http://arxiv.org/abs/1508.01991 Bidirectional LSTM-CRF Models for Sequence Tagging]. &#039;&#039;In arXiv:1508.01991&#039;&#039;. 2015.&lt;br /&gt;
&lt;br /&gt;
Akbik, A., Blythe, D., and Vollgraf, R. (2018). Contextual string embeddings for sequence labeling. In Proceedings of the 27th International Conference on Computational Linguistics (pp. 1638-1649).&lt;br /&gt;
&lt;br /&gt;
== See also ==&lt;br /&gt;
&lt;br /&gt;
* [[Named Entity Recognition (State of the art)|Named Entity Recognition]]&lt;br /&gt;
* [[State of the art]]&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Simzalabim</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=CONLL-2003_(State_of_the_art)&amp;diff=12573</id>
		<title>CONLL-2003 (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=CONLL-2003_(State_of_the_art)&amp;diff=12573"/>
		<updated>2019-07-12T13:26:17Z</updated>

		<summary type="html">&lt;p&gt;Simzalabim: /* Table of results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* &#039;&#039;&#039;Performance measure:&#039;&#039;&#039; F = 2 * Precision * Recall / (Recall + Precision)&lt;br /&gt;
* &#039;&#039;&#039;Precision:&#039;&#039;&#039; percentage of named entities found by the algorithm that are correct&lt;br /&gt;
* &#039;&#039;&#039;Recall:&#039;&#039;&#039; percentage of named entities defined in the corpus that were found by the program&lt;br /&gt;
* Exact match (for all words of a chunk) is used in the calculation of precision and recall (see [http://www.cnts.ua.ac.be/conll2000/chunking/output.html CONLL scoring software])&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;Training data:&#039;&#039;&#039; Train split of CONLL-2003 corpus&lt;br /&gt;
* &#039;&#039;&#039;Dryrun data:&#039;&#039;&#039; Testa split of CONLL-2003 corpus&lt;br /&gt;
* &#039;&#039;&#039;Testing data:&#039;&#039;&#039; Testb split of CONLL-2003 corpus&lt;br /&gt;
* The corpus contains a very high ratio of metonymic references (city names standing for sport teams)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Table of results ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! System name&lt;br /&gt;
! Short description&lt;br /&gt;
! System type (1)&lt;br /&gt;
! Main publications&lt;br /&gt;
! Software&lt;br /&gt;
! Results&lt;br /&gt;
|-&lt;br /&gt;
| FIJZ&lt;br /&gt;
| Best CONLL-2003 participant&lt;br /&gt;
| S&lt;br /&gt;
| Florian, Ittycheriah, Jing and Zhang (2003)&lt;br /&gt;
| -&lt;br /&gt;
| 88.76%&lt;br /&gt;
|-&lt;br /&gt;
| Baseline&lt;br /&gt;
| Vocabulary transfer from training to testing&lt;br /&gt;
| S&lt;br /&gt;
| Tjong Kim Sang and De Meulder(2003)&lt;br /&gt;
| -&lt;br /&gt;
| 59.61% &lt;br /&gt;
|-&lt;br /&gt;
| Balie&lt;br /&gt;
| Unsupervised approach: no prior training&lt;br /&gt;
| U&lt;br /&gt;
| Nadeau, Turney and Matwin (2006)&lt;br /&gt;
| [http://balie.sourceforge.net sourceforge.net]&lt;br /&gt;
| 55.98%&lt;br /&gt;
|-&lt;br /&gt;
| BI-LSTM-CRF&lt;br /&gt;
| Bidirectional LSTM-CRF Model&lt;br /&gt;
| S&lt;br /&gt;
| Huang et al. (2015)&lt;br /&gt;
| -&lt;br /&gt;
| 90.10%&lt;br /&gt;
|-&lt;br /&gt;
| BI-LSTM-CRF&lt;br /&gt;
| Bidirectional LSTM-CRF Model&lt;br /&gt;
| S&lt;br /&gt;
| Akbik, Blythe, &amp;amp; Vollgraf (2018)&lt;br /&gt;
| https://github.com/zalandoresearch/flair&lt;br /&gt;
| 93.09%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* (1) &#039;&#039;&#039;System type&#039;&#039;&#039;: R = hand-crafted rules, S = supervised learning, U = unsupervised learning, H = hybrid&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
Florian, R., Ittycheriah, A., Jing, H. and Zhang, T. (2003) [http://www.cnts.ua.ac.be/conll2003/pdf/16871flo.pdf Named Entity Recognition through Classifier Combination]. &#039;&#039;Proceedings of CoNLL-2003&#039;&#039;. Edmonton, Canada. &lt;br /&gt;
&lt;br /&gt;
Nadeau, D., Turney, P. D. and Matwin, S. (2006) [http://iit-iti.nrc-cnrc.gc.ca/publications/nrc-48727_e.html Unsupervised Named-Entity Recognition: Generating Gazetteers and Resolving Ambiguity]. &#039;&#039;Proceedings 19th Canadian Conference on Artificial Intelligence&#039;&#039;. Québec, Canada.&lt;br /&gt;
&lt;br /&gt;
Tjong Kim Sang, E. F. and De Meulder, F. (2003) [http://www.cnts.ua.ac.be/conll2003/pdf/14247tjo.pdf Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition]. &#039;&#039;Proceedings of CoNLL-2003&#039;&#039;. Edmonton, Canada.&lt;br /&gt;
&lt;br /&gt;
Z. H. Huang, W. Xu, and K. Yu. (2015) [http://arxiv.org/abs/1508.01991 Bidirectional LSTM-CRF Models for Sequence Tagging]. &#039;&#039;In arXiv:1508.01991&#039;&#039;. 2015.&lt;br /&gt;
&lt;br /&gt;
== See also ==&lt;br /&gt;
&lt;br /&gt;
* [[Named Entity Recognition (State of the art)|Named Entity Recognition]]&lt;br /&gt;
* [[State of the art]]&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Simzalabim</name></author>
	</entry>
</feed>