<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Yversley</id>
	<title>ACL Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.aclweb.org/aclwiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Yversley"/>
	<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/Special:Contributions/Yversley"/>
	<updated>2026-04-30T19:21:54Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.6</generator>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_German&amp;diff=6165</id>
		<title>Resources for German</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_German&amp;diff=6165"/>
		<updated>2009-02-03T09:48:37Z</updated>

		<summary type="html">&lt;p&gt;Yversley: /* Lexicons */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Corpora==&lt;br /&gt;
&amp;lt;!-- Please keep this list in alphabetical order --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* [http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]&lt;br /&gt;
* [http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/projekte/tc/CQP.html Experimental Corpus Query System (University of Stuttgart, Germany)]&lt;br /&gt;
* [http://www.wortschatz.uni-leipzig.de/ German plain text and Co-occurrences at LCC]&lt;br /&gt;
* [http://www.coli.uni-sb.de/sfb378/negra-corpus/negra-corpus.html NEGRA Corpus]&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERCorpus/ TIGER treebank]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/en_tuebadz.shtml Tübingen Treebank of Written German (TüBa-D/Z)]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/en_tuebads.shtml Tübingen Treebank of Spoken German (TüBa-D/S, aka Verbmobil treebank)]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/en_tuepp.shtml Tübingen Partially Parsed Corpus of Written German (TüPP-D/Z)]&lt;br /&gt;
&lt;br /&gt;
==Evaluation datasets==&lt;br /&gt;
* [http://www.ukp.tu-darmstadt.de/data/semRelDatasets Semantic relatedness evaluation]&lt;br /&gt;
&lt;br /&gt;
==Lexicons==&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/projekte/IMSLex/ IMSLex German Lexicon]&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/tcl/RESOURCES/German-Lexicon-en.html Lexical information for German]&lt;br /&gt;
* [http://www.cl.uzh.ch/CL/siclemat/sprachanalyse/molif/ mOlif morphological analyzer]&lt;br /&gt;
&lt;br /&gt;
==Resource Access==&lt;br /&gt;
* [http://wortschatz.uni-leipzig.de/Webservices/ Web service access to German language statistics]&lt;br /&gt;
&lt;br /&gt;
==Timeline Analysis==&lt;br /&gt;
* [http://wortschatz.uni-leipzig.de/wort-des-tages/ German Words of the Day]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/~lothar/nw/ Wortwarte (selection of German neologisms for each day) ]&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|German]]&lt;/div&gt;</summary>
		<author><name>Yversley</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=NP_Chunking_(State_of_the_art)&amp;diff=6164</id>
		<title>NP Chunking (State of the art)</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=NP_Chunking_(State_of_the_art)&amp;diff=6164"/>
		<updated>2009-02-03T09:35:41Z</updated>

		<summary type="html">&lt;p&gt;Yversley: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* &#039;&#039;&#039;Performance measure:&#039;&#039;&#039; F = 2 * Precision * Recall / (Recall + Precision)&lt;br /&gt;
* &#039;&#039;&#039;Precision:&#039;&#039;&#039; percentage of NPs found by the algorithm that are correct&lt;br /&gt;
* &#039;&#039;&#039;Recall:&#039;&#039;&#039; percentage of NPs defined in the corpus that were found by the chunking program&lt;br /&gt;
* &#039;&#039;&#039;Training data:&#039;&#039;&#039; sections 15-18 of Wall Street Journal corpus (Ramshaw and Marcus)&lt;br /&gt;
* &#039;&#039;&#039;Testing data:&#039;&#039;&#039; section 20 of Wall Street Journal corpus (Ramshaw and Marcus)&lt;br /&gt;
* original data of the NP chunking experiments by Lance Ramshaw and Mitch Marcus&lt;br /&gt;
* data contains one word per line and each line contains six fields of which only the first three fields are relevant: the word, the part-of-speech tag assigned by the Brill tagger, and the correct IOB tag&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Table of results ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;1&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! System name&lt;br /&gt;
! Short description&lt;br /&gt;
! Main publications&lt;br /&gt;
! Software&lt;br /&gt;
! Reports (F)&lt;br /&gt;
|-&lt;br /&gt;
| KM00&lt;br /&gt;
| B-I-O tagging using SVM classifiers with polynomial kernel&lt;br /&gt;
| Kudo and Matsumoto (2000), CONLL&lt;br /&gt;
| [http://chasen.org/~taku/software/yamcha/ YAMCHA Toolkit] (but models are not provided)&lt;br /&gt;
| 93.79%&lt;br /&gt;
|-&lt;br /&gt;
| KM01&lt;br /&gt;
| learning as in KM00, but voting between different representations&lt;br /&gt;
| Kudo and Matsumoto (2001), NAACL&lt;br /&gt;
| No&lt;br /&gt;
| 94.22%&lt;br /&gt;
|-&lt;br /&gt;
| SP03&lt;br /&gt;
| Second order conditional random fields&lt;br /&gt;
| Fei Sha and Fernando Pereira (2003), HLT/NAACL&lt;br /&gt;
| No&lt;br /&gt;
| 94.3%&lt;br /&gt;
|-&lt;br /&gt;
| SS05&lt;br /&gt;
| specialized HMM + voting between different representations&lt;br /&gt;
| Shen and Sarkar (2005)&lt;br /&gt;
| No&lt;br /&gt;
| 95.23%&lt;br /&gt;
|-&lt;br /&gt;
| M05&lt;br /&gt;
| Second order conditional random fields + multi-label classification&lt;br /&gt;
| Ryan McDonald, KOby Crammer and Fernando Pereira (2005), HLT/EMNLP&lt;br /&gt;
| No&lt;br /&gt;
| 94.29%&lt;br /&gt;
|-&lt;br /&gt;
| V06&lt;br /&gt;
| Conditional random fields + Stochastic Meta Decent (SMD)&lt;br /&gt;
| S. V. N. Vishwanathan, Nicol N. Schraudolph, Mark Schmidt, and Kevin Murphy (2006), ICML&lt;br /&gt;
| No&lt;br /&gt;
| 93.6%&lt;br /&gt;
|-&lt;br /&gt;
| S08&lt;br /&gt;
| Second order latent-dynamic conditional random fields + an improved inference method based on A* search&lt;br /&gt;
| Xu Sun, Louis-Philippe Morency, Daisuke Okanohara and Jun&#039;ichi Tsujii (2008), COLING&lt;br /&gt;
| HCRF Library&lt;br /&gt;
| 94.34%&lt;br /&gt;
|-&lt;br /&gt;
| C00&lt;br /&gt;
| Chunks from the Charniak Parser&lt;br /&gt;
| Hollingshead, Fisher and Roark (2005), Charniak (2000)&lt;br /&gt;
| ?&lt;br /&gt;
| 94.20%&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
E. Charniak (2000). [http://aclweb.org/anthology-new/A/A00/A00-2018.pdf A Maximum-Entropy inspired parser], NAACL 2000&lt;br /&gt;
&lt;br /&gt;
K. Hollingshead, S. Fisher and B. Roark (2005). [http://www.aclweb.org/anthology-new/H/H05/H05-1099.pdf Comparing and combining finite-state and context-free parsers.]  HLT/EMNLP 2005.&lt;br /&gt;
&lt;br /&gt;
T. Kudo and Y. Matsumoto (2000). [http://acl.ldc.upenn.edu/W/W00/W00-0730.pdf Use of support vector learning for chunk identification]. &#039;&#039;Proceedings of the 4th Conference on CoNLL-2000 and LLL-2000&#039;&#039;, pages 142-144, Lisbon, Portugal.&lt;br /&gt;
&lt;br /&gt;
T. Kudo and Y. Matsumoto (2001). [http://acl.ldc.upenn.edu/N/N01/N01-1025.pdf Chunking with support vector machines]. &#039;&#039;Proceedings of NAACL-2001&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
F. Sha and F. Pereira (2003). [http://www-rcf.usc.edu/~feisha/htmls/Papers.html Shallow Parsing with Conditional Random Fields]. &#039;&#039;Proceedings of HLT-NAACL 2003&#039;&#039;, pages 213-220. Edmonton, Canada.&lt;br /&gt;
&lt;br /&gt;
H. Shen and A. Sarkar (2005). [http://www.cs.sfu.ca/~anoop/papers/pdf/ai05.pdf Voting between multiple data representations for text chunking]. &#039;&#039;Proceedings of the Eighteenth Meeting of the Canadian Society for Computational Intelligence, Canadian AI 2005&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
R. McDonald, K. Crammer and F. Pereira (2005). [http://ryanmcd.googlepages.com/segmentationHLT-EMNLP2005.pdf Flexible Text Segmentation with Structured Multilabel Classification]. &#039;&#039;Human Language Technologies and Empirical Methods in Natural Language Processing (HLT-EMNLP), 2005&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
S. V. N. Vishwanathan, N. Schraudolph, M. Schmidt, and K. Murphy. Accelerated Training Conditional Random Fields with Stochastic Gradient Methods. In Proc. Intl. Conf. Machine Learning, pp. 969 – 976, ACM Press, New York, NY, USA, 2006.&lt;br /&gt;
&lt;br /&gt;
X. Sun, L.P. Morency, D. OKanohara and J. Tsujii (2008). [http://www.aclweb.org/anthology-new/C/C08/C08-1106.pdf Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Improved Inference]. &#039;&#039;Proceedings of The 22nd International Conference on Computational Linguistics (COLING 2008)&#039;&#039;. Pages 841-848. Manchester, UK.&lt;br /&gt;
&lt;br /&gt;
== See also ==&lt;br /&gt;
&lt;br /&gt;
* [[State of the art]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== External links ==&lt;br /&gt;
&lt;br /&gt;
* dataset is available from [ftp://ftp.cis.upenn.edu/pub/chunker/ ftp://ftp.cis.upenn.edu/pub/chunker/]&lt;br /&gt;
* more information is available from [http://ifarm.nl/erikt/research/np-chunking.html NP Chunking]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:State of the art]]&lt;/div&gt;</summary>
		<author><name>Yversley</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_German&amp;diff=6163</id>
		<title>Resources for German</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_German&amp;diff=6163"/>
		<updated>2009-02-03T09:22:41Z</updated>

		<summary type="html">&lt;p&gt;Yversley: /* Corpora */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Corpora==&lt;br /&gt;
&amp;lt;!-- Please keep this list in alphabetical order --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* [http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]&lt;br /&gt;
* [http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/projekte/tc/CQP.html Experimental Corpus Query System (University of Stuttgart, Germany)]&lt;br /&gt;
* [http://www.wortschatz.uni-leipzig.de/ German plain text and Co-occurrences at LCC]&lt;br /&gt;
* [http://www.coli.uni-sb.de/sfb378/negra-corpus/negra-corpus.html NEGRA Corpus]&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERCorpus/ TIGER treebank]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/en_tuebadz.shtml Tübingen Treebank of Written German (TüBa-D/Z)]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/en_tuebads.shtml Tübingen Treebank of Spoken German (TüBa-D/S, aka Verbmobil treebank)]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/en_tuepp.shtml Tübingen Partially Parsed Corpus of Written German (TüPP-D/Z)]&lt;br /&gt;
&lt;br /&gt;
==Evaluation datasets==&lt;br /&gt;
* [http://www.ukp.tu-darmstadt.de/data/semRelDatasets Semantic relatedness evaluation]&lt;br /&gt;
&lt;br /&gt;
==Lexicons==&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/tcl/RESOURCES/German-Lexicon-en.html Lexical information for German] &lt;br /&gt;
&lt;br /&gt;
==Resource Access==&lt;br /&gt;
* [http://wortschatz.uni-leipzig.de/Webservices/ Web service access to German language statistics]&lt;br /&gt;
&lt;br /&gt;
==Timeline Analysis==&lt;br /&gt;
* [http://wortschatz.uni-leipzig.de/wort-des-tages/ German Words of the Day]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/~lothar/nw/ Wortwarte (selection of German neologisms for each day) ]&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|German]]&lt;/div&gt;</summary>
		<author><name>Yversley</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_German&amp;diff=6162</id>
		<title>Resources for German</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_German&amp;diff=6162"/>
		<updated>2009-02-03T09:20:34Z</updated>

		<summary type="html">&lt;p&gt;Yversley: /* Timeline Analysis */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Corpora==&lt;br /&gt;
&amp;lt;!-- Please keep this list in alphabetical order --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* [http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]&lt;br /&gt;
* [http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/projekte/tc/CQP.html Experimental Corpus Query System (University of Stuttgart, Germany)]&lt;br /&gt;
* [http://www.wortschatz.uni-leipzig.de/ German plain text and Co-occurrences at LCC]&lt;br /&gt;
* [http://www.coli.uni-sb.de/sfb378/negra-corpus/negra-corpus.html NEGRA Corpus]&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERCorpus/ TIGER treebank]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/en_tuebadz.shtml Tübingen Treebank of Written German (TüBa-D/Z)]&lt;br /&gt;
&lt;br /&gt;
==Evaluation datasets==&lt;br /&gt;
* [http://www.ukp.tu-darmstadt.de/data/semRelDatasets Semantic relatedness evaluation]&lt;br /&gt;
&lt;br /&gt;
==Lexicons==&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/tcl/RESOURCES/German-Lexicon-en.html Lexical information for German] &lt;br /&gt;
&lt;br /&gt;
==Resource Access==&lt;br /&gt;
* [http://wortschatz.uni-leipzig.de/Webservices/ Web service access to German language statistics]&lt;br /&gt;
&lt;br /&gt;
==Timeline Analysis==&lt;br /&gt;
* [http://wortschatz.uni-leipzig.de/wort-des-tages/ German Words of the Day]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/~lothar/nw/ Wortwarte (selection of German neologisms for each day) ]&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|German]]&lt;/div&gt;</summary>
		<author><name>Yversley</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/aclwiki/index.php?title=Resources_for_German&amp;diff=6161</id>
		<title>Resources for German</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/aclwiki/index.php?title=Resources_for_German&amp;diff=6161"/>
		<updated>2009-02-03T09:19:17Z</updated>

		<summary type="html">&lt;p&gt;Yversley: added Tiger and Negra&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Corpora==&lt;br /&gt;
&amp;lt;!-- Please keep this list in alphabetical order --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* [http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]&lt;br /&gt;
* [http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/projekte/tc/CQP.html Experimental Corpus Query System (University of Stuttgart, Germany)]&lt;br /&gt;
* [http://www.wortschatz.uni-leipzig.de/ German plain text and Co-occurrences at LCC]&lt;br /&gt;
* [http://www.coli.uni-sb.de/sfb378/negra-corpus/negra-corpus.html NEGRA Corpus]&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERCorpus/ TIGER treebank]&lt;br /&gt;
* [http://www.sfs.uni-tuebingen.de/en_tuebadz.shtml Tübingen Treebank of Written German (TüBa-D/Z)]&lt;br /&gt;
&lt;br /&gt;
==Evaluation datasets==&lt;br /&gt;
* [http://www.ukp.tu-darmstadt.de/data/semRelDatasets Semantic relatedness evaluation]&lt;br /&gt;
&lt;br /&gt;
==Lexicons==&lt;br /&gt;
* [http://www.ims.uni-stuttgart.de/tcl/RESOURCES/German-Lexicon-en.html Lexical information for German] &lt;br /&gt;
&lt;br /&gt;
==Resource Access==&lt;br /&gt;
* [http://wortschatz.uni-leipzig.de/Webservices/ Web service access to German language statistics]&lt;br /&gt;
&lt;br /&gt;
==Timeline Analysis==&lt;br /&gt;
* [http://wortschatz.uni-leipzig.de/wort-des-tages/ German Words of the Day]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Resources by language|German]]&lt;/div&gt;</summary>
		<author><name>Yversley</name></author>
	</entry>
</feed>