Difference between revisions of "POS Induction (State of the art)"

From ACL Wiki
Jump to navigation Jump to search
(Berg-Kirkpatrick et al. (2010))
m (Reverted edits by Creek (talk) to last revision by Auser)
(4 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 +
==Evaluation==
 +
'''Many-to-1:''' Map every induced label to a gold standard tag greedily (45 labels to 45 tags of the Penn tag set). Use the mapping to compute tag accuracy on the Wall Street Journal portion of the Penn TreeBank.
 +
 +
==Results==
 +
 
{| border="1" cellpadding="5" cellspacing="1" width="100%"
 
{| border="1" cellpadding="5" cellspacing="1" width="100%"
 
|-
 
|-
Line 7: Line 12:
 
! Many-to-1
 
! Many-to-1
 
|-
 
|-
| Prototype-based+Brown
+
| Brown+proto
 
| MRF initialized with Brown prototypes
 
| MRF initialized with Brown prototypes
 
| Christodoulopoulos, Goldwater and Steedman (2010)
 
| Christodoulopoulos, Goldwater and Steedman (2010)
Line 18: Line 23:
 
|  
 
|  
 
| 75.5%
 
| 75.5%
 +
|-
 +
| Clark DMF
 +
| Distributional clustering + morphology + frequency
 +
| Clark (2003)
 +
| [http://www.cs.rhul.ac.uk/home/alexc/pos2.tar.gz alexc]
 +
| 71.2%*
 
|-
 
|-
 
|}
 
|}
  
 +
<nowiki>*</nowiki> according to Christodoulopoulos, Goldwater and Steedman (2010)
  
 
== References ==
 
== References ==
Line 27: Line 39:
  
 
* [http://www.aclweb.org/anthology/N/N10/N10-1083.pdf Taylor Berg-Kirkpatrick, Alexandre Bouchard-Cote, John DeNero, and Dan Klein. 2010. Painless Unsupervised Learning with Features. NAACL 2010.]
 
* [http://www.aclweb.org/anthology/N/N10/N10-1083.pdf Taylor Berg-Kirkpatrick, Alexandre Bouchard-Cote, John DeNero, and Dan Klein. 2010. Painless Unsupervised Learning with Features. NAACL 2010.]
 +
 +
* [http://www.aclweb.org/anthology/E/E03/E03-1009.pdf Alexander Clark. 2003. Combining distributional and morphological information for part of speech induction. In Proceedings of EACL 2003, pages 59–66, Morristown, NJ, USA.]
  
 
== See also ==
 
== See also ==

Revision as of 05:17, 25 June 2012

Evaluation

Many-to-1: Map every induced label to a gold standard tag greedily (45 labels to 45 tags of the Penn tag set). Use the mapping to compute tag accuracy on the Wall Street Journal portion of the Penn TreeBank.

Results

System name Short description Main publications Software Many-to-1
Brown+proto MRF initialized with Brown prototypes Christodoulopoulos, Goldwater and Steedman (2010) 76.1%
Logistic regression with features and LBFGS Berg-Kirkpatrick et al. (2010) 75.5%
Clark DMF Distributional clustering + morphology + frequency Clark (2003) alexc 71.2%*

* according to Christodoulopoulos, Goldwater and Steedman (2010)

References

See also