Language Identification (State of the art): Difference between revisions

From ACL Wiki
Jump to navigation Jump to search
Kiwibird (talk | contribs)
 
(2 intermediate revisions by 2 users not shown)
Line 8: Line 8:
| SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments?  
| SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments?  
|-
|-
| textcat    || n-gram matching  || Ted Dunning (1994) "[http://citeseer.ist.psu.edu/dunning94statistical.html Statistical Identification of Language]" || Yes || - || -
| textcat    || n-gram matching  || Cavnar, W. B. and J. M. Trenkle (1994) "[http://www.nonlineardynamics.com/trenkle/papers/sdr94ps.gz N-Gram-Based Text Categorization]" || Yes || - || -
|-
|-


|}
|}


==See also==
* [[Language Identification Tools]]


[[Category:State of the art]]
[[Category:State of the art]]

Latest revision as of 08:56, 6 December 2012

"Standard" measure:

"Standard" datasets:

System Name Short Description Main Publications Software (if available) Results Comments (i.e. extra resources used, train/test times, ...)
SystemName How does it work? Author and Article [1] Software? 98% according to... Any extra comments?
textcat n-gram matching Cavnar, W. B. and J. M. Trenkle (1994) "N-Gram-Based Text Categorization" Yes - -

See also