Language Identification (State of the art): Difference between revisions
Jump to navigation
Jump to search
| (2 intermediate revisions by 2 users not shown) | |||
| Line 8: | Line 8: | ||
| SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments? | | SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments? | ||
|- | |- | ||
| textcat || n-gram matching || | | textcat || n-gram matching || Cavnar, W. B. and J. M. Trenkle (1994) "[http://www.nonlineardynamics.com/trenkle/papers/sdr94ps.gz N-Gram-Based Text Categorization]" || Yes || - || - | ||
|- | |- | ||
|} | |} | ||
==See also== | |||
* [[Language Identification Tools]] | |||
[[Category:State of the art]] | [[Category:State of the art]] | ||
Latest revision as of 08:56, 6 December 2012
"Standard" measure:
"Standard" datasets:
| System Name | Short Description | Main Publications | Software (if available) | Results | Comments (i.e. extra resources used, train/test times, ...) |
|---|---|---|---|---|---|
| SystemName | How does it work? | Author and Article [1] | Software? | 98% according to... | Any extra comments? |
| textcat | n-gram matching | Cavnar, W. B. and J. M. Trenkle (1994) "N-Gram-Based Text Categorization" | Yes | - | - |