Difference between revisions of "Language Identification (State of the art)"
Jump to navigation
Jump to search
Line 8: | Line 8: | ||
| SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments? | | SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments? | ||
|- | |- | ||
− | | textcat || n-gram matching || | + | | textcat || n-gram matching || Cavnar, W. B. and J. M. Trenkle (1994) "[http://www.nonlineardynamics.com/trenkle/papers/sdr94ps.gz N-Gram-Based Text Categorization]" || Yes || - || - |
|- | |- | ||
Revision as of 17:43, 23 May 2008
"Standard" measure:
"Standard" datasets:
System Name | Short Description | Main Publications | Software (if available) | Results | Comments (i.e. extra resources used, train/test times, ...) |
---|---|---|---|---|---|
SystemName | How does it work? | Author and Article [1] | Software? | 98% according to... | Any extra comments? |
textcat | n-gram matching | Cavnar, W. B. and J. M. Trenkle (1994) "N-Gram-Based Text Categorization" | Yes | - | - |