Difference between revisions of "Language Identification (State of the art)"

From ACL Wiki
Jump to: navigation, search
("Standard" datasets:)
(See also)
 
(2 intermediate revisions by 2 users not shown)
Line 8: Line 8:
 
| SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments?  
 
| SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments?  
 
|-
 
|-
| textcat    || n-gram matching  || Ted Dunning (1994) "[http://citeseer.ist.psu.edu/dunning94statistical.html Statistical Identification of Language]" || Yes || - || -
+
| textcat    || n-gram matching  || Cavnar, W. B. and J. M. Trenkle (1994) "[http://www.nonlineardynamics.com/trenkle/papers/sdr94ps.gz N-Gram-Based Text Categorization]" || Yes || - || -
 
|-
 
|-
  
 
|}
 
|}
  
 +
==See also==
 +
* [[Language Identification Tools]]
  
 
[[Category:State of the art]]
 
[[Category:State of the art]]

Latest revision as of 01:56, 6 December 2012

"Standard" measure:

"Standard" datasets:

System Name Short Description Main Publications Software (if available) Results Comments (i.e. extra resources used, train/test times, ...)
SystemName How does it work? Author and Article [1] Software? 98% according to... Any extra comments?
textcat n-gram matching Cavnar, W. B. and J. M. Trenkle (1994) "N-Gram-Based Text Categorization" Yes - -

See also