Difference between revisions of "Language Identification (State of the art)"

From ACL Wiki
Jump to navigation Jump to search
Line 8: Line 8:
 
| SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments?  
 
| SystemName || How does it work? || Author and Article [http://www.bla.com] || Software? || 98% according to... || Any extra comments?  
 
|-
 
|-
| textcat    || n-gram matching  || Ted Dunning (1994) "[http://citeseer.ist.psu.edu/dunning94statistical.html Statistical Identification of Language]" || Yes || - || -
+
| textcat    || n-gram matching  || Cavnar, W. B. and J. M. Trenkle (1994) "[http://www.nonlineardynamics.com/trenkle/papers/sdr94ps.gz N-Gram-Based Text Categorization]" || Yes || - || -
 
|-
 
|-
  

Revision as of 17:43, 23 May 2008

"Standard" measure:

"Standard" datasets:

System Name Short Description Main Publications Software (if available) Results Comments (i.e. extra resources used, train/test times, ...)
SystemName How does it work? Author and Article [1] Software? 98% according to... Any extra comments?
textcat n-gram matching Cavnar, W. B. and J. M. Trenkle (1994) "N-Gram-Based Text Categorization" Yes - -