Difference between revisions of "Part-of-speech tagging"

From ACL Wiki
Jump to: navigation, search
(4 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
'''Part-of-speech tagging''' is the task of assigning a part-of-speech tag to each word in a given text.
 
'''Part-of-speech tagging''' is the task of assigning a part-of-speech tag to each word in a given text.
 
<math>u' = u^2\,</math>
 
  
 
==History==
 
==History==
Line 8: Line 6:
  
 
==Software==
 
==Software==
 +
*[http://sourceforge.net/projects/acopost ACOPOST] - a collection of taggers using maximum entropy, second order Markov, exemplar, and transformation-based models.  See also [http://hermes.sourceforge.net/acopost.html this site]. Free, open source license.
 
*[http://danieldk.org/Code/Citar Citar] - uses [http://en.wikipedia.org/wiki/Trigram trigram]-based [http://en.wikipedia.org/wiki/Hidden_Markov_model HMM]s.  Free, open source license.
 
*[http://danieldk.org/Code/Citar Citar] - uses [http://en.wikipedia.org/wiki/Trigram trigram]-based [http://en.wikipedia.org/wiki/Hidden_Markov_model HMM]s.  Free, open source license.
 
*[http://crfpp.sourceforge.net CRF++] - uses [http://en.wikipedia.org/wiki/Conditional_random_field Conditional random fields]. Free, open source license.
 
*[http://crfpp.sourceforge.net CRF++] - uses [http://en.wikipedia.org/wiki/Conditional_random_field Conditional random fields]. Free, open source license.
Line 13: Line 12:
 
*[http://sourceforge.net/projects/gposttl GPoSTTL] - Enhanced [http://en.wikipedia.org/wiki/Brill_tagger TBL] tagger for English.  Open source license.
 
*[http://sourceforge.net/projects/gposttl GPoSTTL] - Enhanced [http://en.wikipedia.org/wiki/Brill_tagger TBL] tagger for English.  Open source license.
 
*[http://code.google.com/p/hunpos/ HunPos] - uses trigram-based HMMs.  Free, open source license.
 
*[http://code.google.com/p/hunpos/ HunPos] - uses trigram-based HMMs.  Free, open source license.
*[http://l2r.cs.uiuc.edu/~cogcomp/asoftware.php?skey=FLBJPOS LBJ POS Tagger] - Uses Averaged [http://en.wikipedia.org/wiki/Perceptron Perceptron] based sequential model. Java API, Free, open source license.
+
*[http://cogcomp.cs.illinois.edu/page/software_view/3 Illinois LBJ POS Tagger] - Uses averaged [http://en.wikipedia.org/wiki/Perceptron Perceptron] based sequential model. Java API, Free, open source license.
 
*[http://ilk.uvt.nl/mbt Memory-based tagger] (MBT) - uses [http://ilk.uvt.nl/timbl TiMBL].  Free, open source license.
 
*[http://ilk.uvt.nl/mbt Memory-based tagger] (MBT) - uses [http://ilk.uvt.nl/timbl TiMBL].  Free, open source license.
 +
*[http://ufal.mff.cuni.cz/morce/index.php Morče] - Uses Averaged [http://en.wikipedia.org/wiki/Perceptron Perceptron] based  model. Free, open source license.
 
*[http://nlp.stanford.edu/software/tagger.shtml Stanford Tagger] - uses [http://en.wikipedia.org/wiki/Logistic_regression Maximum entropy models].  Free, open source license.
 
*[http://nlp.stanford.edu/software/tagger.shtml Stanford Tagger] - uses [http://en.wikipedia.org/wiki/Logistic_regression Maximum entropy models].  Free, open source license.
*[http://www.lsi.upc.es/~nlp/SVMTool SVMTool] - uses [http://en.wikipedia.org/wiki/Support_vector_machine Support vector machines].  Free, open source license.
+
*[http://www.lsi.upc.es/~nlp/SVMTool SVMTool] - uses [http://en.wikipedia.org/wiki/Support_vector_machine Support vector machines].  Free, open source license, but depends on non-Free/open source [http://svmlight.joachims.org/ SVMlight].
  
 
==See also==
 
==See also==

Revision as of 07:24, 18 September 2012

Part-of-speech tagging is the task of assigning a part-of-speech tag to each word in a given text.

History

Further reading

Software

See also

External links