Difference between revisions of "User:Sivareddy"

From ACL Wiki
Jump to: navigation, search
(Page describing all my softwares)
Line 7: Line 7:
 
Research Interests: Lexical Semantics, Semantic Composition, Multiwords, Machine Learning, Word Sense Disambiguation/Induction, Lexical Acquisition, Web Corpora, Web as a Resource for NLP problems, Cross Language Resources, Syntactic Parsing, Question Answering Inference
 
Research Interests: Lexical Semantics, Semantic Composition, Multiwords, Machine Learning, Word Sense Disambiguation/Induction, Lexical Acquisition, Web Corpora, Web as a Resource for NLP problems, Cross Language Resources, Syntactic Parsing, Question Answering Inference
  
Keywords: Polysemy, Compositionality, Semantic Composition, Domain WSD, Vector Space Models, Semantics, IIIT Hyderabad, York, Lexical Computing Ltd., Sketch Engine
+
Keywords: [[Polysemy]], [[Compositionality]], [[Semantic Composition]], [[Domain WSD]], [[Vector Space Models]], [[Semantics]], IIIT Hyderabad, York, Lexical Computing Ltd., [[Sketch Engine]], [[Resources]], [[POS Taggers]], [[Morphological Analyzers]]
 +
 
 +
Please find some of the resources developed by me.
 +
 
 +
== Compound Noun Compositionality Dataset ==
 +
 
 +
[http://sivareddy.in/papers/files/ijcnlp_compositionality_data.tgz '''Compositionality Dataset'''] described in [http://sivareddy.in/papers/ijcnlp2011empirical.pdf Reddy, McCarthy and Manandhar (2011, IJCNLP)]. [http://dianamccarthy.co.uk/downloads.html Alternate download link] from [http://dianamccarthy.co.uk/ Diana McCarthy]
 +
 
 +
== POS Taggers, Corpora, Lemmatizers, Morph Analyzers for Indian Languages ==
 +
 
 +
Most of these tools are developed by the methods described in [http://sivareddy.in/papers/clia2011IndianCrossLang.pdf Reddy and Sharoff (2011, CLIA @ IJCNLP)]. Some of the taggers are built using cross-lingual resources and some using mono-lingual resources. Please read corresponding README's of each tool for additional information. This work is supported by [http://sketchengine.co.uk Sketch Engine] and [http://corpus.leeds.ac.uk/it/ Intellitext project]. If you need resources for any other Indian languages, please contact me.
 +
 
 +
=== Kannada Tools ===
 +
 
 +
[http://sivareddy.in/papers/files/kannada-pos-tagger-2.0.tgz Download v2.0] [http://sivareddy.in/papers/files/kannada.sample.out.txt Sample Output of the tagger] For the complete corpus described in the paper, please contact me. [http://corpus.leeds.ac.uk/tools/ Alternate download link] from [http://www.comp.leeds.ac.uk/ssharoff/ Serge Sharoff]
 +
 
 +
=== Telugu Tools ===
 +
 
 +
[http://sivareddy.in/papers/files/telugu-pos-tagger-2.0.tgz Download v2.0] [http://sivareddy.in/papers/files/telugu.sample.out.txt Sample Output of the tagger]
 +
 
 +
=== Hindi Tools ===
 +
 
 +
[http://sivareddy.in/papers/files/hindi-pos-tagger-2.0.tgz Download v2.0] [http://sivareddy.in/papers/files/hindi.sample.out.txt Sample Output of the tagger] [#apertium-indonesian-malaysian ]
 +
 
 +
== Indonesian and Malay morphological analyzer, part-of-speech (POS) tagger, Machine Translation System ==
 +
 
 +
With support from [http://sketchengine.co.uk Sketch Engine], I have made few contributions to the [http://wiki.apertium.org/wiki/Main_Page Apertium] Indonesian-Malay language pair. All the tools can be downloaded from svn repository https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-id-ms/ To download use the command "svn co https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-id-ms/" <br />

Revision as of 11:43, 11 May 2012

Name: Siva Reddy

Webpage: http://sivareddy.in

CV: http://sivareddy.in/cv_siva.pdf

Research Interests: Lexical Semantics, Semantic Composition, Multiwords, Machine Learning, Word Sense Disambiguation/Induction, Lexical Acquisition, Web Corpora, Web as a Resource for NLP problems, Cross Language Resources, Syntactic Parsing, Question Answering Inference

Keywords: Polysemy, Compositionality, Semantic Composition, Domain WSD, Vector Space Models, Semantics, IIIT Hyderabad, York, Lexical Computing Ltd., Sketch Engine, Resources, POS Taggers, Morphological Analyzers

Please find some of the resources developed by me.

Compound Noun Compositionality Dataset

Compositionality Dataset described in Reddy, McCarthy and Manandhar (2011, IJCNLP). Alternate download link from Diana McCarthy

POS Taggers, Corpora, Lemmatizers, Morph Analyzers for Indian Languages

Most of these tools are developed by the methods described in Reddy and Sharoff (2011, CLIA @ IJCNLP). Some of the taggers are built using cross-lingual resources and some using mono-lingual resources. Please read corresponding README's of each tool for additional information. This work is supported by Sketch Engine and Intellitext project. If you need resources for any other Indian languages, please contact me.

Kannada Tools

Download v2.0 Sample Output of the tagger For the complete corpus described in the paper, please contact me. Alternate download link from Serge Sharoff

Telugu Tools

Download v2.0 Sample Output of the tagger

Hindi Tools

Download v2.0 Sample Output of the tagger [#apertium-indonesian-malaysian ]

Indonesian and Malay morphological analyzer, part-of-speech (POS) tagger, Machine Translation System

With support from Sketch Engine, I have made few contributions to the Apertium Indonesian-Malay language pair. All the tools can be downloaded from svn repository https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-id-ms/ To download use the command "svn co https://apertium.svn.sourceforge.net/svnroot/apertium/incubator/apertium-id-ms/"