Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-I
I08-1: Entire volume
I08-1000: Front Matter
I08-1001: Badam-Osor Khaltar; Atsushi Fujii
A Lemmatization Method for Modern Mongolian and its Application to Information Retrieval
I08-1002: Hai Zhao; Chunyu Kit
An Empirical Comparison of Goodness Measures for Unsupervised Chinese Word Segmentation with a Unified Framework
I08-1003: Michael Tepper; Fei Xia
A Hybrid Approach to the Induction of Underlying Morphology
I08-1004: GuoDong Zhou; Fang Kong; QiaoMing Zhu
Context-Sensitive Convolution Tree Kernel for Pronoun Resolution
I08-1005: GuoDong Zhou; JunHui Li; LongHua Qian; QiaoMing Zhu
Semi-Supervised Learning for Relation Extraction
I08-1006: Xiaoyan Zhang; Ting Wang; Huowang Chen
Story Link Detection based on Dynamic Information Extending
I08-1007: Eiji Aramaki; Takeshi Imai; Kengo Miyo; Kazuhiko Ohe
Orthographic Disambiguation Incorporating Transliterated Probability
I08-1008: Min Zhang; Chengjie Sun; Haizhou Li; AiTi Aw; Chew Lim Tan; Xiaolong Wang
Name Origin Recognition Using Maximum Entropy Model and Diverse Features
I08-1009: Harshit Surana; Anil Kumar Singh
A More Discerning and Adaptable Multilingual Transliteration Mechanism for Indian Languages
I08-1010: G. Bharadwaja Kumar; Kavi Narayana Murthy
UCSG: A Wide Coverage Shallow Parsing System
I08-1011: Prachya Boonkwan; Thepchai Supnithi
Memory-Inductive Categorial Grammar: An Approach to Gap Resolution in Analytic-Language Translation
I08-1012: Wenliang Chen; Daisuke Kawahara; Kiyotaka Uchimoto; Yujie Zhang; Hitoshi Isahara
Dependency Parsing with Short Dependency Relations in Unlabeled Data
I08-1013: Béatrice Daille; Emmanuel Morin
An Effective Compositional Model for Lexical Alignment
I08-1014: Wilson Wong; Wei Liu; Mohammed Bennamoun
Determining the Unithood of Word Sequences Using a Probabilistic Approach
I08-1015: Dinakar Jayarajan; Dipti Deodhare; Balaraman Ravindran
Lexical Chains as Document Features
I08-1016: Ani Nenkova
Entity-driven Rewrite for Multi-document Summarization
I08-1017: Xiaofeng Wu; Chengqing Zong
A New Approach to Automatic Document Summarization
I08-1018: Harendra Bhandari; Masashi Shimbo; Takahiko Ito; Yuji Matsumoto
Generic Text Summarization Using Probabilistic Latent Semantic Indexing
I08-1019: Yasunari Miyabe; Hiroya Takamura; Manabu Okumura
Identifying Cross-Document Relations between Sentences
I08-1020: Horacio Saggion
Experiments on Semantic-based Clustering for Cross-document Coreference
I08-1021: Long Qiu; Min-Yen Kan; Tat-Seng Chua
Modeling Context in Scenario Template Creation
I08-1022: Ke Wu; Xiaolin Wang; Bao-Liang Lu
Cross Language Text Categorization Using a Bilingual Lexicon
I08-1023: Yong Wang; Yiqun Liu; Min Zhang; Shaoping Ma; Liyun Ru
Identify Temporal Websites Based on User Behavior Analysis
I08-1024: Youssef Kadri; Jian-Yun Nie
A Comparative Study for Query Translation using Linear Combination and Confidence Measure
I08-1025: Keiji Shinzato; Tomohide Shibata; Daisuke Kawahara; Chikara Hashimoto; Sadao Kurohashi
TSUBAKI: An Open Search Engine Infrastructure for Developing New Information Access Methodology
I08-1026: Fan Ding; Bin Wang
A Study on Effectiveness of Syntactic Relationship in Dependence Retrieval Model
I08-1027: Takashi Shichiri; Hiroaki Nanjo; Takehiko Yoshimi
Automatic Estimation of Word Significance oriented for Speech-based Information Retrieval
I08-1028: Yuichiro Fukubayashi; Kazunori Komatani; Mikio Nakano; Kotaro Funakoshi; Hiroshi Tsujino; Tetsuya Ogata; Hiroshi G. Okuno
Rapid Prototyping of Robust Language Understanding Modules for Spoken Dialogue Systems
I08-1029: Gina-Anne Levow
Automatic Prosodic Labeling with Conditional Random Fields and Rich Acoustic Features
I08-1030: Ruiqiang Zhang; Eiichiro Sumita
Chinese Unknown Word Translation by Subword Re-segmentation
I08-1031: Jong-Hoon Oh; Hitoshi Isahara
Hypothesis Selection in Machine Transliteration: A Web Mining Approach
I08-1032: Takeshi Abekawa; Kyo Kageura
What Prompts Translators to Modify Draft Translations? An Analysis of Basic Modification Patterns for Use in the Automatic Notification of Awkwardly Translated Text
I08-1033: Ming-Hong Bai; Keh-Jiann Chen; Jason S. Chang
Improving Word Alignment by Adjusting Chinese Word Segmentation
I08-1034: Marina Sokolova; Vivi Nastase; Stan Szpakowicz
The Telling Tail: Signals of Success in Electronic Negotiation Texts
I08-1035: Dipanjan Das; Mohit Kumar; Alexander I. Rudnicky
Automatic Extraction of Briefing Templates
I08-1036: Oksana Yakhnenko; Barbara Rosario
Mining the Web for Relations between Digital Devices using a Probabilistic Maximum Margin Model
I08-1037: Yu-Chun Wang; Richard Tzong-Han Tsai; Wen-Lian Hsu
Learning Patterns from the Web to Translate Named Entities for Cross Language Information Retrieval
I08-1038: Bo Wang; Houfeng Wang
Bootstrapping Both Product Features and Opinion Words from Chinese Customer Reviews with Cross-Inducing
I08-1039: Daisuke Ikeda; Hiroya Takamura; Lev-Arie Ratinov; Manabu Okumura
Learning to Shift the Polarity of Words for Sentiment Classification
I08-1040: Taras Zagibalov; John Carroll
Unsupervised Classification of Sentiment and Objectivity in Chinese Text
I08-1041: Saima Aman; Stan Szpakowicz
Using Roget's Thesaurus for Fine-grained Emotion Recognition
I08-1042: Jesús Giménez; Lluís Márquez
Heterogeneous Automatic MT Evaluation Through Non-Parametric Metric Combinations
I08-1043: Young-Sook Hwang; Young-Kil Kim; Sangkyu Park
Paraphrasing Depending on Bilingual Context Toward Generalization of Translation Knowledge
I08-1044: Xiaofeng Yu; Wai Lam; Shing-Kit Chan
A Framework Based on Graphical Models with Logic for Chinese Named Entity Recognition
I08-1045: Sujan Kumar Saha; Sudeshna Sarkar; Pabitra Mitra
A Hybrid Feature Set based Maximum Entropy Hindi Named Entity Recognition
I08-1046: Stanley; Wai Keong Yong; Jian Su
An Eeffective Methods of Using Web Based Information for Relation Extraction
I08-1047: Mamoru Komachi; Hisami Suzuki
Minimally Supervised Learning of Semantic Knowledge from Query Logs
I08-1048: Jingbo Zhu; Huizhen Wang; Eduard Hovy
Learning a Stopping Criterion for Active Learning for Word Sense Disambiguation and Text Classification
I08-1049: Jin-Shea Kuo; Haizhou Li
Multi-View Co-Training of Transliteration Model
I08-1050: Kenji Hirohata; Naoaki Okazaki; Sophia Ananiadou; Mitsuru Ishizuka
Identifying Sections in Scientific Abstracts using Conditional Random Fields
I08-1051: Aljoscha Burchardt; Sebastian Padó; Dennis Spohr; Anette Frank; Ulrich Heid
Formalising Multi-layer Corpora in OWL DL - Lexicon Modelling, Querying and Consistency Control
I08-1052: Kiyoaki Shirai; Takenobu Tokunaga; Chu-Ren Huang; Shu-Kai Hsieh; Tzu-Yi Kuo; Virach Sornlertlamvanich; Thatsanee Charoenporn
Constructing Taxonomy of Numerative Classifiers for Asian Languages
I08-1053: Nikesh Garera; David Yarowsky
Translating Compounds by Learning Component Gloss Translation Models via Multiple Languages
I08-1054: Marius Pasca
Answering Definition Questions via Temporally-Anchored Text Snippets
I08-1055: Ryuichiro Higashinaka; Hideki Isozaki
Corpus-based Question Answering for why-Questions
I08-1056: Lucian Vlad Lita; Jaime Carbonell
Cluster-Based Query Expansion for Statistical Question Answering
I08-1057: Chen-Ming Hung
A Semantic Feature for Relation Recognition Using a Web-based Corpus
I08-1058: Yo Ehara; Kumiko Tanaka-Ishii
Multilingual Text Entry using Automatic Language Detection
I08-1059: Michael Gamon; Jianfeng Gao; Chris Brockett; Alexandre Klementiev; William B. Dolan; Dmitriy Belenko; Lucy Vanderwende
Using Contextual Speller Techniques and Language Modeling for ESL Error Correction
I08-1060: Takashi Tsunakawa; Jun'ichi Tsujii
Bilingual Synonym Identification with Spelling Variations
I08-1061: Nikesh Garera; David Yarowsky
Minimally Supervised Multilingual Taxonomy and Translation Lexicon Induction
I08-1062: Jessica Ramírez; Masayuki Asahara; Yuji Matsumoto
Japanese-Spanish Thesaurus Construction Using English as a Pivot
I08-1063: M. Saravanan; B. Ravindran; S. Raman
Automatic Identification of Rhetorical Roles using Conditional Random Fields for Legal Document Summarization
I08-1064: Kathrin Spreyer; Anette Frank
Projection-based Acquisition of a Temporal Labeller
I08-1065: Shuya Abe; Kentaro Inui; Yuji Matsumoto
Acquiring Event Relation Knowledge by Learning Cooccurrence Patterns and Fertilizing Cooccurrence Samples with Verbal Nouns
I08-1066: Deyi Xiong; Min Zhang; AiTi Aw; Haitao Mi; Qun Liu; Shouxun Lin
Refinements in BTG-based Statistical Machine Translation
I08-1067: Ananthakrishnan Ramanathan; Jayprasad Hegde; Ritesh M. Shah; Pushpak Bhattacharyya; Sasikumar M
Simple Syntactic and Morphological Processing Can Help English-Hindi Statistical Machine Translation
I08-1068: Rohini U; Vamshi Ambati; Vasudeva Varma
Statistical Machine Translation Models for Personalized Search
I08-1069: Fei Xia; William D. Lewis
Repurposing Theoretical Linguistic Data for Tool Development and Search
I08-1070: Atsushi Fujita; Satoshi Sato
Computing Paraphrasability of Syntactic Variants Using Web Snippets
I08-1071: Wisam Dakka; Silviu Cucerzan
Augmenting Wikipedia with Named Entity Tags
I08-1072: Masato Hagiwara; Yasuhiro Ogawa; Katsuhiko Toyama
Context Feature Selection for Distributional Similarity
I08-1073: Ryu Iida; Diana McCarthy; Rob Koeling
Gloss-Based Semantic Similarity Metrics for Predominant Sense Acquisition
I08-1074: Su Nam Kim; Timothy Baldwin
Benchmarking Noun Compound Interpretation
Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-II
I08-2: Entire volume
I08-2075: Aasish Pappu; Ratna Sanyal
Vaakkriti: Sanskrit Tokenizer
I08-2076: Amir Azim Sharifloo; Mehrnoush Shamsfard
A Bottom Up approach to Persian Stemming
I08-2077: Asif Ekbal; Rejwanul Haque; Sivaji Bandyopadhyay
Named Entity Recognition in Bengali: A Conditional Random Field Approach
I08-2078: Shing-Kit Chan; Wai Lam; Xiaofeng Yu
An Online Cascaded Approach to Biomedical Named Entity Recognition
I08-2079: Qiang Zhou
Automatic Rule Acquisition for Chinese Intra-chunk Relations
I08-2080: Ryohei Sasano; Sadao Kurohashi
Japanese Named Entity Recognition Using Structural Natural Language Processing
I08-2081: YingJu Xia; Hao Yu; Gang Zou
Dimensionality Reduction with Multilingual Resource
I08-2082: Xing Yi; Jianfeng Gao; William B. Dolan
A Web-based English Proofing System for English as a Second Language Users
I08-2083: Hung-Chi Huang; Hsin-Hsi Chen; Ming-Shun Lin
Analysis of Intention in Dialogues Using Category Trees and Its Application to Advertisement Recommendation
I08-2084: Thuy Vu; AiTi Aw; Min Zhang
Term Extraction Through Unithood and Termhood Unification
I08-2085: Yeha Lee; Seung-Hoon Na; Jong-Hyeok Lee
Search Result Clustering Using Label Language Model
I08-2086: HaiXiang Huang; Atsushi Fujii
Effects of Related Term Extraction in Transliteration into Chinese
I08-2087: Dakun Zhang; Le Sun; Wenbo Li
A Structured Prediction Approach for Statistical Machine Translation
I08-2088: Keiji Yasuda; Ruiqiang Zhang; Hirofumi Yamamoto; Eiichiro Sumita
Method of Selecting Training Data to Build a Compact and Efficient Translation Model
I08-2089: Holger Schwenk; Philipp Koehn
Large and Diverse Language Models for Statistical Machine Translation
I08-2090: Javier Couto; Jean-Luc Minel
A Linguistic and Navigational Knowledge Approach to Text Navigation
I08-2091: Virach Sornlertlamvanich; Thatsanee Charoenporn; Chumpol Mokarat; Hitoshi Isahara; Hammam Riza; Purev Jaimai
Synset Assignment for Bi-lingual Dictionary with Limited Resource
I08-2092: Tomoya Noro; Takehiro Tokuda
Ranking Words for Building a Japanese Defining Vocabulary
I08-2093: William D. Lewis; Fei Xia
Automatically Identifying Computationally Relevant Typological Features
I08-2094: Suguru Matsuyoshi; Satoshi Sato
Automatic Paraphrasing of Japanese Functional Expressions Using a Hierarchically Organized Dictionary
I08-2095: Sibabrata Paladhi; Sivaji Bandyopadhyay
Generation of Referring Expression Using Prefix Tree Structure
I08-2096: Tuomo Kakkonen; Erkki Sutinen
Coverage-based Evaluation of Parser Generalizability
I08-2097: Daisuke Kawahara; Kiyotaka Uchimoto
Learning Reliability of Parses for Domain Adaptation of Dependency Parsing
I08-2098: Duen-Chi Yang; Yu-Ming Hsieh; Keh-Jiann Chen
Resolving Ambiguities of Chinese Conjunctive Structures by Divide-and-conquer Approaches
I08-2099: Rafiya Begum; Samar Husain; Arun Dhwaj; Dipti Misra Sharma; Lakshmi Bai; Rajeev Sangal
Dependency Annotation Scheme for Indian Languages
I08-2100: Masaki Murata; Sachiyo Tsukawaki; Toshiyuki Kanamaru; Qing Ma; Hitoshi Isahara
Non-Factoid Japanese Question Answering through Passage Retrieval that Is Weighted Based on Types of Answers
I08-2101: Mohamad Ali Honarpisheh; Gholamreza Ghassem-Sani; Ghassem Mirroshandel
A Multi-Document Multi-Lingual Automatic Summarization System
I08-2102: Megumi Makino; Kazuhide Yamamoto
Summarization by Analogy: An Example-based Approach for News Articles
I08-2103: Donghong Ji; Yu Nie
Sentence Ordering based on Cluster Adjacency in Multi-Document Summarization
I08-2104: Tomoyosi Akiba; Kei Shimizu; Atsushi Fujii
Statistical Machine Translation based Passage Retrieval for Cross-Lingual Question Answering
I08-2105: Vivi Nastase
Unsupervised All-words Word Sense Disambiguation with Grammatical Dependencies
I08-2106: Patrick Saint-Dizier
Syntactic and Semantic Frames in PrepNet
I08-2107: Lin Sun; Anna Korhonen; Yuval Krymolowski
Automatic Classification of English Verbs Using Rich Syntactic Features
I08-2108: Timothy Baldwin; Su Nam Kim; Francis Bond; Sanae Fujita; David Martinez; Takaaki Tanaka
MRD-based Word Sense Disambiguation: Further Extending Lesk
I08-2109: Wanxiang Che; Min Zhang; AiTi Aw; Chew Lim Tan; Ting Liu; Sheng Li
Fast Computing Grammar-driven Convolution Tree Kernel for Semantic Role Labeling
I08-2110: Tomohide Shibata; Michitaka Odani; Jun Harashima; Takashi Oonishi; Sadao Kurohashi
SYNGRAPH: A Flexible Matching Method based on Synonymous Expression Extraction from an Ordinary Dictionary and a Web Corpus
I08-2111: Eduard Bejček; Pavel Straňák; Pavel Schlesinger
Annotation of Multiword Expressions in the Prague Dependency Treebank
I08-2112: Paul McNamee; Rion Snow; Patrick Schone; James Mayfield
Learning Named Entity Hyponyms for Question Answering
I08-2113: Kamadev Bhanuprasad; Mats Svenson
Errgrams -- A Way to Improving ASR for Highly Inflected Dravidian Languages
I08-2114: Sunita Maithani; J.S. Rawat
Noise as a Tool for Spoken Language Identification
I08-2115: Sameer Badaskar; Sachin Agarwal; Shilpa Arora
Identifying Real or Fake Articles: Towards better Language Modeling
I08-2116: Akinori Fujino; Hideki Isozaki; Jun Suzuki
Multi-label Text Categorization with Model Combination based on F1-score Maximization
I08-2117: Manabu Sassano
An Experimental Comparison of the Voted Perceptron and Support Vector Machines in Japanese Analysis Tasks
I08-2118: Venkatesan Chakravarthy; Sachindra Joshi; Ganesh Ramakrishnan; Shantanu Godbole; Sreeram Balakrishnan
Learning Decision Lists with Known Rules for Text Mining
I08-2119: Mengqiu Wang
A Re-examination of Dependency Path Kernels for Relation Extraction
I08-2120: Bo Li; Juan Liu
Mining Chinese-English Parallel Corpora from the Web
I08-2121: Kenji Tateishi; Dai Kusui
Fast Duplicated Documents Detection using Multi-level Prefix-filter
I08-2122: Yoshinobu Kano; Ngan Nguyen; Rune Sætre; Kazuhiro Yoshida; Keiichiro Fukamachi; Yusuke Miyao; Yoshimasa Tsuruoka; Sophia Ananiadou; Jun'ichi Tsujii
Towards Data and Goal Oriented Analysis: Tool Inter-operability and Combinatorial Comparison
I08-2123: Danushka Bollegala; Yutaka Matsuo; Mitsuru Ishizuka
A Co-occurrence Graph-based Approach for Personal Name Alias Extraction from Anchor Texts
I08-2124: Donghui Feng; Gully Burns; Jingbo Zhu; Eduard Hovy
Towards Automated Semantic Analysis on Biomedical Research Articles
I08-2125: Lucian Vlad Lita; Shipeng Yu; Stefan Niculescu; Jinbo Bi
Large Scale Diagnostic Code Classification for Medical Patient Records
I08-2126: Asuka Sumida; Kentaro Torisawa
Hacking Wikipedia for Hyponymy Relation Acquisition
I08-2127: Naoaki Okazaki; Mitsuru Ishizuka; Jun'ichi Tsujii
A Discriminative Approach to Japanese Abbreviation Extraction
I08-2128: Mostafa Al Masum Shaikh; Helmut Prendinger; Mitsuru Ishizuka
Linguistic Interpretation of Emotions for Affect Sensing from Text
I08-2129: Hiroyuki Akama; Maki Miyake; Jaeyoung Jung
How to Take Advantage of the Limitations with Markov Clustering?--The Foundations of Branching Markov Clustering (BMCL)
I08-2130: Hongzhi Xu; Chunping Li
Combining Context Features by Canonical Belief Network for Chinese Part-Of-Speech Tagging
I08-2131: Ahmed Hassan; Sara Noeman; Hany Hassan
Language Independent Text Correction using Finite State Automata
I08-2132: Yaodong Chen; Ting Wang; Huowang Chen; Xishan Xu
Semantic Role Labeling of Chinese Using Transductive SVM and Semantic Heuristics
I08-2133: Maria Georgescul; Alexander Clark; Susan Armstrong
A Comparative Study of Mixture Models for Automatic Topic Segmentation of Multiparty Dialogues
I08-2134: Richard Tzong-Han Tsai; Hsi-Chuan Hung
Exploiting Unlabeled Text to Extract New Words of Different Semantic Transparency for Chinese Word Segmentation
I08-2135: Monojit Choudhury; Animesh Mukherjee; Niloy Ganguly
Social Network Inspired Models of NLP and Language Evolution
I08-2136: Rada Mihalcea; Vivi Nastase
How to Add a New Language on the NLP Map: Building Resources and Tools for Languages with Scarce Resources
I08-2137: Horacio Saggion
Introduction to Text Summarization and Other Information Access Technologies
I08-2138: Mandeep Singh Gill; Gurpreet Singh Lehal; Shiv Sharma Joshi
A Punjabi Grammar Checker
I08-2139: Jiří Mírovský
Netgraph – Making Searching in Treebanks Easy
I08-2140: Son Doan; QuocHung-Ngo; Ai Kawazoe; Nigel Collier
Global Health Monitor - A Web-based System for Detecting and Mapping Infectious Diseases
I08-2141: Anil Kumar Singh
A Mechanism to Provide Language-Encoding Support and an NLP Friendly Editor
I08-2142: Ruvan Weerasinghe; Asanka Wasala; Dulip Herath; Viraj Welgama
NLP Applications of Sinhala: TTS & OCR
I08-2143: Swati Gupta; Marilyn A. Walker; Daniela M. Romano
POLLy: A Conversational System that uses a Shared Representation to Generate Action and Social Language
I08-2144: CLIA Consortium
Cross Lingual Information Access System for Indian Languages
I08-2145:
Author Index
Proceedings of the IJCNLP-08 Workshop on NLP for Less Privileged Languages
I08-3000: Front Matter
I08-3001: Anne David; Micheal Maxwell
Invited Talk: Building Language Resources: Ways to move forward
I08-3002: Virach Sornlertlamvanich
Invited Talk: Cross Language Resource Sharing
I08-3003: Monojit Choudhury
Invited Talk: Breaking the Zipfian Barrier of NLP
I08-3004: Anil Kumar Singh
Natural Language Processing for Less Privileged Languages: Where do we come from? Where are we going?
I08-3005: Thatsanee Charoenporn; Virach Sornlertlamvanich; Hitoshi Isahara; Kergrit Robkop
KUI: an ubiquitous tool for collective intelligence development
I08-3006: Tirthankar Dasgupta; Sandipan Dandpat; Anupam Basu
Prototype Machine Translation System From Text-To-Indian Sign Language
I08-3007: Michael Maxwell; Anne David
Joint Grammar Development by Linguists and Computer Scientists
I08-3008: Daniel Zeman; Philip Resnik
Cross-Language Parser Adaptation between Related Languages
I08-3009: Sandeva Goonetilleke; Yoshihiko Hayashi; Yuichi Itoh; Fumio Kishino
SriShell Primo: A Predictive Sinhala Text Input System
I08-3010: Zin Maung Maung; Yoshiki Mikami
A Rule-based Syllable Segmentation of Myanmar Text
I08-3011: I. Alegria; X. Arregi; X. Artola; A. Diaz de Ilarraza; G. Labaka; M. Lersundi; A. Mayor; K. Sarasola
Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open-source
I08-3012: Sandipan Sarkar; Sivaji Bandyopadhyay
Design of a Rule-based Stemmer for Natural Language Text in Bengali
I08-3013: Jackson Muhirwe; Trond Trosterud
Finite State Solutions For Reduplication In Kinyarwanda Language
I08-3014: Mohammad Naveed Ali; M. A. Khan; Muhammad Aamir Khan
An Optimal Order of Factors for the Computational Treatment of Personal Anaphoric Devices in Urdu Discourse
I08-3015: Thoudam Doren Singh; Sivaji Bandyopadhyay
Morphology Driven Manipuri POS Tagger
I08-3016: Krishnakumar V; Indrani Roy
Acharya - A Text Editor and Framework for working with Indic Scripts
I08-3017: R.K. Aggarwal; M. Dave
Implementing a Speech Recognition System Interface for Indian Languages
I08-3018: Hammam Riza
Indigenous Languages of Indonesia: Creating Language Resources for Language Preservation
I08-3019: Chirag Patel; Karthik Gali
Part-Of-Speech Tagging for Gujarati Using Conditional Random Fields
I08-3020: David Ellis; Mathias Creutz; Timo Honkela; Mikko Kurimo
Speech to speech machine translation: Biblical chatter from Finnish to English
Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing
I08-4: Entire volume
I08-4000: Front Matter
I08-4001: Zhou-Jun Li; Wen-Han Chao; Yue-Xin Chen
An Example-based Decoder for Spoken Language Machine Translation
I08-4002: Chengguo Jin; Seung-Hoon Na; Dong-Il Kim; Jong-Hyeok Lee
Automatic Extraction of English-Chinese Transliteration Pairs using Dynamic Window and Tokenizer
I08-4003: Jin-Shea Kuo; Haizhou Li; Chih-Lung Lin
Mining Transliterations from Web Query Results: An Incremental Approach
I08-4004: Feiliang Ren; Jingbo Zhu
An Effective Hybrid Machine Learning Approach for Coreference Resolution
I08-4005: Yuchang Cheng; Masayuki Asahara; Yuji Matsumoto
Use of Event Types for Temporal Relation Identification in Chinese Text
I08-4006: Jinghua Wang; Jianyi Liu; Ping Zhang
Chinese Word Sense Disambiguation with PageRank and HowNet
I08-4007: Bor-shen Lin
Stochastic Dependency Parsing Based on A* Admissible Search
I08-4008: Jia Lu; Masayuki Asahara; Yuji Matsumoto
Analyzing Chinese Synthetic Words with Tree-based Information and a Survey on Chinese Morphologically Derived Words
I08-4009: Zhenxing Wang; Changning Huang; Jingbo Zhu
Which Performs Better on In-Vocabulary Word Segmentation: Based on Word or Character?
I08-4010: Guangjin Jin; Xiao Chen
The Fourth International Chinese Language Processing Bakeoff: Chinese Word Segmentation, Named Entity Recognition and Chinese POS Tagging
I08-4011: Aitao Chen; Ya Zhang; Gordon Sun
A Two-Stage Approach to Chinese Part-of-Speech Tagging
I08-4012: Jiang Li; Rile Hu; Guohua Zhang; Yuezhong Tang; Zhanjiang Song; Xia Wang
NOKIA Research Center Beijing Chinese Word Segmentation System for the SIGHAN Bakeoff 2007
I08-4013: Xinnian Mao; Yuan Dong; Saike He; Sencheng Bao; Haila Wang
Chinese Word Segmentation and Named Entity Recognition Based on Conditional Random Fields
I08-4014: Ying Qin; Caixia Yuan; Jiashen Sun; Xiaojie Wang
BUPT Systems in the SIGHAN Bakeoff 2007
I08-4015: Zhenxing Wang; Changning Huang; Jingbo Zhu
The Character-based CRF Segmenter of MSRA&NEU for the 4th Bakeoff
I08-4016: Xiaofeng Yu; Wai Lam; Shing-Kit Chan; Yiu Kei Wu; Bo Chen
Chinese NER Using CRFs and Logic for the Fourth SIGHAN Bakeoff
I08-4017: Hai Zhao; Chunyu Kit
Unsupervised Segmentation Helps Supervised Learning of Character Tagging for Word Segmentation and Named Entity Recognition
I08-4018: Samuel W.K. Chan; Mickey W.C. Chong
An Agent-Based Approach to Chinese Word Segmentation
I08-4019: Xiaohe Chen; Bin Li; Junzhi Lu; Hongdong Nian; Xuri Tang
Nanjing Normal University Segmenter for the Fourth SIGHAN Bakeoff
I08-4020: Yuanyong Feng; Ruihong Huang; Le Sun
Two Step Chinese Named Entity Recognition Based on Conditional Random Fields Models
I08-4021: Guohong Fu; Jonathan J. Webster
A Morpheme-based Part-of-Speech Tagger for Chinese
I08-4022: Jingzhou He; Houfeng Wang
Chinese Named Entity Recognition and Word Segmentation Based on Character
I08-4023: Degen Huang; Xiao Sun; Shidou Jiao; Lishuang Li; Zhuoye Ding; Ru Wan
HMM and CRF Based Hybrid Model for Chinese Lexical Analysis
I08-4024: Ka Seng Leong; Fai Wong; Yiping Li; Ming Chui Dong
Chinese Tagging Based on Maximum Entropy Model
I08-4025: Dong Song; Anoop Sarkar
Training a Perceptron with Global and Local Features for Chinese Word Segmentation
I08-4026: Guang-Lu Sun; Cheng-Jie Sun; Ke Sun; Xiao-Long Wang
A Study of Chinese Lexical Analysis Based on Discriminative Models
I08-4027: Jia-Lin Tsai
Word Boundary Token Model for the SIGHAN Bakeoff 2007
I08-4028: Xihong Wu; Xiaojun Lin; Xinhao Wang; Chunyao Wu; Yaozhong Zhang; Dianhai Yu
An Improved CRF based Chinese Language Processing System for SIGHAN Bakeoff 2007
I08-4029: Yu-Chieh Wu; Jie-Chi Yang; Yue-Shi Lee
Description of the NCU Chinese Word Segmentation and Part-of-Speech Tagging for SIGHAN Bakeoff 2007
I08-4030: Zhiting Xu; Xian Qian; Yuejie Zhang; Yaqian Zhou
CRF-based Hybrid Model for Word Segmentation, NER and even POS Tagging
I08-4031: Fan Yang; Jun Zhao; Bo Zou
CRFs-Based Named Entity Recognition Incorporated with Heuristic Entity List Searching
I08-4032: Jianfeng Zhang; Jiaheng Zheng; Hu Zhang; Hongye Tan
A Chinese Word Segmentation System Based on Cascade Model
I08-4033: Ruiqiang Zhang; Eiichiro Sumita
Achilles: NiCT/ATR Chinese Morphological Analyzer for the Fourth Sighan Bakeoff
I08-4034:
Author Index
Proceedings of the IJCNLP-08 Workshop on Named Entity Recognition for South and South East Asian Languages
I08-5000: Front Matter
I08-5001: Sobha L
Invited Talk: Named Entity Recognition: Different Approaches
I08-5002: Sivaji Bandyopadhyay
Invited Talk: Multilingual Named Entity Recognition
I08-5003: Anil Kumar Singh
Named Entity Recognition for South and South East Asian Languages: Taking Stock
I08-5004: Sujan Kumar Saha; Sanjay Chatterji; Sandipan Dandapat; Sudeshna Sarkar; Pabitra Mitra
A Hybrid Named Entity Recognition System for South and South East Asian Languages
I08-5005: Karthik Gali; Harshit Surana; Ashwini Vaidya; Praneeth Shishtla; Dipti Misra Sharma
Aggregating Machine Learning and Rule Based Heuristics for Named Entity Recognition
I08-5006: Asif Ekbal; Rejwanul Haque; Amitava Das; Venkateswarlu Poka; Sivaji Bandyopadhyay
Language Independent Named Entity Recognition in Indian Languages
I08-5007: P Srikanth; Kavi Narayana Murthy
Named Entity Recognition for Telugu
I08-5008: Asif Ekbal; Sivaji Bandyopadhyay
Bengali Named Entity Recognition Using Support Vector Machine
I08-5009: Vijayakrishna R; Sobha L
Domain Focused Named Entity Recognizer for Tamil Using Conditional Random Fields
I08-5010: Praneeth M Shishtla; Prasad Pingali; Vasudeva Varma
A Character n-gram Based Approach for Improved Recall in Indian Language NER
I08-5011: Bidyut Baran Chaudhuri; Suvankar Bhattacharya
An Experiment on Automatic Detection of Named Entities in Bangla
I08-5012: Praveen P; Ravi Kiran V
Hybrid Named Entity Recognition System for South and South East Asian Languages
I08-5013: Amit Goyal
Named Entity Recognition for South Asian Languages
I08-5014: Animesh Nayan; B. Ravi Kiran Rao; Pawandeep Singh; Sudip Sanyal; Ratna Sanyal
Named Entity Recognition for Indian Languages
I08-5015: Praneeth M Shishtla; Karthik Gali; Prasad Pingali; Vasudeva Varma
Experiments in Telugu NER: A Conditional Random Field Approach
Proceedings of the 2nd workshop on Cross Lingual Information Access (CLIA) Addressing the Information Need of Multilingual Societies
I08-6: Entire volume
I08-6000: Front Matter
I08-6001: Peter Chew; Ahmed Abdelali
The Effects of Language Relatedness on Multilingual Information Retrieval: A Case Study With Indo-European and Semitic Languages
I08-6002: Pattabhi R K Rao T; Sobha L
Identifying Similar and Co-referring Documents Across Languages
I08-6003: Achim Ruopp; Fei Xia
Finding parallel texts on the web using cross-language information retrieval
I08-6004: K Saravanan; A Kumaran
Some Experiments in Mining Named Entity Transliteration Pairs from Comparable Corpora
I08-6005: Gareth Jones; Fabio Fantino; Eamonn Newman; Ying Zhang
Domain-Specific Query Translation for Multilingual Information Access using Machine Translation Augmented With Dictionaries Mined from Wikipedia
I08-6006: Prasad Pingali; Suryaganesh; Sreeharsha Yella; Vasudeva Varma
Statistical Transliteration for Cross Language Information Retrieval using HMM alignment model and CRF
I08-6007: Anurag Bhardwaj; Damien Jose; Venu Govindaraju
Script Independent Word Spotting in Multilingual Documents
I08-6008: Sibabrata Paladhi; Sivaji Bandyopadhyay
A Document Graph Based Query Focused Multi-Document Summarizer
I08-6009: Manoj Kumar Chinnakotla; Sagar Ranadive; Om P. Damani; Pushpak Bhattacharyya
Hindi and Marathi to English Cross Language Information Retrieval
I08-6010: Debasis Mandal; Sandipan Dandapat; Mayank Gupta; Pratyush Banerjee; Sudeshna Sarkar
Bengali and Hindi to English CLIR Evaluation
I08-6011: Sivaji Bandyopadhyay; Tapabrata Mondal; Sudip Kumar Naskar; Asif Ekbal; Rejwanul Haque; Srinivasa Rao Godavarthy
Bengali, Hindi and Telugu to English Ad-hoc Bilingual Task
I08-6012: Jagadeesh Jagarlamudi; A Kumaran
Cross-Lingual Information Retrieval System for Indian Languages
I08-6013: Prasad Pingali; Vasudeva Varma
Hindi and Telugu to English CLIR using Query Expansion
I08-6014: Mandar Mitra; Prosenjit Majumdar
FIRE: Forum for Information Retrieval Evaluation
I08-6015:
Author Index
Proceedings of the 6th Workshop on Asian Language Resources
I08-7: Entire volume
I08-7000: Front Matter
I08-7001: Asif Ekbal; Sivaji Bandyopadhyay
Development of Bengali Named Entity Tagged Corpus and its Use in NER Systems
I08-7002: Sujan Kumar Saha; Sudeshna Sarkar; Pabitra Mitra
Gazetteer Preparation for Named Entity Recognition in Indian Languages
I08-7003: Gaoying Cui; Qin Lu; Wenjie Li
Preliminary Chinese Term Classification for Ontology Construction
I08-7004: Makiko Matsuda; Tomoe Takahashi; Hiroki Goto; Yoshikazu Hayase; Robin Lee Nagano; Yoshiki Mikami
Technical Terminology in Asian Languages: Different Approaches to Adopting Engineering Terms
I08-7005: Wunna Ko Ko; Thin Zar Phyo
Selection of XML tag set for Myanmar National Corpus
I08-7006: Hla Hla Htay; Kavi Narayana Murthy
Myanmar Word Segmentation using Syllable level Longest Matching
I08-7007: Rizza Caminero; Yoshiki Mikami
The Link Structure of Language Communities and its Implication for Language-specific Crawling
I08-7008: Tirthankar Dasgupta; Sambit Shukla; Sandeep Kumar; Synny Diwakar; Anupam Basu
A Multilingual Multimedia Indian Sign Language Dictionary Tool
I08-7009: Deniz Zeyrek; Bonnie Webber
A Discourse Resource for Turkish: Annotating Discourse Connectives in the METU Corpus
I08-7010: Rashmi Prasad; Samar Husain; Dipti Sharma; Aravind Joshi
Towards an Annotated Corpus of Discourse Relations in Hindi
I08-7011: Yin-Sheng Tai; D. Victoria Rau; Meng-Chien Yang
A Semantic Study on Yami Ontology in Traditional Songs
I08-7012: Rama Sree R.J.; Uma Maheswara Rao G; Madhu Murthy K.V.
Assessment and Development of POS Tag Set for Telugu
I08-7013: Sankaran Baskaran; Kalika Bali; Tanmoy Bhattacharya; Pushpak Bhattacharyya; Girish Nath Jha; Rajendran S; Saravanan K; Sobha L; Subbarao K V
Designing a Common POS-Tagset Framework for Indian Languages
I08-7014: Hammam Riza
Resources Report on Languages of Indonesia
I08-7015: Ryo Nishimura; Yasuhiko Watanabe; Yoshihiro Okada
Confirmed Language Resource for Answering How Type Questions Developed by Using Mails Posted to a Mailing List
I08-7016: Purev Jaimai; Odbayar Chimeddorj
Corpus building for Mongolian language
I08-7017: Sarmad Hussain
Resources for Urdu Language Processing
I08-7018: Kikuo Maekawa
Balanced Corpus of Contemporary Written Japanese
I08-7019: Viet Hoang-Anh; Thu Dinh-Thi-Phuong; Thang Huynh-Quyet
A Basic Framework to Build a Test Collection for the Vietnamese Text Catergorization
I08-7020: Virach Sornlertlamvanich; Thatsanee Charoenporn; Suphanut Thayaboon; Chumpol Mokarat; Hitoshi Isahara
Enhanced Tools for Online Collaborative Language Resource Development
I08-7021: Shuichi Itahashi; Koiti Hasida
Japanese Effort Toward Sharing Text and Speech Corpora
I08-7022:
Author Index
Proceedings of the Workshop on Technologies and Corpora for Asia-Pacific Speech Translation (TCAST)
I08-8000: Front Matter
I08-8001: Jonghoon Lee; Donghyeon Lee; Gary Geunbae Lee
Transformation-based Sentence Splitting method for Statistical Machine Translation
I08-8002: Chai Wutiwiwatchai; Thepchai Supnithi; Krit Kosawat
Speech-to-Speech Translation Activities in Thailand
I08-8003: Andrew Finch; Eiichiro Sumita
Phrase-based Machine Transliteration
I08-8004: Sakriani Sakti; Eka Kelana; Hammam Riza; Shinsuke Sakai; Konstantin Markov; Satoshi Nakamura
Development of Indonesian Large Vocabulary Continuous Speech Recognition System within A-STAR Project
I08-8005: Hyungbae Jeon; Kyuwoong Hwang; Hoon Chung; Seunghi Kim; Jun Park; Yunkeun Lee
Using Confidence Vector in Multi-Stage Speech Recognition
I08-8006: Hammam Riza; Oskar Riandi
Toward Asian Speech Translation System: Developing Speech Recognition and Machine Translation for Indonesian Language
