ACL Logo ACL Anthology
A Digital Archive of Research Papers in Computational Linguistics

Google search the Anthology

Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)

L10-1001  [bib]: Hercules Dalianis; Hao-chun Xing; Xin Zhang
Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction

L10-1002  [bib]: Lluís Padró; Miquel Collado; Samuel Reese; Marina Lloberes; Irene Castellón
FreeLing 2.1: Five Years of Open-source Language Processing Tools

L10-1003  [bib]: Amit Kirschenbaum; Shuly Wintner
A General Method for Creating a Bilingual Transliteration Dictionary

L10-1004  [bib]: Huan-An Kao; Hsin-Hsi Chen
Comment Extraction from Blog Posts and Its Applications to Opinion Mining

L10-1005  [bib]: Thomas Schmidt; Wilfried Schütte
FOLKER: An Annotation Tool for Efficient Transcription of Natural, Multi-party Interaction

L10-1006  [bib]: Roberto Navigli; Paola Velardi; Juana María Ruiz-Martínez
An Annotated Dataset for Extracting Definitions and Hypernyms from the Web

L10-1007  [bib]: Maria Khokhlova; Victor Zakharov
Studying Word Sketches for Russian

L10-1008  [bib]: Marta R. Costa-jussà; José A. R. Fonollosa
Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System

L10-1009  [bib]: Onno Crasborn
The Sign Linguistics Corpora Network: Towards Standards for Signed Language Resources

L10-1010  [bib]: Antoinette Hawayek; Riccardo Del Gratta; Giuseppe Cappelli
A Bilingual Dictionary Mexican Sign Language-Spanish/Spanish-Mexican Sign Language

L10-1011  [bib]: Serge Sharoff; Zhili Wu; Katja Markert
The Web Library of Babel: evaluating genre collections

L10-1012  [bib]: Hans-Ulrich Krieger
A General Methodology for Equipping Ontologies with Time

L10-1013  [bib]: Ting Qian; Kristy Hollingshead; Su-youn Yoon; Kyoung-young Kim; Richard Sproat
A Python Toolkit for Universal Transliteration

L10-1014  [bib]: K. Bretonnel Cohen; Christophe Roeder; William A. Baumgartner Jr.; Lawrence E. Hunter; Karin Verspoor
Test Suite Design for Biomedical Ontology Concept Recognition Systems

L10-1015  [bib]: Els Lefever; Véronique Hoste
Construction of a Benchmark Data Set for Cross-lingual Word Sense Disambiguation

L10-1016  [bib]: Alberto Barrón-Cedeño; Martin Potthast; Paolo Rosso; Benno Stein
Corpus and Evaluation Measures for Automatic Plagiarism Detection

L10-1017  [bib]: Claus Zinn; Peter Wittenburg; Jacquelijn Ringersma
An Evolving eScience Environment for Research Data in Linguistics

L10-1018  [bib]: Simon Scerri; Gerhard Gossen; Brian Davis; Siegfried Handschuh
Classifying Action Items for Semantic Email

L10-1019  [bib]: Yulia Tsvetkov; Shuly Wintner
Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content

L10-1020  [bib]: Vassiliki Rentoumi; Stefanos Petrakis; Manfred Klenner; George A. Vouros; Vangelis Karkaletsis
United we Stand: Improving Sentiment Analysis by Joining Machine Learning and Rule Based Methods

L10-1021  [bib]: Núria Bel
Handling of Missing Values in Lexical Acquisition

L10-1022  [bib]: Elin Carlsson; Hercules Dalianis
Influence of Module Order on Rule-Based De-identification of Personal Names in Electronic Patient Records Written in Swedish

L10-1023  [bib]: Marta R. Costa-jussà; Mireia Farrús; José B. Mariño; José A. R. Fonollosa
Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems

L10-1024  [bib]: Anil Kumar Singh; Bharat Ram Ambati
An Integrated Digital Tool for Accessing Language Resources

L10-1025  [bib]: Jakob Schou Pedersen; Lars Bo Larsen
A Speech Corpus for Dyslexic Reading Training

L10-1026  [bib]: Yassine Benajiba; Imed Zitouni
Arabic Word Segmentation for Better Unit of Analysis

L10-1027  [bib]: James Pustejovsky; Kiyong Lee; Harry Bunt; Laurent Romary
ISO-TimeML: An International Standard for Semantic Annotation

L10-1028  [bib]: Ranka Stanković; Ivan Obradović; Olivera Kitanović
GIS Application Improvement with Multilingual Lexical and Terminological Resources

L10-1029  [bib]: Nathanael Chambers; Dan Jurafsky
A Database of Narrative Schemas

L10-1030  [bib]: Dimitrios Kokkinakis; Ulla Gerdin
A Swedish Scientific Medical Corpus for Terminology Management and Linguistic Exploration

L10-1031  [bib]: Thomas Proisl; Besim Kabashi
Using High-Quality Resources in NLP: The Valency Dictionary of English as a Resource for Left-Associative Grammars

L10-1032  [bib]: Xabier Saralegi; Maddalen Lopez de Lacalle
Dictionary and Monolingual Corpus-based Query Translation for Basque-English CLIR

L10-1033  [bib]: Véronika Lux-Pogodalla; Dominique Besagni; Karën Fort
FastKwic, an “Intelligent“ Concordancer Using FASTR

L10-1034  [bib]: Cvetana Krstev; Ranka Stanković; Duško Vitas
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration

L10-1035  [bib]: Plaban Kr. Bhowmick; Anupam Basu; Pabitra Mitra
Determining Reliability of Subjective and Multi-label Emotion Annotation through Novel Fuzzy Agreement Measure

L10-1036  [bib]: Xavier Tannier; Véronique Moriceau
FIDJI: Web Question-Answering at Quaero 2009

L10-1037  [bib]: Kai Wörner
A Tool for Feature-Structure Stand-Off-Annotation on Transcriptions of Spoken Discourse

L10-1038  [bib]: Jan Odijk
The CLARIN-NL Project

L10-1039  [bib]: Virach Sornlertlamvanich; Thatsanee Charoenporn; Hitoshi Isahara
Language Resource Management System for Asian WordNet Collaboration and Its Web Service Application

L10-1040  [bib]: Jinho D. Choi; Claire Bonial; Martha Palmer
Propbank Frameset Annotation Guidelines Using a Dedicated Editor, Cornerstone

L10-1041  [bib]: Johannes Handl; Carsten Weber
A Multilayered Declarative Approach to Cope with Morphotactics and Allomorphy in Derivational Morphology

L10-1042  [bib]: Olga Lyashevskaya
Bank of Russian Constructions and Valencies

L10-1043  [bib]: Zareen Syed; Evelyne Viegas; Savas Parastatidis
Automatic Discovery of Semantic Relations using MindNet

L10-1044  [bib]: Adam Kilgarriff; Siva Reddy; Jan Pomikálek; Avinesh PVS
A Corpus Factory for Many Languages

L10-1045  [bib]: Richard Johansson; Alessandro Moschitti
A Flexible Representation of Heterogeneous Annotation Data

L10-1046  [bib]: Olivier Galibert; Sophie Rosset; Xavier Tannier; Fanny Grandry
Hybrid Citation Extraction from Patents

L10-1047  [bib]: Luca Dini; Giampaolo Mazzini
The Impact of Grammar Enhancement on Semantic Resources Induction

L10-1048  [bib]: Yiou Wang; Kiyotaka Uchimoto; Jun’ichi Kazama; Canasai Kruengkrai; Kentaro Torisawa
Adapting Chinese Word Segmentation for Machine Translation Based on Short Units

L10-1049  [bib]: Ekaterina Ovchinnikova; Laure Vieu; Alessandro Oltramari; Stefano Borgo; Theodore Alexandrov
Data-Driven and Ontological Analysis of FrameNet for Natural Language Reasoning

L10-1050  [bib]: Samira Shaikh; Tomek Strzalkowski; Aaron Broadwell; Jennifer Stromer-Galley; Sarah Taylor; Nick Webb
MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse

L10-1051  [bib]: Yanli Sun
Mining the Correlation between Human and Automatic Evaluation at Sentence Level

L10-1052  [bib]: Alberto Simões; José João Almeida; Rita Farinha
Processing and Extracting Data from Dicionário Aberto

L10-1053  [bib]: Ulli Waltinger
GermanPolarityClues: A Lexical Resource for German Sentiment Analysis

L10-1054  [bib]: Antonio Pareja-Lora; Guadalupe Aguado de Cea
Ontology-based Interoperation of Linguistic Tools for an Improved Lemma Annotation in Spanish

L10-1055  [bib]: Torsten Zesch; Iryna Gurevych
The More the Better? Assessing the Influence of Wikipedia’s Growth on Semantic Relatedness Measures

L10-1056  [bib]: Nick Campbell; Akiko Tabata
A Software Toolkit for Viewing Annotated Multimodal Data Interactively over the Web

L10-1057  [bib]: Ana Cristina Mendes; Luísa Coheur; Paula Vaz Lobo
Named Entity Recognition in Questions: Towards a Golden Collection

L10-1058  [bib]: Patrizia Paggio; Jens Allwood; Elisabeth Ahlsén; Kristiina Jokinen; Costanza Navarretta
The NOMCO Multimodal Nordic Resource - Goals and Characteristics

L10-1059  [bib]: Kikuo Maekawa; Makoto Yamazaki; Takehiko Maruyama; Masaya Yamaguchi; Hideki Ogura; Wakako Kashino; Toshinobu Ogiso; Hanae Koiso; Yasuharu Den
Design, Compilation, and Preliminary Analyses of Balanced Corpus of Contemporary Written Japanese

L10-1060  [bib]: Lieve Macken
An Annotation Scheme and Gold Standard for Dutch-English Word Alignment

L10-1061  [bib]: Stefan Scherer; Ingo Siegert; Lutz Bigalke; Sascha Meudt
Developing an Expressive Speech Labeling Tool Incorporating the Temporal Characteristics of Emotion

L10-1062  [bib]: Ahmet Aker; Robert Gaizauskas
Model Summaries for Location-related Images

L10-1063  [bib]: Justus Roux; Pieter Scholtz; Daleen Klop; Claus Povlsen; Bart Jongejan; Asta Magnusdottir
Incorporating Speech Synthesis in the Development of a Mobile Platform for e-learning.

L10-1064  [bib]: Bernard Jacquemin
A Derivational Rephrasing Experiment for Question Answering

L10-1065  [bib]: Sherri Condon; Dan Parvaz; John Aberdeen; Christy Doran; Andrew Freeman; Marwan Awad
Evaluation of Machine Translation Errors in English and Iraqi Arabic

L10-1066  [bib]: Mahdi Mohseni; Behrouz Minaei-bidgoli
A Persian Part-Of-Speech Tagger Based on Morphological Analysis

L10-1067  [bib]: Olga Babko-Malaya; Dan Hunter; Connie Fournelle; Jim White
Evaluation of Document Citations in Phase 2 Gale Distillation

L10-1068  [bib]: Çağrı Çöltekin
A Freely Available Morphological Analyzer for Turkish

L10-1069  [bib]: Martin Volk; Noah Bubenhofer; Adrian Althaus; Maya Bangerter; Lenz Furrer; Beni Ruef
Challenges in Building a Multilingual Alpine Heritage Corpus

L10-1070  [bib]: Silvia Pareti; Irina Prodanof
Annotating Attribution Relations: Towards an Italian Discourse Treebank

L10-1071  [bib]: Nick Webb; David Benyon; Preben Hansen; Oil Mival
Evaluating Human-Machine Conversation for Appropriateness

L10-1072  [bib]: Beáta Megyesi; Bengt Dahlqvist; Éva Á. Csató; Joakim Nivre
The English-Swedish-Turkish Parallel Treebank

L10-1073  [bib]: Pradeep Dantuluri; Brian Davis; Siegfried Handschuh
A Use Case for Controlled Languages as Interfaces to Semantic Web Applications

L10-1074  [bib]: Charles Teissèdre; Delphine Battistelli; Jean-Luc Minel
Resources for Calendar Expressions Semantic Tagging and Temporal Navigation through Texts

L10-1075  [bib]: Oscar Saz; Eduardo Lleida; Carlos Vaquero; W.-Ricardo Rodríguez
The Alborada-I3A Corpus of Disordered Speech

L10-1076  [bib]: Sophia Ananiadou; John McNaught; James Thomas; Mark Rickinson; Sandy Oliver
Evaluating a Text Mining Based Educational Search Portal

L10-1077  [bib]: Jennifer Pedler; Roger Mitton
A Large List of Confusion Sets for Spellchecking Assessed Against a Corpus of Real-word Errors

L10-1078  [bib]: Alexander Schmitt; Gregor Bertrand; Tobias Heinroth; Wolfgang Minker; Jackson Liscombe
WITcHCRafT: A Workbench for Intelligent exploraTion of Human ComputeR conversaTions

L10-1079  [bib]: Svetlana Stoyanchev; Paul Piwek
Constructing the CODA Corpus: A Parallel Corpus of Monologues and Expository Dialogues

L10-1080  [bib]: Dain Kaplan; Ryu Iida; Takenobu Tokunaga
Annotation Process Management Revisited

L10-1081  [bib]: Giulio Paci; Giorgio Pedrazzi; Roberta Turra
Wikipedia-based Approach for Linking Ontology Concepts to their Realisations in Text

L10-1082  [bib]: Marianne Laurent; Philippe Bretier; Carole Manquillet
Ad-hoc Evaluations Along the Lifecycle of Industrial Spoken Dialogue Systems: Heading to Harmonisation?

L10-1083  [bib]: Masahiro Nakano; Hideyuki Shibuki; Rintaro Miyazaki; Madoka Ishioroshi; Koichi Kaneko; Tatsunori Mori
Construction of Text Summarization Corpus for the Credibility of Information on the Web

L10-1084  [bib]: João Silva; António Branco; Patricia Gonçalves
Top-Performing Robust Constituency Parsing of Portuguese: Freely Available in as Many Ways as you Can Get it

L10-1085  [bib]: Sylviane Cardey; Krzysztof Bogacki; Xavier Blanco; Ruslan Mitkov
Resources for Controlled Languages for Alert Messages and Protocols in the European Perspective

L10-1086  [bib]: Tomaž Erjavec
MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora

L10-1087  [bib]: Tomaž Erjavec; Darja Fišer; Simon Krek; Nina Ledinek
The JOS Linguistically Tagged Corpus of Slovene

L10-1088  [bib]: Livio Robaldo; Eleni Miltsakaki; Alessia Bianchini
Corpus-based Semantics of Concession: Where do Expectations Come from?

L10-1089  [bib]: Darja Fišer; Senja Pollak; Špela Vintar
Learning to Mine Definitions from Slovene Structured and Unstructured Knowledge-Rich Resources

L10-1090  [bib]: Dan Tufiş; Dan Ştefănescu
A Differential Semantics Approach to the Annotation of Synsets in WordNet

L10-1091  [bib]: Elena Grishina
Multimodal Russian Corpus (MURCO): First Steps

L10-1092  [bib]: Jörg Tiedemann
Lingua-Align: An Experimental Toolbox for Automatic Tree-to-Tree Alignment

L10-1093  [bib]: Cécile Grivaz
Human Judgements on Causation in French Texts

L10-1094  [bib]: Rita Marinelli; Adriana Roventini; Giovanni Spadoni; Sebastiana Cucurullo
Lexical Semantic Resources in a Terminological Network

L10-1095  [bib]: Aleksander Wawer
Is Sentiment a Property of Synsets? Evaluating Resources for Sentiment Classification using Machine Learning

L10-1096  [bib]: Iñaki Alegria; Garbiñe Aranbarri; Klara Ceberio; Gorka Labaka; Bittor Laskurain; Ruben Urizar
A Morphological Processor Based on Foma for Biscayan (a Basque dialect)

L10-1097  [bib]: Adam Przepiórkowski; Rafał L. Górski; Marek Łaziński; Piotr Pęzik
Recent Developments in the National Corpus of Polish

L10-1098  [bib]: António Branco; Francisco Costa; João Silva; Sara Silveira; Sérgio Castro; Mariana Avelãs; Clara Pinto; João Graça
Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank

L10-1099  [bib]: Lars Borin; Markus Forsberg; Dimitrios Kokkinakis
Diabase: Towards a Diachronic BLARK in Support of Historical Studies

L10-1100  [bib]: Anne Abeillé; Danièle Godard
The Grande Grammaire du Français Project

L10-1101  [bib]: Satoshi Sekine; Kapil Dalwani
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

L10-1102  [bib]: Alexander Schmitt; Tim Polzehl; Wolfgang Minker; Jackson Liscombe
The Influence of the Utterance Length on the Recognition of Aged Voices

L10-1103  [bib]: Marta Recasens; Eduard Hovy; M. Antònia Martí
A Typology of Near-Identity Relations for Coreference (NIDENT)

L10-1104  [bib]: Ineke Schuurman; Véronique Hoste; Paola Monachesi
Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch

L10-1105  [bib]: Daan Broeder; Marc Kemps-Snijders; Dieter Van Uytvanck; Menzo Windhouwer; Peter Withers; Peter Wittenburg; Claus Zinn
A Data Category Registry- and Component-based Metadata Framework

L10-1106  [bib]: Isa Maks; Piek Vossen
Annotation Scheme and Gold Standard for Dutch Subjective Adjectives

L10-1107  [bib]: Mark Arehart
Indexing Methods for Faster and More Effective Person Name Search

L10-1108  [bib]: Hiroyuki Shinnou; Minoru Sasaki
Detection of Peculiar Examples using LOF and One Class SVM

L10-1109  [bib]: Naushad UzZaman; James Allen
TRIOS-TimeBank Corpus: Extended TimeBank Corpus with Help of Deep Understanding of Text

L10-1110  [bib]: Adam Funk; Kalina Bontcheva
Ontology-Based Categorization of Web Services with Machine Learning

L10-1111  [bib]: Yugo Murawaki; Sadao Kurohashi
Online Japanese Unknown Morpheme Detection using Orthographic Variation

L10-1112  [bib]: Bracha Nir; Brian MacWhinney; Shuly Wintner
A Morphologically-Analyzed CHILDES Corpus of Hebrew

L10-1113  [bib]: Kristiina Jokinen
Non-verbal Signals for Turn-taking and Feedback

L10-1114  [bib]: Alejandro Abejón; Doroteo T. Toledano; Danilo Spada; González Victor; Daniel Hernández López
A Study of the Influence of Speech Type on Automatic Language Recognition Performance

L10-1115  [bib]: Gosse Bouma
Cross-lingual Ontology Alignment using EuroWordNet and Wikipedia

L10-1116  [bib]: François Lefebvre-Albaret; Patrice Dalle
Video Retrieval in Sign Language Videos : How to Model and Compare Signs?

L10-1117  [bib]: Claire Gardent; Alejandra Lorenzo
Identifying Sources of Weakness in Syntactic Lexicon Extraction

L10-1118  [bib]: Marco Passarotti; Felice Dell'Orletta
Improvements in Parsing the Index Thomisticus Treebank. Revision, Combination and a Feature Model for Medieval Latin

L10-1119  [bib]: Ineke Schuurman; Vincent Vandeghinste
Cultural Aspects of Spatiotemporal Analysis in Multilingual Applications

L10-1120  [bib]: Takehiro Teraoka; Jun Okamoto; Shun Ishizaki
An Associative Concept Dictionary for Verbs and its Application to Elliptical Word Estimation

L10-1121  [bib]: Sowmya V. B.; Monojit Choudhury; Kalika Bali; Tirthankar Dasgupta; Anupam Basu
Resource Creation for Training and Testing of Transliteration Systems for Indian Languages

L10-1122  [bib]: Antonio Balvet; Lucie Barque; Rafael Marín
Building a Lexicon of French Deverbal Nouns from a Semantically Annotated Corpus

L10-1123  [bib]: Sara Tonelli; Giuseppe Riccardi; Rashmi Prasad; Aravind Joshi
Annotation of Discourse Relations for Conversational Spoken Dialogs

L10-1124  [bib]: Sandra Williams; Richard Power
A Fact-aligned Corpus of Numerical Expressions

L10-1125  [bib]: Ludovic Quintard; Olivier Galibert; Gilles Adda; Brigitte Grau; Dominique Laurent; Véronique Moriceau; Sophie Rosset; Xavier Tannier; Anne Vilnat
Question Answering on Web Data: The QA Evaluation in Quæro

L10-1126  [bib]: Silvia Quarteroni; Alessandro Moschitti
A Comprehensive Resource to Evaluate Complex Open Domain Question Answering

L10-1127  [bib]: Olivier Galibert; Ludovic Quintard; Sophie Rosset; Pierre Zweigenbaum; Claire Nédellec; Sophie Aubin; Laurent Gillard; Jean-Pierre Raysz; Delphine Pois; Xavier Tannier; Louise Deléger; Dominique Laurent
Named and Specific Entity Detection in Varied Data: The Quæro Named Entity Baseline Evaluation

L10-1128  [bib]: Kyota Tsutsumida; Jun Okamoto; Shun Ishizaki; Makoto Nakatsuji; Akimichi Tanaka; Tadasu Uchiyama
Study of Word Sense Disambiguation System that uses Contextual Features - Approach of Combining Associative Concept Dictionary and Corpus -

L10-1129  [bib]: Lars Ahrenberg
Alignment-based Profiling of Europarl Data in an English-Swedish Parallel Corpus

L10-1130  [bib]: Muhammad Kamran Malik; Tafseer Ahmed; Sebastian Sulger; Tina Bögel; Atif Gulzar; Ghulam Raza; Sarmad Hussain; Miriam Butt
Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar

L10-1131  [bib]: Volha Petukhova; Harry Bunt
Towards an Integrated Scheme for Semantic Annotation of Multimodal Dialogue Data

L10-1132  [bib]: Cristina Bosco; Simonetta Montemagni; Alessandro Mazzei; Vincenzo Lombardo; Felice Dell'Orletta; Alessandro Lenci; Leonardo Lesmo; Giuseppe Attardi; Maria Simi; Alberto Lavelli; Johan Hall; Jens Nilsson; Joakim Nivre
Comparing the Influence of Different Treebank Annotations on Dependency Parsing

L10-1133  [bib]: Christian Federmann
Appraise: An Open-Source Toolkit for Manual Phrase-Based Evaluation of Translations

L10-1134  [bib]: Hai Zhao; Yan Song; Chunyu Kit
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method

L10-1135  [bib]: Bolette S. Pedersen; Sanni Nimb; Anna Braasch
Merging Specialist Taxonomies and Folk Taxonomies in Wordnets - A case Study of Plants, Animals and Foods in the Danish Wordnet

L10-1136  [bib]: Marta Tatu; Dan Moldovan
Inducing Ontologies from Folksonomies using Natural Language Understanding

L10-1137  [bib]: Orphée De Clercq; Maribel Montero Perez
Data Collection and IPR in Multilingual Parallel Corpora. Dutch Parallel Corpus

L10-1138  [bib]: Agata Cybulska; Piek Vossen
Event Models for Historical Perspectives: Determining Relations between High and Low Level Events in Text, Based on the Classification of Time, Location and Participants.

L10-1139  [bib]: Christian Scheible
An Evaluation of Predicate Argument Clustering using Pseudo-Disambiguation

L10-1140  [bib]: Fabienne Venant
Meaning Representation: From Continuity to Discreteness

L10-1141  [bib]: Matthieu Vernier; Laura Monceaux; Béatrice Daille
Learning Subjectivity Phrases missing from Resources through a Large Set of Semantic Tests

L10-1142  [bib]: Marco Guerini; Carlo Strapparava; Oliviero Stock
Evaluation Metrics for Persuasive NLP with Google AdWords

L10-1143  [bib]: Bart Desmet; Véronique Hoste
Towards a Balanced Named Entity Corpus for Dutch

L10-1144  [bib]: Grégory Senay; Georges Linarès; Benjamin Lecouteux; Stanislas Oger; Thierry Michel
Transcriber Driving Strategies for Transcription Aid System

L10-1145  [bib]: Nikola Ljubešić; Tomislava Lauc; Damir Boras
Building a Gold Standard for Event Detection in Croatian

L10-1146  [bib]: Ziqi Zhang; José Iria; Fabio Ciravegna
Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction

L10-1147  [bib]: Jana Straková; Pavel Pecina
Czech Information Retrieval with Syntax-based Language Models

L10-1148  [bib]: Giuseppe Attardi; Stefano Dei Rossi; Giulia Di Pietro; Alessandro Lenci; Simonetta Montemagni; Maria Simi
A Resource and Tool for Super-sense Tagging of Italian Texts

L10-1149  [bib]: Izaskun Aldezabal; María Jesús Aranzabe; Arantza Díaz de Ilarraza; Ainara Estarrona
Building the Basque PropBank

L10-1150  [bib]: Brigitte Bigi; Christine Meunier; Irina Nesterenko; Roxane Bertrand
Automatic Detection of Syllable Boundaries in Spontaneous Speech

L10-1151  [bib]: Nikos Tsourakis; Agnes Lisowska; Manny Rayner; Pierrette Bouillon
Examining the Effects of Rephrasing User Input on Two Mobile Spoken Language Systems

L10-1152  [bib]: Samuel Reese; Gemma Boleda; Montse Cuadros; Lluís Padró; German Rigau
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus

L10-1153  [bib]: Markus Dickinson; Charles Jochim
Evaluating Distributional Properties of Tagsets

L10-1154  [bib]: Maxim Khalilov; José A. R. Fonollosa; Inguna Skadina; Edgars Brālītis; Lauma Pretkalnina
Towards Improving English-Latvian Translation: A System Comparison and a New Rescoring Feature

L10-1155  [bib]: Grigori Sidorov; Alberto Barrón-Cedeño; Paolo Rosso
English-Spanish Large Statistical Dictionary of Inflectional Forms

L10-1156  [bib]: Fernando Fernández-Martínez; Juan Manuel Lucas-Cuesta; Roberto Barra Chicote; Javier Ferreiros; Javier Macías-Guarasa
HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish

L10-1157  [bib]: Joana Hois
Inter-Annotator Agreement on a Linguistic Ontology for Spatial Language - A Case Study for GUM-Space

L10-1158  [bib]: Dekang Lin; Kenneth Church; Heng Ji; Satoshi Sekine; David Yarowsky; Shane Bergsma; Kailash Patil; Emily Pitler; Rachel Lathbury; Vikram Rao; Kapil Dalwani; Sushant Narsale
New Tools for Web-Scale N-grams

L10-1159  [bib]: Eric Auer; Albert Russel; Han Sloetjes; Peter Wittenburg; Oliver Schreer; S. Masnieri; Daniel Schneider; Sebastian Tschöpel
ELAN as Flexible Annotation Framework for Sound and Image Processing Detectors

L10-1160  [bib]: Damjan Vlaj; Aleksandra Zögling Markuš; Marko Kos; Zdravko Kačič
Acquisition and Annotation of Slovenian Lombard Speech Database

L10-1161  [bib]: Bruno Cartoni; Marie-Aude Lefer
The MuLeXFoR Database: Representing Word-Formation Processes in a Multilingual Lexicographic Environment

L10-1162  [bib]: Fang Xu; Dietrich Klakow
Paragraph Acquisition and Selection for List Question Using Amazon’s Mechanical Turk

L10-1163  [bib]: Lun-Wei Ku; Ting-Hao Huang; Hsin-Hsi Chen
Construction of a Chinese Opinion Treebank

L10-1164  [bib]: Takeshi Abekawa; Masao Utiyama; Eiichiro Sumita; Kyo Kageura
Community-based Construction of Draft and Final Translation Corpus Through a Translation Hosting Site Minna no Hon'yaku (MNH)

L10-1165  [bib]: Vamshi Ambati; Stephan Vogel; Jaime Carbonell
Active Learning and Crowd-Sourcing for Machine Translation

L10-1166  [bib]: Hercules Dalianis; Sumithra Velupillai
How Certain are Clinical Assessments? Annotating Swedish Clinical Text for (Un)certainties, Speculations and Negations

L10-1167  [bib]: Winston Anderson; Laurette Pretorius; Albert Kotzé
Base Concepts in the African Languages Compared to Upper Ontologies and the WordNet Top Ontology

L10-1168  [bib]: Keyan Zhou; Aijun Li; Zhigang Yin; Chengqing Zong
CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation

L10-1169  [bib]: Kwanchiva Saykham; Ananlada Chotimongkol; Chai Wutiwiwatchai
Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System

L10-1170  [bib]: Ruud Koolen; Emiel Krahmer
The D-TUNA Corpus: A Dutch Dataset for the Evaluation of Referring Expression Generation Algorithms

L10-1171  [bib]: Aina Peris; Mariona Taulé; Gemma Boleda; Horacio Rodríguez
ADN-Classifier:Automatically Assigning Denotation Types to Nominalizations

L10-1172  [bib]: Lene Antonsen; Trond Trosterud; Linda Wiechetek
Reusing Grammatical Resources for New Languages

L10-1173  [bib]: Line Adde; Torbjørn Svendsen
NameDat: A Database of English Proper Names Spoken by Native Norwegians

L10-1174  [bib]: Natalie D. Snoeren; Martine Adda-Decker; Gilles Adda
The Study of Writing Variants in an Under-resourced Language: Some Evidence from Mobile N-Deletion in Luxembourgish

L10-1175  [bib]: Katarzyna Głowińska; Adam Przepiórkowski
The Design of Syntactic Annotation Levels in the National Corpus of Polish

L10-1176  [bib]: Yuki Kamiya; Tomohiro Ohno; Shigeki Matsubara; Hideki Kashioka
Construction of Back-Channel Utterance Corpus for Responsive Spoken Dialogue System Development

L10-1177  [bib]: Marina B. Ruiter; Toni C. M. Rietveld; Catia Cucchiarini; Emiel J. Krahmer; Helmer Strik
Human Language Technology and Communicative Disabilities: Requirements and Possibilities for the Future

L10-1178  [bib]: Felix Burkhardt; Martin Eckert; Wiebke Johannsen; Joachim Stegmann
A Database of Age and Gender Annotated Telephone Speech

L10-1179  [bib]: Maarten Marx; Anne Schuth
DutchParl. The Parliamentary Documents in Dutch

L10-1180  [bib]: Verena Henrich; Erhard Hinrichs
GernEdiT - The GermaNet Editing Tool

L10-1181  [bib]: Erhard Hinrichs; Verena Henrich; Thomas Zastrow
Sustainability of Linguistic Data and Analysis in the Context of a Collaborative eScience Environment

L10-1182  [bib]: Fabienne Fritzinger; Frank Richter; Marion Weller
Pattern-Based Extraction of Negative Polarity Items from Dependency-Parsed Text

L10-1183  [bib]: Attila Görög; Piek Vossen
Computer Assisted Semantic Annotation in the DutchSemCor Project

L10-1184  [bib]: Marie Hinrichs; Thomas Zastrow; Erhard Hinrichs
WebLicht: Web-based LRT Services in a Distributed eScience Infrastructure

L10-1185  [bib]: Francisco Torreira; Mirjam Ernestus
The Nijmegen Corpus of Casual Spanish

L10-1186  [bib]: Diana Santos; Luís Miguel Cabral; Corina Forascu; Pamela Forner; Fredric Gey; Katrin Lamm; Thomas Mandl; Petya Osenova; Anselmo Peñas; Álvaro Rodrigo; Julia Schulz; Yvonne Skalban; Erik Tjong Kim Sang
GikiCLEF: Crosscultural Issues in Multilingual Information Access

L10-1187  [bib]: Dieter Van Uytvanck; Claus Zinn; Daan Broeder; Peter Wittenburg; Mariano Gardellini
Virtual Language Observatory: The Portal to the Language Resources and Technology Universe

L10-1188  [bib]: Pavel Skrelin; Nina Volskaya; Daniil Kocharov; Karina Evgrafova; Olga Glotova; Vera Evdokimova
A Fully Annotated Corpus of Russian Speech

L10-1189  [bib]: Werner Spiegl; Korbinian Riedhammer; Stefan Steidl; Elmar Nöth
FAU IISAH Corpus -- A German Speech Database Consisting of Human-Machine and Human-Human Interaction Acquired by Close-Talking and Far-Distance Microphones

L10-1190  [bib]: Kais Dukes; Nizar Habash
Morphological Annotation of Quranic Arabic

L10-1191  [bib]: Florian Schiel
BAStat : New Statistical Resources at the Bavarian Archive for Speech Signals

L10-1192  [bib]: Kais Dukes; Eric Atwell; Abdul-Baquee M. Sharaf
Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank

L10-1193  [bib]: Tommi Vatanen; Jaakko J. Väyrynen; Sami Virpioja
Language Identification of Short Text Segments with N-gram Models

L10-1194  [bib]: Joseph Polifroni; Imre Kiss; Mark Adler
Bootstrapping Named Entity Extraction for the Creation of Mobile Services

L10-1195  [bib]: Bert Réveil; Jean-Pierre Martens; Henk van den Heuvel
Improving Proper Name Recognition by Adding Automatically Learned Pronunciation Variants to the Lexicon

L10-1196  [bib]: Majdi Sawalha; Eric Atwell
Fine-Grain Morphological Analyzer and Part-of-Speech Tagger for Arabic Text

L10-1197  [bib]: Carlos Periñán-Pascual; Francisco Arcas-Túnez
The Architecture of FunGramKB

L10-1198  [bib]: Patrick Bauer; David Scheler; Tim Fingscheidt
WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network

L10-1199  [bib]: Philip van Oosten; Dries Tanghe; Véronique Hoste
Towards an Improved Methodology for Automated Readability Prediction

L10-1200  [bib]: Majdi Sawalha; Eric Atwell
Constructing and Using Broad-coverage Lexical Resource for Enhancing Morphological Analysis of Arabic

L10-1201  [bib]: Jérôme Urbain; Elisabetta Bevacqua; Thierry Dutoit; Alexis Moinet; Radoslaw Niewiadomski; Catherine Pelachaud; Benjamin Picart; Joëlle Tilmanne; Johannes Wagner
The AVLaughterCycle Database

L10-1202  [bib]: Gerlof Bouma; Lilja Øvrelid; Jonas Kuhn
Towards a Large Parallel Corpus of Cleft Constructions

L10-1203  [bib]: Ziqi Zhang; Anna Lisa Gentile; Lei Xia; José Iria; Sam Chapman
A Random Graph Walk based Approach to Computing Semantic Relatedness Using Knowledge from Wikipedia

L10-1204  [bib]: Adrien Lardilleux; Julien Gosme; Yves Lepage
Bilingual Lexicon Induction: Effortless Evaluation of Word Alignment Tools and Production of Resources for Improbable Language Pairs

L10-1205  [bib]: Marijn Schraagen; Gerrit Bloothooft
Evaluating Repetitions, or how to Improve your Multilingual ASR System by doing Nothing

L10-1206  [bib]: Akira Utsumi
Exploring the Relationship between Semantic Spaces and Semantic Relations

L10-1207  [bib]: Christina Leitner; Martin Schickbichler; Stefan Petrik
Example-Based Automatic Phonetic Transcription

L10-1208  [bib]: Anne Garcia-Fernandez; Sophie Rosset; Anne Vilnat
MACAQ : A Multi Annotated Corpus to Study how we Adapt Answers to Various Questions

L10-1209  [bib]: Carlos-D. Martínez-Hinarejos; Vicent Tamarit; José-M. Benedí
Evaluation of HMM-based Models for the Annotation of Unsegmented Dialogue Turns

L10-1210  [bib]: Raheel Nawaz; Paul Thompson; John McNaught; Sophia Ananiadou
Meta-Knowledge Annotation of Bio-Events

L10-1211  [bib]: Yoshinobu Kano; Ruben Dorado; Luke McCrohon; Sophia Ananiadou; Jun'ichi Tsujii
U-Compare: An Integrated Language Resource Evaluation Platform Including a Comprehensive UIMA Resource Library

L10-1212  [bib]: Janne Bondi Johannessen; Kristin Hagen; Anders Nøklestad; Joel Priestley
Enhancing Language Resources with Maps

L10-1213  [bib]: Jana Z. Sukkarieh; Eleanor Bolge
Building a Textual Entailment Suite for the Evaluation of Automatic Content Scoring Technologies

L10-1214  [bib]: Haïfa Zargayouna; Adeline Nazarenko
Evaluation of Textual Knowledge Acquisition Tools: a Challenging Task

L10-1215  [bib]: Gerard de Melo; Gerhard Weikum
Providing Multilingual, Multimodal Answers to Lexical Database Queries

L10-1216  [bib]: Alessandro Lenci; Martina Johnson; Gabriella Lapesa
Building an Italian FrameNet through Semi-automatic Corpus Analysis

L10-1217  [bib]: Rein Ove Sikveland; Anton Öttl; Ingunn Amdal; Mirjam Ernestus; Torbjørn Svendsen; Jens Edlund
Spontal-N: A Corpus of Interactional Spoken Norwegian

L10-1218  [bib]: Svetla Koeva; Diana Blagoeva; Siya Kolkovska
Bulgarian National Corpus Project

L10-1219  [bib]: Donghui Lin; Yoshiaki Murakami; Toru Ishida; Yohei Murakami; Masahiro Tanaka
Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes

L10-1220  [bib]: Boris Haselbach; Ulrich Heid
The Development of a Morphosyntactic Tagset for Afrikaans and its Use with Statistical Tagging

L10-1221  [bib]: Sophia Yat Mei Lee; Ying Chen; Shoushan Li; Chu-Ren Huang
Emotion Cause Events: Corpus Construction and Analysis

L10-1222  [bib]: Costanza Navarretta
The DAD Parallel Corpora and their Uses

L10-1223  [bib]: Andrew Thwaites; Jeroen Geertzen; William D. Marslen-Wilson; Paula Buttery
LIPS: A Tool for Predicting the Lexical Isolation Point of a Word

L10-1224  [bib]: Caroline Williams; Andrew Thwaites; Paula Buttery; Jeroen Geertzen; Billi Randall; Meredith Shafto; Barry Devereux; Lorraine Tyler
The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals

L10-1225  [bib]: Henk van den Heuvel; René van Horik; Stef Scagliola; Eric Sanders; Paula Witkamp
The VeteranTapes: Research Corpus, Fragment Processing Tool, and Enhanced Publications for the e-Humanities

L10-1226  [bib]: Raffaella Bernardi; Manuel Kirschner; Zorana Ratkovic
Context Fusion: The Role of Discourse Structure and Centering Theory

L10-1227  [bib]: Jun Okamoto; Shun Ishizaki
Homographic Ideogram Understanding Using Contextual Dynamic Network

L10-1228  [bib]: Cheikh M. Bamba Dione; Jonas Kuhn; Sina Zarrieß
Design and Development of Part-of-Speech-Tagging Resources for Wolof (Niger-Congo, spoken in Senegal)

L10-1229  [bib]: Roser Morante
Descriptive Analysis of Negation Cues in Biomedical Texts

L10-1230  [bib]: Xuchen Yao; Irina Borisova; Mehwish Alam
PDTB XML: the XMLization of the Penn Discourse TreeBank 2.0

L10-1231  [bib]: Agnieszka Mykowiecka; Katarzyna Głowińska; Joanna Rabiega-Wiśniewska
Domain-related Annotation of Polish Spoken Dialogue Corpus LUNA.PL

L10-1232  [bib]: Lubomir Otrusina; Pavel Smrz
A New Approach to Pseudoword Generation

L10-1233  [bib]: Horacio Saggion; Elena Stein-Sparvieri; David Maldavsky; Sandra Szasz
NLP Resources for the Analysis of Patient/Therapist Interviews

L10-1234  [bib]: Dafydd Gibbon; Moses Ekpenyong; Eno-Abasi Urua
Medefaidrin: Resources Documenting the Birth and Death Language Life-cycle

L10-1235  [bib]: Satoshi Sato; Sayoko Kaide
A Person-Name Filter for Automatic Compilation of Bilingual Person-Name Lexicons

L10-1236  [bib]: Rania Al-Sabbagh; Roxana Girju
Mining the Web for the Induction of a Dialectical Arabic Lexicon

L10-1237  [bib]: Tobias Heinroth; Dan Denich; Alexander Schmitt; Wolfgang Minker
Efficient Spoken Dialogue Domain Representation and Interpretation

L10-1238  [bib]: Philippe Dreuw; Hermann Ney; Gregorio Martinez; Onno Crasborn; Justus Piater; Jose Miguel Moya; Mark Wheatley
The SignSpeak Project - Bridging the Gap Between Signers and Speakers

L10-1239  [bib]: Junko Kubo; Keita Tsuji; Shigeo Sugimoto
Automatic Term Recognition Based on the Statistical Differences of Relative Frequencies in Different Corpora

L10-1240  [bib]: Violeta Seretan; Eric Wehrli; Luka Nerima; Gabriela Soare
FipsRomanian: Towards a Romanian Version of the Fips Syntactic Parser

L10-1241  [bib]: Jens Edlund; Jonas Beskow; Kjell Elenius; Kahl Hellmer; Sofia Strönbergsson; David House
Spontal: A Swedish Spontaneous Dialogue Corpus of Audio, Video and Motion Capture

L10-1242  [bib]: Walid Magdy; Jinming Min; Johannes Leveling; Gareth J. F. Jones
Building a Domain-specific Document Collection for Evaluating Metadata Effects on Information Retrieval

L10-1243  [bib]: Horacio Saggion; Adam Funk
Interpreting SentiWordNet for Opinion Classification

L10-1244  [bib]: Doris Baum; Daniel Schneider; Rolf Bardeli; Jochen Schwenninger; Barbara Samlowski; Thomas Winkler; Joachim Köhler
DiSCo - A German Evaluation Corpus for Challenging Problems in the Broadcast Domain

L10-1245  [bib]: Antonio Balvet; Cyril Courtin; Dominique Boutet; Christian Cuxac; Ivani Fusellier-Souza; Brigitte Garcia; Marie-Thérèse L’Huillier; Marie-Anne Sallandre
The Creagest Project: a Digitized and Annotated Corpus for French Sign Language (LSF) and Natural Gestural Languages

L10-1246  [bib]: Jesús Tomás; Alejandro Canovas; Jaime Lloret; Miguel García Pineda; Jose L. Abad
Speech Translation in Pedagogical Environment Using Additional Sources of Knowledge

L10-1247  [bib]: Elena Grishina; Svetlana Savchuk; Alexej Poljakov
Design and Data Collection for the Accentological Corpus of the Russian Language

L10-1248  [bib]: Paul Felt; Owen Merkling; Marc Carmen; Eric Ringger; Warren Lemmon; Kevin Seppi; Robbie Haertel
CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development

L10-1249  [bib]: Michael Pucher; Friedrich Neubarth; Volker Strom; Sylvia Moosmüller; Gregor Hofer; Christian Kranzler; Gudrun Schuchmann; Dietmar Schabus
Resources for Speech Synthesis of Viennese Varieties

L10-1250  [bib]: Michael Wiegand; Dietrich Klakow
Predictive Features for Detecting Indefinite Polar Sentences

L10-1251  [bib]: Ulrich Heid; Fabienne Fritzinger; Erhard Hinrichs; Marie Hinrichs; Thomas Zastrow
Term and Collocation Extraction by Means of Complex Linguistic Web Services

L10-1252  [bib]: Nicoletta Calzolari; Claudia Soria
Preparing the field for an Open Resource Infrastructure: the role of the FLaReNet Network of Excellence

L10-1253  [bib]: Nicoletta Calzolari; Claudia Soria; Riccardo Del Gratta; Sara Goggi; Valeria Quochi; Irene Russo; Khalid Choukri; Joseph Mariani; Stelios Piperidis
The LREC Map of Language Resources and Technologies

L10-1254  [bib]: Nicolas Moreau; Olivier Hamon; Djamel Mostefa; Sophie Rosset; Olivier Galibert; Lori Lamel; Jordi Turmo; Pere R. Comas; Paolo Rosso; Davide Buscaldi; Khalid Choukri
Evaluation Protocol and Tools for Question-Answering on Speech Transcripts

L10-1255  [bib]: Roser Sanromà; Gemma Boleda
The Database of Catalan Adjectives

L10-1256  [bib]: Amal Zouaq; Michel Gagnon; Benoit Ozell
Can Syntactic and Logical Graphs help Word Sense Disambiguation?

L10-1257  [bib]: Meng Wang; Chu-Ren Huang; Shiwen Yu; Weiwei Sun
Automatic Acquisition of Chinese Novel Noun Compounds

L10-1258  [bib]: Nelleke Oostdijk; Suzan Verberne; Cornelis Koster
Constructing a Broad-coverage Lexicon for Text Mining in the Patent Domain

L10-1259  [bib]: Paul Bedaride; Claire Gardent
Syntactic Testsuites and Textual Entailment Recognition

L10-1260  [bib]: Jan Štěpánek; Petr Pajas
Querying Diverse Treebanks in a Uniform Way

L10-1261  [bib]: Rodolfo Delmonte; Antonella Bristot; Vincenzo Pallotta
Deep Linguistic Processing with GETARUNS for Spoken Dialogue Understanding

L10-1262  [bib]: Emad Mohamed; Sandra Kübler
Arabic Part of Speech Tagging

L10-1263  [bib]: Alexander Pak; Patrick Paroubek
Twitter as a Corpus for Sentiment Analysis and Opinion Mining

L10-1264  [bib]: Rena Nemoto; Martine Adda-Decker; Jacques Durand
Word Boundaries in French: Evidence from Large Speech Corpora

L10-1265  [bib]: Benoît Sagot; Laurence Danlos; Rosa Stern
A Lexicon of French Quotation Verbs for Automatic Quotation Extraction

L10-1266  [bib]: Marie Mikulová; Jan Štěpánek
Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank

L10-1267  [bib]: Klaar Vanopstal; Robert Vander Stichele; Godelieve Laureys; Joost Buysschaert
Assessing the Impact of English Language Skills and Education Level on PubMed Searches by Dutch-speaking Users

L10-1268  [bib]: Yasuharu Den; Hanae Koiso; Takehiko Maruyama; Kikuo Maekawa; Katsuya Takanashi; Mika Enomoto; Nao Yoshida
Two-level Annotation of Utterance-units in Japanese Dialogs: An Empirically Emerged Scheme

L10-1269  [bib]: Marie Candito; Benoît Crabbé; Pascal Denis
Statistical French Dependency Parsing: Treebank Conversion and First Results

L10-1270  [bib]: Yue Ma; Adeline Nazarenko; Laurent Audibert
Formal Description of Resources for Ontology-based Semantic Annotation

L10-1271  [bib]: Luis Javier Rodríguez-Fuentes; Mikel Penagarikano; Germán Bordel; Amparo Varona; Mireia Díez
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems

L10-1272  [bib]: Jarmila Panevová; Magda Ševčíková
Annotation of Morphological Meanings of Verbs Revisited

L10-1273  [bib]: Andrew Hickl; Sanda Harabagiu
Unsupervised Discovery of Collective Action Frames for Socio-Cultural Analysis

L10-1274  [bib]: Ting-Hao Huang; Lun-Wei Ku; Hsin-Hsi Chen
Predicting Morphological Types of Chinese Bi-Character Words by Machine Learning Approaches

L10-1275  [bib]: Nicole Novielli; Carlo Strapparava
Studying the Lexicon of Dialogue Acts

L10-1276  [bib]: Jorge Vivaldi; Iria da Cunha; Juan Manuel Torres-Moreno; Patricia Velázquez-Morales
Automatic Summarization Using Terminological and Semantic Resources

L10-1277  [bib]: Olivier Hamon
Is my Judge a good One?

L10-1278  [bib]: Mátyás Brendel; Riccardo Zaccarelli; Laurence Devillers
Building a System for Emotions Detection from Speech to Control an Affective Avatar

L10-1279  [bib]: Roxane Segers; Piek Vossen
Facilitating Non-expert Users of the KYOTO Platform: the TMEKO Editing Protocol for Synset to Ontology Mappings

L10-1280  [bib]: Ekaterina Buyko; Elena Beisswanger; Udo Hahn
The GeneReg Corpus for Gene Expression Regulation Events ― An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability

L10-1281  [bib]: Graham Neubig; Shinsuke Mori
Word-based Partial Annotation for Efficient Corpus Construction

L10-1282  [bib]: Gertrud Faaß; Ulrich Heid; Helmut Schmid
Design and Application of a Gold Standard for Morphological Analysis: SMOR as an Example of Morphological Evaluation

L10-1283  [bib]: Richard Schwarz; Hinrich Schütze; Fabienne Martin; Achim Stein
Identification of Rare & Novel Senses Using Translations in a Parallel Corpus

L10-1284  [bib]: Cláudia Freitas; Cristina Mota; Diana Santos; Hugo Gonçalo Oliveira; Paula Carvalho
Second HAREM: Advancing the State of the Art of Named Entity Recognition in Portuguese

L10-1285  [bib]: Marc Kupietz; Cyril Belica; Holger Keibel; Andreas Witt
The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research

L10-1286  [bib]: Eva Sassolini; Alessandra Cinini
Cultural Heritage: Knowledge Extraction from Web Documents

L10-1287  [bib]: Marta Villegas; Núria Bel; Santiago Bel; Víctor Rodríguez
A Case Study on Interoperability for Language Resources and Applications

L10-1288  [bib]: Bruno Cartoni; Pierre Zweigenbaum
Semi-Automated Extension of a Specialized Medical Lexicon for French

L10-1289  [bib]: Kyle Duarte; Sylvie Gibet
Heterogeneous Data Sources for Signed Language Analysis and Synthesis: The SignCom Project

L10-1290  [bib]: Patrick Paroubek; Alexander Pak; Djamel Mostefa
Annotations for Opinion Mining Evaluation in the Industrial Context of the DOXA project

L10-1291  [bib]: Milen Kouylekov; Yashar Mehdad; Matteo Negri
Mining Wikipedia for Large-scale Repositories of Context-Sensitive Entailment Rules

L10-1292  [bib]: Sara Stymne; Lars Ahrenberg
Using a Grammar Checker for Evaluation and Postprocessing of Statistical Machine Translation

L10-1293  [bib]: Marion Weller; Ulrich Heid
Extraction of German Multiword Expressions from Parsed Corpora Using Context Features

L10-1294  [bib]: Patrick Paroubek; Olivier Hamon; Eric de La Clergerie; Cyril Grouin; Anne Vilnat
The Second Evaluation Campaign of PASSAGE on Parsing of French

L10-1295  [bib]: Kepa Joseba Rodríguez; Francesca Delogu; Yannick Versley; Egon W. Stemle; Massimo Poesio
Anaphoric Annotation of Wikipedia and Blogs in the Live Memories Corpus

L10-1296  [bib]: Dan Flickinger; Stephan Oepen; Gisle Ytrestøl
WikiWoods: Syntacto-Semantic Annotation for English Wikipedia

L10-1297  [bib]: Josef Ruppenhofer; Caroline Sporleder; Fabian Shirokov
Speaker Attribution in Cabinet Protocols

L10-1298  [bib]: Nick Webb; David Benyon; Jay Bradley; Preben Hansen; Oil Mival
Wizard of Oz Experiments for a Companion Dialogue System: Eliciting Companionable Conversation

L10-1299  [bib]: Carlos Gómez Gallo; T. Florian Jaeger; Katrina Furth
A Database for the Exploration of Spanish Planning

L10-1300  [bib]: Irina Temnikova
Cognitive Evaluation Approach for a Controlled Language Post-­Editing Experiment

L10-1301  [bib]: Jens Allwood; Harald Hammarström; Andries Hendrikse; Mtholeni N. Ngcobo; Nozibele Nomdebevana; Laurette Pretorius; Mac van der Merwe
Work on Spoken (Multimodal) Language Corpora in South Africa

L10-1302  [bib]: Nils Reiter; Oliver Hellwig; Anand Mishra; Anette Frank; Jens Burkhardt
Using NLP Methods for the Analysis of Rituals

L10-1303  [bib]: C. Anton Rytting; Paul Rodrigues; Tim Buckwalter; David Zajic; Bridget Hirsch; Jeff Carnes; Nathanael Lynn; Sarah Wayland; Chris Taylor; Jason White; Charles Blake III; Evelyn Browne; Corey Miller; Tristan Purvis
Error Correction for Arabic Dictionary Lookup

L10-1304  [bib]: Christopher R. Walker; Hannah Copperman
Evaluating Complex Semantic Artifacts

L10-1305  [bib]: Mohamed Altantawy; Nizar Habash; Owen Rambow; Ibrahim Saleh
Morphological Analysis and Generation of Arabic Nouns: A Morphemic Functional Approach

L10-1306  [bib]: Hiroyuki Kaji; Takashi Tsunakawa; Daisuke Okada
Using Comparable Corpora to Adapt a Translation Model to Domains

L10-1307  [bib]: Hannah Copperman; Christopher R. Walker
Fred’s Reusable Evaluation Device: Providing Support for Quick and Reliable Linguistic Annotation

L10-1308  [bib]: Alexis Baird; Christopher R. Walker
The Creation of a Large-Scale LFG-Based Gold Parsebank

L10-1309  [bib]: Kathrin Baker; Michael Bloodgood; Bonnie Dorr; Nathaniel W. Filardo; Lori Levin; Christine Piatko
A Modality Lexicon and its use in Automatic Tagging

L10-1310  [bib]: Michael Tanenblatt; Anni Coden; Igor Sominsky
The ConceptMapper Approach to Named Entity Recognition

L10-1311  [bib]: Yoshihiko Hayashi; Thierry Declerck; Chiharu Narawa
LAF/GrAF-grounded Representation of Dependency Structures

L10-1312  [bib]: Marc Carmen; Paul Felt; Robbie Haertel; Deryle Lonsdale; Peter McClanahan; Owen Merkling; Eric Ringger; Kevin Seppi
Tag Dictionaries Accelerate Manual Annotation

L10-1313  [bib]: Stasinos Konstantopoulos
Learning Language Identification Models: A Comparative Analysis of the Distinctive Features of Names and Common Words

L10-1314  [bib]: Chris Irwin Davis; Dan Moldovan
Feasibility of Automatically Bootstrapping a Persian WordNet

L10-1315  [bib]: Aditi Sharma Grover; Gerhard B. van Huyssteen; Marthinus W. Pretorius
The South African Human Language Technologies Audit

L10-1316  [bib]: Massimo Poesio; Marco Baroni; Oswald Lanz; Alessandro Lenci; Alexandros Potamianos; Hinrich Schütze; Sabine Schulte im Walde; Luca Surian
BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do

L10-1317  [bib]: Iñaki Sainz; Eva Navas; Inma Hernáez; Antonio Bonafonte; Francisco Campillo
TTS Evaluation Campaign with a Common Spanish Database

L10-1318  [bib]: Diana Santos; Cristina Mota
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora

L10-1319  [bib]: Koichiro Honda; Tomoyosi Akiba
Language Modeling Approach for Retrieving Passages in Lecture Audio Data

L10-1320  [bib]: Pamela Forner; Danilo Giampiccolo; Bernardo Magnini; Anselmo Peñas; Álvaro Rodrigo; Richard Sutcliffe
Evaluating Multilingual Question Answering Systems at CLEF

L10-1321  [bib]: Veronika Vincze; Dóra Szauter; Attila Almási; György Móra; Zoltán Alexin; János Csirik
Hungarian Dependency Treebank

L10-1322  [bib]: Francesca Fallucchi; Maria Teresa Pazienza; Fabio Massimo Zanzotto
Generic Ontology Learners on Application Domains

L10-1323  [bib]: Alessandro Oltramari; Guido Vetere; Maurizio Lenzerini; Aldo Gangemi; Nicola Guarino
Senso Comune

L10-1324  [bib]: Rogelio Nazar; Maarten Janssen
Combining Resources: Taxonomy Extraction from Multiple Dictionaries

L10-1325  [bib]: Yan Zhao; Gertjan van Noord
POS Multi-tagging Based on Combined Models

L10-1326  [bib]: Ibon Saratxaga; Inmaculada Hernáez; Eva Navas; Iñaki Sainz; Iker Luengo; Jon Sanchez; Igor Odriozola; Daniel Erro
AhoTransf: A Tool for Multiband Excitation Based Speech Analysis and Modification

L10-1327  [bib]: Louise Deléger; Pierre Zweigenbaum
Identifying Paraphrases between Technical and Lay Corpora

L10-1328  [bib]: Stavros Ntalampiras; Todor Ganchev; Ilyas Potamitis; Nikos Fakotakis
Heterogeneous Sensor Database in Support of Human Behaviour Analysis in Unrestricted Environments: The Audio Part

L10-1329  [bib]: Khalil Dahab; Anja Belz
A Game-based Approach to Transcribing Images of Text

L10-1330  [bib]: Nicolas Serrano; Francisco Castro; Alfons Juan
The RODRIGO Database

L10-1331  [bib]: Luisa Bentivogli; Elena Cabrio; Ido Dagan; Danilo Giampiccolo; Medea Lo Leggio; Bernardo Magnini
Building Textual Entailment Specialized Data Sets: a Methodology for Isolating Linguistic Phenomena Relevant to Inference

L10-1332  [bib]: Amal Al-Saif; Katja Markert
The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic

L10-1333  [bib]: Ioana Vasilescu; Sophie Rosset; Martine Adda-Decker
On the Role of Discourse Markers in Interactive Spoken Question Answering Systems

L10-1334  [bib]: Björn Schuller; Riccardo Zaccarelli; Nicolas Rollet; Laurence Devillers
CINEMO ― A French Spoken Language Resource for Complex Emotions: Facts and Baselines

L10-1335  [bib]: Sara Romano; Francesco Cutugno
New Features in Spoken Language Search Hawk (SpLaSH): Query Language and Query Sequence

L10-1336  [bib]: Claire Mouton; Gaël de Chalendar; Benoît Richert
FrameNet Translation Using Bilingual Dictionaries with Evaluation on the English-French Pair

L10-1337  [bib]: Jiří Mírovský; Petr Pajas; Anna Nedoluzhko
Annotation Tool for Extended Textual Coreference and Bridging Anaphora

L10-1338  [bib]: Azad Abad; Luisa Bentivogli; Ido Dagan; Danilo Giampiccolo; Shachar Mirkin; Emanuele Pianta; Asher Stern
A Resource for Investigating the Impact of Anaphora and Coreference on Inference.

L10-1339  [bib]: Robert Remus; Uwe Quasthoff; Gerhard Heyer
SentiWS - A Publicly Available German-language Resource for Sentiment Analysis

L10-1340  [bib]: Polina Panicheva; John Cardiff; Paolo Rosso
Personal Sense and Idiolect: Combining Authorship Attribution and Opinion Analysis

L10-1341  [bib]: Dirk Goldhahn; Uwe Quasthoff
Automatic Annotation of Co-Occurrence Relations

L10-1342  [bib]: Max Jakob; Markéta Lopatková; Valia Kordoni
Mapping between Dependency Structures and Compositional Semantic Representations

L10-1343  [bib]: Danielle Ben-Gera; Yi Zhang; Valia Kordoni
Semantic Feature Engineering for Enhancing Disambiguation Performance in Deep Linguistic Processing

L10-1344  [bib]: Nuria Gala; Véronique Rey; Michael Zock
A Tool for Linking Stems and Conceptual Fragments to Enhance word Access

L10-1345  [bib]: Valia Kordoni; Yi Zhang
Disambiguating Compound Nouns for a Dynamic HPSG Treebank of Wall Street Journal Texts

L10-1346  [bib]: Lukas Michelbacher; Florian Laws; Beate Dorow; Ulrich Heid; Hinrich Schütze
Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure

L10-1347  [bib]: Samuel Broscheit; Simone Paolo Ponzetto; Yannick Versley; Massimo Poesio
Extending BART to Provide a Coreference Resolution System for German

L10-1348  [bib]: Ulrich Heid; Helmut Schmid; Kerstin Eckart; Erhard Hinrichs
A Corpus Representation Format for Linguistic Web Services: The D-SPIN Text Corpus Format and its Relationship with ISO Standards

L10-1349  [bib]: Lucia Specia; Nicola Cancedda; Marc Dymetman
A Dataset for Assessing Machine Translation Evaluation Metrics

L10-1350  [bib]: Jakob Halskov; Dorte Haltrup Hansen; Anna Braasch; Sussi Olsen
Quality Indicators of LSP Texts ― Selection and Measurements Measuring the Terminological Usefulness of Documents for an LSP Corpus

L10-1351  [bib]: Gregor Bertrand; Florian Nothdurft; Steffen Walter; Andreas Scheck; Henrik Kessler; Wolfgang Minker
Towards Investigating Effective Affective Dialogue Strategies

L10-1352  [bib]: Rüdiger Gleim; Alexander Mehler
Computational Linguistics for Mere Mortals - Powerful but Easy-to-use Linguistic Processing for Scientists in the Humanities

L10-1353  [bib]: Maria Holmqvist
Heuristic Word Alignment with Parallel Phrases

L10-1354  [bib]: Martijn Goudbeek; Mirjam Broersma
The Demo / Kemo Corpus: A Principled Approach to the Study of Cross-cultural Differences in the Vocal Expression and Perception of Emotion

L10-1355  [bib]: Armando Stellato; Heiko Stoermer; Stefano Bortoli; Noemi Scarpato; Andrea Turbati; Paolo Bouquet; Maria Teresa Pazienza
Maskkot ― An Entity-centric Annotation Platform

L10-1356  [bib]: Petr Pollák; Josef Rajnoha
Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices

L10-1357  [bib]: Guillaume Bernard; Sophie Rosset; Martine Adda-Decker; Olivier Galibert
A Question-answer Distance Measure to Investigate QA System Progress

L10-1358  [bib]: Andre Blessing; Hinrich Schütze
Fine-Grained Geographical Relation Extraction from Wikipedia

L10-1359  [bib]: Sarra El Ayari; Brigitte Grau; Anne-Laure Ligozat
Fine-grained Linguistic Evaluation of Question Answering Systems

L10-1360  [bib]: Nao Tatsumi; Jun Okamoto; Shun Ishizaki
Evaluating Semantic Relations and Distances in the Associative Concept Dictionary using NIRS-imaging

L10-1361  [bib]: Danica Damljanovic; Milan Agatonovic; Hamish Cunningham
Identification of the Question Focus: Combining Syntactic Analysis and Ontology-based Lookup through the User Interaction

L10-1362  [bib]: Arnaud Grappy; Brigitte Grau; Olivier Ferret; Cyril Grouin; Véronique Moriceau; Isabelle Robba; Xavier Tannier; Anne Vilnat; Vincent Barbier
A Corpus for Studying Full Answer Justification

L10-1363  [bib]: Silvana Marianela Bernaola Biggio; Manuela Speranza; Roberto Zanoli
Entity Mention Detection using a Combination of Redundancy-Driven Classifiers

L10-1364  [bib]: Gabor Recski; András Rung; Attila Zséder; András Kornai
NP Alignment in Bilingual Corpora

L10-1365  [bib]: Andrew Gargett; Konstantina Garoufi; Alexander Koller; Kristina Striegnitz
The GIVE-2 Corpus of Giving Instructions in Virtual Environments

L10-1366  [bib]: Alexander Vorwerk; Xiaohui Wang; Dorothea Kolossa; Steffen Zeiler; Reinhold Orglmeister
WAPUSK20 - A Database for Robust Audiovisual Speech Recognition

L10-1367  [bib]: Eneko Agirre; Montse Cuadros; German Rigau; Aitor Soroa
Exploring Knowledge Bases for Similarity

L10-1368  [bib]: Cristina Sánchez-Marco; Gemma Boleda; Josep Maria Fontana; Judith Domingo
Annotation and Representation of a Diachronic Corpus of Spanish

L10-1369  [bib]: Ghulam Raza
Inferring Subcat Frames of Verbs in Urdu

L10-1370  [bib]: Romaric Besançon; Gaël de Chalendar; Olivier Ferret; Faiza Gara; Olivier Mesnard; Meriama Laïb; Nasredine Semmar
LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation

L10-1371  [bib]: Grzegorz Chrupała; Dietrich Klakow
A Named Entity Labeler for German: Exploiting Wikipedia and Distributional Clusters

L10-1372  [bib]: Magnus Rosell
Text Cluster Trimming for Better Descriptions and Improved Quality

L10-1373  [bib]: Jesús González-Rubio; Jorge Civera; Alfons Juan; Francisco Casacuberta
Saturnalia: A Latin-Catalan Parallel Corpus for Statistical MT

L10-1374  [bib]: Emilia Apostolova; Sean Neilan; Gary An; Noriko Tomuro; Steven Lytinen
Djangology: A Light-weight Web-based Tool for Distributed Collaborative Text Annotation

L10-1375  [bib]: Leon Derczynski; Robert Gaizauskas
Analysing Temporally Annotated Corpora with CAVaT

L10-1376  [bib]: Martin Reynaert; Nelleke Oostdijk; Orphée De Clercq; Henk van den Heuvel; Franciska de Jong
Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus

L10-1377  [bib]: Samuel Cruz-Lara; Gil Francopoulo; Laurent Romary; Nasredine Semmar
MLIF : A Metamodel to Represent and Exchange Multilingual Textual Information

L10-1378  [bib]: Josef Ruppenhofer; Jonas Sunde; Manfred Pinkal
Generating FrameNets of Various Granularities: The FrameNet Transformer

L10-1379  [bib]: Francesca Bonin; Felice Dell'Orletta; Simonetta Montemagni; Giulia Venturi
A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora

L10-1380  [bib]: Olivier Blanc; Matthieu Constant; Anne Dister; Patrick Watrin
Partial Parsing of Spontaneous Spoken French

L10-1381  [bib]: Annelies Braffort; Laurence Bolot; Emilie Chételat-Pelé; Annick Choisier; Maxime Delorme; Michael Filhol; Jérémie Segouat; Cyril Verrecchia; Flora Badin; Nadège Devos
Sign Language Corpora for Analysis, Processing and Evaluation

L10-1382  [bib]: Sara Tonelli; Emanuele Pianta; Rodolfo Delmonte; Michele Brunelli
VenPro: A Morphological Analyzer for Venetan

L10-1383  [bib]: Mohamed Maamouri; Ann Bies; Seth Kulick; Wajdi Zaghouani; Dave Graff; Mike Ciul
From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News

L10-1384  [bib]: Enikő Héja
The Role of Parallel Corpora in Bilingual Lexicography

L10-1385  [bib]: Harry Bunt; Jan Alexandersson; Jean Carletta; Jae-Woong Choe; Alex Chengyu Fang; Koiti Hasida; Kiyong Lee; Volha Petukhova; Andrei Popescu-Belis; Laurent Romary; Claudia Soria; David Traum
Towards an ISO Standard for Dialogue Act Annotation

L10-1386  [bib]: Archna Bhatia; Rajesh Bhatt; Bhuvana Narasimhan; Martha Palmer; Owen Rambow; Dipti Misra Sharma; Michael Tepper; Ashwini Vaidya; Fei Xia
Empty Categories in a Hindi Treebank

L10-1387  [bib]: Marina Lloberes; Irene Castellón; Lluís Padró
Spanish FreeLing Dependency Grammar

L10-1388  [bib]: Magali Sanches Duran; Marcelo Adriano Amâncio; Sandra Maria Aluísio
Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building

L10-1389  [bib]: Ralph Grishman
The Impact of Task and Corpus on Event Extraction Systems

L10-1390  [bib]: Seth Kulick; Ann Bies; Mohamed Maamouri
Consistent and Flexible Integration of Morphological Annotation in the Arabic Treebank

L10-1391  [bib]: Andrea Zaninello; Malvina Nissim
Creation of Lexical Resources for a Characterisation of Multiword Expressions in Italian

L10-1392  [bib]: Jana Šindlerová; Ondřej Bojar
Building a Bilingual ValLex Using Treebank Token Alignment: First Observations

L10-1393  [bib]: Alberto Díaz; Pablo Gervás; Antonio García; Laura Plaza
Development and Use of an Evaluation Collection for Personalisation of Digital Newspapers

L10-1394  [bib]: Jonathan H. Clark; Alon Lavie
LoonyBin: Keeping Language Technologists Sane through Automated Management of Experimental (Hyper)Workflows

L10-1395  [bib]: Keith J. Miller; Sarah McLeod; Elizabeth Schroeder; Mark Arehart; Kenneth Samuel; James Finley; Vanesa Jurica; John Polk
Improving Personal Name Search in the TIGR System

L10-1396  [bib]: Kornel Laskowski; Jens Edlund
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm

L10-1397  [bib]: Jette Viethen; Simon Zwarts; Robert Dale; Markus Guhe
Dialogue Reference in a Visual Domain

L10-1398  [bib]: Sunao Hara; Norihide Kitaoka; Kazuya Takeda
Estimation Method of User Satisfaction Using N-gram-based Dialog History Model for Spoken Dialog System

L10-1399  [bib]: Peng-Wen Chen; Snehal Kumar Chennuru; Ying Zhang
A Language Approach to Modeling Human Behaviors

L10-1400  [bib]: Masaki Murata; Tomohiro Ohno; Shigeki Matsubara; Yasuyoshi Inagaki
Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation

L10-1401  [bib]: Stergos Afantenos; Pascal Denis; Philippe Muller; Laurence Danlos
Learning Recursive Segments for Discourse Parsing

L10-1402  [bib]: Shu Zhang; Wenjie Jia; Yingju Xia; Yao Meng; Hao Yu
Extracting Product Features and Sentiments from Chinese Customer Reviews

L10-1403  [bib]: Bernd Bohnet; Leo Wanner
Open Soucre Graph Transducer Interpreter and Grammar Development Environment

L10-1404  [bib]: Min-Jae Kwon; Hae-Yun Lee; Hee-Rahk Chae
Linking Korean Words with an Ontology

L10-1405  [bib]: Sabine Ploux; Armelle Boussidan; Hyungsuk Ji
The Semantic Atlas: an Interactive Model of Lexical Representation

L10-1406  [bib]: Roberto P. A. Araujo; Rafael L. de Oliveira; Eder M. de Novais; Thiago D. Tadeu; Daniel B. Pereira; Ivandré Paraboni
SINotas: the Evaluation of a NLG Application

L10-1407  [bib]: Isabella Poggi; Francesca D'Errico; Laura Vincze
Types of Nods. The Polysemy of a Social Signal

L10-1408  [bib]: Toomas Altosaar; Louis ten Bosch; Guillaume Aimetti; Christos Koniaris; Kris Demuynck; Henk van den Heuvel
A Speech Corpus for Modeling Language Acquisition: CAREGIVER

L10-1409  [bib]: Sanja Seljan; Marko Tadić; Željko Agić; Jan Šnajder; Bojana Dalbelo Bašić; Vjekoslav Osmann
Corpus Aligner (CorAl) Evaluation on English-Croatian Parallel Corpora

L10-1410  [bib]: Siim Orasmaa; Reina Käärik; Jaak Vilo; Tiit Hennoste
Information Retrieval of Word Form Variants in Spoken Language Corpora Using Generalized Edit Distance

L10-1411  [bib]: Montserrat Marimon
The Spanish Resource Grammar

L10-1412  [bib]: Anne Vilnat; Patrick Paroubek; Eric Villemonte de la Clergerie; Gil Francopoulo; Marie-Laure Guénot
PASSAGE Syntactic Representation: a Minimal Common Ground for Evaluation

L10-1413  [bib]: Mark Fishel; Harri Kirik
Linguistically Motivated Unsupervised Segmentation for Machine Translation

L10-1414  [bib]: Carlo Strapparava; Marco Guerini; Oliviero Stock
Predicting Persuasiveness in Political Discourses

L10-1415  [bib]: Didier Cadic; Cédric Boidin; Christophe d'Alessandro
Towards Optimal TTS Corpora

L10-1416  [bib]: Claudia Borg; Mike Rosner; Gordon J. Pace
Automatic Grammar Rule Extraction and Ranking for Definitions

L10-1417  [bib]: Manny Rayner; Pierrette Bouillon; Nikos Tsourakis; Johanna Gerlach; Maria Georgescul; Yukie Nakao; Claudia Baur
A Multilingual CALL Game Based on Speech Translation

L10-1418  [bib]: Peter Adolphs; Xiwen Cheng; Tina Klüwer; Hans Uszkoreit; Feiyu Xu
Question Answering Biographic Information and Social Network Powered by the Semantic Web

L10-1419  [bib]: Ekaterina Shutova; Simone Teufel
Metaphor Corpus Annotated for Source - Target Domain Mappings

L10-1420  [bib]: Federico Sangati; Willem Zuidema; Rens Bod
Efficiently Extract Rrecurring Tree Fragments from Large Treebanks

L10-1421  [bib]: Alexandros Lazaridis; Theodoros Kostoulas; Todor Ganchev; Iosif Mporas; Nikos Fakotakis
Vergina: A Modern Greek Speech Database for Speech Synthesis

L10-1422  [bib]: Vivi Nastase; Michael Strube; Benjamin Boerschinger; Caecilia Zirn; Anas Elghafari
WikiNet: A Very Large Scale Multi-Lingual Concept Network

L10-1423  [bib]: Niraj Aswani; Robert Gaizauskas
Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages

L10-1424  [bib]: Hiroki Hanaoka; Hideki Mima; Jun'ichi Tsujii
A Japanese Particle Corpus Built by Example-Based Annotation

L10-1425  [bib]: Caroline Sporleder; Linlin Li; Philip Gorinski; Xaver Koch
Idioms in Context: The IDIX Corpus

L10-1426  [bib]: Theodoros Kostoulas; Otilia Kocsis; Todor Ganchev; Fernando Fernández-Aranda; Juan J. Santamaría; Susana Jiménez-Murcia; Maher Ben Moussa; Nadia Magnenat-Thalmann; Nikos Fakotakis
The PlayMancer Database: A Multimodal Affect Database in Support of Research and Development Activities in Serious Game Environment

L10-1427  [bib]: Swaran Lata; Somnath Chandra Vijay Kumar
Development of Linguistic Resources and Tools for Providing Multilingual Solutions in Indian Languages ― A Report on National Initiative

L10-1428  [bib]: Cristina Nicolae; Gabriel Nicolae; Kirk Roberts
C-3: Coherence and Coreference Corpus

L10-1429  [bib]: Stephen A. Boxwell; Chris Brew
A Pilot Arabic CCGbank

L10-1430  [bib]: Cécile Fougeron; Lise Crevier-Buchman; Corinne Fredouille; Alain Ghio; Christine Meunier; Claude Chevrie-Muller; Jean-Francois Bonastre; Antonia Colazo-Simon; Céline Delooze; Danielle Duez; Cédric Gendrot; Thierry Legou; Nathalie Lévêque; Claire Pillot-Loiseau; Serge Pinto; Gilles Pouchoulin; Danièle Robert; Jacqueline Vaissière; François Viallet; Coralie Vincent
The DesPho-APaDy Project: Developing an Acoustic-phonetic Characterization of Dysarthric Speech in French

L10-1431  [bib]: Mithun Balakrishna; Dan Moldovan; Marta Tatu; Marian Olteanu
Semi-Automatic Domain Ontology Creation from Text Resources

L10-1432  [bib]: Maite Melero; Gemma Boleda; Montse Cuadros; Cristina España-Bonet; Lluís Padró; Martí Quixal; Carlos Rodríguez; Roser Saurí
Language Technology Challenges of a ‘Small’ Language (Catalan)

L10-1433  [bib]: John Lee; Dag Haug
Porting an Ancient Greek and Latin Treebank

L10-1434  [bib]: Sabine Schulte im Walde
Comparing Computational Models of Selectional Preferences - Second-order Co-Occurrence vs. Latent Semantic Clusters

L10-1435  [bib]: Paul McNamee; Hoa Trang Dang; Heather Simpson; Patrick Schone; Stephanie M. Strassel
An Evaluation of Technologies for Knowledge Base Population

L10-1436  [bib]: Óscar Ferrández; Michael Ellsworth; Rafael Muñoz; Collin F. Baker
Aligning FrameNet and WordNet based on Semantic Neighborhoods

L10-1437  [bib]: Hassina Aliane; Zaia Alimazighi; Ahmed Cherif Mazari
Al ―Khalil : The Arabic Linguistic Ontology Project

L10-1438  [bib]: Marc Kemps-Snijders; Thomas Koller; Han Sloetjes; Huib Verwey
LAT Bridge: Bridging Tools for Annotation and Exploration of Rich Linguistic Data

L10-1439  [bib]: Ondřej Bojar; Adam Liška; Zdeněk Žabokrtský
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9

L10-1440  [bib]: Maria Liakata; Simone Teufel; Advaith Siddharthan; Colin Batchelor
Corpora for the Conceptualisation and Zoning of Scientific Papers

L10-1441  [bib]: Alberto Tretti; Barbara Di Eugenio
Analysis and Presentation of Results for Mobile Local Search

L10-1442  [bib]: Yannick Estève; Thierry Bazillon; Jean-Yves Antoine; Frédéric Béchet; Jérôme Farinas
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News

L10-1443  [bib]: Katrin Tomanek; Udo Hahn
Annotation Time Stamps ― Temporal Metadata from the Linguistic Annotation Process

L10-1444  [bib]: Stuart Moore; Sabine Buchholz; Anna Korhonen
Annotating the Enron Email Corpus with Number Senses

L10-1445  [bib]: Piroska Lendvai; Thierry Declerck; Sándor Darányi; Pablo Gervás; Raquel Hervás; Scott Malec; Federico Peinado
Integration of Linguistic Markup into Semantic Models of Folk Narratives: The Fairy Tale Use Case

L10-1446  [bib]: Bora Savas; Yoshihiko Hayashi; Monica Monachini; Claudia Soria; Nicoletta Calzolari
An LMF-based Web Service for Accessing WordNet-type Semantic Lexicons

L10-1447  [bib]: Jordi Atserias; Giuseppe Attardi; Maria Simi; Hugo Zaragoza
Active Learning for Building a Corpus of Questions for Parsing

L10-1448  [bib]: Paul Cook; Suzanne Stevenson
Automatically Identifying Changes in the Semantic Orientation of Words

L10-1449  [bib]: Anton Leuski; David Traum
NPCEditor: A Tool for Building Question-Answering Characters

L10-1450  [bib]: Changqin Quan; Fuji Ren
Automatic Annotation of Word Emotion in Sentences Based on Ren-CECps

L10-1451  [bib]: Kseniya Zablotskaya; Steffen Walter; Wolfgang Minker
Speech Data Corpus for Verbal Intelligence Estimation

L10-1452  [bib]: Yi Liu; Pascale Fung; Yongsheng Yang; Denise DiPersio; Meghan Glenn; Strassel Stephanie; Christopher Cieri
A Very Large Scale Mandarin Chinese Broadcast Corpus for GALE Project

L10-1453  [bib]: Anca Dinu
Building a Generative Lexicon for Romanian

L10-1454  [bib]: Jerid Francom; Amy LaCross; Adam Ussishkin
How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese.

L10-1455  [bib]: Kevin Walker; Christopher Caruso; Denise DiPersio
Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development

L10-1456  [bib]: Laura Street; Nathan Michalov; Rachel Silverstein; Michael Reynolds; Lurdes Ruela; Felicia Flowers; Angela Talucci; Priscilla Pereira; Gabriella Morgon; Samantha Siegel; Marci Barousse; Antequa Anderson; Tashom Carroll; Anna Feldman
Like Finding a Needle in a Haystack: Annotating the American National Corpus for Idiomatic Expressions

L10-1457  [bib]: Wajdi Zaghouani; Bruno Pouliquen; Mohamed Ebrahim; Ralf Steinberger
Adapting a resource-light highly multilingual Named Entity Recognition system to Arabic

L10-1458  [bib]: Xuansong Li; Niyu Ge; Stephen Grimes; Stephanie M. Strassel; Kazuaki Maeda
Enriching Word Alignment with Linguistic Tags

L10-1459  [bib]: Kathleen Eberhard; Hannele Nicholson; Sandra Kübler; Susan Gundersen; Matthias Scheutz
The Indiana ``Cooperative Remote Search Task" (CReST) Corpus

L10-1460  [bib]: Carl Christensen; Ross Hendrickson; Deryle Lonsdale
Principled Construction of Elicited Imitation Tests

L10-1461  [bib]: Bharat Ram Ambati; Mridul Gupta; Samar Husain; Dipti Misra Sharma
A High Recall Error Identification Tool for Hindi Treebank Validation

L10-1462  [bib]: Susan Robinson; Antonio Roque; David Traum
Dialogues in Context: An Objective User-Oriented Evaluation Approach for Virtual Human Dialogue

L10-1463  [bib]: Xuchen Yao; Pravin Bhutada; Kallirroi Georgila; Kenji Sagae; Ron Artstein; David Traum
Practical Evaluation of Speech Recognizers for Virtual Human Dialogue Systems

L10-1464  [bib]: Kiyonori Ohtake; Teruhisa Misu; Chiori Hori; Hideki Kashioka; Satoshi Nakamura
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems

L10-1465  [bib]: Bal Krishna Bal; Patrick Saint Dizier
Towards Building Annotated Resources for Analyzing Opinions and Argumentation in News Editorials

L10-1466  [bib]: Iris Eshkol; Denis Maurel; Nathalie Friburger
Eslo: From Transcription to Speakers' Personal Information Annotation

L10-1467  [bib]: Peter Wittenburg; Nuria Bel; Lars Borin; Gerhard Budin; Nicoletta Calzolari; Eva Hajicova; Kimmo Koskenniemi; Lothar Lemnitzer; Bente Maegaard; Maciej Piasecki; Jean-Marie Pierrel; Stelios Piperidis; Inguna Skadina; Dan Tufis; Remco van Veenendaal; Tamas Váradi; Martin Wynne
Resource and Service Centres as the Backbone for a Sustainable Service Infrastructure

L10-1468  [bib]: Stefania Spina
The Dictionary of Italian Collocations: Design and Integration in an Online Learning Environment

L10-1469  [bib]: Suguru Matsuyoshi; Megumi Eguchi; Chitose Sao; Koji Murakami; Kentaro Inui; Yuji Matsumoto
Annotating Event Mentions in Text with Modality, Focus, and Source Information

L10-1470  [bib]: Sisay Adugna; Andreas Eisele
English ― Oromo Machine Translation: An Experiment Using a Statistical Approach

L10-1471  [bib]: Atsushi Fujii
Modeling Wikipedia Articles to Enhance Encyclopedic Search

L10-1472  [bib]: Matthias Hartung; Anette Frank
A Semi-supervised Type-based Classification of Adjectives: Distinguishing Properties and Relations

L10-1473  [bib]: Andreas Eisele; Yu Chen
MultiUN: A Multilingual Corpus from United Nation Documents

L10-1474  [bib]: Myriam Rakho; Matthieu Constant
Evaluating the Impact of Some Linguistic Information on the Performances of a Similarity-based and Translation-oriented Word-Sense Disambiguation Method

L10-1475  [bib]: Eckhard Bick
FrAG, a Hybrid Constraint Grammar Parser for French

L10-1476  [bib]: Julia Maria Schulz; Christa Womser-Hacker; Thomas Mandl
Multilingual Corpus Development for Opinion Mining

L10-1477  [bib]: Bartosz Broda; Michał Marcińczuk; Maciej Piasecki
Building a Node of the Accessible Language Technology Infrastructure

L10-1478  [bib]: Cássia Trojahn; Paulo Quaresma; Renata Vieira
An API for Multi-lingual Ontology Matching

L10-1479  [bib]: Volker Fritzsch; Stefan Scherer; Friedhelm Schwenker
An Open Source Process Engine Framework for Realtime Pattern Recognition and Information Fusion Tasks

L10-1480  [bib]: Niraj Aswani; Robert Gaizauskas
English-Hindi Transliteration using Multiple Similarity Metrics

L10-1481  [bib]: Rodrigo Agerri; Ana García-Serrano
Q-WordNet: Extracting Polarity from WordNet Senses

L10-1482  [bib]: Taiji Nagasaka; Ran Shimanouchi; Akiko Sakamoto; Takafumi Suzuki; Yohei Morishita; Takehito Utsuro; Suguru Matsuyoshi
Utilizing Semantic Equivalence Classes of Japanese Functional Expressions in Translation Rule Acquisition from Parallel Patent Sentences

L10-1483  [bib]: Simon Mille; Leo Wanner
Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation

L10-1484  [bib]: Hiroaki SATO
How FrameSQL Shows the Japanese FrameNet Data

L10-1485  [bib]: Aya Nishikawa; Ryo Nishimura; Yasuhiko Watanabe; Yoshihiro Okada
A Context Sensitive Variant Dictionary for Supporting Variant Selection

L10-1486  [bib]: Benoît Sagot; Géraldine Walther
A Morphological Lexicon for the Persian Language

L10-1487  [bib]: Benoît Sagot
The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French

L10-1488  [bib]: Montse Cuadros; Egoitz Laparra; German Rigau; Piek Vossen; Wauter Bosma
Integrating a Large Domain Ontology of Species into WordNet

L10-1489  [bib]: Jean-Luc Rouas; Mayumi Beppu; Martine Adda-Decker
Comparison of Spectral Properties of Read, Prepared and Casual Speech in French

L10-1490  [bib]: Svetla Koeva
Lexicon and Grammar in Bulgarian FrameNet

L10-1491  [bib]: Peter Spyns; Elisabeth D'Halleweyn
Flemish-Dutch HLT Policy: Evolving to New Forms of Collaboration

L10-1492  [bib]: Elisabetta Jezek; Valeria Quochi
Capturing Coercions in Texts: a First Annotation Exercise

L10-1493  [bib]: Brigitte Jörg; Hans Uszkoreit; Alastair Burt
LT World: Ontology and Reference Information Portal

L10-1494  [bib]: Thiago D. Tadeu; Eder M. de Novais; Ivandré Paraboni
Extracting Surface Realisation Templates from Corpora

L10-1495  [bib]: Arif Bramantoro; Ulrich Schäfer; Toru Ishida
Towards an Integrated Architecture for Composite Language Services and Multiple Linguistic Processing Components

L10-1496  [bib]: Asif Ekbal; Sriparna Saha
Maximum Entropy Classifier Ensembling using Genetic Algorithm for NER in Bengali

L10-1497  [bib]: Mohamed Belgacem; Georges Antoniadis; Laurent Besacier
Automatic Identification of Arabic Dialects

L10-1498  [bib]: Sathish Pammi; Marcela Charfuelan; Marc Schröder
Multilingual Voice Creation Toolkit for the MARY TTS Platform

L10-1499  [bib]: Petya Osenova; Laska Laskova; Kiril Simov
Exploring Co-Reference Chains for Concept Annotation of Domain Texts

L10-1500  [bib]: Kathrin Spreyer; Lilja Øvrelid; Jonas Kuhn
Training Parsers on Partial Trees: A Cross-language Comparison

L10-1501  [bib]: Peter Menke; Alexander Mehler
The Ariadne System: A Flexible and Extensible Framework for the Modeling and Storage of Experimental Data in the Humanities.

L10-1502  [bib]: Alessandra Giordani; Alessandro Moschitti
Corpora for Automatically Learning to Map Natural Language Questions into SQL Queries

L10-1503  [bib]: Naoki Ishikawa; Ryo Nishimura; Yasuhiko Watanabe; Yoshihiro Okada; Masaki Murata
Detection of submitters suspected of pretending to be someone else in a community site

L10-1504  [bib]: Fabienne Fritzinger; Marion Weller; Ulrich Heid
A Survey of Idiomatic Preposition-Noun-Verb Triples on Token Level

L10-1505  [bib]: Daniel Cer; Marie-Catherine de Marneffe; Dan Jurafsky; Chris Manning
Parsing to Stanford Dependencies: Trade-offs between Speed and Accuracy

L10-1506  [bib]: Antonio Reyes; Martin Potthast; Paolo Rosso; Benno Stein
Evaluating Humour Features on Web Comments

L10-1507  [bib]: Daisuke Kawahara; Sadao Kurohashi
Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation

L10-1508  [bib]: Emiliano Giovannetti
An Unsupervised Approach for Semantic Relation Interpretation

L10-1509  [bib]: Oi Yee Kwong
Constructing an Annotated Story Corpus: Some Observations and Issues

L10-1510  [bib]: Klaar Vanopstal; Bart Desmet; Véronique Hoste
Towards a Learning Approach for Abbreviation Detection and Resolution.

L10-1511  [bib]: Catarina Magro
When CORDIAL Becomes Friendly: Endowing the CORDIAL Corpus with a Syntactic Annotation Layer

L10-1512  [bib]: Mridul Gupta; Vineet Yadav; Samar Husain; Dipti Misra Sharma
Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank

L10-1513  [bib]: Marc Verhagen
The Brandeis Annotation Tool

L10-1514  [bib]: Iker Luengo; Eva Navas; Igor Odriozola; Ibon Saratxaga; Inmaculada Hernaez; Iñaki Sainz; Daniel Erro
Modified LTSE-VAD Algorithm for Applications Requiring Reduced Silence Frame Misclassification

L10-1515  [bib]: Nancy Ide; Keith Suderman; Brian Simms
ANC2Go: A Web Application for Customized Corpus Creation

L10-1516  [bib]: Shunsuke Kozawa; Hitomi Tohyama; Kiyotaka Uchimoto; Shigeki Matsubara
Collection of Usage Information for Language Resources from Academic Articles

L10-1517  [bib]: Nick Rizzolo; Dan Roth
Learning Based Java for Rapid Development of NLP Systems

L10-1518  [bib]: Jorge Vivaldi; Horacio Rodríguez
Finding Domain Terms using Wikipedia

L10-1519  [bib]: Claire Brierley; Eric Atwell
ProPOSEC: A Prosody and PoS Annotated Spoken English Corpus

L10-1520  [bib]: Margarita Alonso Ramos; Leo Wanner; Orsolya Vincze; Gerard Casamayor del Bosque; Nancy Vázquez Veiga; Estela Mosqueira Suárez; Sabela Prieto González
Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora

L10-1521  [bib]: Chi-kiu Lo; Dekai Wu
Evaluating Machine Translation Utility via Semantic Role Labels

L10-1522  [bib]: Yu Chen; Andreas Eisele
Integrating a Rule-based with a Hierarchical Translation System

L10-1523  [bib]: Massimo Poesio; Olga Uryupina; Yannick Versley
Creating a Coreference Resolution System for Italian

L10-1524  [bib]: Ondřej Bojar; Pavel Straňák; Daniel Zeman
Data Issues in English-to-Hindi Machine Translation

L10-1525  [bib]: Tom Vanallemeersch
Belgisch Staatsblad Corpus: Retrieving French-Dutch Sentences from Official Documents

L10-1526  [bib]: Šárka Zikánová; Lucie Mladová; Jiří Mírovský; Pavlína Jínová
Typical Cases of Annotators’ Disagreement in Discourse Annotations in Prague Dependency Treebank

L10-1527  [bib]: Yu Fu; Feiyu Xu; Hans Uszkoreit
Determining the Origin and Structure of Person Names

L10-1528  [bib]: Arndt Riester; David Lorenz; Nina Seemann
A Recursive Annotation Scheme for Referential Information Status

L10-1529  [bib]: Jaouad Mousser
A Large Coverage Verb Taxonomy for Arabic

L10-1530  [bib]: Helena Spilková; Daniel Brenner; Anton Öttl; Pavel Vondřička; Wim van Dommelen; Mirjam Ernestus
The Kachna L1/L2 Picture Replication Corpus

L10-1531  [bib]: Stefano Baccianella; Andrea Esuli; Fabrizio Sebastiani
SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining

L10-1532  [bib]: Pierre Tirilly; Vincent Claveau; Patrick Gros
News Image Annotation on a Large Parallel Text-image Corpus

L10-1533  [bib]: Diego De Cao; Danilo Croce; Roberto Basili
Extensive Evaluation of a FrameNet-WordNet mapping resource

L10-1534  [bib]: Djamé Seddah
Exploring the Spinal-STIG Model for Parsing French

L10-1535  [bib]: Tommaso Caselli; Irina Prodanof
Annotating Event Anaphora: A Case Study

L10-1536  [bib]: Bente Maegaard; Mohamed Attia; Khalid Choukri; Olivier Hamon; Steven Krauwer; Mustafa Yaseen
Cooperation for Arabic Language Resources and Tools ― The MEDAR Project

L10-1537  [bib]: Katerina Pastra; Christian Wallraven; Michael Schultze; Argyro Vataki; Kathrin Kaulard
The POETICON Corpus: Capturing Language Use and Sensorimotor Experience in Everyday Interaction

L10-1538  [bib]: Philippe Blache; Roxane Bertrand; Mathilde Guardiola; Marie-Laure Guénot; Christine Meunier; Irina Nesterenko; Berthille Pallaud; Laurent Prévot; Béatrice Priego-Valverde; Stéphane Rauzy
The OTIM Formal Annotation Model: A Preliminary Step before Annotation Scheme

L10-1539  [bib]: Sanaz Jabbari; Mark Hepple; Louise Guthrie
Evaluating Lexical Substitution: Analysis and New Measures

L10-1540  [bib]: Mehrnoush Shamsfard; Hakimeh Fadaei; Elham Fekri
Extracting Lexico-conceptual Knowledge for Developing Persian WordNet

L10-1541  [bib]: Paula Vaz Lobo; David Martins de Matos
Fairy Tale Corpus Organization Using Latent Semantic Mapping and an Item-to-item Top-n Recommendation Algorithm

L10-1542  [bib]: Alistair Willis; David King; David Morse; Anton Dil; Chris Lyal; Dave Roberts
From XML to XML: The Why and How of Making the Biodiversity Literature Accessible to Researchers

L10-1543  [bib]: Linda Brandschain; David Graff; Christopher Cieri; Kevin Walker; Chris Caruso; Abby Neely
Greybeard Longitudinal Speech Study

L10-1544  [bib]: Francisco Campillo; Daniela Braga; Ana Belén Mourín; Carmen García-Mateo; Pedro Silva; Miguel Sales Dias; Francisco Méndez
Building High Quality Databases for Minority Languages such as Galician

L10-1545  [bib]: William D. Lewis; Chris Wendt; David Bullock
Achieving Domain Specificity in SMT without Overt Siloing

L10-1546  [bib]: Linda Brandschain; David Graff; Chris Cieri; Kevin Walker; Chris Caruso; Abby Neely
Mixer 6

L10-1547  [bib]: Markus Egg; Gisela Redeker
How Complex is Discourse Structure?

L10-1548  [bib]: Mohammed Attia; Antonio Toral; Lamia Tounsi; Monica Monachini; Josef van Genabith
An Automatically Built Named Entity Lexicon for Arabic

L10-1549  [bib]: Zhiyi Song; Stephanie Strassel; Gary Krug; Kazuaki Maeda
Enhanced Infrastructure for Creation and Collection of Translation Resources

L10-1550  [bib]: Egoitz Laparra; German Rigau
eXtended WordFrameNet

L10-1551  [bib]: Barbara Plank
Improved Statistical Measures to Assess Natural Language Parser Performance across Domains

L10-1552  [bib]: Heng Ji; Xiang Li; Angelo Lucia; Jianting Zhang
Annotating Event Chains for Carbon Sequestration Literature

L10-1553  [bib]: Carlos Ramisch; Aline Villavicencio; Christian Boitet
mwetoolkit: a Framework for Multiword Expression Identification

L10-1554  [bib]: Ines Rehbein; Josef Ruppenhofer
There’s no Data like More Data? Revisiting the Impact of Data Size on a Classification Task

L10-1555  [bib]: Jirka Hana; Anna Feldman
A Positional Tagset for Russian

L10-1556  [bib]: Georgios Petasis; Dimitrios Petasis
BlogBuster: A Tool for Extracting Corpora from the Blogosphere

L10-1557  [bib]: Mehrnoush Shamsfard; Hoda Sadat Jafari; Mahdi Ilbeygi
STeP-1: A Set of Fundamental Tools for Persian Text Processing

L10-1558  [bib]: Drahomíra "johanka" Spoustová; Miroslav Spousta; Pavel Pecina
Building a Web Corpus of Czech

L10-1559  [bib]: Cristina Vertan
Towards the Integration of Language Tools Within Historical Digital Libraries

L10-1560  [bib]: Adriane Boyd
EAGLE: an Error-Annotated Corpus of Beginning Learner German

L10-1561  [bib]: Olivier Ferret
Testing Semantic Similarity Measures for Extracting Synonyms from a Corpus

L10-1562  [bib]: Ernesto William De Luca
A Corpus for Evaluating Semantic Multilingual Web Retrieval Systems: The Sense Folder Corpus

L10-1563  [bib]: Roberta Catizone; Alexiei Dingli; Robert Gaizauskas
Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue

L10-1564  [bib]: Lamia Tounsi; Josef van Genabith
Arabic Parsing Using Grammar Transforms

L10-1565  [bib]: Rui Wang; Caroline Sporleder
Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank

L10-1566  [bib]: Na-Rae Han; Joel Tetreault; Soo-Hwa Lee; Jin-Young Ha
Using an Error-Annotated Learner Corpus to Develop an ESL/EFL Error Correction System

L10-1567  [bib]: Ian McGraw; Chia-ying Lee; Lee Hetherington; Stephanie Seneff; Jim Glass
Collecting Voices from the Cloud

L10-1568  [bib]: Aurélien Max; Josep Maria Crego; François Yvon
Contrastive Lexical Evaluation of Machine Translation

L10-1569  [bib]: Elaine Uí Dhonnchadha; Josef Van Genabith
Partial Dependency Parsing for Irish

L10-1570  [bib]: Paola Monachesi; Thomas Markus
Socially Driven Ontology Enrichment for eLearning

L10-1571  [bib]: Aurélien Max; Guillaume Wisniewski
Mining Naturally-occurring Corrections and Paraphrases from Wikipedia’s Revision History

L10-1572  [bib]: Sara Rosenthal; William Lipovsky; Kathleen McKeown; Kapil Thadani; Jacob Andreas
Towards Semi-Automated Annotation for Prepositional Phrase Attachment

L10-1573  [bib]: Patrice Lopez; Laurent Romary
GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains

L10-1574  [bib]: Rita Marinelli
Lexical Resources and Ontological Classifications for the Recognition of Proper Names Sense Extension

L10-1575  [bib]: Thierry Declerck; Piroska Lendvai
Towards a Standardized Linguistic Annotation of the Textual Content of Labels in Knowledge Representation Systems

L10-1576  [bib]: Yohei Murakami; Donghui Lin; Masahiro Tanaka; Takao Nakaguchi; Toru Ishida
Language Service Management with the Language Grid

L10-1577  [bib]: Kristina Vučković; Željko Agić; Marko Tadić
Improving Chunking Accuracy on Croatian Texts by Morphosyntactic Tagging

L10-1578  [bib]: David K. Elson; Kathleen R. McKeown
Building a Bank of Semantically Encoded Narratives

L10-1579  [bib]: Billy Tak-Ming Wong
Semantic Evaluation of Machine Translation

L10-1580  [bib]: Bento Carlos Dias-da-Silva; Ariani Di-Felippo
REBECA: Turning WordNet Databases into "Ontolexicons"

L10-1581  [bib]: Athanasios Karasimos; Evanthia Petropoulou
A Crash Test with Linguistica in Modern Greek: The Case of Derivational Affixes and Bound Stems

L10-1582  [bib]: Christian Federmann; Thierry Declerck
Extraction, Merging, and Monitoring of Company Data from Heterogeneous Sources

L10-1583  [bib]: Rui Wang; Yi Zhang
Hybrid Constituent and Dependency Parsing with Tsinghua Chinese Treebank

L10-1584  [bib]: Parisa Kordjamshidi; Martijn Van Otterlo; Marie-Francine Moens
Spatial Role Labeling: Task Definition and Annotation Scheme

L10-1585  [bib]: Irene Russo
Discovering Polarity for Ambiguous and Objective Adjectives through Adverbial Modification

L10-1586  [bib]: Kiril Simov; Petya Osenova
Constructing of an Ontology-based Lexicon for Bulgarian

L10-1587  [bib]: Meghan Lammie Glenn; Stephanie M. Strassel; Haejoong Lee; Kazuaki Maeda; Ramez Zakhary; Xuansong Li
Transcription Methods for Consistency, Volume and Efficiency

L10-1588  [bib]: Claudiu Mihăilă; Iustina Ilisei; Diana Inkpen
Romanian Zero Pronoun Distribution: A Comparative Study

L10-1589  [bib]: Renata Savy
Pr.A.Ti.D: A Coding Scheme for Pragmatic Annotation of Dialogues.

L10-1590  [bib]: Prasanth Kolachina; Sudheer Kolachina; Anil Kumar Singh; Samar Husain; Viswanath Naidu; Rajeev Sangal; Aksar Bharati
Grammar Extraction from Treebanks for Hindi and Telugu

L10-1591  [bib]: Andrejs Vasiljevs; Kaspars Balodis
Corpus Based Analysis for Multilingual Terminology Entry Compounding

L10-1592  [bib]: Kazuaki Maeda; Haejoong Lee; Stephen Grimes; Jonathan Wright; Robert Parker; David Lee; Andrea Mazzucchi
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation

L10-1593  [bib]: José M. García-Miguel; Gael Vaamonde; Fita González Domínguez
ADESSE, a Database with Syntactic and Semantic Annotation of a Corpus of Spanish

L10-1594  [bib]: David Guthrie; Mark Hepple; Wei Liu
Efficient Minimal Perfect Hash Language Models

L10-1595  [bib]: Stephanie Strassel; Dan Adams; Henry Goldberg; Jonathan Herr; Ron Keesing; Daniel Oblinger; Heather Simpson; Robert Schrag; Jonathan Wright
The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks

L10-1596  [bib]: Heather Simpson; Stephanie Strassel; Robert Parker; Paul McNamee
Wikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population

L10-1597  [bib]: Damien Nouvel; Jean-Yves Antoine; Nathalie Friburger; Denis Maurel
An Analysis of the Performances of the CasEN Named Entities Recognition System in the Ester2 Evaluation Campaign

L10-1598  [bib]: Jiří Materna; Karel Pala
Using Ontologies for Semi-automatic Linking VerbaLex with FrameNet

L10-1599  [bib]: Thepchai Supnithi; Taneth Ruangrajitpakorn; Kanokorn Trakultaweekool; Peerachet Porkaew
AutoTagTCG : A Framework for Automatic Thai CG Tagging

L10-1600  [bib]: Helena Blancafort
Learning Morphology of Romance, Germanic and Slavic Languages with the Tool Linguistica

L10-1601  [bib]: Noureddine Loukil; Kais Haddar; Abdelmajid Benhamadou
A Syntactic Lexicon for Arabic Verbs

L10-1602  [bib]: Girish Nath Jha
The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI)

L10-1603  [bib]: Željko Agić; Nikola Ljubešić; Marko Tadić
Towards Sentiment Analysis of Financial Texts in Croatian

L10-1604  [bib]: Sylwia Ozdowska; Vincent Claveau
Inferring Syntactic Rules for Word Alignment through Inductive Logic Programming

L10-1605  [bib]: Agata Savary; Jakub Waszczuk; Adam Przepiórkowski
Towards the Annotation of Named Entities in the National Corpus of Polish

L10-1606  [bib]: Avaré Stewart; Kerstin Denecke; Wolfgand Nejdl
Cross-Corpus Textual Entailment for Sublanguage Analysis in Epidemic Intelligence

L10-1607  [bib]: Javier Couto; Helena Blancafort; Somara Seng; Nicolas Kuchmann-Beauger; Anass Talby; Claude de Loupy
OAL: A NLP Architecture to Improve the Development of Linguistic Resources for NLP

L10-1608  [bib]: Karel Pala; Christiane Fellbaum; Sonja Bosch
Lexical Resources for Noun Compounds in Czech, English and Zulu

L10-1609  [bib]: Dietrich Rebholz-Schuhmann; Antonio José Jimeno-Yepes; Erik M. van Mulligen; Ning Kang; Jan Kors; David Milward; Peter Corbett; Ekaterina Buyko; Katrin Tomanek; Elena Beisswanger; Udo Hahn
The CALBC Silver Standard Corpus for Biomedical Named Entities ― A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers

L10-1610  [bib]: Gabor Melli
Concept Mentions within KDD-2009 Abstracts (kdd09cma1) Linked to a KDD Ontology (kddo1)

L10-1611  [bib]: Petra-Maria Strauß; Stefan Scherer; Georg Layher; Holger Hoffmann
Evaluation of the PIT Corpus Or What a Difference a Face Makes?

L10-1612  [bib]: Luka Nerima; Eric Wehrli; Violeta Seretan
A Recursive Treatment of Collocations

L10-1613  [bib]: Bonaventura Coppola; Alessandro Moschitti
A General Purpose FrameNet-based Shallow Semantic Parser

L10-1614  [bib]: Timo Sowa; Fiorenza Arisio; Luca Cristoforetti
DICIT: Evaluation of a Distant-talking Speech Interface for Television

L10-1615  [bib]: Arianne Reimerink; Pilar León Araúz; Pedro J. Magaña Redondo
EcoLexicon: An Environmental TKB

L10-1616  [bib]: José João Almeida; André Santos; Alberto Simões
Bigorna -- A Toolkit for Orthography Migration Challenges

L10-1617  [bib]: Jan Jona Javoršek; Tomaž Erjavec
Experimental Deployment of a Grid Virtual Organization for Human Language Technologies

L10-1618  [bib]: Eric Charton; Juan-Manuel Torres-Moreno
NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems

L10-1619  [bib]: Wauter Bosma; Piek Vossen
Bootstrapping Language Neutral Term Extraction

L10-1620  [bib]: Jinho D. Choi; Claire Bonial; Martha Palmer
Propbank Instance Annotation Guidelines Using a Dedicated Editor, Jubilee

L10-1621  [bib]: Kumutha Swampillai; Mark Stevenson
Inter-sentential Relations in Information Extraction Corpora

L10-1622  [bib]: Peter Nabende
Applying a Dynamic Bayesian Network Framework to Transliteration Identification

L10-1623  [bib]: Alexandra Balahur; Ralf Steinberger; Mijail Kabadjov; Vanni Zavarella; Erik van der Goot; Matina Halkia; Bruno Pouliquen; Jenya Belyaeva
Sentiment Analysis in the News

L10-1624  [bib]: Daniel Sonntag; Bogdan Sacaleanu
Speech Grammars for Textual Entailment Patterns in Multimodal Question Answering

L10-1625  [bib]: Jan Strunk
Enriching a Treebank to Investigate Relative Clause Extraposition in German

L10-1626  [bib]: Claude de Loupy; Marie Guégan; Christelle Ayache; Somara Seng; Juan-Manuel Torres Moreno
A French Human Reference Corpus for Multi-Document Summarization and Sentence Compression

L10-1627  [bib]: Fei Xia; Carrie Lewis; William D. Lewis
The Problems of Language Identification within Hugely Multilingual Data Sets

L10-1628  [bib]: Rebecca J. Passonneau; Ansaf Salleb-Aoussi; Vikas Bhardwaj; Nancy Ide
Word Sense Annotation of Polysemous Words by Multiple Annotators

L10-1629  [bib]: Michael Gasser
Expanding the Lexicon for a Resource-Poor Language Using a Morphological Analyzer and a Web Crawler

L10-1630  [bib]: Susan Windisch Brown; Travis Rood; Martha Palmer
Number or Nuance: Which Factors Restrict Reliable Word Sense Annotation?

L10-1631  [bib]: Joshua B. Gordon; Rebecca J. Passonneau
An Evaluation Framework for Natural Language Understanding in Spoken Dialogue Systems

L10-1632  [bib]: Andrew Hickl; Arnold Jung; Ying Shi
Multilingual Question Generation

L10-1633  [bib]: René Witte; Ninus Khamis; Juergen Rilling
Flexible Ontology Population from Text: The OwlExporter

L10-1634  [bib]: Rashmi Prasad; Aravind Joshi; Bonnie Webber
Exploiting Scope for Shallow Discourse Parsing

L10-1635  [bib]: Pushpak Bhattacharyya
IndoWordNet

L10-1636  [bib]: Kirk Roberts; Srikanth Gullapalli; Cosmin Adrian Bejan; Sanda Harabagiu
A Linguistic Resource for Semantic Parsing of Motion Events

L10-1637  [bib]: Jennifer DeCamp
Language Technology Resource Center

L10-1638  [bib]: Manuela Sassi; Gabriella Pardelli; Stefania Biagioni; Carlo Carlesi; Sara Goggi
A Digital Archive of Research Papers in Computer Science

L10-1639  [bib]: Zygmunt Vetulani; Marek Kubis; Tomasz Obrębski
PolNet ― Polish WordNet: Data and Tools

L10-1640  [bib]: Victoria Arranz; Khalid Choukri
ELRA’s Services 15 Years on...Sharing and Anticipating the Community

L10-1641  [bib]: Youssef Aït Ouguengay; Aïcha Bouhjar
For Standardised Amazigh Linguistic Resources

L10-1642  [bib]: Christopher Cieri; Khalid Choukri; Nicoletta Calzolari; D. Terence Langendoen; Johannes Leveling; Martha Palmer; Nancy Ide; James Pustejovsky
A Road Map for Interoperable Language Resource Metadata

L10-1643  [bib]: Quan Nguyen; Michael Kipp
Annotation of Human Gesture using 3D Skeleton Controls

L10-1644  [bib]: Michal Gishri; Vered Silber-Varod; Ami Moyal
Lexicon Design for Transcription of Spontaneous Voice Messages

L10-1645  [bib]: Christopher Cieri; Mark Liberman
Adapting to Trends in Language Resource Development: A Progress Report on LDC Activities