ACL Logo ACL Anthology
A Digital Archive of Research Papers in Computational Linguistics

Google search the Anthology

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014)

L14-1001  [bib]: Ben Verhoeven; Walter Daelemans
CLiPS Stylometry Investigation (CSI) corpus: A Dutch corpus for the detection of age, gender, personality, sentiment and deception in text

L14-1002  [bib]: Vasile Rus; Rajendra Banjade; Mihai Lintean
On Paraphrase Identification Corpora

L14-1003  [bib]: Wiltrud Kessler; Jonas Kuhn
A Corpus of Comparisons in Product Reviews

L14-1004  [bib]: Vincent Claveau; Ewa Kijak
Generating and using probabilistic morphological resources for the biomedical domain

L14-1005  [bib]: Kiril Simov; Iliana Simova; Ginka Ivanova; Maria Mateva; Petya Osenova
A System for Experiments with Dependency Parsers

L14-1006  [bib]: Thamar Solorio; Ragib Hasan; Mainul Mizan
Sockpuppet Detection in Wikipedia: A Corpus of Real-World Deceptive Writing for Linking Identities

L14-1007  [bib]: Andre Blessing; Jonas Kuhn
Textual Emigration Analysis (TEA)

L14-1008  [bib]: Xiaoyun Wang; Jinsong Zhang; Masafumi Nishida; Seiichi Yamamoto
Phoneme Set Design Using English Speech Database by Japanese for Dialogue-Based English CALL Systems

L14-1009  [bib]: Amel Fraisse; Patrick Paroubek
Toward a unifying model for Opinion, Sentiment and Emotion information extraction

L14-1010  [bib]: Thorsten Trippel; Daan Broeder; Matej Durco; Oddrun Ohren
Towards automatic quality assessment of component metadata

L14-1011  [bib]: Claire Bonial; Julia Bonn; Kathryn Conger; Jena D. Hwang; Martha Palmer
PropBank: Semantics of New Predicate Types

L14-1012  [bib]: Jonathan Wright
RESTful Annotation and Efficient Collaboration

L14-1013  [bib]: Hendrik Buschmeier; Zofia Malisz; Joanna Skubisz; Marcin Wlodarczak; Ipke Wachsmuth; Stefan Kopp; Petra Wagner
ALICO: a multimodal corpus for the study of active listening

L14-1014  [bib]: Łukasz Kobyliński
PoliTa: A multitagger for Polish

L14-1015  [bib]: Siddharth Jain; Archna Bhatia; Angelique Rein; Eduard Hovy
A Corpus of Participant Roles in Contentious Discussions

L14-1016  [bib]: John Richardson; Toshiaki Nakazawa; Sadao Kurohashi
Bilingual Dictionary Construction with Transliteration Filtering

L14-1017  [bib]: Vera Cabarrão; Helena Moniz; Fernando Batista; Ricardo Ribeiro; Nuno Mamede; Hugo Meinedo; Isabel Trancoso; Ana Isabel Mata; David Martins de Matos
Revising the annotation of a Broadcast News corpus: a linguistic approach

L14-1018  [bib]: Pyry Takala; Pekka Malo; Ankur Sinha; Oskar Ahlgren
Gold-standard for Topic-specific Sentiment Analysis of Economic Texts

L14-1019  [bib]: Damir Cavar; Malgorzata Cavar
Visualization of Language Relations and Families: MultiTree

L14-1020  [bib]: Goran Glavaš; Jan Šnajder; Marie-Francine Moens; Parisa Kordjamshidi
HiEve: A Corpus for Extracting Event Hierarchies from News Stories

L14-1021  [bib]: James Pustejovsky; Zachary Yocum
Image Annotation with ISO-Space: Distinguishing Content from Structure

L14-1022  [bib]: Olivier Galibert; Jeremy Leixa; Gilles Adda; Khalid Choukri; Guillaume Gravier
The ETAPE speech processing evaluation

L14-1023  [bib]: David Kamholz; Jonathan Pool; Susan Colowick
PanLex: Building a Resource for Panlingual Lexical Translation

L14-1024  [bib]: Dan Tufiș
Large SMT data-sets extracted from Wikipedia

L14-1025  [bib]: Valeria de Paiva; Livy Real; Alexandre Rademaker; Gerard de Melo
NomLex-PT: A Lexicon of Portuguese Nominalizations

L14-1026  [bib]: Elisabet Comelles; Jordi Atserias; Victoria Arranz; Irene Castellon; Jordi Sesé
VERTa: Facing a Multilingual Experience of a Linguistically-based MT Evaluation

L14-1027  [bib]: Yifan He; Adam Meyers
Corpus and Method for Identifying Citations in Non-Academic Text

L14-1028  [bib]: Vanessa Loza; Shibamouli Lahiri; Rada Mihalcea; Po-Hsiang Lai
Building a Dataset for Summarization and Keyword Extraction from Emails

L14-1029  [bib]: Jordan Schmidek; Denilson Barbosa
Improving Open Relation Extraction via Sentence Re-Structuring

L14-1030  [bib]: Juan Soler; Leo Wanner
How to Use less Features and Reach Better Performance in Author Gender Identification

L14-1031  [bib]: Michael Mohler; Marc Tomlinson; David Bracewell; Bryan Rink
Semi-supervised methods for expanding psycholinguistics norms by integrating distributional similarity with the structure of WordNet

L14-1032  [bib]: Dagmar Jung; Katarzyna Klessa; Zsuzsa Duray; Beatrix Oszkó; Mária Sipos; Sándor Szeverényi; Zsuzsa Várnai; Trilsbeek Paul; Tamás Váradi
Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages

L14-1033  [bib]: Bistra Andreeva; William Barry; Jacques Koreman
A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

L14-1034  [bib]: Benoît Sagot
DeLex, a freely-avaible, large-scale and linguistically grounded morphological lexicon for German

L14-1035  [bib]: Peter Baumann; Janet Pierrehumbert
Using Resource-Rich Languages to Improve Morphological Analysis of Under-Resourced Languages

L14-1036  [bib]: Clara Bacciu; Angelica Lo Duca; Andrea Marchetti; Maurizio Tesconi
Accommodations in Tuscany as Linked Data

L14-1037  [bib]: Przemyslaw Lenkiewicz; Olha Shkaravska; Twan Goosen; Daan Broeder; Menzo Windhouwer; Stephanie Roth; Olof Olsson
The DWAN framework: Application of a web annotation framework for the general humanities to the domain of language resources

L14-1038  [bib]: Arjan van Hessen; Franciska de Jong; Stef Scagliola; Tanja Petrovic
Croatian Memories

L14-1039  [bib]: Deryle Lonsdale; Carl Christensen
Combining elicited imitation and fluency features for oral proficiency measurement

L14-1040  [bib]: Pavel Smrz; Jan Kouril
Semantic Search in Documents Enriched by LOD-based Annotations

L14-1041  [bib]: Manuel Fiorelli; Maria Teresa Pazienza; Armando Stellato
A Meta-data Driven Platform for Semi-automatic Configuration of Ontology Mediators

L14-1042  [bib]: Huijing Deng; Grzegorz Chrupała
Semantic approaches to software component retrieval with English queries

L14-1043  [bib]: Georgios Petasis
The Ellogon Pattern Engine: Context-free Grammars over Annotations

L14-1044  [bib]: Friedel Wolff; Laurette Pretorius; Paul Buitelaar
Missed opportunities in translation memory matching

L14-1045  [bib]: Marie-Catherine de Marneffe; Timothy Dozat; Natalia Silveira; Katri Haverinen; Filip Ginter; Joakim Nivre; Christopher D. Manning
Universal Stanford dependencies: A cross-linguistic typology

L14-1046  [bib]: Reid Swanson; Stephanie Lukin; Luke Eisenberg; Thomas Corcoran; Marilyn Walker
Getting Reliable Annotations for Sarcasm in Online Dialogues

L14-1047  [bib]: Johannes Hellrich; Simon Clematide; Udo Hahn; Dietrich Rebholz-Schuhmann
Collaboratively Annotating Multilingual Parallel Corpora in the Biomedical Domain―some MANTRAs

L14-1048  [bib]: Heeyoung Lee; Mihai Surdeanu; Bill MacCartney; Dan Jurafsky
On the Importance of Text Analysis for Stock Price Prediction

L14-1049  [bib]: David Escudero; Aguilar-Cuevas Lourdes; González-Ferreras César; Gutiérrez-González Yurena; Valentín Cardeñoso-Payo
On the use of a fuzzy classifier to speed up the Sp ToBI labeling of the Glissando Spanish corpus

L14-1050  [bib]: Antonio San Martín; Marie-Claude L'Homme
Definition patterns for predicative terms in specialized lexical resources

L14-1051  [bib]: Xiao Jiang; Yufan Guo; Jeroen Geertzen; Dora Alexopoulou; Lin Sun; Anna Korhonen
Native Language Identification Using Large, Longitudinal Data

L14-1052  [bib]: Juan Luo; Yves Lepage
Production of Phrase Tables in 11 European Languages using an Improved Sub-sentential Aligner

L14-1053  [bib]: Anne Garcia-Fernandez; Anne-Laure Ligozat; Anne Vilnat
Construction and Annotation of a French Folkstale Corpus

L14-1054  [bib]: Yuri Bizzoni; Federico Boschetti; Harry Diakoff; Riccardo Del Gratta; Monica Monachini; Gregory Crane
The Making of Ancient Greek WordNet

L14-1055  [bib]: Fei Xia; William Lewis; Michael Wayne Goodman; Joshua Crowgey; Emily M. Bender
Enriching ODIN

L14-1056  [bib]: Ozlem Cetinoglu
Turkish Treebank as a Gold Standard for Morphological Disambiguation and Its Influence on Parsing

L14-1057  [bib]: Krešimir Šojat; Matea Srebačić; Marko Tadić; Tin Pavelić
CroDeriV: a new resource for processing Croatian morphology

L14-1058  [bib]: Masaya Yamaguchi
Building a Database of Japanese Adjective Examples from Special Purpose Web Corpora

L14-1059  [bib]: George Christodoulides
Praaline: Integrating Tools for Speech Corpus Research

L14-1060  [bib]: Dana Dannells; Normunds Gruzitis
Extracting a bilingual semantic grammar from FrameNet-annotated corpora

L14-1061  [bib]: Fabienne Braune; Daniel Bauer; Kevin Knight
Mapping Between English Strings and Reentrant Semantic Graphs

L14-1062  [bib]: Ines Rehbein; Sören Schalowski; Heike Wiese
The KiezDeutsch Korpus (KiDKo) Release 1.0

L14-1063  [bib]: Gerard de Melo
Etymological Wordnet: Tracing The History of Words

L14-1064  [bib]: Victoria Rosén; Petter Haugereid; Martha Thunes; Gyri S. Losnegaard; Helge Dyvik
The Interplay Between Lexical and Syntactic Resources in Incremental Parsebanking

L14-1065  [bib]: Rafal Rak; Jacob Carter; Andrew Rowley; Riza Theresa Batista-Navarro; Sophia Ananiadou
Interoperability and Customisation of Annotation Schemata in Argo

L14-1066  [bib]: Maciej Ogrodniczuk; Mateusz Kopeć; Agata Savary
Polish Coreference Corpus in Numbers

L14-1067  [bib]: Natalia Silveira; Timothy Dozat; Marie-Catherine de Marneffe; Samuel Bowman; Miriam Connor; John Bauer; Chris Manning
A Gold Standard Dependency Corpus for English

L14-1068  [bib]: Jan Šnajder
DerivBase.hr: A High-Coverage Derivational Morphology Resource for Croatian

L14-1069  [bib]: Simon Scerri; Behrang Q. Zadeh; Maciej Dabrowski; Ismael Rivera
Extracting Information for Context-aware Meeting Preparation

L14-1070  [bib]: Kai Hong; John Conroy; Benoit Favre; Alex Kulesza; Hui Lin; Ani Nenkova
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization

L14-1071  [bib]: Zhiyi Song; Stephanie Strassel; Haejoong Lee; Kevin Walker; Jonathan Wright; Jennifer Garland; Dana Fore; Brian Gainor; Preston Cabe; Thomas Thomas; Brendan Callahan; Ann Sawyer
Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus

L14-1072  [bib]: Bruno Laranjeira; Viviane Moreira; Aline Villavicencio; Carlos Ramisch; Maria José Finatto
Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them

L14-1073  [bib]: Yulia Tsvetkov; Nathan Schneider; Dirk Hovy; Archna Bhatia; Manaal Faruqui; Chris Dyer
Augmenting English Adjective Senses with Supersenses

L14-1074  [bib]: Christian Buck; Kenneth Heafield; Bas van Ooyen
N-gram Counts and Language Models from the Common Crawl

L14-1075  [bib]: Tim vor der Brück; Alexander Mehler; Zahurul Islam
ColLex.en: Automatically Generating and Evaluating a Full-form Lexicon for English

L14-1076  [bib]: Tamara Polajnar; Laura Rimell; Stephen Clark
Evaluation of Simple Distributional Compositional Operations on Longer Texts

L14-1077  [bib]: Horacio Saggion
Creating Summarization Systems with SUMMA

L14-1078  [bib]: Antske Fokkens; Serge Ter Braake; Niels Ockeloen; Piek Vossen; Susan Legêne; Guus Schreiber
BiographyNet: Methodological Issues when NLP supports historical research

L14-1079  [bib]: Anthony Rousseau; Paul Deléglise; Yannick Estève
Enhancing the TED-LIUM Corpus with Selected Data for Language Modeling and More TED Talks

L14-1080  [bib]: Ajay Dubey; Parth Gupta; Vasudeva Varma; Paolo Rosso
Enrichment of Bilingual Dictionary through News Stream Data

L14-1081  [bib]: Darja Fišer; Aleš Tavčar; Tomaž Erjavec
sloWCrowd: A crowdsourcing tool for lexicographic tasks

L14-1082  [bib]: Tomohide Shibata; Shotaro Kohama; Sadao Kurohashi
A Large Scale Database of Strongly-related Events in Japanese

L14-1083  [bib]: Thomas Francois; Nùria Gala; Patrick Watrin; Cédrick Fairon
FLELex: a graded lexical resource for French foreign learners

L14-1084  [bib]: Lucas Hilgert; Lucelene Lopes; Artur Freitas; Renata Vieira; Denise Hogetop; Aline Vanin
Building Domain Specific Bilingual Dictionaries

L14-1085  [bib]: Guillaume Wisniewski; Natalie Kübler; François Yvon
A Corpus of Machine Translation Errors Extracted from Translation Students Exercises

L14-1086  [bib]: Clare Voss; Stephen Tratz; Jamal Laoudi; Douglas Briesch
Finding Romanized Arabic Dialect in Code-Mixed Tweets

L14-1087  [bib]: Nicolas Auguin; Pascale Fung
Co-Training for Classification of Live or Studio Music Recordings

L14-1088  [bib]: Marc Tomlinson; David Bracewell; Wayne Krug; David Hinote
#mygoal: Finding Motivations on Twitter

L14-1089  [bib]: David Graff; Kevin Walker; Stephanie Strassel; Xiaoyi Ma; Karen Jones; Ann Sawyer
The RATS Collection: Supporting HLT Research with Degraded Audio Data

L14-1090  [bib]: Chris Hokamp; Rada Mihalcea; Peter Schuelke
Modeling Language Proficiency Using Implicit Feedback

L14-1091  [bib]: Kevin Reschke; Martin Jankowiak; Mihai Surdeanu; Christopher Manning; Daniel Jurafsky
Event Extraction Using Distant Supervision

L14-1092  [bib]: Stefan Ultes; Hüseyin Dikme; Wolfgang Minker
First Insight into Quality-Adaptive Dialogue

L14-1093  [bib]: Antonio Toral
TLAXCALA: a multilingual corpus of independent news

L14-1094  [bib]: Sander Wubben; Antal van den Bosch; Emiel Krahmer
Creating and using large monolingual parallel corpora for sentential paraphrase generation

L14-1095  [bib]: Jayendra Rakesh Yeka; Prasanth Kolachina; Dipti Misra Sharma
Benchmarking of English-Hindi parallel corpora

L14-1096  [bib]: Mark Dilsizian; Polina Yanovich; Shu Wang; Carol Neidle; Dimitris Metaxas
A New Framework for Sign Language Recognition based on 3D Handshape Identification and Linguistic Modeling

L14-1097  [bib]: Karteek Addanki; Dekai Wu
Evaluating Improvised Hip Hop Lyrics - Challenges and Observations

L14-1098  [bib]: Nathan Green; Septina Dian Larasati
Votter Corpus: A Corpus of Social Polling Language

L14-1099  [bib]: Chen Chen; Vincent Ng
SinoCoreferencer: An End-to-End Chinese Event Coreference Resolver

L14-1100  [bib]: Mohamed Maamouri; Ann Bies; Seth Kulick; Michael Ciul; Nizar Habash; Ramy Eskander
Developing an Egyptian Arabic Treebank: Impact of Dialectal Morphology on Annotation and Tool Development

L14-1101  [bib]: Tatjana Scheffler
A German Twitter Snapshot

L14-1102  [bib]: Helen Hastie; Anja Belz
A Comparative Evaluation Methodology for NLG in Interactive Systems

L14-1103  [bib]: Kyoko Ohara
Relating Frames and Constructions in Japanese FrameNet

L14-1104  [bib]: Hans-Ulrich Krieger; Thierry Declerck
TMO ― The Federated Ontology of the TrendMiner Project

L14-1105  [bib]: Gemma Bel Enguix; Reinhard Rapp; Michael Zock
A Graph-Based Approach for Computing Free Word Associations

L14-1106  [bib]: Roald Eiselen; Martin Puttkammer
Developing Text Resources for Ten South African Languages

L14-1107  [bib]: Paul Felt; Robbie Haertel; Eric Ringger; Kevin Seppi
Momresp: A Bayesian Model for Multi-Annotator Document Labeling

L14-1108  [bib]: Jessica Ouyang; Kathy McKeown
Towards Automatic Detection of Narrative Structure

L14-1109  [bib]: Anabela Barreiro; Fernando Batista; Ricardo Ribeiro; Helena Moniz; Isabel Trancoso
OpenLogos Semantico-Syntactic Knowledge-Rich Bilingual Dictionaries

L14-1110  [bib]: Tilia Ellendorff; Fabio Rinaldi; Simon Clematide
Using Large Biomedical Databases as Gold Annotations for Automatic Relation Extraction

L14-1111  [bib]: Rachele Sprugnoli; Alessandro Lenci
Crowdsourcing for the identification of event nominals: an experiment

L14-1112  [bib]: Jianqiang Ma
Automatic Refinement of Syntactic Categories in Chinese Word Structures

L14-1113  [bib]: Ann Bies; Justin Mott; Seth Kulick; Jennifer Garland; Colin Warner
Incorporating Alternate Translations into English Translation Treebank

L14-1114  [bib]: Rico Sennrich; Beat Kunz
Zmorge: A German Morphological Lexicon Extracted from Wiktionary

L14-1115  [bib]: Mona Diab; Mohamed AlBadrashiny; Maryam Aminian; Mohammed Attia; Heba Elfardy; Nizar Habash; Abdelati Hawwari; Wael Salloum; Pradeep Dasigi; Ramy Eskander
Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon

L14-1116  [bib]: Thierry Declerck; Karlheinz Mörth; Eveline Wandl-Vogt
A SKOS-based Schema for TEI encoded Dictionaries at ICLTT

L14-1117  [bib]: Milen Kouylekov; Stephan Oepen
Semantic Technologies for Querying Linguistic Annotations: An Experiment Focusing on Graph-Structured Data

L14-1118  [bib]: Heather Pon-Barry; Stuart Shieber; Nicholas Longenbaugh
Eliciting and Annotating Uncertainty in Spoken Language

L14-1119  [bib]: Martin Gleize; Brigitte Grau
A hierarchical taxonomy for classifying hardness of inference tasks

L14-1120  [bib]: Michael Rosner; Kurt Sultana
Automatic Methods for the Extension of a Bilingual Dictionary using Comparable Corpora

L14-1121  [bib]: Yutaka Mitsuishi; Vit Novacek; Pierre-Yves Vandenbussche
A Method for Building Burst-Annotated Co-Occurrence Networks for Analysing Trends in Textual Data

L14-1122  [bib]: José Pedro Ferreira; Cristiano Chesi; Daan Baldewijns; Fernando Miguel Pinto; Margarita Correia; Daniela Braga; Hyongsil Cho; Amadeu Ferreira; Miguel Dias
Casa de la Lhéngua: a set of language resources and natural language processing tools for Mirandese

L14-1123  [bib]: Jan Strunk; Florian Schiel; Frank Seifart
Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora using WebMAUS

L14-1124  [bib]: Lars Hellan; Dorothee Beermann; Tore Bruland; Mary Esther Kropp Dakubu; Montserrat Marimon
MultiVal - towards a multilingual valence lexicon

L14-1125  [bib]: Michel Vacher; Benjamin Lecouteux; Pedro Chahuara; François Portet; Brigitte Meillon; Nicolas Bonnefond
The Sweet-Home speech and multimodal corpus for home automation interaction

L14-1126  [bib]: David Lewis; Rob Brennan; Leroy Finn; Dominic Jones; Alan Meehan; Declan O'sullivan; Sebastian Hellmann; Felix Sasaki
Global Intelligent Content: Active Curation of Language Resources using Linked Data

L14-1127  [bib]: Liviu Dinu; Alina Maria Ciobanu
On the Romance Languages Mutual Intelligibility

L14-1128  [bib]: Anca Dinu; Liviu Dinu; Ionut Sorodoc
Aggregation methods for efficient collocation detection

L14-1129  [bib]: Jennifer D'Souza; Vincent Ng
Annotating Inter-Sentence Temporal Relations in Clinical Notes

L14-1130  [bib]: Juris Borzovs; Ilze Ilziņa; Iveta Keiša; Mārcis Pinnis; Andrejs Vasiļjevs
Terminology localization guidelines for the national scenario

L14-1131  [bib]: Claire Brierley; Majdi Sawalha; Eric Atwell
Tools for Arabic Natural Language Processing: a case study in qalqalah prosody

L14-1132  [bib]: Ana Isabel Mata; Helena Moniz; Fernando Batista; Julia Hirschberg
Teenage and adult speech in school context: building and processing a corpus of European Portuguese

L14-1133  [bib]: Archna Bhatia; Mandy Simons; Lori Levin; Yulia Tsvetkov; Chris Dyer; Jordan Bender
A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness

L14-1134  [bib]: Michaela Regneri; Rui Wang; Manfred Pinkal
Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction

L14-1135  [bib]: Sandra Antunes; Amália Mendes
An evaluation of the role of statistical measures and frequency for MWE identification

L14-1136  [bib]: Chi-kiu Lo; Dekai Wu
On the reliability and inter-annotator agreement of human semantic MT evaluation via HMEANT

L14-1137  [bib]: Shikun Zhang; Wang Ling; Chris Dyer
Dual Subtitles as Parallel Corpora

L14-1138  [bib]: Peter Exner; Pierre Nugues
REFRACTIVE: An Open Source Tool to Extract Knowledge from Syntactic and Semantic Relations

L14-1139  [bib]: Guiyao Ke; Pierre-Francois Marteau; Gildas Menier
Variations on quantitative comparability measures and their evaluations on synthetic French-English comparable corpora

L14-1140  [bib]: Liviu Dinu; Alina Maria Ciobanu; Ioana Chitoran; Vlad Niculae
Using a machine learning model to assess the complexity of stress systems

L14-1141  [bib]: Ana Isabel Mata; Helena Moniz; Telmo Móia; Anabela Gonçalves; Fátima Silva; Fernando Batista; Inês Duarte; Fátima Oliveira; Isabel Falé
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora

L14-1142  [bib]: Kevin Black; Eric Ringger; Paul Felt; Kevin Seppi; Kristian Heal; Deryle Lonsdale
Evaluating Lemmatization Models for Machine-Assisted Corpus-Dictionary Linkage

L14-1143  [bib]: Jonathan Washington; Ilnar Salimzyanov; Francis Tyers
Finite-state morphological transducers for three Kypchak languages

L14-1144  [bib]: Antoni Oliver; Salvador Climent
Automatic creation of WordNets from parallel corpora

L14-1145  [bib]: Maciej Ogrodniczuk; Mateusz Kopeć
The Polish Summaries Corpus

L14-1146  [bib]: Tanja Schultz; Tim Schlippe
GlobalPhone: Pronunciation Dictionaries in 20 Languages

L14-1147  [bib]: Alexandru Ceausu; Sabine Hunsicker
Pre-ordering of phrase-based machine translation input in translation workflow

L14-1148  [bib]: Emanuele Di Buccio; Giorgio Maria Di Nunzio; Gianmaria Silvello
A Vector Space Model for Syntactic Distances Between Dialects

L14-1149  [bib]: Christian Curtis
A finite-state morphological analyzer for a Lakota HPSG grammar

L14-1150  [bib]: Jennifer Drexler; Pushpendre Rastogi; Jacqueline Aguilar; Benjamin Van Durme; Matt Post
A Wikipedia-based Corpus for Contextualized Machine Translation

L14-1151  [bib]: Spandana Gella; Carlo Strapparava; Vivi Nastase
Mapping WordNet Domains, WordNet Topics and Wikipedia Categories to Generate Multilingual Domain Specific Resources

L14-1152  [bib]: Sophia Lee; Shoushan Li; Chu-Ren Huang
Annotating Events in an Emotion Corpus

L14-1153  [bib]: Shyam Sundar Agrawal; Abhimanue; Shweta Bansal; Minakshi Mahajan
Statistical Analysis of Multilingual Text Corpus and Development of Language Models

L14-1154  [bib]: Christopher Cieri; Denise DiPersio; Mark Liberman; Andrea Mazzucchi; Stephanie Strassel; Jonathan Wright
New Directions for Language Resource Development and Distribution

L14-1155 : Joseph Mariani; Patrick Paroubek; Gil Francopoulo; Olivier Hamon
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis

L14-1156  [bib]: Kirk Roberts; Kate Masterton; Marcelo Fiszman; Halil Kilicoglu; Dina Demner-Fushman
Annotating Question Decomposition on Complex Medical Questions

L14-1157  [bib]: Clarissa Xavier; Vera Lima
Boosting Open Information Extraction with Noun-Based Relations

L14-1158  [bib]: Krasimir Angelov
Bootstrapping Open-Source English-Bulgarian Computational Dictionary

L14-1159  [bib]: Mathieu Mangeot
MotàMot project: conversion of a French-Khmer published dictionary for building a multilingual lexical system

L14-1160  [bib]: Sérgio Curto; Ana C. Mendes; Pedro Curto; Luísa Coheur; Angela Costa
JUST.ASK, a QA system that learns to answer new questions from previous interactions

L14-1161  [bib]: manjira sinha; Tirthankar Dasgupta; Anupam Basu
Design and Development of an Online Computational Framework to Facilitate Language Comprehension Research on Indian Languages

L14-1162  [bib]: Mirjam Ernestus; Lucie Kočková-Amortová; Petr Pollak
The Nijmegen Corpus of Casual Czech

L14-1163  [bib]: Yan Song; Fei Xia
Modern Chinese Helps Archaic Chinese Processing: Finding and Exploiting the Shared Properties

L14-1164  [bib]: Włodzimierz Gruszczyński; Maciej Ogrodniczuk
Digital Library 2.0: Source of Knowledge and Research Collaboration Platform

L14-1165  [bib]: Kristiina Jokinen
Open-domain Interaction and Online Content in the Sami Language

L14-1166  [bib]: Akira Utsumi
A Character-based Approach to Distributional Semantic Models: Exploiting Kanji Characters for Constructing JapaneseWord Vectors

L14-1167  [bib]: Dasha Bogdanova; Angeliki Lazaridou
Cross-Language Authorship Attribution

L14-1168  [bib]: Paul Felt; Eric Ringger; Kevin Seppi; Kristian Heal
Using Transfer Learning to Assist Exploratory Corpus Annotation

L14-1169  [bib]: Judith Muzerelle; Anaïs Lefeuvre; Emmanuel Schang; Jean-Yves Antoine; Aurore Pelletier; Denis Maurel; Iris Eshkol; Jeanne Villaneau
ANCOR_Centre, a large free spoken French coreference corpus: description of the resource and reliability measures

L14-1170  [bib]: Alex Rudnick; Taylor Skidmore; Alberto Samaniego; Michael Gasser
Guampa: a Toolkit for Collaborative Translation

L14-1171  [bib]: Daan Broeder; Ineke Schuurman; Menzo Windhouwer
Experiences with the ISOcat Data Category Registry

L14-1172  [bib]: Menzo Windhouwer; Justin Petro; Shakila Shayan
RELISH LMF: Unlocking the Full Power of the Lexical Markup Framework

L14-1173  [bib]: Ryu Iida; Takenobu Tokunaga
Building a Corpus of Manually Revised Texts from Discourse Perspective

L14-1174  [bib]: Matej Durco; Menzo Windhouwer
The CMD Cloud

L14-1175  [bib]: Lars Borin; Anju Saxena; Taraka Rama; Bernard Comrie
Linguistic landscaping of South Asia using digital language resources: Genetic vs. areal linguistics

L14-1176  [bib]: Anabela Barreiro; Johanna Monti; Brigitte Orliac; Susanne Preuß; Kutz Arrieta; Wang Ling; Fernando Batista; Isabel Trancoso
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate

L14-1177  [bib]: Michael kipp; Levin Freiherr von Hollen; Michael Christopher Hrstka; Franziska Zamponi
Single-Person and Multi-Party 3D Visualizations for Nonverbal Communication Analysis

L14-1178  [bib]: Hiroaki Shimizu; Graham Neubig; Sakriani Sakti; Tomoki Toda; Satoshi Nakamura
Collection of a Simultaneous Translation Corpus for Comparative Analysis

L14-1179  [bib]: Huseyin Cakmak; Jerome Urbain; Thierry Dutoit; Joelle Tilmanne
The AV-LASYN Database : A synchronous corpus of audio and 3D facial marker data for audio-visual laughter synthesis

L14-1180  [bib]: Volha Petukhova; Andrei Malchanau; Harry Bunt
Interoperability of Dialogue Corpora through ISO 24617-2-based Querying

L14-1181  [bib]: Catia Cucchiarini; Steve Bodnar; Bart Penning de Vries; Roeland Van Hout; Helmer Strik
ASR-based CALL systems and learner speech data: new resources and opportunities for research and development in second language learning

L14-1182  [bib]: Volha Petukhova; Martin Gropp; Dietrich Klakow; Gregor Eigner; Mario Topf; Stefan Srb; Petr Motlicek; Blaise Potard; John Dines; Olivier Deroo; Ronny Egeler; Uwe Meinz; Steffen Liersch; Anna Schmidt
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues

L14-1183  [bib]: Thomas Schmidt
The Database for Spoken German ― DGD2

L14-1184  [bib]: Liviu Dinu; Alina Maria Ciobanu
Building a Dataset of Multilingual Cognates for the Romanian Lexicon

L14-1185  [bib]: Giuseppe Rizzo; Marieke van Erp; Raphaël Troncy
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web

L14-1186  [bib]: Ting Liu; Kit Cho; G. Aaron Broadwell; Samira Shaikh; Tomek Strzalkowski; John Lien; Sarah Taylor; Laurie Feldman; Boris Yamrom; Nick Webb; Umit Boz; Ignacio Cases; Ching-Sheng Lin
Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings

L14-1187  [bib]: Riyaz Ahmad Bhat; Shahid Musjtaq Bhat; Dipti Misra Sharma
Towards building a Kashmiri Treebank: Setting up the Annotation Pipeline

L14-1188  [bib]: Olga Uryupina; Barbara Plank; Aliaksei Severyn; Agata Rotondi; Alessandro Moschitti
SenTube: A Corpus for Sentiment Analysis on YouTube Social Media

L14-1189  [bib]: Carlos Daniel Hernandez Mena; Abel Herrera Camacho
CIEMPIESS: A New Open-Sourced Mexican Spanish Radio Corpus

L14-1190  [bib]: Arantza del Pozo; Carlo Aliprandi; Aitor Álvarez; Carlos Mendes; Joao P. Neto; Sérgio Paulo; Nicola Piccinini; Matteo Raffaelli
SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling

L14-1191  [bib]: Peter Fankhauser; Jörg Knappen; Elke Teich
Exploring and Visualizing Variation in Language Resources

L14-1192  [bib]: Kareem Darwish; Wei Gao
Simple Effective Microblog Named Entity Recognition: Arabic as an Example

L14-1193  [bib]: Miguel B. Almeida; Mariana S. C. Almeida; André F. T. Martins; Helena Figueira; Pedro Mendes; Cláudia Pinto
Priberam Compressive Summarization Corpus: A New Multi-Document Summarization Corpus for European Portuguese

L14-1194  [bib]: Chantal van Son; Marieke van Erp; Antske Fokkens; Piek Vossen
Hope and Fear: How Opinions Influence Factuality

L14-1195  [bib]: Vincent Vandeghinste; Ineke Schuurman
Linking Pictographs to Synsets: Sclera2Cornetto

L14-1196  [bib]: Mohamed Morchid; Georges Linares; Richard Dufour
CHARACTERIZING AND PREDICTING BURSTY EVENTS: THE BUZZ CASE STUDY ON TWITTER

L14-1197  [bib]: Hans-Ulrich Krieger; Christian Spurk; Hans Uszkoreit; Feiyu Xu; Yi Zhang; Frank Müller; Thomas Tolxdorff
Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies

L14-1198  [bib]: Dietmar Schabus; Michael Pucher; Phil Hoole
The MMASCS multi-modal annotated synchronous corpus of audio, video, facial motion and tongue motion data of normal, fast and slow speech

L14-1199  [bib]: Lucie Poláková; Pavlína Jínová; Jiří Mírovský
Genres in the Prague Discourse Treebank

L14-1200  [bib]: Joris Pelemans; Kris Demuynck; Hugo Van hamme; Patrick Wambacq
Speech Recognition Web Services for Dutch

L14-1201  [bib]: Angela Costa; Tiago Luís; Luísa Coheur
Translation errors from English to Portuguese: an annotated corpus

L14-1202  [bib]: Fadoua Ataa Allah; Siham Boulaknadel
Amazigh Verb Conjugator

L14-1203  [bib]: Pawel Kamocki
The liability of service providers in e-Research Infrastructures: killing the messenger?

L14-1204  [bib]: Quentin Pradet; Laurence Danlos; Gaël de Chalendar
Adapting VerbNet to French using existing resources

L14-1205  [bib]: Sharid Loaiciga; Thomas Meyer; Andrei Popescu-Belis
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling

L14-1206  [bib]: Nelleke Oostdijk; Henk van den Heuvel
The evolving infrastructure for language resources and the role for data scientists

L14-1207  [bib]: Attila Novák
A New Form of Humor ― Mapping Constraint-Based Computational Morphologies to a Finite-State Representation

L14-1208  [bib]: Matti Karppa; Ville Viitaniemi; Marcos Luzardo; Jorma Laaksonen; Tommi Jantunen
SLMotion - An extensible sign language oriented video analysis tool

L14-1209  [bib]: Chenhui Chu; Toshiaki Nakazawa; Sadao Kurohashi
Constructing a Chinese―Japanese Parallel Corpus from Wikipedia

L14-1210  [bib]: Michael Carl; Mercedes Martínez García; Bartolomé Mesa-Lao
CFT13: A resource for research into the post-editing process

L14-1211  [bib]: Ludger Zeevaert
Mörkum Njálu. An annotated corpus to analyse and explain grammatical divergences between 14th-century manuscripts of Njál's saga.

L14-1212  [bib]: Jean-Philippe Goldman; Adrian Leeman; Marie-José Kolly; Ingrid Hove; Ibrahim Almajai; Volker Dellwo; Steven Moran
A Crowdsourcing Smartphone Application for Swiss German: Putting Language Documentation in the Hands of the Users

L14-1213  [bib]: Steven Bethard; Philip Ogren; Lee Becker
ClearTK 2.0: Design Patterns for Machine Learning in UIMA

L14-1214  [bib]: Inès Zribi; Rahma Boujelbane; Abir Masmoudi; Mariem Ellouze; Lamia Belguith; Nizar Habash
A Conventional Orthography for Tunisian Arabic

L14-1215  [bib]: Thomas Mayer; Michael Cysouw
Creating a massively parallel Bible corpus

L14-1216  [bib]: Reinhard Rapp
Corpus-Based Computation of Reverse Associations

L14-1217  [bib]: Palmira Marrafa; Raquel Amaro; Sara Mendes
LexTec ― a rich language resource for technical domains in Portuguese

L14-1218  [bib]: Blanca Arias; Nuria Bel; Mercè Lorente; Montserrat Marimón; Alba Milà; Jorge Vivaldi; Muntsa Padró; Marina Fomicheva; Imanol Larrea
Boosting the creation of a treebank

L14-1219  [bib]: Eric Sanders; Ineke van de Craats; Vanja de Lint
The Dutch LESLLA Corpus

L14-1220  [bib]: Tomáš Jelínek
Improvements to Dependency Parsing Using Automatic Simplification of Data

L14-1221  [bib]: Claudiu Mihăilă; Sophia Ananiadou
The Meta-knowledge of Causality in Biomedical Scientific Discourse

L14-1222  [bib]: Wolfgang Maier; Miriam Kaeshammer; Peter Baumann; Sandra Kübler
Discosuite - A parser test suite for German discontinuous structures

L14-1223  [bib]: Francesco Barbieri; Horacio Saggion
Modelling Irony in Twitter: Feature Analysis and Evaluation

L14-1224  [bib]: Gregor Titze; Volha Bryl; Cäcilia Zirn; Simone Paolo Ponzetto
DBpedia Domains: augmenting DBpedia with domain information

L14-1225  [bib]: Mathieu Chollet; Magalie Ochs; Catherine Pelachaud
Mining a multimodal corpus for non-verbal behavior sequences conveying attitudes

L14-1226  [bib]: Cyril Grouin
Biomedical entity extraction using machine-learning based approaches

L14-1227  [bib]: Achim Stein
Parsing Heterogeneous Corpora with a Rich Dependency Grammar

L14-1228  [bib]: Eckhard Bick
ML-Optimization of Ported Constraint Grammars

L14-1229  [bib]: Samira Shaikh; Tomek Strzalkowski; Ting Liu; George Aaron Broadwell; Boris Yamrom; Sarah Taylor; Laurie Feldman; Kit Cho; Umit Boz; Ignacio Cases; Yuliya Peshkova; Ching-Sheng Lin
A Multi-Cultural Repository of Automatically Discovered Linguistic and Conceptual Metaphors

L14-1230  [bib]: Haritz Salaberri; Olatz Arregi; Beñat Zapirain
First approach toward Semantic Role Labeling for Basque

L14-1231  [bib]: Lianet Sepúlveda Torres; Magali Sanches Duran; Sandra Aluísio
GENERATING A LEXICON OF ERRORS IN PORTUGUESE TO SUPPORT AN ERROR IDENTIFICATION SYSTEM FOR SPANISH NATIVE LEARNERS

L14-1232  [bib]: Lei Zhang; Michael Färber; Achim Rettinger
xLiD-Lexica: Cross-lingual Linked Data Lexica

L14-1233  [bib]: Yuan Luo; Thomas Boucher; Tolga Oral; David Osofsky; Sara Weber
A Study on Expert Sourcing Enterprise Question Collection and Classification

L14-1234  [bib]: Hong Li; Sebastian Krause; Feiyu Xu; Hans Uszkoreit; Robert Hummel; Veselina Mironova
Annotating Relation Mentions in Tabloid Press

L14-1235  [bib]: Chetana Gavankar; Ashish Kulkarni; Ganesh Ramakrishnan
Efficient Reuse of Structured and Unstructured Resources for Ontology Population

L14-1236  [bib]: Marie Kopřivová; Hana Goláňová; Petra Klimešová; David Lukeš
MAPPING DIATOPIC AND DIACHRONIC VARIATION IN SPOKEN CZECH: THE ORTOFON AND DIALEKT CORPORA

L14-1237  [bib]: Patrick Schone; Heath Nielson; Mark Ward
Corpus and Evaluation of Handwriting Recognition of Historical Genealogical Records

L14-1238  [bib]: Ildikó Pilán; Elena Volodina
Reusing Swedish FrameNet for training semantic roles

L14-1239  [bib]: Kay Berkling; Johanna Fay; Masood Ghayoomi; Katrin Hein; Rémi Lavalley; Ludwig Linhuber; Sebastian Stüker
A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification

L14-1240  [bib]: Christian Haenig; Andreas Niekler; Carsten Wuensch
PACE Corpus: a multilingual corpus of Polarity-annotated textual data from the domains Automotive and CEllphone

L14-1241  [bib]: Veronika Vincze; Viktor Varga; Katalin Ilona Simkó; János Zsibrita; Ágoston Nagy; Richárd Farkas; János Csirik
Szeged Corpus 2.5: Morphological Modifications in a Manually POS-tagged Hungarian Corpus

L14-1242  [bib]: Pierre André Ménard; Caroline Barriere
Linked Open Data and Web Corpus Data for noun compound bracketing

L14-1243  [bib]: João Freitas; António Teixeira; Miguel Dias
Multimodal Corpora for Silent Speech Interaction

L14-1244  [bib]: Tomoko Izumi; Tomohide Shibata; Hisako Asano; Yoshihiro Matsuo; Sadao Kurohashi
Constructing a Corpus of Japanese Predicate Phrases for Synonym/Antonym Relations

L14-1245  [bib]: Evelina Rennes; Arne Jonsson
The Impact of Cohesion Errors in Extraction Based Summaries

L14-1246  [bib]: Lanjun Zhou; Binyang Li; Zhongyu Wei; Kam-Fai Wong
The CUHK Discourse TreeBank for Chinese: Annotating Explicit Discourse Connectives for the Chinese TreeBank

L14-1247  [bib]: Kugatsu Sadamitsu; Ryuichiro Higashinaka; Yoshihiro Matsuo
Extraction of Daily Changing Words for Question Answering

L14-1248  [bib]: Sandipan Dandapat; Declan Groves
MTWatch: A Tool for the Analysis of Noisy Parallel Data

L14-1249  [bib]: Martin Riedl; Richard Steuer; Chris Biemann
DISTRIBUTED DISTRIBUTIONAL SIMILARITIES OF GOOGLE BOOKS OVER THE CENTURIES

L14-1250  [bib]: Saba Urooj; Sarmad Hussain; Asad Mustafa; Rahila Parveen; Farah Adeeba; Tafseer Ahmed Khan; Miriam Butt; Annette Hautli
The CLE Urdu POS Tagset

L14-1251  [bib]: Darina Benikova; Chris Biemann; Marc Reznicek
NoSta-D Named Entity Annotation for German: Guidelines and Dataset

L14-1252  [bib]: Yi-Fen Liu; Shu-Chuan Tseng; J.-S Roger Jang
Phone Boundary Annotation in Conversational Speech

L14-1253  [bib]: Mayumi Bono; Kouhei Kikuchi; Paul Cibulka; Yutaka Osugi
A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations

L14-1254  [bib]: Adam Przepiórkowski; Elżbieta Hajnicz; Agnieszka Patejuk; Marcin Woliński; Filip Skwarski; Marek Świdziński
Walenty: Towards a comprehensive valence dictionary of Polish

L14-1255  [bib]: Balamurali A.R
Can the Crowd be Controlled?: A Case Study on Crowd Sourcing and Automatic Validation of Completed Tasks based on User Modeling

L14-1256  [bib]: Thomas Bögel; Jannik Strötgen; Michael Gertz
Computational Narratology: Extracting Tense Clusters from Narrative Texts

L14-1257  [bib]: Mārcis Pinnis; Ilze Auziņa; Kārlis Goba
Designing the Latvian Speech Recognition Corpus

L14-1258  [bib]: Pavel Vondřička
Aligning parallel texts with InterText

L14-1259  [bib]: Panot Chaimongkol; Akiko Aizawa; Yuka Tateisi
Corpus for Coreference Resolution on Scientific Papers

L14-1260  [bib]: Ingrid Falk; Delphine Bernhard; Christophe Gérard
From Non Word to New Word: Automatically Identifying Neologisms in French Newspapers

L14-1261  [bib]: Nancy Underwood; Bartolomé Mesa-Lao; Mercedes García Martínez; Michael Carl; Vicent Alabau; Jesús González-Rubio; Luis A. Leiva; Germán Sanchis-Trilles; Daniel Ortíz-Martínez; Francisco Casacuberta
Evaluating the effects of interactivity in a post-editing workbench

L14-1262 : Georgios Paltoglou
Using Twitter and Sentiment Analysis for event detection

L14-1263  [bib]: Thomas Schmidt
The Research and Teaching Corpus of Spoken German ― FOLK

L14-1264  [bib]: Stefania Degaetano-Ortlieb; Peter Fankhauser; Hannah Kermes; Ekaterina Lapshinova-Koltunski; Noam Ordan; Elke Teich
Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

L14-1265  [bib]: Hassan Saif; Miriam Fernandez; Yulan He; Harith Alani
On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter

L14-1266  [bib]: Patrik Lambert; Carlos Rodriguez-Penagos
Adapting Freely Available Resources to Build an Opinion Mining Pipeline in Portuguese

L14-1267  [bib]: Milena Hnátková; Michal Křen; Pavel Procházka; Hana Skoumalová
The SYN-series corpora of written Czech

L14-1268  [bib]: Liane Guillou; Christian Hardmeier; Aaron Smith; Jörg Tiedemann; Bonnie Webber
ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT

L14-1269  [bib]: Johann-Mattis List; Jelena Prokić
A Benchmark Database of Phonetic Alignments in Historical Linguistics and Dialectology

L14-1270  [bib]: Xavier Tannier
Extracting News Web Page Creation Time with DCTFinder

L14-1271  [bib]: Karel Kučera; Martin Stluka
Corpus of 19th-century Czech Texts: Problems and Solutions

L14-1272  [bib]: Julien Velcin; Young-Min Kim; Caroline Brun; Jean-Yves Dormagen; Eric SanJuan; Leila Khouas; Anne Peradotto; Stéphane Bonnevay; Claude Roux; Julien Boyadjian; Alejandro Molina; Marie Neihouser
Investigating the Image of Entities in Social Media: Dataset Design and First Results

L14-1273  [bib]: Per Erik Solberg; Arne Skjærholt; Lilja Øvrelid; Kristin Hagen; Janne Bondi Johannessen
The Norwegian Dependency Treebank

L14-1274  [bib]: Maik Stührenberg
Extending standoff annotation

L14-1275  [bib]: László Laki; György Orosz
An efficient language independent toolkit for complete morphological disambiguation

L14-1276  [bib]: Peter Spyns; Remco van Veenendaal
A decade of HLT Agency activities in the Low Countries: from resource maintenance (BLARK) to service offerings (BLAISE)

L14-1277  [bib]: Eunah Cho; Sarah Fünfer; Sebastian Stüker; Alex Waibel
A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation

L14-1278  [bib]: Niklas Vanhainen; Giampiero Salvi
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish

L14-1279  [bib]: Begum Erten; Cem Bozsahin; Deniz Zeyrek
Turkish Resources for Visual Word Recognition

L14-1280  [bib]: Eshrag Refaee; Verena Rieser
An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis

L14-1281  [bib]: Massimo Moneglia; Susan Brown; Francesca Frontini; Gloria Gagliardi; Fahad Khan; Monica Monachini; Alessandro Panunzi
The IMAGACT Visual Ontology. An Extendable Multilingual Infrastructure for the representation of lexical encoding of Action

L14-1282  [bib]: Martin Benjamin
Collaboration in the Production of a Massively Multilingual Lexicon

L14-1283  [bib]: François Salmon; Félicien Vallet
An Effortless Way To Create Large-Scale Datasets For Famous Speakers

L14-1284  [bib]: Bogdan Ludusan; Maarten Versteegh; Aren Jansen; Guillaume Gravier; Xuan-Nga Cao; Mark Johnson; Emmanuel Dupoux
Bridging the gap between speech technology and natural language processing: an evaluation toolbox for term discovery systems

L14-1285  [bib]: Dietmar Rösner; Rafael Friesen; Stephan Günther; Rico Andrich
Modeling and evaluating dialog success in the LAST MINUTE corpus

L14-1286  [bib]: Maxim Sidorov; Stefan Ultes; Alexander Schmitt
Comparison of Gender- and Speaker-adaptive Emotion Recognition

L14-1287  [bib]: Sian Alsop; Hilary Nesi
The pragmatic annotation of a corpus of academic lectures

L14-1288  [bib]: Dorte Haltrup Hansen; Lene Offersgaard; Sussi Olsen
Using TEI, CMDI and ISOcat in CLARIN-DK

L14-1289  [bib]: Sabrina Campano; Jessica Durand; Chloé Clavel
Comparative analysis of verbal alignment in human-human and human-agent interactions

L14-1290  [bib]: Petic Mircea; Daniela Gîfu
Transliteration and alignment of parallel texts from Cyrillic to Latin

L14-1291  [bib]: Corina Dima; Verena Henrich; Erhard Hinrichs; Christina Hoppermann
How to Tell a Schneemann from a Milchmann: An Annotation Scheme for Compound-Internal Relations

L14-1292  [bib]: Susana Bautista; Horacio Saggion
Can Numerical Expressions Be Simpler? Implementation and Demostration of a Numerical Simplification System for Spanish

L14-1293  [bib]: Anita Rácz; István Nagy T.; Veronika Vincze
4FX: Light Verb Constructions in a Multilingual Parallel Corpus

L14-1294  [bib]: Fritz Kliche; Andre Blessing; Dr. Ulrich Heid; Jonathan Sonntag
The eIdentity Text Exploration Workbench

L14-1295  [bib]: Nesrine Fourati; Catherine Pelachaud
Emilya: Emotional body expression in daily actions database

L14-1296  [bib]: Kareem Darwish; Ahmed Abdelali; Hamdy Mubarak
Using Stem-Templates to Improve Arabic POS and Gender/Number Tagging

L14-1297  [bib]: Shaoda He; Xiaojun Zou; Liumingjing Xiao; Junfeng Hu
Construction of Diachronic Ontologies from People's Daily of Fifty Years

L14-1298  [bib]: Jonathan Chevelu; Gwénolé Lecorvé; Damien Lolive
ROOTS: a toolkit for easy, fast and consistent processing of large sequential annotated data collections

L14-1299  [bib]: Martin Jansche
Computer-Aided Quality Assurance of an Icelandic Pronunciation Dictionary

L14-1300  [bib]: Ismail El Maarouf; Jane Bradbury; Vít Baisa; Patrick Hanks
Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing

L14-1301  [bib]: Veronika Vincze; János Zsibrita; Péter Durst; Martina Katalin Szabó
Automatic Error Detection concerning the Definite and Indefinite Conjugation in the HunLearner Corpus

L14-1302  [bib]: Maxim Sidorov; Christina Brester; Wolfgang Minker; Eugene Semenkin
Speech-Based Emotion Recognition: Feature Selection by Self-Adaptive Multi-Criteria Genetic Algorithm

L14-1303  [bib]: Stefan Höfler; Kyoko Sugisaki
Constructing and exploiting an automatically annotated resource of legislative texts

L14-1304  [bib]: Roman Schneider
GenitivDB ― a Corpus-Generated Database for German Genitive Classification

L14-1305  [bib]: Jana Sindlerova; Zdenka Uresova; Eva Fucikova
Resources in Conflict: A Bilingual Valency Lexicon vs. a Bilingual Treebank vs. a Linguistic Theory

L14-1306  [bib]: Roser Saurí; Judith Domingo; Toni Badia
The NewSoMe Corpus: A Unifying Opinion Annotation Framework across Genres and in Multiple Languages

L14-1307  [bib]: Nianwen Xue; Yuchen Zhang
Buy one get one free: Distant annotation of Chinese tense, event type and modality

L14-1308  [bib]: Kodai Takahashi; Masashi Inoue
Multimodal dialogue segmentation with gesture post-processing

L14-1309  [bib]: André Bittar; dini luca; Sigrid Maurel; Mathieu Ruhlmann
The Dangerous Myth of the Star System

L14-1310  [bib]: Haibo Li; Masato Hagiwara; Qi Li; Heng Ji
Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese

L14-1311  [bib]: Verginica Barbu Mititelu; Elena Irimia; Dan Tufiș
CoRoLa ― The Reference Corpus of Contemporary Romanian Language

L14-1312  [bib]: Tibor Kiss; Francis Jeffry Pelletier; Tobias Stadtfeld
Building a reference lexicon for countability in English

L14-1313  [bib]: Gaël de Chalendar
The LIMA Multilingual Analyzer Made Free: FLOSS Resources Adaptation and Correction

L14-1314  [bib]: Marco Marelli; Stefano Menini; Marco Baroni; Luisa Bentivogli; Raffaella bernardi; Roberto Zamparelli
A SICK cure for the evaluation of compositional distributional semantic models

L14-1315  [bib]: Bayu Rahayudi; Ronald Poppe; Dirk Heylen
Twente Debate Corpus ― A Multimodal Corpus for Head Movement Analysis

L14-1316  [bib]: Annika Hämäläinen; Jairo Avelar; Silvia Rodrigues; Miguel Sales Dias; Artur Kolesiński; Tibor Fegyó; Géza Németh; Petra Csobánka; Karine Lan; David Hewson
The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech

L14-1317  [bib]: Matteo Abrate; Angelo Mario Del Grosso; Emiliano Giovannetti; Angelica Lo Duca; Damiana Luzzi; Lorenzo Mancini; Andrea Marchetti; Irene Pedretti; Silvia Piccini
Sharing Cultural Heritage: the Clavius on the Web Project

L14-1318  [bib]: Bo Liu; Jingjing Liu; Xiang Yu; Dimitris Metaxas; Carol Neidle
3D Face Tracking and Multi-Scale, Spatio-temporal Analysis of Linguistically Significant Facial Expressions and Head Positions in ASL

L14-1319  [bib]: Leah Geer; Jonathan Keane
Exploring factors that contribute to successful fingerspelling comprehension

L14-1320  [bib]: Nobal Niraula; Vasile Rus; Rajendra Banjade; Dan Stefanescu; William Baggett; Brent Morgan
The DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems

L14-1321  [bib]: Mikel Forcada
On the annotation of TMX translation memories for advanced leveraging in computer-aided translation

L14-1322  [bib]: Shannon Hennig; Ryad Chellali; Nick Campbell
The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech

L14-1323  [bib]: Andrea Moro; Roberto Navigli; Francesco Maria Tucci; Rebecca J. Passonneau
Annotating the MASC Corpus with BabelNet

L14-1324  [bib]: Joost Bastings; Khalil Sima'an
All Fragments Count in Parser Evaluation

L14-1325  [bib]: Juan-María Garrido; Yesika Laplaza; Benjamin Kolz; Miquel Cornudella
TexAFon 2.0: A text processing tool for the generation of expressive speech in TTS applications

L14-1326  [bib]: Mojgan Seraji; Carina Jahani; Beáta Megyesi; Joakim Nivre
A Persian Treebank with Stanford Typed Dependencies

L14-1327  [bib]: Marion Baranes; Benoît Sagot
A Language-independent Approach to Extracting Derivational Relations from an Inflectional Lexicon

L14-1328  [bib]: Dilek Kucuk; Guillaume Jacquet; Ralf Steinberger
Named Entity Recognition on Turkish Tweets

L14-1329  [bib]: Anne Lacheret; Sylvain Kahane; Julie Beliao; Anne Dister; Kim Gerdes; Jean-Philippe Goldman; Nicolas Obin; Paola Pietrandrea; Atanas Tchobanov
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French

L14-1330  [bib]: Montserrat Marimon; Núria Bel; Beatriz Fisas; Blanca Arias; Silvia Vázquez; Jorge Vivaldi; Carlos Morell; Mercè Lorente
The IULA Spanish LSP Treebank

L14-1331  [bib]: Maria Goryainova; Cyril Grouin; Sophie Rosset; Ioana Vasilescu
Morpho-Syntactic Study of Errors from Speech Recognition System

L14-1332  [bib]: Nianwen Xue; Ondrej Bojar; Jan Hajic; Martha Palmer; Zdenka Uresova; Xiuhong Zhang
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech

L14-1333  [bib]: Adam Meyers; Giancarlo Lee; Angus Grieve-Smith; Yifan He; Harriet Taber
Annotating Relations in Scientific Articles

L14-1334  [bib]: Prescott Klassen; Fei Xia; Lucy Vanderwende; Meliha Yetisgen
Annotating Clinical Events in Text Snippets for Phenotype Detection

L14-1335  [bib]: Pablo Ruiz; Aitor Álvarez; Haritz Arzelus
Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling

L14-1336  [bib]: Maria Evangelia Chatzimina; Cyril Grouin; Pierre Zweigenbaum
Use of unsupervised word classes for entity recognition: Application to the detection of disorders in clinical reports

L14-1337  [bib]: Eva Hajičová
Three dimensions of the so-called ""interoperability"" of annotation schemes"

L14-1338  [bib]: Miriam Kaeshammer; Anika Westburg
On Complex Word Alignment Configurations

L14-1339  [bib]: Dimitrios Kokkinakis; Jyrki Niemi; Sam Hardwick; Krister Lindén; Lars Borin
HFST-SweNER ― A New NER Resource for Swedish

L14-1340  [bib]: Motaz Saad; David Langlois; Kamel Smaili
Building and Modelling Multilingual Subjective Corpora

L14-1341  [bib]: Barbara Schuppler; Martin Hagmueller; Juan A. Morales-Cordovilla; Hannes Pessentheiner
GRASS: the Graz corpus of Read And Spontaneous Speech

L14-1342  [bib]: Menzo Windhouwer; Ineke Schuurman
Linguistic resources and cats: how to use ISOcat, RELcat and SCHEMAcat

L14-1343  [bib]: Lars Borin; Jens Allwood; Gerard de Melo
Bring vs. MTRoget: Evaluating automatic thesaurus translation

L14-1344  [bib]: Paula Lopez-Otero; Laura Docio-Fernandez; Carmen Garcia-Mateo
Introducing a Framework for the Evaluation of Music Detection Tools

L14-1345  [bib]: Tristan Miller; Iryna Gurevych
WordNet―Wikipedia―Wiktionary: Construction of a Three-way Alignment

L14-1346  [bib]: Cristina Grisot; Thomas Meyer
Cross-linguistic annotation of narrativity for English/French verb tense disambiguation

L14-1347  [bib]: Eleftherios Avramidis; Aljoscha Burchardt; Sabine Hunsicker; Maja Popović; Cindy Tscherwinka; David Vilar; Hans Uszkoreit
The taraXÜ corpus of human-annotated machine translations

L14-1348  [bib]: Mahmoud El-Haj; Paul Rayson; Steve Young; Martin Walker
Detecting Document Structure in a Very Large Corpus of UK Financial Reports

L14-1349  [bib]: Dan Stefanescu; Rajendra Banjade; Vasile Rus
Latent Semantic Analysis Models on Wikipedia and TASA

L14-1350  [bib]: Georg Rehm; Hans Uszkoreit; Sophia Ananiadou; Núria Bel; Audroné Bielevičiené; Lars Borin; António Branco; Gerhard Budin; Nicoletta Calzolari; Walter Daelemans; Radovan Garabík; Marko Grobelnik; Carmen Garcia-Mateo; Josef van Genabith; Jan Hajic; Inma Hernaez; John Judge; Svetla Koeva; Simon Krek; Cvetana Krstev; Krister Linden; Bernardo Magnini; Joseph Mariani; John McNaught; Maite Melero; Monica Monachini; Asuncion Moreno; Jan Odijk; Maciej Ogrodniczuk; Piotr Pezik; Stelios Piperidis; Adam Przepiórkowski; Eiríkur Rögnvaldsson; Michael Rosner; Bolette Pedersen; Inguna Skadina; Koenraad De Smedt; Marko Tadić; Paul Thompson; Dan Tufiș; Tamás Váradi; Andrejs Vasiļjevs; Kadri Vider; Jolanta Zabarskaite
The Strategic Impact of META-NET on the Regional, National and International Level

L14-1351  [bib]: Alan Akbik; Thilo Michael
The Weltmodell: A Data-Driven Commonsense Knowledge Base

L14-1352  [bib]: Florian Schiel; Thomas Kisler
German Alcohol Language Corpus - the Question of Dialect

L14-1353  [bib]: Koenraad De Smedt; Erhard Hinrichs; Detmar Meurers; Inguna Skadina; Bolette Pedersen; Costanza Navarretta; Núria Bel; Krister Linden; Marketa Lopatkova; Jan Hajic; Gisle Andersen; Przemyslaw Lenkiewicz
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications

L14-1354  [bib]: Nathan Hartmann; Lucas Avanço; Pedro Balage; Magali Duran; Maria das Graças Volpe Nunes; Thiago Pardo; Sandra Aluísio
A Large Corpus of Product Reviews in Portuguese: Tackling Out-Of-Vocabulary Words

L14-1355  [bib]: Anoop Kunchukuttan; Abhijit Mishra; Rajen Chatterjee; Ritesh Shah; Pushpak Bhattacharyya
Shata-Anuvadak: Tackling Multiway Translation of Indian Languages

L14-1356  [bib]: Erhard Hinrichs; Steven Krauwer
The CLARIN Research Infrastructure: Resources and Tools for eHumanities Scholars

L14-1357  [bib]: Renlong Ai; Marcela Charfuelan; Walter Kasper; Tina Klüwer; Hans Uszkoreit; Feiyu Xu; Sandra Gasber; Philip Gienandt
Sprinter: Language Technologies for Interactive and Multimedia Language Learning

L14-1358  [bib]: Wushouer Mairidan; Toru Ishida; Donghui Lin; Katsutoshi Hirayama
Bilingual Dictionary Induction as an Optimization Problem

L14-1359  [bib]: Brian MacWhinney; Davida Fromm
Two Approaches to Metaphor Detection

L14-1360  [bib]: Shinsuke Mori; Hideki Ogura; Tetsuro Sasada
A Japanese Word Dependency Corpus

L14-1361  [bib]: Hege Fromreide; Dirk Hovy; Anders Søgaard
Crowdsourcing and annotating NER for Twitter #drift

L14-1362  [bib]: Sebastian Krause; Hong Li; Feiyu Xu; Hans Uszkoreit; Robert Hummel; Luise Spielhagen
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction

L14-1363  [bib]: Alain Couillault; Karën Fort; Gilles Adda; Hugues Mazancourt (de)
Evaluating corpora documentation with regards to the Ethics and Big Data Charter

L14-1364  [bib]: Ahmet Aker; Monica Paramita; Emma Barker; Robert Gaizauskas
Bootstrapping Term Extractors for Multiple Languages

L14-1365  [bib]: Els Lefever; Marjan Van de Kauter; Veronique Hoste
Evaluation of Automatic Hypernym Extraction from Technical Corpora in English and Dutch

L14-1366  [bib]: Bartosz Broda; Bartłomiej Nitoń; Włodzimierz Gruszczyński; Maciej Ogrodniczuk
Measuring Readability of Polish Texts: Baseline Experiments

L14-1367  [bib]: Raphael Winkelmann; Georg Raess
Introducing a web application for labeling, visualizing speech and correcting derived speech signals

L14-1368  [bib]: Lise Rebout; Phillippe Langlais
An Iterative Approach for Mining Parallel Sentences in a Comparable Corpus

L14-1369  [bib]: Mohamed Elmahdy; Mark Hasegawa-Johnson; Eiman Mustafawi
Development of a TV Broadcasts Speech Recognition System for Qatari Arabic

L14-1370  [bib]: Wajdi Zaghouani; Kais Dukes
Can Crowdsourcing be used for Effective Annotation of Arabic?

L14-1371  [bib]: Hanae Koiso; Yasuharu Den; Ken'ya Nishikawa; Kikuo Maekawa
Design and development of an RDB version of the Corpus of Spontaneous Japanese

L14-1372  [bib]: Mohamed Elmahdy; Mark Hasegawa-Johnson; Eiman Mustafawi
Automatic Long Audio Alignment and Confidence Scoring for Conversational Arabic Speech

L14-1373  [bib]: Dirk Goldhahn; Uwe Quasthoff
Vocabulary-Based Language Similarity using Web Corpora

L14-1374  [bib]: Piek Vossen; German Rigau; Luciano Serafini; Pim Stouten; Francis Irving; Willem Van Hage
NewsReader: recording history from daily news streams

L14-1375  [bib]: Çağrı Çöltekin
A set of open source tools for Turkish natural language processing

L14-1376  [bib]: Tjerk Hagemeijer; Michel Généreux; Iris Hendrickx; Amália Mendes; Abigail Tiny; Armando Zamora
The Gulf of Guinea Creole Corpora

L14-1377  [bib]: Ville Viitaniemi; Tommi Jantunen; Leena Savolainen; Matti Karppa; Jorma Laaksonen
S-pot - a benchmark in spotting signs within continuous signing

L14-1378  [bib]: Masood Ghayoomi; Jonas Kuhn
Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank

L14-1379  [bib]: Iñaki Alegria; Nora Aranberri; Pere Comas; Victor Fresno; Pablo Gamallo; Lluís Padró; Iñaki San Vicente; Jordi Turmo; Arkaitz Zubiaga
TweetNorm_es: an annotated corpus for Spanish microtext normalization

L14-1380  [bib]: Elżbieta Hajnicz
The Procedure of Lexico-Semantic Annotation of Składnica Treebank

L14-1381  [bib]: Júlia Pajzs; Ralf Steinberger; Maud Ehrmann; Mohamed Ebrahim; Leonida Della Rocca; Stefano Bucci; Eszter Simon; Tamás Váradi
Media monitoring and information extraction for the highly inflected agglutinative language Hungarian

L14-1382  [bib]: Véronique Moriceau; Xavier Tannier
French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime

L14-1383  [bib]: Maarten Truyens; Patrick Van Eecke
Legal aspects of text mining

L14-1384  [bib]: Angelina Ivanova; Gertjan van Noord
Treelet Probabilities for HPSG Parsing and Error Correction

L14-1385  [bib]: Abir Masmoudi; Mariem Ellouze Khmekhem; Yannick Esteve; Lamia Hadrich Belguith; Nizar Habash
A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition

L14-1386  [bib]: Marie-Claude L'Homme; Benoît Robichaud; Carlos Subirats Rüggeberg
Discovering frames in specialized domains

L14-1387  [bib]: Lori Levin; Teruko Mitamura; Brian MacWhinney; Davida Fromm; Jaime Carbonell; Weston Feely; Robert Frederking; Anatole Gershman; Carlos Ramirez
Resources for the Detection of Conventionalized Metaphors in Four Languages

L14-1388  [bib]: Jan Odijk
CLARIN-NL: Major results

L14-1389  [bib]: Hugo Gonçalo Oliveira; Inês Coelho; Paulo Gomes
Exploiting Portuguese Lexical Knowledge Bases for Answering Open Domain Cloze Questions Automatically

L14-1390  [bib]: Yuka Tateisi; Yo Shidahara; Yusuke Miyao; Akiko Aizawa
Annotation of Computer Science Papers for Semantic Relation Extrac-tion

L14-1391  [bib]: Wan Yu Ho; Christine Kng; Shan Wang; Francis Bond
Identifying Idioms in Chinese Translations

L14-1392  [bib]: Thierry Etchegoyhen; Lindsay Bywood; Mark Fishel; Panayota Georgakopoulou; Jie Jiang; Gerard van Loenhout; Arantza del Pozo; Mirjam Sepesy Maucec; Anja Turner; Martin Volk
Machine Translation for Subtitling: A Large-Scale Evaluation

L14-1393  [bib]: Elisabetta Jezek; Bernardo Magnini; Anna Feltracco; Alessia Bianchini; Octavian Popescu
T-PAS; A resource of Typed Predicate Argument Structures for linguistic analysis and semantic processing

L14-1394  [bib]: Kara Warburton
NARROWING THE GAP BETWEEN TERMBASES AND CORPORA IN COMMERCIAL ENVIRONMENTS

L14-1395  [bib]: Subhabrata Mukherjee; Sachindra Joshi
Author-Specific Sentiment Aggregation for Polarity Prediction of Reviews

L14-1396  [bib]: Guillaume Jacquet; Maud Ehrmann; Ralf Steinberger
Clustering of Multi-Word Named Entity variants: Multilingual Evaluation

L14-1397  [bib]: Richard Sproat; Bruno Cartoni; HyunJeong Choe; David Huynh; Linne Ha; Ravindran Rajakumar; Evelyn Wenzel-Grondie
A Database for Measuring Linguistic Information Content

L14-1398  [bib]: Noushin Rezapour Asheghi; Serge Sharoff; Katja Markert
Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing

L14-1399  [bib]: Héctor Martínez Alonso; Lauren Romeo
Crowdsourcing as a preprocessing for complex semantic annotation tasks

L14-1400  [bib]: Marco Turchi; Matteo Negri
Automatic Annotation of Machine Translation Datasets with Binary Quality Judgements

L14-1401  [bib]: Marianna Apidianaki; Emilia Verzeni; Diana McCarthy
Semantic Clustering of Pivot Paraphrases

L14-1402  [bib]: Dirk Hovy; Barbara Plank; Anders Søgaard
When POS data sets don’t add up: Combatting sample bias

L14-1403  [bib]: Matthew Shardlow
Out in the Open: Finding and Categorising Errors in the Lexical Simplification Pipeline

L14-1404  [bib]: Mark Finlayson; Jeffry Halverson; Steven Corman
The N2 corpus: A semantically annotated collection of Islamist extremist stories

L14-1405  [bib]: Robert Remus; Dominique Ziegelmayer
Learning from Domain Complexity

L14-1406  [bib]: Ahmed Abbasi; Ammar Hassan; Milan Dhar
Benchmarking Twitter Sentiment Analysis Tools

L14-1407  [bib]: Camille Fauth; Anne Bonneau; Frank Zimmerer; Juergen Trouvain; Bistra Andreeva; Vincent Colotte; Dominique Fohr; Denis Jouvet; Jeanin Jügler; Yves Laprie; Odile Mella; Bernd Möbius
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process

L14-1408  [bib]: Carlo Strapparava; Lorenzo Gatti; Marco Guerini; Oliviero Stock
Creative language explorations through a high-expressivity N-grams query language

L14-1409  [bib]: Reinhard Rapp
Using Word Familiarities and Word Associations to Measure Corpus Representativeness

L14-1410  [bib]: Marie Candito; Guy Perrier; Bruno Guillaume; Corentin Ribeyre; Karën Fort; Djamé Seddah; Eric de la Clergerie
Deep Syntax Annotation of the Sequoia French Treebank

L14-1411  [bib]: Marie Candito; Pascal Amsili; Lucie Barque; Farah Benamara; Gaël de Chalendar; Marianne Djemaa; Pauline Haas; Richard Huyghe; Yvette Yannick Mathieu; Philippe Muller; Benoît Sagot; Laure Vieu
Developing a French FrameNet: Methodology and First results

L14-1412  [bib]: Marta Sabou; Kalina Bontcheva; Leon Derczynski; Arno Scharl
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines

L14-1413  [bib]: Ioannis Korkontzelos; Sophia Ananiadou
Locating Requests among Open Source Software Communication Messages

L14-1414  [bib]: Katerina Rysova; Jiří Mírovský
Valency and Word Order in Czech ― A Corpus Probe

L14-1415  [bib]: Thierry Declerck; Hans-Ulrich Krieger
Harmonization of German Lexical Resources for Opinion Mining

L14-1416  [bib]: Magda Sevcikova; Zdenek Zabokrtsky
Word-Formation Network for Czech

L14-1417  [bib]: Jamie Bost; Johanna Moore
An Analysis of Older Users' Interactions with Spoken Dialogue Systems

L14-1418  [bib]: Martin Volk; Johannes Graën; Elena Callegaro
Innovations in Parallel Corpus Search Tools

L14-1419  [bib]: Joaquim Moré; Salvador Climent
Machine Translationness: Machine-likeness in Machine Translation Evaluation

L14-1420  [bib]: Mikaël Morardo; Eric De La Clergerie
Towards an environment for the production and the validation of lexical semantic resources

L14-1421  [bib]: Jonathan Gratch; Ron Artstein; Gale Lucas; Giota stratou; Stefan Scherer; Angela Nazarian; Rachel Wood; Jill Boberg; David DeVault; Stacy Marsella; David Traum; Albert ""Skip"" Rizzo; Louis-Philippe Morency"
The Distress Analysis Interview Corpus of human and computer interviews

L14-1422  [bib]: Brigitte Bigi; Tatsuya Watanabe; Laurent Prévot
Representing Multimodal Linguistic Annotated data

L14-1423  [bib]: Timur Gilmanov; Olga Scrivner; Sandra Kübler
SWIFT Aligner, A Multifunctional Tool for Parallel Corpora: Visualization, Word Alignment, and (Morpho)-Syntactic Cross-Language Transfer

L14-1424  [bib]: Rosemary Orr; Marijn Huijbregts; Roeland van Beek; Lisa Teunissen; Kate Backhouse; David van Leeuwen
Semi-automatic annotation of the UCU accents speech corpus

L14-1425  [bib]: Daniela Amaral; Evandro Fonseca; Lucelene Lopes; Renata Vieira
Comparative Analysis of Portuguese Named Entities Recognition Tools

L14-1426  [bib]: Ana Lúcia Santos; Michel Généreux; Aida Cardoso; Celina Agostinho; Silvana Abalada
A corpus of European Portuguese child and child-directed speech

L14-1427  [bib]: Guntis Barzdins; Didzis Gosko; Laura Rituma; Peteris Paikens
Using C5.0 and Exhaustive Search for Boosting Frame-Semantic Parsing Accuracy

L14-1428  [bib]: Verena Lyding; Lionel Nicolas; Egon Stemle
'interHist' ̶ an interactive visual interface for corpus exploration

L14-1429  [bib]: Rodrigo Boos; Kassius Prestes; Aline Villavicencio
Identification of Multiword Expressions in the brWaC

L14-1430  [bib]: Lis Pereira; Elga Strafella; Yuji Matsumoto
Collocation or Free Combination? ― Applying Machine Translation Techniques to identify collocations in Japanese

L14-1431  [bib]: Adam Kilgarriff; Pavel Rychlý; Milos Jakubicek; Vojtěch Kovář; Vit Baisa; Lucia Kocincová
Extrinsic Corpus Evaluation with a Collocation Dictionary Task

L14-1432  [bib]: Dominique Estival; Steve Cassidy; Felicity Cox; Denis Burnham
AusTalk: an audio-visual corpus of Australian English

L14-1433  [bib]: Nathan Schneider; Spencer Onuffer; Nora Kazour; Emily Danchik; Michael T. Mordowanec; Henrietta Conrad; Noah A. Smith
Comprehensive Annotation of Multiword Expressions in a Social Web Corpus

L14-1434  [bib]: Leonardo Sameshima Taba; Helena Caseli
Automatic semantic relation extraction from Portuguese texts

L14-1435  [bib]: Houda Bouamor; Nizar Habash; Kemal Oflazer
A Multidialectal Parallel Corpus of Arabic

L14-1436  [bib]: Costanza Navarretta; Magdalena Lis
Transfer learning of feedback head expressions in Danish and Polish comparable multimodal corpora

L14-1437  [bib]: Miquel Esplà-Gomis; Filip Klubička; Nikola Ljubešić; Sergio Ortiz-Rojas; Vassilis Papavassiliou; Prokopis Prokopidis
Comparing two acquisition systems for automatically building an English―Croatian parallel corpus from multilingual websites

L14-1438  [bib]: Fabrizio Gotti; Phillippe Langlais; Atefeh Farzindar
Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government

L14-1439  [bib]: Siim Orasmaa
Towards an Integration of Syntactic and Temporal Annotations in Estonian

L14-1440  [bib]: Emanuele Bastianelli; Giuseppe Castellucci; Danilo Croce; Luca Iocchi; Roberto Basili; Daniele Nardi
HuRIC: a Human Robot Interaction Corpus

L14-1441  [bib]: Joke Daems; Lieve Macken; Sonia Vandepitte
On the origin of errors: A fine-grained analysis of MT and PE errors and their relationship

L14-1442  [bib]: Giampiero Salvi; Niklas Vanhainen
The WaveSurfer Automatic Speech Recognition Plugin

L14-1443  [bib]: Matěj Korvas; Ondřej Plátek; Ondřej Dušek; Lukáš Žilka; Filip Jurčíček
Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license

L14-1444  [bib]: Antje Schlaf; Claudia Bobach; Matthias Irmer
Creating a Gold Standard Corpus for the Extraction of Chemistry-Disease Relations from Patent Texts

L14-1445  [bib]: Anna Polychroniou; Hugues Salamin; Alessandro Vinciarelli
The SSPNet-Mobile Corpus: Social Signal Processing Over Mobile Phones.

L14-1446  [bib]: Alina Wróblewska; Adam Przepiórkowski
Projection-based Annotation of a Polish Dependency Treebank

L14-1447  [bib]: Gianluca Lebani; Veronica Viola; Alessandro Lenci
Bootstrapping an Italian VerbNet: data-driven analysis of verb alternations

L14-1448  [bib]: Arda Celebi; Arzucan Özgür
Self-training a Constituency Parser using n-gram Trees

L14-1449  [bib]: Bushra Jawaid; Amir Kamran; Ondrej Bojar
A Tagged Corpus and a Tagger for Urdu

L14-1450  [bib]: Kostadin Cholakov; Chris Biemann; Judith Eckle-Kohler; Iryna Gurevych
Lexical Substitution Dataset for German

L14-1451  [bib]: Lauren Romeo; Sara Mendes; Núria Bel
A cascade approach for complex-type classification

L14-1452  [bib]: Cédric Lopez; Frédérique Segond; Olivier Hondermarck; Paolo Curtoni; Luca Dini
Generating a Resource for Products and Brandnames Recognition. Application to the Cosmetic Domain.

L14-1453  [bib]: Louise Deleger; Anne-Laure Ligozat; Cyril Grouin; Pierre Zweigenbaum; Aurelie Neveol
Annotation of specialized corpora using a comprehensive entity and relation scheme

L14-1454  [bib]: Katarzyna Klessa; Dafydd Gibbon
Annotation Pro + TGA: automation of speech timing analysis

L14-1455  [bib]: Francesca Frontini; Valeria Quochi; Sebastian Padó; Monica Monachini; Jason Utt
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database

L14-1456  [bib]: Ahmed Salama; Houda Bouamor; Behrang Mohit; Kemal Oflazer
YouDACC: the Youtube Dialectal Arabic Comment Corpus

L14-1457  [bib]: Dan Flickinger; Emily M. Bender; Stephan Oepen
Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar

L14-1458  [bib]: Tommaso Caselli; Laure Vieu; Carlo Strapparava; Guido Vetere
Enriching the "Senso Comune" Platform with Automatically Acquired Data

L14-1459  [bib]: Christoph Draxler
Online experiments with the Percy software framework - experiences and some early results

L14-1460  [bib]: Onno Crasborn; Han Sloetjes
Improving the exploitation of linguistic annotations in ELAN

L14-1461  [bib]: Simon Fuller; Phil Maguire; Philippe Moser
A Deep Context Grammatical Model For Authorship Attribution

L14-1462  [bib]: Teresa Herrmann; Jan Niehues; Alex Waibel
Manual Analysis of Structurally Informed Reordering in German-English Machine Translation

L14-1463  [bib]: Gabriele Pallotti; Francesca Frontini; Fabio Affè; Monica Monachini; Stefania Ferrari
Presenting a system of human-machine interaction for performing map tasks.

L14-1464  [bib]: Moritz Wittmann; Marion Weller; Sabine Schulte im Walde
Automatic Extraction of Synonyms for German Particle Verbs from Parallel Data with Distributional Similarity as a Re-Ranking Feature

L14-1465  [bib]: Layla El Asri; Rémi Lemonnier; Romain Laroche; Olivier Pietquin; Hatim Khouzaimi
NASTIA: Negotiating Appointment Setting Interface

L14-1466  [bib]: Layla El Asri; Romain Laroche; Olivier Pietquin
DINASTI: Dialogues with a Negotiating Appointment Setting Interface

L14-1467  [bib]: Annemarie Friedrich; Marina Valeeva; Alexis Palmer
LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

L14-1468  [bib]: Manfred Stede; Arne Neumann
Potsdam Commentary Corpus 2.0: Annotation for Discourse Research

L14-1469  [bib]: Nabil Hathout; Franck Sajous; Basilio Calderone
GLÀFF, a Large Versatile French Lexicon

L14-1470  [bib]: Ahti Lohk; Kaarel Allik; Heili Orav; Leo Võhandu
DENSE COMPONENTS IN THE STRUCTURE OF WORDNET

L14-1471  [bib]: Lauren Romeo; Gianluca Lebani; Núria Bel; Alessandro Lenci
Choosing which to use? A study of distributional models for nominal lexical semantic classification

L14-1472  [bib]: Jens Forster; Christoph Schmidt; Oscar Koller; Martin Bellgardt; Hermann Ney
Extensions of the Sign Language Recognition and Translation Corpus RWTH-PHOENIX-Weather

L14-1473  [bib]: Sara Candeias; Dirce Celorico; Jorge Proença; Arlindo Veiga; Carla Lopes; Fernando Perdigão
HESITA(te) in Portuguese

L14-1474  [bib]: Sameh Alansary
MUHIT: A Multilingual Harmonized Dictionary

L14-1475  [bib]: Maddalen Lopez de Lacalle; Egoitz Laparra; German Rigau
Predicate Matrix: extending SemLink through WordNet mappings

L14-1476  [bib]: AiTi Aw; Sharifah Mahani Aljunied; Nattadaporn Lertcheva; Sasiwimon Kalunsima
TaLAPi ― A Thai Linguistically Annotated Corpus for Language Processing

L14-1477  [bib]: Felice Dell'Orletta; Giulia Venturi; Andrea Cimino; Simonetta Montemagni
T2K^2: a System for Automatically Extracting and Organizing Knowledge from Texts

L14-1478  [bib]: Giovanni Costantini; Iacopo Iaderola; Andrea Paoloni; Massimiliano Todisco
EMOVO Corpus: an Italian Emotional Speech Database

L14-1479  [bib]: Arfath Pasha; Mohamed Al-Badrashiny; Mona Diab; Ahmed El Kholy; Ramy Eskander; Nizar Habash; Manoj Pooleery; Owen Rambow; Ryan Roth
MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic

L14-1480  [bib]: Ritesh Kumar
Developing Politeness Annotated Corpus of Hindi Blogs

L14-1481  [bib]: Weston Feely; Mehdi Manshadi; Robert Frederking; Lori Levin
The CMU METAL Farsi NLP Approach

L14-1482  [bib]: Bart Desmet; Véronique Hoste
Recognising suicidal messages in Dutch social media

L14-1483  [bib]: Maximilian Köper; Sabine Schulte im Walde
A Rank-based Distance Measure to Detect Polysemy and to Determine Salient Vector-Space Features for German Prepositions

L14-1484  [bib]: Rosalee Wolfe; John McDonald; Larwan Berke; Marie Stumbo
Expanding n-gram analytics in ELAN and a case study for sign synthesis

L14-1485  [bib]: Hen-Hsen Huang; Huan-Yuan Chen; Chang-Sheng Yu; Hsin-Hsi Chen; Po-Ching Lee; Chun-Hsun Chen
Sentence Rephrasing for Parsing Sentences with OOV Words

L14-1486  [bib]: Bruno Guillaume; Karën Fort; Guy Perrier; Paul Bédaride
Mapping the Lexique des Verbes du Français (Lexicon of French Verbs) to a NLP lexicon using examples

L14-1487  [bib]: Aurelie Neveol; Julien Grosjean; Stéfan Darmoni; Pierre Zweigenbaum
Language Resources for French in the Biomedical Domain

L14-1488  [bib]: Adriane Boyd; Jirka Hana; Lionel Nicolas; Detmar Meurers; Katrin Wisniewski; Andrea Abel; Karin Schöne; Barbora Štindlová; Chiara Vettori
The MERLIN corpus: Learner language and the CEFR

L14-1489  [bib]: Yvonne Adesam; Malin Ahlberg; Peter Andersson; Gerlof Bouma; Markus Forsberg; Mans Hulden
Computer-aided morphology expansion for Old Swedish

L14-1490  [bib]: Bushra Jawaid; Ondrej Bojar
Two-Step Machine Translation with Lattices

L14-1491  [bib]: Björn Schuller; Felix Friedmann; Florian Eyben
The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production

L14-1492  [bib]: Luz Rello; Ricardo Baeza-Yates; Joaquim Llisterri
DysList: An Annotated Resource of Dyslexic Errors

L14-1493  [bib]: Raymond SHEN; Hideaki KIKUCHI
Estimation of Speaking Style in Speech Corpora Focusing on speech transcriptions

L14-1494  [bib]: Anne Garcia-Fernandez; Olivier Ferret; Marco Dinarelli
Evaluation of different strategies for domain adaptation in opinion mining

L14-1495  [bib]: Travis Goodwin; Sanda Harabagiu
Clinical Data-Driven Probabilistic Graph Processing

L14-1496  [bib]: Muntsa Padró; Marco Idiart; Aline Villavicencio; Carlos Ramisch
Comparing Similarity Measures for Distributional Thesauri

L14-1497  [bib]: Cheikh M. Bamba Dione
Pruning the Search Space of the Wolof LFG Grammar Using a Probabilistic and a Constraint Grammar Parser

L14-1498  [bib]: Maha Althobaiti; Udo Kruschwitz; Massimo Poesio
AraNLP: a Java-based Library for the Processing of Arabic Text.

L14-1499  [bib]: Jena D. Hwang; Annie Zaenen; Martha Palmer
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data

L14-1500  [bib]: Yoshihiko Hayashi
Web-imageability of the Behavioral Features of Basic-level Concepts

L14-1501  [bib]: Steve Cassidy; Dominique Estival; Timothy Jones; Denis Burnham; Jared Burghold
The Alveo Virtual Laboratory: A Web Based Repository API

L14-1502  [bib]: Chris Culy; Marco Passarotti; Ulla König-Cardanobile
A Compact Interactive Visualization of Dependency Treebank Query Results

L14-1503  [bib]: Irina Temnikova; Andrea Varga; Dogan Biyikli
Building a Crisis Management Term Resource for Social Media: The Case of Floods and Protests

L14-1504  [bib]: Hao Wu; Zhiye Fei; Aaron Dai; Mark Sammons; Dan Roth; Stephen Mayhew
ILLINOISCLOUDNLP: Text Analytics Services in the Cloud

L14-1505  [bib]: Satoshi Sato
Text Readability and Word Distribution in Japanese

L14-1506  [bib]: Julie Hochgesang
The Use of a FileMaker Pro Database in Evaluating Sign Language Notation Systems

L14-1507  [bib]: Octavian Popescu; Martha Palmer; Patrick Hanks
Mapping CPA Patterns onto OntoNotes Senses

L14-1508  [bib]: Emily M. Bender
Language CoLLAGE: Grammatical Description with the LinGO Grammar Matrix

L14-1509  [bib]: Silvia Rodríguez Vázquez; Pierrette Bouillon; Anton Bolfing
Applying Accessibility-Oriented Controlled Language (CL) Rules to Improve Appropriateness of Text Alternatives for Images: an Exploratory Study

L14-1510  [bib]: Ryan Cotterell; Chris Callison-Burch
A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic

L14-1511  [bib]: Elisa Omodei; Jean-Philippe Cointet; Thierry Poibeau
Reconstructing the Semantic Landscape of Natural Language Processing

L14-1512  [bib]: Marieke van Erp; Gleb Satyukov; Piek Vossen; Marit Nijsen
Discovering and Visualising Stories in News

L14-1513  [bib]: Zhengzhong Liu; Jun Araki; Eduard Hovy; Teruko Mitamura
Supervised Within-Document Event Coreference using Information Propagation

L14-1514  [bib]: Ana Aguiar; Mariana Kaiseler; Hugo Meinedo; Pedro Almeida; Mariana Cunha; Jorge Silva
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments

L14-1515  [bib]: Shinsuke Mori; Graham Neubig
Language Resource Addition: Dictionary or Corpus?

L14-1516  [bib]: Luca Cristoforetti; Mirco Ravanelli; Maurizio Omologo; Alessandro Sosi; Alberto Abad; Martin Hagmueller; Petros Maragos
The DIRHA simulated corpus

L14-1517  [bib]: Daniel Hladek; Jan Stas; Jozef Juhar
The Slovak Categorized News Corpus

L14-1518  [bib]: Uwe Quasthoff; Dirk Goldhahn; Thomas Eckart; Erla Hallsteinsdóttir; Sabine Fiedler
High Quality Word Lists as a Resource for Multiple Purposes

L14-1519  [bib]: Varvara Logacheva; Lucia Specia
A Quality-based Active Sample Selection Strategy for Statistical Machine Translation

L14-1520  [bib]: Juri Ganitkevitch; Chris Callison-Burch
The Multilingual Paraphrase Database

L14-1521  [bib]: Menno van Zaanen; Gerhard Van Huyssteen; Suzanne Aussems; Chris Emmery; Roald Eiselen
The Development of Dutch and Afrikaans Language Resources for Compound Boundary Analysis.

L14-1522  [bib]: Lluís Padró; Zeljko Agic; Xavier Carreras; Blaz Fortuna; Esteban García-Cuesta; Zhixing Li; Tadej Stajner; Marko Tadić
Language Processing Infrastructure in the XLike Project

L14-1523  [bib]: Gregor Thurmair
Conceptual transfer: Using local classifiers for transfer selection

L14-1524  [bib]: Marta Villegas; Maite Melero; Núria Bel
Metadata as Linked Open Data: mapping disparate XML metadata registries into one RDF/OWL registry.

L14-1525  [bib]: Grégoire Détrez; Víctor M. Sánchez-Cartagena; Aarne Ranta
Sharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium

L14-1526  [bib]: Georgios Petasis
Annotating Arguments: The NOMAD Collaborative Annotation Tool

L14-1527  [bib]: Diana Maynard; Mark Greenwood
Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis.

L14-1528  [bib]: Xabier Artola; Zuhaitz Beloki; Aitor Soroa
A stream computing approach towards scalable NLP

L14-1529  [bib]: Þórdís Úlfarsdóttir
ISLEX ― a Multilingual Web Dictionary

L14-1530  [bib]: Manuela Sanguinetti; Cristina Bosco; Loredana Cupi
Exploiting catenae in a parallel treebank alignment

L14-1531  [bib]: Irina Temnikova; William A. Baumgartner Jr.; Negacy D. Hailu; Ivelina Nikolova; Tony McEnery; Adam Kilgarriff; Galia Angelova; K. Bretonnel Cohen
Sublanguage Corpus Analysis Toolkit: A tool for assessing the representativeness and sublanguage characteristics of corpora

L14-1532  [bib]: Violeta Seretan; Pierrette Bouillon; Johanna Gerlach
A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation

L14-1533  [bib]: Sigrún Helgadóttir; Hrafn Loftsson; Eiríkur Rögnvaldsson
Correcting Errors in a New Gold Standard for Tagging Icelandic Text

L14-1534  [bib]: Béatrice Daille; Amir Hazem
Semi-compositional Method for Synonym Extraction of Multi-Word Terms

L14-1535  [bib]: Matus Pleva; Jozef Juhar
TUKE-BNews-SK: Slovak Broadcast News Corpus Construction and Evaluation

L14-1536  [bib]: Csaba Oravecz; Tamás Váradi; Bálint Sass
The Hungarian Gigaword Corpus

L14-1537  [bib]: Kunal Sachdeva; Rishabh Srivastava; Sambhav Jain; Dipti Sharma
Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT

L14-1538  [bib]: Maria Pia di Buono; Mario Monteleone
From Natural Language to Ontology Population in the Cultural Heritage Domain. A Computational Linguistics-based approach.

L14-1539  [bib]: Stephen Wattam; Paul Rayson; Marc Alexander; Jean Anderson
Experiences with Parallelisation of an Existing NLP Pipeline: Tagging Hansard

L14-1540  [bib]: Younggyun Hahm; Jungyeul Park; Kyungtae Lim; Youngsik Kim; Dosam Hwang; Key-Sun Choi
Named Entity Corpus Construction using Wikipedia and DBpedia Ontology

L14-1541  [bib]: Zoraida Callejas; Brian Ravenet; Magalie Ochs; Catherine Pelachaud
A model to generate adaptive multimodal job interviews with a virtual recruiter

L14-1542  [bib]: Željko Agić; Nikola Ljubešić
The SETimes.HR Linguistically Annotated Corpus of Croatian

L14-1543  [bib]: Jerid Francom; Mans Hulden; Adam Ussishkin
ACTIV-ES: a comparable, cross-dialect corpus of ‘everyday’ Spanish from Argentina, Mexico, and Spain

L14-1544  [bib]: Van-Minh Pho; Thibault André; Anne-Laure Ligozat; Brigitte Grau; Gabriel Illouz; Thomas Francois
Multiple Choice Question Corpus Analysis for Distractor Characterization

L14-1545  [bib]: Željko Agić; Daša Berović; Danijela Merkler; Marko Tadić
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing

L14-1546  [bib]: Roberto Gretter
Euronews: a multilingual speech corpus for ASR

L14-1547  [bib]: Masood Ghayoomi; Kiril Simov; Petya Osenova
Constituency Parsing of Bulgarian: Word- vs Class-based Parsing

L14-1548  [bib]: Maike Paetzel; David Nicolas Racca; David DeVault
A Multimodal Corpus of Rapid Dialogue Games

L14-1549  [bib]: Juan Rafael Orozco-Arroyave; Julián David Arias-Londoño; Jesús Francisco Vargas-Bonilla; María Claudia Gonzalez-Rátiva; Elmar Nöth
New Spanish speech corpus database for the analysis of people suffering from Parkinson's disease

L14-1550  [bib]: Scott Martens; Marco Passarotti
Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D

L14-1551  [bib]: Peter Anick; Marc Verhagen; James Pustejovsky
Identification of Technology Terms in Patents

L14-1552  [bib]: Tomáš Kliegr; Ondřej Zamazal
Towards Linked Hypernyms Dataset 2.0: complementing DBpedia with hypernym discovery

L14-1553  [bib]: Eduard Bejček; Kettnerová Václava; Marketa Lopatkova
Automatic Mapping Lexical Resources: A Lexical Unit as the Keystone

L14-1554  [bib]: Kris Heylen; Stephen Bond; Dirk De Hertog De Hertog; Ivan Vulić; Hendrik Kockaert
TermWise: A CAT-tool with Context-Sensitive Terminological Support.

L14-1555  [bib]: Irina Galinskaya; Valentin Gusev; Elena Mescheryakova; Mariya Shmatova
Measuring the Impact of Spelling Errors on the Quality of Machine Translation

L14-1556  [bib]: Sakriani Sakti; Keigo Kubo; Sho Matsumiya; Graham Neubig; Tomoki Toda; Satoshi Nakamura; Fumihiro Adachi; Ryosuke Isotani
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System

L14-1557  [bib]: Brigitte Bigi; Roxane Bertrand; Mathilde Guardiola
Automatic detection of other-repetition occurrences: application to French conversational Speech

L14-1558  [bib]: Andrej Zgank; Ana Zwitter Vitez; Darinka Verdonik
The Slovene BNSI Broadcast News database and reference speech corpus GOS: Towards the uniform guidelines for future work

L14-1559  [bib]: Roberto Bartolini; Valeria Quochi; Irene De Felice; Irene Russo; Monica Monachini
From Synsets to Videos: Enriching ItalWordNet Multimodally

L14-1560  [bib]: Vidas Daudaravicius
Language Editing Dataset of Academic Texts

L14-1561  [bib]: Matti Varjokallio; mikko kurimo
A Toolkit for Efficient Learning of Lexical Units for Speech Recognition

L14-1562  [bib]: Yuichi Ishimoto; Tomoyuki Tsuchiya; Hanae Koiso; Yasuharu Den
Towards Automatic Transformation between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features

L14-1563  [bib]: Hiroaki Noguchi; Yasuhiro Katagiri; Yasuharu Den
Japanese conversation corpus for training and evaluation of backchannel prediction model.

L14-1564  [bib]: Jan Gorisch; Corine Astésano; Ellen; Gurman Bard; Brigitte Bigi; Laurent Prévot
Aix Map Task corpus: The French multimodal corpus of task-oriented dialogue

L14-1565  [bib]: Heike Zinsmeister; Ulrich Heid; Kathrin Beck
Adapting a part-of-speech tagset to non-standard text: The case of STTS

L14-1566  [bib]: Milan Rusko; Sakhia Darjaa; Marian Trnka; Marian Ritomsky; Robert Sabo
Alert!... Calm Down, There is Nothing to Worry About. Warning and Soothing Speech Synthesis.

L14-1567  [bib]: Valia Kordoni; Iliana Simova
Multiword Expressions in Machine Translation

L14-1568  [bib]: Christian Girardi; Manuela Speranza; Rachele Sprugnoli; Sara Tonelli
CROMER: a Tool for Cross-Document Event and Entity Coreference

L14-1569  [bib]: Tiberiu Boroș; Adriana Stan; Oliver Watts; Stefan Daniel Dumitrescu
RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus

L14-1570  [bib]: Arturs Znotins; Peteris Paikens
Coreference Resolution for Latvian

L14-1571  [bib]: Elena Mitocariu; Daniel Anechitei; Dan Cristea
How Could Veins Speed Up The Process Of Discourse Parsing

L14-1572  [bib]: Nitsan Chrizman; Alon Itai
How to construct a multi-lingual domain ontology

L14-1573  [bib]: Thomas Lavergne; Gilles Adda; Martine Adda-Decker; Lori Lamel
Automatic language identity tagging on word and sentence-level in multilingual text sources: a case-study on Luxembourgish

L14-1574  [bib]: Orphee De Clercq; Sarah Schulz; Bart Desmet; Veronique Hoste
Towards Shared Datasets for Normalization Research

L14-1575  [bib]: Nicolas Pécheux; Alexander Allauzen; François Yvon
Rule-based Reordering Space in Statistical Machine Translation

L14-1576  [bib]: Luis Javier Rodriguez-Fuentes; Mikel Penagarikano; Amparo Varona; Mireia Diez; German Bordel
KALAKA-3: a database for the recognition of spoken European languages on YouTube audios

L14-1577  [bib]: Andrew Gargett; John Barnden
Mining Online Discussion Forums for Metaphors

L14-1578  [bib]: Theodosia Togia; Ann Copestake
TagNText: A parallel corpus for the induction of resource-specific non-taxonomical relations from tagged images

L14-1579  [bib]: Carmen Garcia-Mateo; Antonio Cardenal; Xose Luis Regueira; Elisa Fernández Rei; Marta Martinez; Roberto Seara; Rocío Varela; Noemí Basanta
CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis

L14-1580  [bib]: Akira Fujita; Akihiro Kameda; Ai Kawazoe; Yusuke Miyao
Overview of Todai Robot Project and Evaluation Framework of its NLP-based Problem Solving

L14-1581  [bib]: Demulier Virginie; Elisabetta Bevacqua; Florian Focone; Tom Giraud; Pamela Carreno; Brice Isableu; Sylvie Gibet; Pierre De Loor; Jean-Claude Martin
A Database of Full Body Virtual Interactions Annotated with Expressivity Scores

L14-1582  [bib]: Piotr Banski; Nils Diewald; Michael Hanl; Marc Kupietz; Andreas Witt
Access control by query rewriting: the case of KorAP

L14-1583  [bib]: Igor Odriozola; Inma Hernaez; María Inés Torres; Luis Javier Rodriguez-Fuentes; Mikel Penagarikano; Eva Navas
Basque Speecon-like and Basque SpeechDat MDB-600: speech databases for the development of ASR technology for Basque

L14-1584  [bib]: Andrew Gargett; Sam Hellmuth; Ghazi AlGethami
DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment

L14-1585  [bib]: Coline Claude-Lachenaud; Eric Charton; Benoit Ozell; Michel Gagnon
A multimodal interpreter for 3D visualization and animation of verbal concepts

L14-1586  [bib]: Tobias Bocklet; Andreas Maier; Korbinian Riedhammer; Ulrich Eysholdt; Elmar Nöth
Erlangen-CLP: A Large Annotated Corpus of Speech from Children with Cleft Lip and Palate

L14-1587  [bib]: Elena Cabrio; Serena Villata; Fabien Gandon
Classifying Inconsistencies in DBpedia Language Specific Chapters

L14-1588  [bib]: Anindya Roy; Camille Guinaudeau; Herve Bredin; Claude Barras
TVD: A Reproducible and Multiply Aligned TV Series Dataset

L14-1589  [bib]: Cédric Lopez; Reda Bestandji; Mathieu Roche; Rachel Panckhurst
Towards Electronic SMS Dictionary Construction: An Alignment-based Approach

L14-1590  [bib]: Olivier Ferret
Compounds and distributional thesauri

L14-1591  [bib]: Antonio Balvet; Dejan Stosic; Aleksandra Miletic
TALC-sef A Manually-Revised POS-TAgged Literary Corpus in Serbian, English and French

L14-1592  [bib]: Shinsuke Goto; Donghui Lin; Toru Ishida
Crowdsourcing for Evaluating Machine Translation Quality

L14-1593  [bib]: Billy T.M. Wong; Ian C. Chow; Jonathan J. Webster; Hengbin Yan
The Halliday Centre Tagger: An Online Platform for Semi-automatic Text Annotation and Analysis

L14-1594  [bib]: Shinsuke Mori; Hirokuni Maeta; Yoko Yamakata; Tetsuro Sasada
Flow Graph Corpus from Recipe Texts

L14-1595  [bib]: Johannes Kirschnick; Alan Akbik; Holmer Hemsen
Freepal: A Large Collection of Deep Lexico-Syntactic Patterns for Relation Extraction

L14-1596  [bib]: Rachel Bawden; Marie-Amélie Botalla; kim gerdes; Sylvain Kahane
Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie

L14-1597  [bib]: Marcin Woliński
Morfeusz Reloaded

L14-1598  [bib]: Mauro Dragoni; Alessio Bosca; Matteo Casu; Andi Rexha
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool

L14-1599  [bib]: Kasia Budzynska; Mathilde Janier; Chris Reed; Patrick Saint-Dizier; Manfred Stede; Olena yakorska
A Model for Processing Illocutionary Structures and Argumentation in Debates

L14-1600  [bib]: Deryle Lonsdale; Benjamin Millard
Student achievement and French sentence repetition test scores

L14-1601  [bib]: Daniel Luzzati; Cyril Grouin; Ioana Vasilescu; Martine Adda-Decker; Eric Bilinski; Nathalie Camelin; Juliette Kahn; Carole Lailler; Lori Lamel; Sophie Rosset
Human annotation of ASR error regions: Is "gravity" a sharable concept for human annotators?

L14-1602  [bib]: Yves Scherrer; Luka Nerima; Lorenza Russo; Maria Ivanova; Eric Wehrli
SwissAdmin: A multilingual tagged parallel corpus of press releases

L14-1603  [bib]: Anna Vernerová; Václava Kettnerová; Marketa Lopatkova
To Pay or to Get Paid: Enriching a Valency Lexicon with Diatheses

L14-1604  [bib]: Liang Tian; Derek F. Wong; Lidia S. Chao; Paulo Quaresma; Francisco Oliveira; Lu Yi
UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation

L14-1605  [bib]: Rodrigo Agerri; Josu Bermudez; German Rigau
IXA pipeline: Efficient and Ready to Use Multilingual NLP tools

L14-1606  [bib]: Suguru Matsuyoshi; Ryo Otsuki; Fumiyo Fukumoto
Annotating the Focus of Negation in Japanese Text

L14-1607  [bib]: Mohamed Sherif; Sandro Coelho; Ricardo Usbeck; Sebastian Hellmann; Jens Lehmann; Martin Brümmer; Andreas Both
NIF4OGGD - NLP Interchange Format for Open German Governmental Data

L14-1608  [bib]: Alessio Bosca; Matteo Casu; Matteo Dragoni; Nikolaos Marianos
A Gold Standard for CLIR evaluation in the Organic Agriculture Domain

L14-1609  [bib]: Senka Drobac; Krister Lindén; Tommi Pirinen; Miikka Silfverberg
Heuristic Hyper-minimization of Finite State Lexicons

L14-1610  [bib]: Stelios Piperidis; Harris Papageorgiou; Christian Spurk; Georg Rehm; Khalid Choukri; Olivier Hamon; Nicoletta Calzolari; Riccardo Del Gratta; Bernardo Magnini; Christian Girardi
META-SHARE: One year after

L14-1611  [bib]: Claudia Baur; Manny Rayner; Nikos Tsourakis
USING A SERIOUS GAME TO COLLECT A CHILD LEARNER SPEECH CORPUS

L14-1612  [bib]: Riccardo Del Gratta; Gabriella Pardelli; Sara Goggi
The LRE Map disclosed

L14-1613  [bib]: Evgeny Stepanov; Giuseppe Riccardi; Ali Orkan Bayer
The Development of the Multilingual LUNA Corpus for Spoken Language System Porting

L14-1614  [bib]: Magdalena Rysova
Verbs of Saying with a Textual Connecting Function in the Prague Discourse Treebank

L14-1615  [bib]: Marc Poch; Núria Bel; Sergio Espeja; Felipe Navio
Ranking Job Offers for Candidates: learning hidden knowledge from Big Data

L14-1616  [bib]: Valérie Hanoka; Benoît Sagot
An Open-Source Heavily Multilingual Translation Graph Extracted from Wiktionaries and Parallel Corpora

L14-1617  [bib]: Claudia Borg; Albert Gatt
Crowd-sourcing evaluation of automatically acquired, morphologically related word groupings

L14-1618  [bib]: Auður Hauksdóttir
An Innovative World Language Centre : Challenges for the Use of Language Technology

L14-1619  [bib]: Yves Scherrer; Benoît Sagot
A language-independent and fully unsupervised approach to lexicon induction and part-of-speech tagging for closely related languages

L14-1620  [bib]: David Tavarez; Eva Navas; Daniel Erro; Ibon Saratxaga; Inma Hernaez
New bilingual speech databases for audio diarization

L14-1621  [bib]: Mohamed Morchid; Richard Dufour; Georges Linares
A LDA-BASED TOPIC CLASSIFICATION APPROACH FROM HIGHLY IMPERFECT AUTOMATIC TRANSCRIPTIONS

L14-1622  [bib]: Cristina Sánchez Marco
An open source part-of-speech tagger for Norwegian: Building on existing language resources

L14-1623  [bib]: Ahmet Aker; Monica Paramita; Marcis Pinnis; Robert Gaizauskas
Bilingual dictionaries for all EU languages

L14-1624  [bib]: Martin Reynaert
Synergy of Nederlab and

L14-1625  [bib]: Raphael Rubino; Antonio Toral; Nikola Ljubešić; Gema Ramírez-Sánchez
Quality Estimation for Synthetic Parallel Data Generation

L14-1626 : Janine Pimentel
Adding a Third Language to a Lexical Resource Describing Legal Terminology: the assignment of equivalents

L14-1627  [bib]: Wolfgang Seeker; Jonas Kuhn
An Out-of-Domain Test Suite for Dependency Parsing of German

L14-1628  [bib]: Maud Ehrmann; Francesco Cecconi; Daniele Vannella; John Philip McCrae; Philipp Cimiano; Roberto Navigli
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0

L14-1629  [bib]: George Kiomourtzis; George Giannakopoulos; Georgios Petasis; Pythagoras Karampiperis; Vangelis Karkaletsis
NOMAD: Linguistic Resources and Tools Aimed at Policy Formulation and Validation

L14-1630  [bib]: Lina Henriksen; Dorte Haltrup Hansen; Bente Maegaard; Bolette Sandford Pedersen; Claus Povlsen
Encompassing a spectrum of LT users in the CLARIN-DK Infrastructure

L14-1631  [bib]: Alexis Nasr; Frederic Bechet; Benoit Favre; Thierry Bazillon; Jose Deulofeu; Andre Valli
Automatically enriching spoken corpora with syntactic information for linguistic studies

L14-1632  [bib]: Mathieu Lafourcade; Karën Fort
Propa-L: a semantic filtering service from a lexical network created using Games With A Purpose

L14-1633  [bib]: Maria Simi; Cristina Bosco; Simonetta Montemagni
Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies

L14-1634  [bib]: Mark Cieliebak; Oliver Dürr; Fatih Uzdilli
Meta-Classifiers Easily Improve Commercial Sentiment Detection Tools

L14-1635  [bib]: Kyle Richardson; Jonas Kuhn
UnixMan Corpus: A Resource for Language Learning in the Unix Domain

L14-1636  [bib]: Jonathan Sonntag; Manfred Stede
GraPAT: a Tool for Graph Annotations

L14-1637  [bib]: Antonio Pareja-Lora; Guillermo Cárcamo-Escorza; Alicia Ballesteros-Calvo
Standardisation and Interoperation of Morphosyntactic and Syntactic Annotation Tools for Spanish and their Annotations

L14-1638  [bib]: Gongye Jin; Daisuke Kawahara; Sadao Kurohashi
A Framework for Compiling High Quality Knowledge Resources From Raw Corpora

L14-1639  [bib]: Jason Utt; Sylvia Springorum; Maximilian Köper; Sabine Schulte im Walde
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data

L14-1640  [bib]: Guoyu Tang; Yunqing Xia; Weizhi Wang; Raymond Lau; Fang Zheng
Clustering tweets usingWikipedia concepts

L14-1641  [bib]: Maria Koutsombogera; Samer Al Moubayed; Bajibabu Bollepalli; Ahmed Hussen Abdelaziz; Martin Johansson; José David Aguas Lopes; Jekaterina Novikova; Catharine Oertel; Kalin Stefanov; Gül Varol
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue

L14-1642  [bib]: Nikola Ljubešić; Darja Fišer; Tomaž Erjavec
TweetCaT: a tool for building Twitter corpora of smaller languages

L14-1643  [bib]: Ondrej Bojar; Vojtěch Diatka; Pavel Rychlý; Pavel Stranak; Vit Suchomel; Aleš Tamchyna; Daniel Zeman
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation

L14-1644  [bib]: Silvia Necsulescu; Sara Mendes; Núria Bel
Combining dependency information and generalization in a pattern-based approach to the classification of lexical-semantic relation instances

L14-1645  [bib]: Aimilios Chalamandaris; Pirros Tsiakoulis; Sotiris Karabetsos; Spyros Raptis
Using Audio Books for Training a Text-to-Speech System

L14-1646  [bib]: Agata Cybulska; Piek Vossen
Using a sledgehammer to crack a nut? Lexical diversity and event coreference resolution

L14-1647  [bib]: Nikola Ljubešić; Antonio Toral
caWaC -- A web corpus of Catalan and its application to language modeling and machine translation

L14-1648  [bib]: Marc Kupietz; Harald Lüngen
Recent Developments in DeReKo

L14-1649  [bib]: Shu-Kai Hsieh
Why Chinese Web-as-Corpus is Wacky? Or: How Big Data is Killing Chinese Corpus Linguistics

L14-1650  [bib]: Tafseer Ahmed Khan
Automatic acquisition of Urdu nouns (along with gender and irregular plurals)

L14-1651  [bib]: Clare Llewellyn; Claire Grover; Jon Oberlander; Ewan Klein
Re-using an Argument Corpus to Aid in the Curation of Social Media Collections

L14-1652  [bib]: Raivis Skadiņš; Jörg Tiedemann; Roberts Rozis; Daiga Deksne
Billions of Parallel Words for Free: Building and Using the EU Bookshop Corpus

L14-1653  [bib]: Isa Maks; Ruben Izquierdo; Francesca Frontini; Rodrigo Agerri; Piek Vossen; Andoni Azpeitia
Generating Polarity Lexicons with WordNet propagation in 5 languages

L14-1654  [bib]: Mara Chinea-Rios; Germán Sanchis Trilles; Daniel Daniel Ortiz-Martínez; Francisco Casacuberta
Online optimisation of log-linear weights in interactive machine translation

L14-1655  [bib]: Jannik Strötgen; Thomas Bögel; Julian Zell; Ayser Armiti; Tran Van Canh; Michael Gertz
Extending HeidelTime for Temporal Expressions Referring to Historic Dates

L14-1656  [bib]: Roman Klinger; Philipp Cimiano
The USAGE review corpus for fine grained multi lingual opinion analysis

L14-1657  [bib]: Nadjet Bouayad-Agha; Alicia Burga; Gerard Casamayor; Joan Codina; Rogelio Nazar; Leo Wanner
An Exercise in Reuse of Resources: Adapting General Discourse Coreference Resolution for Detecting Lexical Chains in Patent Documentation

L14-1658  [bib]: Bernardo Severo; Cassia Trojahn; Renata Vieira
VOAR: A Visual and Integrated Ontology Alignment Environment

L14-1659  [bib]: Thomas Eckart; Erla Hallsteinsdóttir; Sigrún Helgadóttir; Uwe Quasthoff; Dirk Goldhahn
A 500 Million Word POS-Tagged Icelandic Corpus

L14-1660  [bib]: Chahinez Benkoussas; Hussam Hamdan; Patrice Bellot; Frédéric Béchet; Elodie Faath
A Collection of Scholarly Book Reviews from the Platforms of electronic sources in Humanities and Social Sciences OpenEdition.org

L14-1661  [bib]: Anton Karl Ingason; Hrafn Loftsson; Eiríkur Rögnvaldsson; Einar Freyr Sigurðsson; Joel C. Wallenberg
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian

L14-1662  [bib]: Michael Röder; Ricardo Usbeck; Sebastian Hellmann; Daniel Gerber; Andreas Both
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format

L14-1663  [bib]: Valentín Cardeñoso-Payo; César González-Ferreras; David Escudero
Assessment of Non-native Prosody for Spanish as L2 using quantitative scores and perceptual evaluation

L14-1664  [bib]: Michael Stadtschnitzer; Jochen Schwenninger; Daniel Stein; Joachim Koehler
Exploiting the large-scale German Broadcast Corpus to boost the Fraunhofer IAIS Speech Recognition System

L14-1665  [bib]: Natalia Loukachevitch; Aleksey Alekseev
Summarizing News Clusters on the Basis of Thematic Chains

L14-1666  [bib]: Kilian A. Foth; Arne Köhn; Niels Beuck; Wolfgang Menzel
Because Size Does Matter: The Hamburg Dependency Treebank

L14-1667  [bib]: Vincenzo Galatà; Alberto Benin; Piero Cosi; Giuseppe Riccardo Leone; Giulio Paci; Giacomo Sommavilla; Fabio Tesser
Discovering the Italian literature: interactive access to audio indexed text resources

L14-1668  [bib]: Jorge Gracia; Elena Montiel-Ponsoda; Daniel Vila-Suero; Guadalupe Aguado-de-Cea
Enabling Language Resources to Expose Translations as Linked Data on the Web

L14-1669  [bib]: Judit Ács
Pivot-based multilingual dictionary building using Wiktionary

L14-1670  [bib]: Andrea Glaser; Jonas Kuhn
Exploring the utility of coreference chains for improved identification of personal names

L14-1671  [bib]: Tatiana Erekhinskaya; Meghana Satpute; Dan Moldovan
Multilingual eXtended WordNet Knowledge Base: Semantic Parsing and Translation of Glosses

L14-1672  [bib]: Manel Zarrouk; Mathieu Lafourcade
Relation Inference in Lexical Networks ... with Refinements

L14-1673  [bib]: Veronica Perez-Rosas; Rada Mihalcea; Alexis Narvaez; Mihai Burzo
A Multimodal Dataset for Deception Detection

L14-1674  [bib]: Jean-Philippe Goldman; Tea Prsir; Antoine Auchlin
C-PhonoGenre: a 7-hours corpus of 7 speaking styles in French: relations between situational features and prosodic properties

L14-1675  [bib]: Ahmed Abdelali; Francisco Guzman; Hassan Sajjad; Stephan Vogel
The AMARA Corpus: Building Parallel Language Resources for the Educational Domain

L14-1676  [bib]: Lauma Pretkalniņa; Artūrs Znotiņš; Laura Rituma; Didzis Goško
Dependency parsing representation effects on the accuracy of semantic applications ― an example of an inflective language

L14-1677  [bib]: Guiyao Ke; Pierre-Francois Marteau
Co-clustering of bilingual datasets as a mean for assisting the construction of thematic bilingual comparable corpora

L14-1678  [bib]: Paweł Kędzia; Maciej Piasecki
Ruled-based, Interlingual Motivated Mapping of plWordNet onto SUMO Ontology

L14-1679  [bib]: Pollet Samvelian; Pegah Faghiri; Sarra El Ayari
Extending the coverage of a MWE database for Persian CPs exploiting valency alternations

L14-1680  [bib]: Andrea Horbach; Alexis Palmer; Magdalena Wolska
Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers

L14-1681  [bib]: Ilaine Wang; Sylvain Kahane; Isabelle Tellier
Macrosyntactic Segmenters of a French Spoken Corpus

L14-1682  [bib]: Jetske Klatter; Roeland Van Hout; Henk van den Heuvel; Paula Fikkert; Anne Baker; Jan De Jong; Frank Wijnen; Eric Sanders; Paul Trilsbeek
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive

L14-1683  [bib]: Anders Björkelund; Kerstin Eckart; Arndt Riester; Nadja Schauffler; Katrin Schweitzer
The Extended DIRNDL Corpus as a Resource for Coreference and Bridging Resolution

L14-1684  [bib]: Elena Volodina; Ildikó Pilán; Lars Borin; Therese Lindström Tiedemann
A flexible language learning platform based on language resources and web services

L14-1685  [bib]: Christian Chiarcos
Towards interoperable discourse annotation. Discourse features in the Ontologies of Linguistic Annotation

L14-1686  [bib]: Patrick Littell; Kaitlyn Price; Lori Levin
Morphological parsing of Swahili using crowdsourced lexical resources

L14-1687  [bib]: Eric Charton; Marie-Jean Meurs; Ludovic Jean-Louis; Michel Gagnon
Improving Entity Linking using Surface Form Refinement

L14-1688  [bib]: Victoria Arranz; Khalid Choukri; Valérie Mapelli; Hélène Mazo
ELRA's Consolidated Services for the HLT Community

L14-1689  [bib]: Daisuke Kawahara; Martha Palmer
Single Classifier Approach for Verb Sense Disambiguation based on Generalized Features

L14-1690  [bib]: Raquel Amaro
Extracting semantic relations from Portuguese corpora using lexical-syntactic patterns

L14-1691  [bib]: Artem Ostankov; Florian Röhrbein; Ulli Waltinger
LinkedHealthAnswers: Towards Linked Data-driven Question Answering for the Health Care Domain

L14-1692  [bib]: David Jurgens
An analysis of ambiguity in word sense annotations

L14-1693  [bib]: Iolanda Alfano; Francesco Cutugno; Aurelio De Rosa; Claudio Iacobini; Renata Savy; Miriam Voghera
VOLIP: a corpus of spoken Italian and a virtuous example of reuse of linguistic resources

L14-1694  [bib]: Carla Parra Escartín
Chasing the Perfect Splitter: A Comparison of Different Compound Splitting Tools

L14-1695  [bib]: Homa B. Hashemi; Rebecca Hwa
A Comparison of MT Errors and ESL Errors

L14-1696  [bib]: Philippe Martin
New functions for a multipurpose multimodal tool for phonetic and linguistic analysis of very large speech corpora

L14-1697  [bib]: Paul Buitelaar; Georgeta Bordea; Barry Coughlan
Hot Topics and Schisms in NLP: Community and Trend Analysis with Saffron on ACL and LREC Proceedings

L14-1698  [bib]: Ann Irvine; Joshua Langfus; Chris Callison-Burch
The American Local News Corpus

L14-1699  [bib]: Rudolf Rosa; Jan Mašek; David Mareček; Martin Popel; Daniel Zeman; Zdeněk Žabokrtský
HamleDT 2.0: Thirty Dependency Treebanks Stanfordized

L14-1700  [bib]: Shan Wang; Francis Bond
Building The Sense-Tagged Multilingual Parallel Corpus

L14-1701  [bib]: Marcos Garcia; Pablo Gamallo
Multilingual corpora with coreferential annotation of person entities

L14-1702  [bib]: Muhammad Abdul-Mageed; Mona Diab
SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis

L14-1703  [bib]: Behrang Zadeh; Siegfried Handschuh
Evaluation of Technology Term Recognition with Random Indexing

L14-1704  [bib]: Stefan Bott; Sabine Schulte im Walde
Optimizing a Distributional Semantic Model for the Prediction of German Particle Verb Compositionality

L14-1705  [bib]: Anik Dey; Pascale Fung
A Hindi-English Code-Switching Corpus

L14-1706  [bib]: Nancy Ide; James Pustejovsky; Christopher Cieri; Eric Nyberg; Di Wang; Keith Suderman; Marc Verhagen; Jonathan Wright
The Language Application Grid

L14-1707  [bib]: George Christodoulides; Mathieu Avanzi; Jean-Philippe Goldman
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech

L14-1708  [bib]: Trang Mai Xuan; Yohei Murakami; Donghui Lin; Toru Ishida
Integration of Workflow and Pipeline for Language Service Composition

L14-1709  [bib]: Klim Peshkov; Laurent Prévot
Segmentation evaluation metrics, a comparison grounded on prosodic and discourse units

L14-1710  [bib]: Andrea Abel; Aivars Glaznieks; Lionel Nicolas; Egon Stemle
KoKo: an L1 Learner Corpus for German

L14-1711  [bib]: Petra Barancikova; Rudolf Rosa; Ales Tamchyna
Improving Evaluation of English-Czech MT through Paraphrasing

L14-1712  [bib]: Erik Faessler; Johannes Hellrich; Udo Hahn
Disclose Models, Hide the Data - How to Make Use of Confidential Corpora without Seeing Sensitive Raw Data

L14-1713  [bib]: Mitesh M. Khapra; Ananthakrishnan Ramanathan; Anoop Kunchukuttan; Karthik Visweswariah; Pushpak Bhattacharyya
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control

L14-1714  [bib]: Frederik Baumgardt; Giuseppe Celano; Gregory R. Crane; Stella Dee; Maryam Foradi; Emily Franzini; Greta Franzini; Monica Lent; Maria Moritz; Simona Stoyanova
Open Philology at the University of Leipzig

L14-1715 : Marco Del Tredici; Malvina Nissim
A Modular System for Rule-based Text Categorisation

L14-1716  [bib]: Najeh Hajlaoui; David Kolovratnik; Jaakko Väyrynen; Ralf Steinberger; Daniel Varga
DCEP -Digital Corpus of the European Parliament

L14-1717  [bib]: Joseph Mariani; Christopher Cieri; Gil Francopoulo; Patrick Paroubek; Marine Delaborde
Facing the Identification Problem in Language-Related Scientific Data Analysis.

L14-1718  [bib]: Mariette Soury; Laurence Devillers
Smile and Laughter in Human-Machine Interaction: a study of engagement

L14-1719  [bib]: Livio Robaldo; Guido Boella; Luigi Di Caro; Andrea Violato
Exploiting networks in Law

L14-1720  [bib]: Kristín Bjarnadóttir; Jón Daðason
Utilizing constituent structure for compound analysis

L14-1721  [bib]: Wajdi Zaghouani; Behrang Mohit; Nizar Habash; Ossama Obeid; Nadi Tomeh; Alla Rozovskaya; Noura Farra; Sarah Alkuhlani; Kemal Oflazer
Large Scale Arabic Error Annotation: Guidelines and Framework

L14-1722  [bib]: Thomas Pellegrini; Vahid Hedayati; Angela Costa
El-WOZ: a client-server wizard-of-oz interface

L14-1723  [bib]: Fei Cheng; Kevin Duh; Yuji Matsumoto
Parsing Chinese Synthetic Words with a Character-based Dependency Model

L14-1724  [bib]: Mohamed Ben Jannet; Martine Adda-Decker; Olivier Galibert; Juliette Kahn; Sophie Rosset
ETER : a new metric for the evaluation of hierarchical named entity recognition

L14-1725  [bib]: Jun Araki; Zhengzhong Liu; Eduard Hovy; Teruko Mitamura
Detecting Subevent Structure for Event Coreference Resolution

L14-1726  [bib]: Kashif Shah; Marco Turchi; Lucia Specia
An efficient and user-friendly tool for machine translation quality estimation

L14-1727  [bib]: Alexandra Balahur; Marco Turchi; Ralf Steinberger; Jose Manuel Perea-Ortega; Guillaume Jacquet; Dilek Kucuk; Vanni Zavarella; Adil El Ghali
Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts

L14-1728  [bib]: Joachim Bingel; Thomas Haider
Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers

L14-1729  [bib]: Ophélie Lacroix; Denis Béchet
Validation Issues induced by an Automatic Pre-Annotation Mechanism in the Building of Non-projective Dependency Treebanks

L14-1730  [bib]: Renlong Ai; Marcela Charfuelan
MAT: a tool for L2 pronunciation errors annotation

L14-1731  [bib]: Kalliopi Zervanou; Elias Iosif; Alexandros Potamianos
Word Semantic Similarity for Morphologically Rich Languages

L14-1732  [bib]: Joshua Elliot; Logan Kearsley; Jason Housley; Alan Melby
LexTerm Manager: Design for an Integrated Lexicography and Terminology System

L14-1733  [bib]: Daniel Peterson; Martha Palmer; Shumin Wu
Focusing Annotation for Semantic Role Labeling

L14-1734  [bib]: Emanuele Lapponi; Erik Velldal; Stephan Oepen; Rune Lain Knudsen
Off-Road LAF: Encoding and Processing Annotations in NLP Workflows

L14-1735  [bib]: Penny Labropoulou; Christopher Cieri; Maria Gavrilidou
Developing a Framework for Describing Relations among Language Resources

L14-1736  [bib]: Clément de Groc; Xavier Tannier
Evaluating Web-as-corpus Topical Document Retrieval with an Index of the OpenDirectory

L14-1737  [bib]: Santanu Pal; Sudip Kumar Naskar; Sivaji Bandyopadhyay
Word Alignment-Based Reordering of Source Chunks in PB-SMT

L14-1738  [bib]: Frank Landsbergen; Carole Tiberius; Roderik Dernison
Taalportaal: an online grammar of Dutch and Frisian

L14-1739  [bib]: Andrew Yates; Jon Parker; Nazli Goharian; Ophir Frieder
A Framework for Public Health Surveillance

L14-1740  [bib]: Zdenka Uresova; Jan Hajic; Pavel Pecina; Ondrej Dusek
Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain

L14-1741  [bib]: Axel-Cyrille Ngonga Ngomo; Norman Heino; René Speck; Prodromos Malakasiotis
A tool suite for creating question answering benchmarks

L14-1742  [bib]: Clément de Groc; Xavier Tannier; Claude de Loupy
Thematic Cohesion: measuring terms discriminatory power toward themes

L14-1743  [bib]: Tatiana Gornostay; Andrejs Vasiļjevs
Terminology Resources and Terminology Work Benefit from Cloud Services

L14-1744  [bib]: Munshi Asadullah; Patrick Paroubek; Anne Vilnat
Bidirectionnal converter between syntactic annotations : from French Treebank Dependencies to PASSAGE annotations, and back

L14-1745  [bib]: Marcos Zampieri; Binyam Gebre
VarClass: An Open-source Language Identification Tool for Language Varieties

L14-1746  [bib]: Achim Rettinger; Lei Zhang; Daša Berović; Danijela Merkler; Matea Srebačić; Marko Tadić
RECSA: Resource for Evaluating Cross-lingual Semantic Annotation