ACL Logo ACL Anthology
A Digital Archive of Research Papers in Computational Linguistics

Google search the Anthology

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC'02)

L02-1001 : Susana Afonso; Eckhard Bick; Renato Haber; Diana Santos
Floresta Sintá(c)tica: A treebank for Portuguese.

L02-1002 : Hiroyuki Shinnou
Learning of word sense disambiguation rules by Co-training, checking co-occurrence of features.

L02-1003 : Lorna Balkan; Ken Miller; Birgit Austin; Anne Etheridge; Myriam Garcia Bernabé; Pam Miller
ELSST: a broad-based Multilingual Thesaurus for the Social Sciences.

L02-1004 : Vincent Vandeghinste
Lexicon Optimization: Maximizing Lexical Coverage in Speech Recognition through Automated Compounding.

L02-1005 : Eduard Hovy; Margaret King; Andrei Popescu-Belis
Computer-Aided Specification of Quality Models for Machine Translation Evaluation.

L02-1006 : Serge Sharoff
Meaning as use: exploitation of aligned corpora for the contrastive study of lexical semantics.

L02-1007 : Min-Yen Kan; Judith L. Klavans; Kathleen R. McKeown
Using the Annotated Bibliography as a Resource for Indicative Summarization.

L02-1008 : Choy-Kim Chuah; Zaharin Yusoff
Computational Linguistics at Universiti Sains Malaysia

L02-1009 : Judit Feliu; Jorge Vivaldi; M. Teresa Cabré
Towards an Ontology for a Human Genome Knowledge Base

L02-1010 : Tom Laureys; Kris Demuynck; Jacques Duchateau; Patrick Wambacq
An Improved Algorithm for the Automatic Segmentation of Speech Corpora.

L02-1011 : Katja Markert; Malvina Nissim
Towards a Corpus Annotated for Metonymies: the Case of Location Names.

L02-1012 : Philippe Langlais; Marie Loranger; Guy Lapalme
Translators at work with TRANSTYPE: Resource and Evaluation. 

L02-1013 : Qiang Zhou; Elliott Franco Drabek; Fuji Ren
Annotating the functional chunks in Chinese sentences.

L02-1014 : Hisao Kuwabara; Shuich Itahashi; Mikio Yamamoto; Toshiyuki Takezawa; Satoshi Nakamura; Kazuya Takeda
The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research.

L02-1015 : Diana Santos; Caroline Gasperin
Evaluation of parsed corpora: Experiments in user-transparent and user-visible evaluation.

L02-1016 : Laura Docío-Fernández; Carmen García-Mateo
Acoustic Modeling and Training of a Bilingual ASR System when a Minority Language is Involved.

L02-1017 : Steven Bird; Hans Uszkoreit; Gary Simons
The Open Language Archives Community

L02-1018 : Jakub Piskorski; Witold Drożdżyński; Oliver Scherf; Feiyu Xu
A Flexible XML-based Regular Compiler for Creation and Conversion of Linguistic Resources.

L02-1019 : Robert Modic; Bojan Petek
A Contrastive Acoustic-Phonetic Analysis of Slovenian and English Diphthongs.

L02-1020 : Christoph Draxler; Florian Schiel
Three New Corpora at the Bavarian Archive for Speech Signals – and a First Step Towards Distributed Web-Based Recording.

L02-1021 : René Schneider
n-grams of Seeds: A Hybrid System for Corpus-Based Text Summarization.

L02-1022 : Barry Schiffman
Building a Resource for Evaluating the Importance of Sentences.

L02-1023 : Sabine Schulte im Walde
A Subcategorisation Lexicon for German Verbs induced from a Lexicalised PCFG.

L02-1024 : Ingunn Amdal; Torbjørn Svendsen
Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles.

L02-1025 : Andrea Bozzi
LAperLA: an integrated graphical-linguistic System for old printed Latin Texts.

L02-1026 : Pascale Bernard; Josette Lecomte; Jacques Dendien; Jean-Marie Pierrel
Computerized linguistic resources of the research laboratory ATILF for lexical and textual analysis: Frantext, TLFi, and the software Stella.

L02-1027 : Masaki Murata; Hitoshi Isahara
Automatic extraction of differences between spoken and written languages, and automatic translation from the written to the spoken language.

L02-1028 : Fabio Tamburini
Automatic detection of prosodic prominence in continuous speech.

L02-1029 : Fabio Tamburini
A dynamic model for reference corpora structure definition

L02-1030 : Daniela Alderuccio; Luciana Bordoni
An ontology-based approach in the literary research: two case-studies

L02-1031 : Javier Caminero; Joaquín González-Rodríguez; Javier Ortega-García; Daniel Tapias; Pedro M. Ruz; Mercedes Solá
A Multilingual Speaker Verification System: Architecture and Performance Evaluation.

L02-1032 : Dan Tufiş; Ana-Maria Barbu
Lexical token alignment: experiments, results and applications.

L02-1033 : Achim F. Müller; Janez Stergar; Bogomir Horvat
Designing Prosodic Databases for Automatic Modeling of Slovenian Language in a Multilingual TTS System.

L02-1034 : Nadjet Bouayad-Agha; Richard Power; Donia Scott; Anja Belz
PILLS:  Multilingual generation of medical information documents with overlapping content.

L02-1035 : Felix Sasaki; Claudia Wegener; Andreas Witt; Dieter Metzing; Jens Pönninghaus
Co-reference annotation and resources: A multilingual corpus of typologically diverse languages.

L02-1036 : Udo Hahn; Stefan Schulz
Towards Very Large Ontologies for Medical Language Processing.

L02-1037 : Enrique Alfonseca; Suresh Manandhar
Improving an Ontology Refinement Method with Hyponymy Patterns.

L02-1038 : Enrique Alfonseca; Suresh Manandhar
Proposal for Evaluating Ontology Refinement Methods.

L02-1039 : Matthieu Constant
Methods for Constructing Lexicon-Grammar Resources: The Example of Measure Expressions.

L02-1040 : Kristina Nilsson; Lars Borin
Living off the land: The Web as a source of practice texts for learners of less prevalent languages.

L02-1041 : Sebastian Möller; Ergina Kavallieratou
Diagnostic Assessment of Telephone Transmission Impact on ASR Performance and Human-to-Human Speech Quality.

L02-1042 : Carlos D. Martínez-Hinarejos; Emilio Sanchís; Fernando García-Granada; Pablo Aibar
A Labelling Proposal to Annotate Dialogues.

L02-1043 : Simone Teufel; Noemie Elhadad
Collection and linguistic processing of a large-scale corpus of medical articles.

L02-1044 : Tokunaga Takenobu; Okumura Manabu; Saitô Suguru; Tanaka Hozumi
Constructing a lexicon of action.

L02-1045 : Birte Lönneker
Building Concept Frames based on Text Corpora.

L02-1046 : I. Hernáez; E. Navas; J. Sánchez; I. Madariaga; I. Gaminde; X. Zalbide
BIZKAIFON: A sound archive of dialectal varieties of spoken Basque.

L02-1047 : Roberto Navigli; Paola Velardi
Automatic Adaptation of WordNet to Domains.

L02-1048 : Marta Villegas; Nuria Bel
From DTD to relational dB. An automatic generation of a lexicographical station out off ISLE guidelines.

L02-1049 : Florian Schiel; Silke Steininger; Ulrich Türk
The SmartKom Multimodal Corpus at BAS.

L02-1050 : Nicole Beringer; Katerina Louka; Victoria Penide-Lopez; Uli Türk
End-to-End Evaluation of Multimodal Dialogue Systems – can we Transfer Established Methods?

L02-1051 : Antonio Molina; Ferran Pla; Encarna Segarra; Lidia Moreno
Word Sense Disambiguation using Statistical Models and WordNet.

L02-1052 : Hans C. Boas
Bilingual FrameNet Dictionaries for Machine Translation.

L02-1053 : Yllias Chali
Experiments in Topic Detection.

L02-1054 : Gosse Bouma; Geert Kloosterman
Querying Dependency Treebanks in XML.

L02-1055 : Marianne Starlander; Andrei Popescu-Belis
Corpus-based Evaluation of a French Spelling and Grammar Checker.

L02-1056 : Adam Meyers; Ralph Grishman; Michiko Kosaka
Formal Mechanisms for Capturing Regularizations.

L02-1057 : Erhard W. Hinrichs; Sandra Kübler; Frank H. Müller; Tylman Ule
A Hybrid Architecture for Robust Parsing of German.

L02-1058 : Rainer Siemund; Barbara Heuft; Khalid Choukri; Ossama Emam; Emmanuel Maragoudakis; Herbert Tropf; Oren Gedge; Sherrie Shammass; Asuncion Moreno; Albino Nogueiras Rodriguez; Imed Zitouni; Dorota Iskra
OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.

L02-1059 : Kazutaka Takao; Kenji Imamura; Hideki Kashioka
Comparing and Extracting Paraphrasing Words with 2-Way Bilingual Dictionaries.

L02-1060 : Reinhard Rapp
A Part-of-Speech-Based Search Algorithm for Translation Memories.

L02-1061 : Sabine Brants; Silvia Hansen
Developments in the TIGER Annotation Scheme and their Realization in the Corpus.

L02-1062 : António Branco; José Leitão; João Silva; Luís Gomes
Nexing Corpus: a corpus of verbal protocols on syllogistic reasoning.

L02-1063 : Eva Hajičová; Ivona Kučerová
Argument/Valency Structure in PropBank, LCS Database and Prague Dependency Treebank: A Comparative Pilot Study.

L02-1064 : Karl Weilhammer; Uwe Reichel; Florian Schiel
Multi-Tier Annotations in the Verbmobil Corpus.

L02-1065 : Stefan Schaden
A Database for the Analysis of Cross-Lingual Pronunciation Variants of European City Names.

L02-1066 : Hatem Ghorbel; Giovanni Coray; André Linden
SAM: System for Multi-criteria Text Alignment. 

L02-1067 : Pius ten Hacken
Word Formation and the Validation of Lexical Resources.

L02-1068 : A. Cappelli; M. N. Catarsi; P. Michelassi; L. Moretti; M. Baglioni; F. Turini; M. Tavoni
Knowledge Mining and Discovery for Searching in Literary Texts.

L02-1069 : A. Lavelli; F. Pianesi; E. Maci; I. Prodanof; L. Dini; G. Mazzini
SiSSA: An Infrastructure for Developing NLP Applications.

L02-1070 : Kiril Simov; Petya Osenova; Milena Slavcheva; Sia Kolkovska; Elisaveta Balabanova; Dimitar Doikoff; Krassimira Ivanova; Alexander Simov; Milen Kouylekov
Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank

L02-1071 : Ton van der Wouden; Heleen Hoekstra; Michael Moortgat; Bram Renmans; Ineke Schuurman
Syntactic Analysis in the Spoken Dutch Corpus (CGN).

L02-1072 : Andrei Popescu-Belis; Susan Armstrong; Gilbert Robert
Electronic Dictionaries - from Publisher Data to a Distribution Server: the DicoPro, DicoEast and RERO Projects.

L02-1073 : Claudia Kunze; Lothar Lemnitzer
GermaNet - representation, visualization, application.

L02-1074 : Petra Geutner; Frank Steffens; Dietrich Manstetten
Design of the VICO Spoken Dialogue System: Evaluation of User Expectations by Wizard-of-Oz Experiments.

L02-1075 : Nadia Mana; Ornella Corazzari
The Lexico-semantic Annotation of an Italian Treebank

L02-1076 : Bernardo Magnini; Matteo Negri; Roberto Prevete; Hristo Tanev
Towards Automatic Evaluation of Question/Answering Systems

L02-1077 : Martin Rajman; Anthony Hartley
Automatic Ranking of MT Systems.

L02-1078 : Luisa Bentivogli; Emanuele Pianta
Opportunistic Semantic Tagging.

L02-1079 : Petr Pollák; Václav Hanžl
Tool for Czech Pronunciation Generation Combining Fixed Rules with Pronunciation Lexicon and Lexicon Management Tool

L02-1080 : Tony Rose; Mark Stevenson; Miles Whitehead
The Reuters Corpus Volume 1 -from Yesterday's News to Tomorrow's Language Resources.

L02-1081 : Tilly Dutilh; Truus Kruyt
Implementation and Evaluation of PAROLE PoS in a National Context

L02-1082 : Zdeněk Žabokrtský; Petr Sgall; Sašo Džeroski
A Machine Learning Approach to Automatic Functor Assignment in the Prague Dependency Treebank.

L02-1083 : Carole Tiberius
How to build a multilingual inheritance-based lexicon.

L02-1084 : Carole Tiberius; Dunstan Brown; Greville Corbett.
A typological database of agreement

L02-1085 : Jimmy Lin
The Web as a Resource for Question Answering: Perspectives and Challenges.

L02-1086 : Mitsuo Shimohata; Eiichiro Sumita
Automatic paraphrasing based on parallel corpus for normalization.

L02-1087 : Gianni Lazzari
Speech to Speech Translation: Present and Future Challenges.

L02-1088 : Ivan Kopeček; Karel Pala
Databases of Heterogeneous Segments for Concatenative Speech Synthesis.

L02-1089 : Andrej Žgank; Zdravko Kačič; Bogomir Horvat
Preliminary Evaluation of Slovenian Mobile Database PoliDat.

L02-1090 : Thierry Poibeau; Dominique Dutoit; Sophie Bizouard
Evaluating resource acquisition tools for Information Extraction.

L02-1091 : Dominique Dutoit; Pierre Nugues
An Algorithm to Find Words from Definitions.

L02-1092 : Algimantas Rudzionis; Vytautas Rudzionis
Lithuanian Speech Database LTDIGITS

L02-1093 : Olivier Ferret; Christian Fluhr; Françoise Rousseau-Hans; Jean-Luc Simoni
Building domain specific lexical hierarchies from corpora.

L02-1094 : Walter Daelemans; Véronique Hoste
Evaluation of Machine Learning Methods for Natural Language Processing Tasks.

L02-1095 : Tristan Van Rullen; Philippe Blache
An evaluation of different symbolic shallow parsing techniques.

L02-1096 : Jeska Buhmann; Johanneke Caspers; Vincent J. van Heuven; Heleen Hoekstra; Jean-Pierre Martens; Marc Swerts
Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus.

L02-1097 : Jean-Pierre Martens; Diana Binnenpoorte; Kris Demuynck; Ruben Van Parys; Tom Laureys; Wim Goedertier; Jacques Duchateau
Word Segmentation in the Spoken Dutch Corpus.

L02-1098 : Nelleke Oostdijk; Wim Goedertier; Frank van Eynde; Louis Boves; Jean-Pierre Martens; Michael Moortgat; Harald Baayen
Experiences from the Spoken Dutch Corpus Project.

L02-1099 : George Mikros
Quantitative parameters in corpus design: Estimating the optimum text size in Modern Greek language.

L02-1100 : Pierrette Bouillon; Vincent Claveau; Cécile Fabre; Pascale Sébillot
Acquisition of Qualia Elements from Corpora - Evaluation of a Symbolic Learning Method

L02-1101 : Michelina Savino; Mario Refice; Domenico Daleno
Methods and Tools for Prosodic Analysis of a Spoken Italian Corpus.

L02-1102 : Oliver Lemon; Alexander Gruenstein
Language Resources for Multi-Modal Dialogue Systems. 

L02-1103 : Dominic Widdows; Beate Dorow; Chiu-Ki Chan
Using Parallel Corpora to enrich Multilingual Lexical Resources.

L02-1104 : Ariadna Font Llitjós; Alan W Black
Evaluation and collection of proper name pronunciations online.

L02-1105 : Toshifumi Tanabe; Yasuo Koyama; Kenji Yoshimura; Kosho Shudo
Modal Expressions in Natural Language Sentence and Their Similarity.

L02-1106 : Alex Alsina; Toni Badia; Gemma Boleda; Stefan Bott; Àngel Gil; Martí Quixal; Oriol Valentín
CATCG: a general purpose parsing tool applied.

L02-1107 : Alexander Raake
Does the Content of Speech Influence its Perceived Sound Quality?.

L02-1108 : Monica Ward
Issues in the design, construction and use of Language Resources (LR) for Endangered Languages (Els).

L02-1109 : Aoife Cahill; Josef van Genabith
TTS - A Treebank Tool Suite.

L02-1110 : Rudolf Muhr; Robert Hölrdich; Eva Wächter-Kollpache
The Pronouncing Dictionary of Austrian German and the other Major Varieties of German - A Phonetic Resources Database on the Pronunciation of German.

L02-1111 : Pascale Nicolas; Sabine Letellier-Zarshenas; Igor Schadle; Jean-Yves Antoine; Jean Caelen
Towards a large corpus of spoken dialogue in French that will be freely available: the "Parole Publique" project and its first realisations.

L02-1112 : Steve Cassidy
XQuery as an Annotation Query Language: a Use Case Analysis.

L02-1113 : Constantin Orasan; Ramesh Krishnamurthy
A corpus-based investigation of junk emails.

L02-1114 : Constantin Orasan
Building annotated resources for automatic text summarisation.

L02-1115 : Lynne Bowker; Peter Bennison
Translation Tracking System: A tool for managing translation archives.

L02-1116 : Ilona Steiner; Laura Kallmeyer
VIQTORYA -- A Visual Query Tool for Syntactically Annotated Corpora

L02-1117 : Massimo Poesio; Tomonori Ishikawa; Sabine Schulte im Walde; Renata Vieira
Acquiring Lexical Knowledge for Anaphora Resolution.

L02-1118 : Mike Maxwell
Resources for Morphology Learning and Evaluation.

L02-1119 : Chikashi Nobata; Satoshi Sekine; Hitoshi Isahara; Ralph Grishman
Summarization System Integrated with Named Entity Tagging and IE pattern Discovery.

L02-1120 : Satoshi Sekine; Kiyoshi Sudo; Chikashi Nobata
Extended Named Entity Hierarchy.

L02-1121 : Nick Campbell
Recording techniques for capturing natural every-day speech.

L02-1122 : Kenji Matsumoto; Hideki Tanaka
Automatic Alignment of Japanese and English Newspaper Articles using an MT System and a Bilingual Company Name Dictionary.

L02-1123 : Satoshi Shirai; Kazuhide Yamamoto; Francis Bond; Hozumi Tanaka
Towards a Thesaurus of Predicates.

L02-1124 : Yong-Ju Lee; Bong-Wan Kim; Yongnam Um
Speech Information Technology & Industry Promotion Center in Korea: Activities and Directions.

L02-1125 : Oren Gedge; Christophe Couvreur; Klaus Linhard; Shaunie Shammass; Ami Moyal
Database Adaptation for Speech Recognition in Cross-Environmental Conditions.

L02-1126 : Manolis Maragoudakis; Katia Kermanidis; Nikos Fakotakis; George Kokkinakis
Combining Bayesian and Support Vector Machines Learning to automatically complete Syntactical Information for HPSG-like Formalisms.

L02-1127 : Keiji Yasuda; Fumiaki Sugaya; Toshiyuki Takezawa; Seiichi Yamamoto; Masuzo Yanagida
Automatic machine translation selection scheme to output the best result.

L02-1128 : Aristomenis Thanopoulos; Nikos Fakotakis; George Kokkinakis
Comparative Evaluation of Collocation Extraction Metrics.

L02-1129 : Christophe Laprun; Jonathan G. Fiscus; John Garofolo; Sylvain Pajot
A Pratical Introduction to ATLAS

L02-1130 : John Garofolo; Jonathan G. Fiscus; Alvin Martin; David Pallett; Mark Przybocki
NIST Rich Transcription 2002 Evaluation: A Preview.

L02-1131 : Paloma Martínez; Ana García-Serrano; Alberto Ruiz-Cristina
Integrating Spanish Linguistic Resources in a Web Site Assistant.

L02-1132 : Angelo Dalli
Creation and Evaluation of Extensible Language Resources for Maltese.

L02-1133 : Gregory Grefenstette; Yan Qu; David A. Evans
Expanding lexicons by inducing paradigms and  validating attested forms.

L02-1134 : Taro Watanabe; Mitsuo Shimohata; Eiichiro Sumita
Statistical Machine Translation on Paraphrased Corpora.

L02-1135 : Alejandro Bia; Manuel Sánchez Quero
Building ancient Spanish dictionaries for spell-checking of DL texts

L02-1136 : Hideki Kashioka
Translation Unit Concerning Timing of Simultaneous Translation.

L02-1137 : Masumi Narita; Kazuya Kurokawa; Takehito Utsuro
A Web-based English Abstract Writing Tool Using a Tagged E-J Parallel Corpus.

L02-1138 : Ricardo Ribeiro; Luís Oliveira; Isabel Trancoso
Morphosyntactic Disambiguation for TTS Systems.

L02-1139 : Charles J. Fillmore; Collin F. Baker; Hiroaki Sato
Seeing Arguments through Transparent Structures.

L02-1140 : Charles J. Fillmore; Collin F. Baker; Hiroaki Sato
The FrameNet Database and Software Tools.

L02-1141 : Xiaoyi Ma; Haejoong Lee; Steven Bird; Kazuaki Maeda
Models and Tools for Collaborative Annotation

L02-1142 : Doroteo Torre Toledano; Luis A. Hernández Gómez
HMMs for Automatic Phonetic Segmentation

L02-1143 : Helen Wright Hastie; Rashmi Prasad; Marilyn Walker
Automatic Evaluation: Using a DATE Dialogue Act Tagger for User Satisfaction and Task Completion Prediction.

L02-1144 : Nuria Bel; Javier Caminero; Luis Hernández; Montserrat Marimón; José F. Morlesín; Josep M. Otero; José Relaño; M. Carmen Rodríguez; Pedro M. Ruz; Daniel Tapias
Design and Evaluation of a SLDS for E-Mail Access through the Telephone.

L02-1145 : Ann Copestake; Fabre Lambeau; Aline Villavicencio; Francis Bond; Timothy Baldwin; Ivan A. Sag; Dan Flickinger
Multiword expressions: linguistic precision and reusability.

L02-1146 : Keita Tsuji; Beatrice Daille; Kyo Kageura
Extracting French-Japanese Word Pairs from Bilingual Corpora based on Transliteration Rules.

L02-1147 : Elaine Uí Dhonnchadha
A Two-level Morphological Analyser and Generator for Irish using Finite-State Transducers.

L02-1148 : Smaranda Muresan; Judith Klavans
A Method for Automatically Building and Evaluating Dictionary Resources.

L02-1149 : Violeta Seretan; Dan Cristea
The Use of Referential Constraints in Structuring Discourse.

L02-1150 : Kiyoaki Shirai
Construction of a Word Sense Tagged Corpus for SENSEVAL-2 Japanese Dictionary Task.

L02-1151 : Chung-hye Han; Na-Rare Han; Eon-Suk Ko; Martha Palmer
Development and Evaluation of a Korean Treebank and its Application to NLP

L02-1152 : Alexandra Kinyon; Carlos A. Prolo
Identifying Verb Arguments and their Syntactic Function in the Penn Treebank.

L02-1153 : Parham Mokhtari; Nick Campbell
Automatic Detection of Acoustic Centres of Reliability for Tagging Paralinguistic Information in Expressive Speech.

L02-1154 : Jong-Hoon Oh; Saim Shin; Yong-Seok Choi; Key-Sun Choi
Word Sense Disambiguation with Information Retrieval Technique.

L02-1155 : N. Minematsu; Y. Tomiyama; K. Yoshimoto; K. Shimizu; S. Nakagawa; M. Dantsuji; S. Makino
English Speech Database Read by Japanese Learners for CALL System Development.

L02-1156 : Erica Costantini; Susanne Burger; Fabio Pianesi
NESPOLE!’s Multilingual and Multimodal Corpus.

L02-1157 : Horacio Saggion; Hamish Cunningham; Diana Maynard; Kalina Bontcheva; Oana Hamza; Christian Ursu; Yorick Wilks
Extracting Information for Automatic Indexing of Multimedia Material.

L02-1158 : Horacio Saggion; Dragomir Radev; Simone Teufel; Wai Lam; Stephanie M. Strassel
Developing Infrastructure for the Evaluation of Single and Multi-document Summarization Systems in a Cross-lingual Environment.

L02-1159 : Harris Papageorgiou; Prokopis Prokopidis; Voula Giouli; Iason Demiros; Alexis Konstantinidis; Stelios Piperidis
Multi-level XML-based Corpus Annotation.

L02-1160 : Harald Höge
Project Proposal TC-STAR - Make Speech to Speech Translation Real

L02-1161 : Igor Boguslavsky; Ivan Chardin; Svetlana Grigorieva; Nikolai Grigoriev; Leonid Iomdin; Leonid Kreidlin; Nadezhda Frid
Development of a Dependency Treebank for Russian and its Possible Applications in NLP.

L02-1162 : Stefan Rapp; Michael Strube
An Iterative Data Collection Approach for Multimodal Dialogue Systems.

L02-1163 : Carol Peters; Martin Braschler
The Importance of Evaluation for Cross-Language System Development: the CLEF Experience.

L02-1164 : R. Muñoz; R. Mitkov; M. Palomar; J. Peral; R. Evans; L. Moreno; C. Orasan; M. Saiz-Noeda; A. Ferrández; C. Barbu; P. Martínez-Barco; A. Suárez
Bilingual alignment of anaphoric expressions.

L02-1165 : Heiki-Jaan Kaalep; Kadri Muischnek
Using the Text Corpus to Create a Comprehensive List of Phrasal Verbs

L02-1166 : Diana Raileanu; Paul Buitelaar; Spela Vintar; Jörg Bay
Evaluation Corpora for Sense Disambiguation in the Medical Domain.

L02-1167 : Špela Vintar; Paul Buitelaar; Bärbel Ripplinger; Bogdan Sacaleanu; Diana Raileanu; Detlef Prescher
An Efficient and Flexible Format for Linguistic and Semantic Annotation.

L02-1168 : Markéta Straňáková-Lopatková; Zdenĕk Žabokrtský
Valency Dictionary of Czech Verbs: Complex Tectogrammatical Annotation.

L02-1169 : Darren Pearce
A Comparative Evaluation of Collocation Extraction Techniques

L02-1170 : Marko Tadić
Building the Croatian National Corpus.

L02-1171 : Thorsten Trippel; Dafydd Gibbon
Annotation Driven Concordancing: the PAX Toolkit.

L02-1172 : Katia Lida Kermanidis; Nikos Fakotakis; George Kokkinakis
DELOS: An Automatically Tagged Economic Corpus for Modern Greek.

L02-1173 : Henk van den Heuvel; Khalid Choukri; Harald Höge
Give me a bug. a framework for a bug report service

L02-1174 : Vladimir Hozjan; Zdravko Kacic; Asunción Moreno; Antonio Bonafonte; Albino Nogueiras
Interface Databases: Design and Collection of a Multilingual Emotional Speech Database.

L02-1175 : Vladimir Hozjan; Zdravko Kacic
Objective analysis of emotional speech for English and Slovenian Interface emotional speech databases.

L02-1176 : Konstantin Biatov; Joachim Köhler
Methods and Tools for Speech Data Acquisition exploiting a Database of German Parliamentary Speeches and Transcripts from the Internet.

L02-1177 : Dorota Iskra; Beate Grosskopf; Krzysztof Marasek; Henk van den Heuvel; Frank Diehl; Andreas Kiessling
SPEECON – Speech Databases for Consumer Devices: Database Specification and Validation.

L02-1178 : James Dowdall; Michael Hess; Neeme Kahusk; Kaarel Kaljurand; Mare Koit; Fabio Rinaldi; Kadri Vider
Technical Terminology as a Critical Resource.

L02-1179 : Eva Anna Lenz; Angelika Storrer
Converting a Corpus into a Hypertext: An Approach Using XML Topic Maps and XSLT.

L02-1180 : Rickard Domeij; Ola Knutsson; Kerstin Severinson Eklundh
Different Ways of Evaluating a Swedish Grammar Checker.

L02-1181 : Antonio Moreno Ortiz; Victor Raskin; Sergei Nirenburg
New Developments in Ontological Semantics.

L02-1182 : Amalia Todirascu; Eric Kow; Laurent Romary
Towards Reusable NLP Components.

L02-1183 : Judita Preiss; Anna Korhonen; Ted Briscoe
Subcategorization Acquisition as an Evaluation Method for WSD.

L02-1184 : Shoichiro Hara; Hisashi Yasunaga
Resource Sharing System for Humanity Researches

L02-1185 : A. Benabbou; N. Chenfour; A. Mouradi
Study and quantification of the declination for the Arabic speech synthesis system PARADIS. 

L02-1186 : Matej Rojc; Zdravko Kačič; Darinka Verdonik
Design and Implementation of the Slovenian Phonetic and Morphology Lexicons for the Use in Spoken Language Applications.

L02-1187 : Nabil Hathout; Ludovic Tanguy
Webaffix: Discovering Morphological Links on the WWW.

L02-1188 : Natalia V. Loukachevitch; Boris V. Dobrov
Evaluation of Thesaurus on Sociopolitical Life as Information-Retrieval Tool.

L02-1189 : Nabil Hathout
From WordNet to CELEX: acquiring morphological links from dictionaries of synonyms.

L02-1190 : Ulrich Heid; Bettina Säuberlich; Arne Fitschen
Using Descriptive Generalisations in the Acquisition of Lexical Data for Word Formation.

L02-1191 : Masayuki Asahara; Ryuichi Yoneda; Akiko Yamashita; Yasuharu Den; Yuji Matsumoto
Use of XML and Relational Databases for Consistent Development and Maintenance of Lexicons and Annotated Corpora.

L02-1192 : Laura Pecchia; Giuseppe Cappelli; Elisabetta Guazzini
Linguistic and Computational Problems for the Creation of an Italian Children's Corpus of Spoken Language.

L02-1193 : Lars Ahrenberg ; Mikael Andersson; Magnus Merkel
A System for Incremental and Interactive Word Linking.

L02-1194 : Kiril Ribarov
Old Sources and Modern Procedures: Computer Processing of Old-Church Slavonic.

L02-1195 : José Miguel Aguilar Río
Compiling an Interactive Literary Translation Web Site for Education Purposes.

L02-1196 : Thierry Hamon; Olivier Hû
How to evaluate necessary cooperative systems of terminology building?.

L02-1197 : Nilda Ruimy; Monica Monachini; Raffaella Distante; Elisabetta Guazzini; Stefano Molino; Marisa Ulivieri; Nicoletta Calzolari; Antonio Zampolli
CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.

L02-1198 : Susanne Salmon-Alt; Renata Vieira
Nominal Expressions in Multilingual Corpora: Definites and Demonstratives.

L02-1199 : Jerker Järborg; Dimitrios Kokkinakis; Maria Toporowska Gronostaj
Lexical and Textual Resources for Sense Recognition and Description.

L02-1200 : X. Artola; A. Díaz de Ilarraza; N. Ezeiza; K. Gojenola; G. Hernández; A. Soroa
A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation

L02-1201 : Csaba Oravecz; Péter Dienes
Efficient Stochastic Part-of-Speech Tagging for Hungarian.

L02-1202 : Hannah Kermes; Stefan Evert
YAC - A Recursive Chunker for Unrestricted German Text.

L02-1203 : Koji Eguchi; Kazuko Kuriyama; Noriko Kando
Sensitivity of IR systems Evaluation to Topic Difficulty.

L02-1204 : Brian Mitchell; Robert Gaizauskas
A Comparison of Machine Learning Algorithms for Prepositional Phrase Attachment.

L02-1205 : Dan Cristea; Oana-Diana Postolache; Gabriela-Eugenia Dima; Cătălina Barbu
AR-Engine - a framework for unrestricted co-reference resolution.

L02-1206 : Cătălina Barbu; Richard Evans; Ruslan Mitkov
A corpus based investigation of morphological disagreement in anaphoric relations.

L02-1207 : Cătălina Barbu
Error analysis in anaphora resolution.

L02-1208 : Jean-Yves Antoine; Caroline Bousquet-Vernhettes; Jérôme Goulian; Mohamed Zakaria Kurdi; Sophie Rosset; Nadine Vigouroux; Jeanne Villaneau
Predictive and objective evaluation of speech understanding: the “challenge” evaluation campaign of the I3 speech workgroup of the French CNRS.

L02-1209 : Michael Moortgat; Richard Moot
Using the Spoken Dutch Corpus for type-logical grammar induction.

L02-1210 : Bolette S. Pedersen; Patrizia Paggio
Semantic Lexical Resources Applied to Content-based Querying - the OntoQuery Project.

L02-1211 : Georgios Petasis; Vangelis Karkaletsis; Georgios Paliouras; Ion Androutsopoulos; Constantine D. Spyropoulos
Ellogon: A New Text Engineering Platform.

L02-1212 : Antonio S. Valderrábanos; Alexander Belskis; Luis Iraola Moreno
Multilingual Terminology Extraction and Validation.

L02-1213 : Laila Dybkjær; Niels Ole Bernsen
Natural Interactivity Resources – Data, Annotation Schemes and Tools.

L02-1214 : Niels Ole Bernsen; Laila Dybkjær; Mykola Kolodnytsky
THE NITE WORKBENCH. A Tool for Annotation of Natural Interactivity and Multimodal Data

L02-1215 : Valentin Tablan; Cristian Ursu; Kalina Bontcheva; Hamish Cunningham; Diana Maynard; Oana Hamza; Tony McEnery; Paul Baker; Mark Leisher
A Unicode-based Environment for Creation and Use of Language Resources.

L02-1216 : Dimitra Farmakiotou; Vangelis Karkaletsis; Ioannis Koutsias; George Petasis; Constantine D. Spyropoulos
PatEdit: An Information Extraction Pattern Editor for Fast System Customization.

L02-1217 : Tamás Váradi
The Hungarian National Corpus.

L02-1218 : Paul Clough; Robert Gaizauskas; S. L. Piao
Building and annotating a corpus for the study of journalistic text reuse

L02-1219 : Hennie Brugman; Harriet Spenke; Markus Kramer; Alexander Klassmann
Multimedia Annotation with Multilingual Input Methods and Search Support.

L02-1220 : P. Wittenburg; W. Peters; S. Drude
Analysis of Lexical Structures from Field Linguistics and Language Engineering.

L02-1221 : P. Wittenburg; U. Mosel; A. Dwyer
Methods of Language Documentation in the DOBES project.

L02-1222 : P. Wittenburg; W. Peters; D. Broeder
Metadata Proposals for Corpora and Lexica.

L02-1223 : P. Wittenburg; St. Levinson; S. Kita; H. Brugman
Multimodal Annotations in Gesture and Sign Language Studies

L02-1224 : Daan Broeder; Freddy Offenga; Don Willems
Metadata Tools Supporting Controlled Vocabulary Services.

L02-1225 : Daan Broeder; Peter Wittenburg; Thierry Declerck; Laurent Romary
LREP: A Language Repository Exchange Protocol.

L02-1226 : Caroline Hagège; Claude Roux
A Robust and Flexible Platform for Dependency Extraction.

L02-1227 : Klaus-Dirk Schmitz
Subject-field-specific Ontologies and Terminologies for the Web Community.

L02-1228 : Steve Whittaker; Marilyn Walker; Johanna Moore
Fish or Fowl:A Wizard of Oz Evaluation of Dialogue Strategies in the Restaurant Domain.

L02-1229 : Adriana Roventini; Marisa Ulivieri; Nicoletta Calzolari
Integrating Two Semantic Lexicons, SIMPLE and ItalWordNet: What Can We Gain?.

L02-1230 : Rita Marinelli; Adriana Roventini  
Proper Names In A Semantic Database.

L02-1231 : Leonardo Lesmo; Vincenzo Lombardo
Transformed Subcategorization Frames in Chunk Parsing.

L02-1232 : Gabriela Cavaglià
Measuring corpus homogeneity using a range of measures for inter-document distance.

L02-1233 : Claire Grover; Scott McDonald; Donnla Nic Gearailt; Vangelis Karkaletsis; Dimitra Farmakiotou; Georgios Samaritakis; Georgios Petasis; Maria Teresa Pazienza; Michele Vindigni; Frantz Vichot; Francis Wolinski
Multilingual XML-Based Named Entity Recognition for E-Retail Domains.

L02-1234 : Janienke Sturm; Ilse Bakx; Bert Cranen; Jacques Terken; Fusi Wang
Usability Evaluation of a Dutch Multimodal System for Train Timetable Information.

L02-1235 : Katerina Pastra; Diana Maynard; Oana Hamza; Hamish Cunningham; Yorick Wilks
How feasible is the reuse of grammars for Named Entity Recognition?.

L02-1236 : Claudia Soria; Niels Ole Bernsen; Niels Cadée; Jean Carletta; Laila Dybkjær; Stefan Evert; Ulrich Heid; Amy Isard; Mykola Kolodnytsky; Christoph Lauer; Wolfgang Lezius; Lucas P.J.J. Noldus; Vito Pirrelli; Norbert Reithinger; Andreas Vögele
Advanced Tools for the Study of Natural Interactivity.

L02-1237 : Roldano Cattoni; Morena Danieli; Vanessa Sandrini; Claudia Soria
ADAM: The SI-TAL Corpus of Annotated Dialogues.

L02-1238 : Jason Baldridge; John Dowding; Susana Early  
Leo: an Architecture for Sharing Resources for Unification-Based Grammars.

L02-1239 : Irena Spasić; Goran Nenadić; Sophia Ananiadou
Tuning Context Features with Genetic Algorithms.

L02-1240 : Goran Nenadić; Irena Spasić; Sophia Ananiadou
Automatic Acronym Acquisition and Term Variation Management within Domain-Specific Texts.

L02-1241 : Sanni Nimb
Adverbs in Semantic Lexica for NLP - The extension of the Danish SIMPLE lexicon with Time Adverbs.

L02-1242 : Anna Sågvall Hein; Eva Forsbom; Jörg Tiedemann; Per Weijnitz; Ingrid Almqvist; Leif-Jöran Olsson; Sten Thaning
Scaling Up an MT Prototype for Industrial Use - Databases and Data Flow.

L02-1243 : Xavier Carreras; Lluís Padró
A Flexible Distributed Architecture for Natural Language Analyzers.

L02-1244 : Eugenio Picchi; Eva Sassolini; Ouafae Nahli; Sebastiana Cucurullo; M. Isabel Vargas
Italian arabic linguistic tools.

L02-1245 : Christopher Cieri; Mark Liberman
Language Resource Creation and Distribution at the Linguistic Data Consortium: A Progress Report.

L02-1246 : Claudia Sassen; Dafydd Gibbon
Enhanced Dialogue Markup for Crisis Talk Scenario Resources

L02-1247 : Jörg Tiedemann
MatsLex - a Multilingual Lexical Database for Machine Translation

L02-1248 : Maria Rzewuska
Terminology Resources in the Context of a Major Translation Project.

L02-1249 : Hartmut R. Pfitzinger
Reducing Segmental Duration Variation by Local Speech Rate Normalization of Large Spoken Language Resources.

L02-1250 : Ted Briscoe; John Carroll
Robust Accurate Statistical Annotation of General Text.

L02-1251 : Catia Cucchiarini; Elisabeth D'Halleweyn; Lisanne Teunissen
A Human Language Technologies Platform for the Dutch language: awareness, management maintenance and distribution.

L02-1252 : D. Binnenpoorte; F. De Vriend; J. Sturm; W. Daelemans; H. Strik; C. Cucchiarini
A Field Survey for Establishing Priorities in the Development of HLT Resources for Dutch.

L02-1253 : Ana M. García-Serrano; Luis Rodrigo-Aguado; Javier Calle  
Natural Language Dialogue in a Virtual Assistant Interface.

L02-1254 : Jesús Cardeñosa; Edmundo Tovar; Carolina Gallardo
The UNL System.

L02-1255 : Dieter Maas; Nuebel Rita; Catherine Pease; Paul Schmidt
Bilingual Indexing for Information Retrieval with AUTINDEX.

L02-1256 : Michael Rosner
The Future of Maltilex.

L02-1257 : Nicoletta Calzolari; Ralph Grishman; Marta Palmer
Standards & best practice for multilingual computational lexicons: ISLE MILE  and more”

L02-1258 : Sue Atkins; Nuria Bel; Francesca Bertagna; Pierrette Bouillon; Nicoletta Calzolari; Christiane Fellbaum; Ralph Grishman; Alessandro Lenci; Catherine MacLeod; Martha Palmer; Gregor Thurmair; Marta Villegas; Antonio Zampolli
From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.

L02-1259 : Nicoletta Calzolari; Charles J. Fillmore; Ralph Grishman; Nancy Ide; Alessandro Lenci; Catherine MacLeod; Antonio Zampolli
Towards Best Practice for Multiword Expressions in Computational Lexicons.

L02-1260 : Alessandro Lenci; Roberto Bartolini; Nicoletta Calzolari; Ana Agua; Stephan Busemann; Emmanuel Cartier; Karine Chevreau; José Coch
Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project.

L02-1261 : Anna Braasch
Current Developments of STO - the Danish Lexicon Project for NLP and HLT Applications.

L02-1262 : Robert E. Frederking; Alan W Black; Ralf D. Brown; John Moody; Eric Steinbrecher
Field Testing the Tongues Speech-to-Speech Machine Translation System.

L02-1263 : Julia Hockenmaier; Mark Steedman
Acquiring Compact Lexicalized Grammars from a Cleaner Treebank.

L02-1264 : F. de Vriend; P.A. Coppen; W. Haeseryn
Using Grammatical Description as a Metalanguage Resource.

L02-1265 : Hélèn François; Olivier Boëffard
The Greedy Algorithm and its Application to the Construction of a Continuous Speech Database.

L02-1266 : Martine Hurault-Plantet; Laura Monceaux
Cooperation between black box and glass box approaches for the evaluation of a question answering system.

L02-1267 : Adán Cassán; Sergi Cervell; Mireia Colom; Rafael Marín; Josep M. Merenciano; Gema Pérez; Lluís Valentín
BDCon: A Spanish knowledge database.

L02-1268 : Adán Cassán; Sergi Cervell; Mireia Colom; Rafael Marín; Josep M. Merenciano; Gema Pérez; Lluís Valentín
A step forward to hypertext.

L02-1269 : Asunción Moreno; Oren Gedge; Henk van den Heuvel; Harald Höge; Sabine Horbach; Patricia Martin; Elisabeth Pinto; Antonio Rincón; Franco Senia; Rafid Sukkar
SpeechDat across all America: SALA II.

L02-1270 : Maya Ando; Jun Okamoto; Shun Ishizaki
Extraction of Associative Attributes from Nouns and Quantitative Expression of Prototype Concept.

L02-1271 : Martin Wynne
The Language Resource Archive of the 21st Century.

L02-1272 : Nordine Fourour; Emmanuel Morin; Béatrice Daille
Incremental Recognition and Referential Categorization of French Proper Names.

L02-1273 : Shigeki Matsubara; Akira Takagi; Nobuo Kawaguchi; Yasuyoshi Inagaki
Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research.

L02-1274 : Marcela Charfuelán; Luis Hernández Gómez; Cristina Esteban López; Holmer Hemsen
A XML-based tool for evaluation of SLDS.

L02-1275 : K. López de Ipiña; N. Ezeiza; G. Bordel
Automatic Morphological Segmentation for Continuous Speech Recognition of Basque.

L02-1276 : Richard F. E. Sutcliffe; Kieran White
Searching via Keywords or Concept Hierarchies - Which is Better?.

L02-1277 : Juliana Galvani Greghi; Ronaldo Teixeira Martins; Maria das Graças Volpe Nunes
DIADORIM - A Lexical Database for Brazilian Portuguese.

L02-1278 : Mónica Caballero; José B. Mariño; Asunción Moreno
Multidialectal Spanish Modeling for ASR.

L02-1279 : Paola Monachesi; Alexis Dimitriadis; Rob Goedemans; Anne-Marie Mineur
A unified system for accessing typological databases.

L02-1280 : Klára Osolsobĕ; Karel Pala; Radek Sedláček; Marek Veber
A Procedure for Word Derivational Processes Concerning Lexicon Extension in Highly Inflected Languages

L02-1281 : Heli Uibo
Experimental Two-Level Morphology of Estonian.

L02-1282 : Marianne Dabbadie; Widad Mustafa El Hadi; Ismaïl Timimi
Terminological Enrichment for non-Interactive MT Evaluation

L02-1283 : Paul Kingsbury; Martha Palmer
From TreeBank to PropBank.

L02-1284 : Almudena Ballester; Ángel Martín Municio; Fernando Pardos; Jordi Porta Zamorano; Rafael J. Ruiz Ureña; Fernando Sánchez León
Combining statistics on n-grams for automatic term recognition.

L02-1285 : Steven Bird; Kazuaki Maeda; Xiaoyi Ma; Haejoong Lee; Beth Randall; Salim Zayat
TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit.

L02-1286 : Kazauki Maeda; Steven Bird; Xiaoyi Ma; Haejoong Lee
Creating Annotation Tools with the Annotation Graph Toolkit.

L02-1287 : Nobuo Kawaguchi; Shigeki Matsubara; Kazuya Takeda; Fumitada Itakura
Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research.

L02-1288 : Laurence Devillers; Sophie Rosset; Hélèn Bonneau-Maynard; Lori Lamel
Annotations for Dynamic Diagnosis of the Dialog State

L02-1289 : Jean-Claude Martin; Michael Kipp
Annotating and Measuring Multimodal Behaviour – Tycoon Metrics in the Anvil Tool.

L02-1290 : Emanuela Cresti; Massimo Moneglia; Fernanda Bacelar do Nascimento; Antonio Moreno Sandoval; Jean Veronis; Philippe Martin; Kalid Choukri; Valerie Mapelli; Daniele Falavigna; Antonio Cid; Claude Blum
The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.

L02-1291 : Christopher Cieri; Mark Liberman
TIDES Language Resources: A Resource Map for Translingual Information Access.

L02-1292 : Stefan Eickeler; Martha Larson; Wolff Rüter; Joachim Köhler
Creation of an Annotated German Broadcast Speech Database for Spoken Document Retrieval.

L02-1293 : Jan-Torsten Milde; Ulrike Gut
The TASX-environment: an XML-based toolset for time aligned speech corpora

L02-1294 : Scott Cotton; Steven Bird
An integrated framework for treebanks and multilayer annotations

L02-1295 : Florence Duclaye; François Yvon; Olivier Collin
Using the Web as a Linguistic Resource for Learning Reformulations Automatically.

L02-1296 : Christoph Müller; Michael Strube
An API for Discourse-level Access to XML-encoded Corpora.

L02-1297 : Hidetsugu Nanba; Manabu Okumura
Some Examinations of Intrinsic Methods for Summary Evaluation Based on the Text Summarization Challenge (TSC).

L02-1298 : Mohamed-Zakaria Kurdi; Mohamed Ahafhaf
Toward an objective and generic Method for Spoken Language Understanding Systems Evaluation: an extension of the DCR method.

L02-1299 : Gerhard Heyer; Uwe Quasthoff; Christian Wolff
Information Extraction from Text Corpora: Using Filters on Collocation Sets.

L02-1300 : Christopher Cieri; Stephanie Strassel
The DASL Project: a Case Study in Data Re-Annotation and Re-Use.

L02-1301 : Dragomir R. Radev; Hong Qi; Harris Wu; Weiguo Fan
Evaluating Web-based Question Answering Systems.

L02-1302 : Daisuke Kawahara; Sadao Kurohashi; Kôiti Hasida
Construction of a Japanese Relevance-tagged Corpus.

L02-1303 : Nancy Ide; Randi Reppen; Keith Suderman
The American National Corpus: More Than the Web Can Provide.

L02-1304 : Craig Martell
FORM: An Extensible, Kinematically-based Gesture Annotation Scheme. 

L02-1305 : Toshiyuki Takezawa; Eiichiro Sumita; Fumiaki Sugaya; Hirofumi Yamamoto; Seiichi Yamamoto
Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World.

L02-1306 : Michelle Vanni; Keith Miller
Scaling the ISLE Framework: Use of Existing Corpus Resources for Validation of MT Evaluation Metrics across Languages.

L02-1307 : Le An Ha
Learning description of term patterns using glossary resources.

L02-1308 : Barbara Di Eugenio; Michael Glass; Michael J. Scott
The binomial cumulative distribution function, or, is my system better than yours?.

L02-1309 : Fumiaki Suyaga; Toshiyuki Takezawa; Genichiro Kikui; Seiichi Yamamoto
Proposal of a very-large-corpus acquisition method by cell-formed registration.

L02-1310 : Rada F. Mihalcea
Bootstrapping Large Sense Tagged Corpora.

L02-1311 : Takano Ogino; Hitoshi Isahara; Kazuhiro Kobayashi
The Valence Patterns of Japanese Verbs Extracted From The EDR Corpus.

L02-1312 : Jean-Claude Martin; Jean-Hugues Réty; Nelly Bensimon
Multimodal and Adaptative Pedagogical Resources.

L02-1313 : Timothy Baldwin; Slaven Bilac; Ryo Okumura; Takenobu Tokunaga; Hozumi Tanaka
Enhanced Japanese Electronic Dictionary Look-up.

L02-1314 : Romaric Besançon; Martin Rajman
Evaluation of a Vector Space Similarity Measure in a Multilingual Framework.

L02-1315 : Serge A. Yablonsky
Corpora as Object-Oriented System. From UML-notation to Implementation.

L02-1316 : Roberto Bartolini; Alessandro Lenci; Simonetta Montemagni; Vito Pirrelli
The Lexicon-Grammar Balance in Robust Parsing of Italian.

L02-1317 : Daniel Jung
Humans as Corpus - Language Learning Strategies in Virtually Mediated Authentic Environments.

L02-1318 : Atsuko; Koizumi; Hirohiko Sagawa; Masaru Takeuchi
An Annotated Japanese Sign Language Corpus.

L02-1319 : Paul Baker; Andrew Hardie; Tony McEnery; Hamish Cunningham; Rob Gaizauskas
EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.

L02-1320 : Angelika Salmen
Multi-Modal Menus And Traffic Interaction. Timing As A Crucial Factor For User Driven Mode Decision

L02-1321 : Constantin Orăsan; Richard Evans
Assessing the difficulty of finding people in texts.

L02-1322 : Sussi Olsen
Lemma selection in domain specific computational lexica - some specific problems

L02-1323 : Gábor Prószéky; Márton Miháltz
Automatism and User Interaction: Building a Hungarian WordNet.

L02-1324 : Véronique Gendner
Comparative study of oral and written French automatically tagged with morpho-syntactic information.

L02-1325 : Owen Rambow; Cassandre Creswell; Rachel Szekely; Harriet Taber; Marilyn Walker
A Dependency Treebank for English.

L02-1326 : Ganesh Ramesh; Amit Bagga
A Text-based for Detection and Filtering of Commercial Segments in Broadcast News.

L02-1327 : Nancy Ide; Laurent Romary
Standards for Language Resources.

L02-1328 : Nigel Collier; Koichi Takeuchi
PIA-Core: Semantic Annotation through Example-based Learning

L02-1329 : Nigel Collier; Koichi Takeuchi; Chikashi Nobata; Junichi Fukumoto; Norihiro Ogata
Progress on Multi-lingual Named Entity Annotation Guidelines using RDF (S).

L02-1330 : Thomas Hanke
iLex - A tool for Sign Language Lexicography and Corpus Analysis.

L02-1331 : Joanne Capstick; Hans Uszkoreit; Wolfgang Wahlster; Thierry Declerck; Gregor Erbach; Anthony Jameson; Brigitte Jorg; Reinhard Karger; Tillmann Wegst
COLLATE: Competence Center in Speech and Language Technology.

L02-1332 : Emiko Suzuki; Kyoko Kakihana
Japanese and American Sign Language Dictionary System for Japanese and English Users.

L02-1333 : Primož Jakopin
The feasibility of a complete text corpus.

L02-1334 : Catherine Macleod
Lexical Annotation for Multi-word Entries Containing Nominalizations.

L02-1335 : Silja Huttunen; Roman Yangarber; Ralph Grishman
Diversity of Scenarios in Information extraction.

L02-1336 : Mark T. Maybury
Multimodal Systems, Resources and Evaluation.

L02-1337 : Hiromichi Kawanami; Tsuyoshi Masuda; Tomoki Toda; Kiyohiro Shikano
Designing speech database with prosodic variety for expressive TTS system.

L02-1338 : Atsushi Fujii; Katunobu Itou; Tetsuya Ishikawa
Producing a Large-scale Encyclopedic Corpus over the Web.

L02-1339 : Akinobu Lee; Tatsuya Kawahara; Kazuya Takeda; Masato Mimura; Atsushi Yamada; Akinori Ito; Katsunobu Itou; Kiyohiro Shikano
Continuous Speech Recognition Consortium  an Open Repository for CSR Tools and Models.

L02-1340 : Tony McEnery
Ethical and legal issues in corpus construction

L02-1341 : Antonietta Alonge; Margherita Castelli
Which way should we go? Metaphoric expressions in lexical resources.

L02-1342 : Chai Wutiwiwatchai; Patcharika Cotsomrong; Sinaporn Suebvisai; Supphanat Kanokphara
Phonetically Distributed Continuous Speech Corpus for Thai Language.

L02-1343 : Matthias Denecke
Signatures, Typed Feature Structures and RDFS.

L02-1344 : Marie-Jeanne Derouin; Dr. André Le Meur
Report on the Revision of the Lexicographical Standard ISO 1951 Presentation/Representation of Entries in Dictionaries.

L02-1345 : Véronique Gendner; Gabriel Illouz; Michèle Jardino; Laura Monceaux; Patrick Paroubek; Isabelle Robba; Anne Vilnat
A Protocol for Evaluating Analyzers of Syntax (PEAS).

L02-1346 : Mark T. Maybury; Antonio Zampolli
Language Resources and Evaluation: International Strategy Panel.

L02-1347 : Kishore Papineni
Machine Translation Evaluation: N-grams to the Rescue.

L02-1348 : Michael Kluck; Christa Womser-Hacker
Inside the Evaluation Process of the Cross-Language Evaluation Forum (CLEF): Issues of Multilingual Topic Creation and Multilingual Relevance Assessment.

L02-1349 : Andrew Finch; Ezra Black; Ringo Wathelet
Beyond Tag Trigrams: New Local Features for Tagging.

L02-1350 : Sanda Harabagiu; Finley Lacatusu; Paul Morarescu
Multidocument Summarization with GISTexter.

L02-1351 : Feiyu Xu; Daniela Kurz; Jakub Piskorski; Sven Schmeier
A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping

L02-1352 : Silke Steininger; Florian Schiel; Angelika Glesner
User-State Labeling Procedures For The Multimodal Data Collection Of SmartKom.

L02-1353 : James Pustejovsky
Creating Domain-specific Information Servers.

L02-1354 : Mathieu Lafourcade; Christian Boitet
UNL Lexical Selection with Conceptual Vectors.