Proceedings of the Third International Conference on Language Resources and Evaluation (LREC'02)
L02-1001
: Susana Afonso; Eckhard Bick; Renato Haber; Diana Santos
Floresta Sintá(c)tica: A treebank for Portuguese.
L02-1002
: Hiroyuki Shinnou
Learning of word sense disambiguation rules by Co-training, checking co-occurrence of features.
L02-1003
: Lorna Balkan; Ken Miller; Birgit Austin; Anne Etheridge; Myriam Garcia Bernabé; Pam Miller
ELSST: a broad-based Multilingual Thesaurus for the Social Sciences.
L02-1004
: Vincent Vandeghinste
Lexicon Optimization: Maximizing Lexical Coverage in Speech Recognition through Automated Compounding.
L02-1005
: Eduard Hovy; Margaret King; Andrei Popescu-Belis
Computer-Aided Specification of Quality Models for Machine Translation Evaluation.
L02-1006
: Serge Sharoff
Meaning as use: exploitation of aligned corpora for the contrastive study of lexical semantics.
L02-1007
: Min-Yen Kan; Judith L. Klavans; Kathleen R. McKeown
Using the Annotated Bibliography as a Resource for Indicative Summarization.
L02-1008
: Choy-Kim Chuah; Zaharin Yusoff
Computational Linguistics at Universiti Sains Malaysia
L02-1009
: Judit Feliu; Jorge Vivaldi; M. Teresa Cabré
Towards an Ontology for a Human Genome Knowledge Base
L02-1010
: Tom Laureys; Kris Demuynck; Jacques Duchateau; Patrick Wambacq
An Improved Algorithm for the Automatic Segmentation of Speech Corpora.
L02-1011
: Katja Markert; Malvina Nissim
Towards a Corpus Annotated for Metonymies: the Case of Location Names.
L02-1012
: Philippe Langlais; Marie Loranger; Guy Lapalme
Translators at work with TRANSTYPE: Resource and Evaluation.
L02-1013
: Qiang Zhou; Elliott Franco Drabek; Fuji Ren
Annotating the functional chunks in Chinese sentences.
L02-1014
: Hisao Kuwabara; Shuich Itahashi; Mikio Yamamoto; Toshiyuki Takezawa; Satoshi Nakamura; Kazuya Takeda
The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research.
L02-1015
: Diana Santos; Caroline Gasperin
Evaluation of parsed corpora: Experiments in user-transparent and user-visible evaluation.
L02-1016
: Laura Docío-Fernández; Carmen García-Mateo
Acoustic Modeling and Training of a Bilingual ASR System when a Minority Language is Involved.
L02-1017
: Steven Bird; Hans Uszkoreit; Gary Simons
The Open Language Archives Community
L02-1018
: Jakub Piskorski; Witold Drożdżyński; Oliver Scherf; Feiyu Xu
A Flexible XML-based Regular Compiler for Creation and Conversion of Linguistic Resources.
L02-1019
: Robert Modic; Bojan Petek
A Contrastive Acoustic-Phonetic Analysis of Slovenian and English Diphthongs.
L02-1020
: Christoph Draxler; Florian Schiel
Three New Corpora at the Bavarian Archive for Speech Signals and a First Step Towards Distributed Web-Based Recording.
L02-1021
: René Schneider
n-grams of Seeds: A Hybrid System for Corpus-Based Text Summarization.
L02-1022
: Barry Schiffman
Building a Resource for Evaluating the Importance of Sentences.
L02-1023
: Sabine Schulte im Walde
A Subcategorisation Lexicon for German Verbs induced from a Lexicalised PCFG.
L02-1024
: Ingunn Amdal; Torbjørn Svendsen
Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles.
L02-1025
: Andrea Bozzi
LAperLA: an integrated graphical-linguistic System for old printed Latin Texts.
L02-1026
: Pascale Bernard; Josette Lecomte; Jacques Dendien; Jean-Marie Pierrel
Computerized linguistic resources of the research laboratory ATILF for lexical and textual analysis: Frantext, TLFi, and the software Stella.
L02-1027
: Masaki Murata; Hitoshi Isahara
Automatic extraction of differences between spoken and written languages, and automatic translation from the written to the spoken language.
L02-1028
: Fabio Tamburini
Automatic detection of prosodic prominence in continuous speech.
L02-1029
: Fabio Tamburini
A dynamic model for reference corpora structure definition
L02-1030
: Daniela Alderuccio; Luciana Bordoni
An ontology-based approach in the literary research: two case-studies
L02-1031
: Javier Caminero; Joaquín González-Rodríguez; Javier Ortega-García; Daniel Tapias; Pedro M. Ruz; Mercedes Solá
A Multilingual Speaker Verification System: Architecture and Performance Evaluation.
L02-1032
: Dan Tufiş; Ana-Maria Barbu
Lexical token alignment: experiments, results and applications.
L02-1033
: Achim F. Müller; Janez Stergar; Bogomir Horvat
Designing Prosodic Databases for Automatic Modeling of Slovenian Language in a Multilingual TTS System.
L02-1034
: Nadjet Bouayad-Agha; Richard Power; Donia Scott; Anja Belz
PILLS: Multilingual generation of medical information documents with overlapping content.
L02-1035
: Felix Sasaki; Claudia Wegener; Andreas Witt; Dieter Metzing; Jens Pönninghaus
Co-reference annotation and resources: A multilingual corpus of typologically diverse languages.
L02-1036
: Udo Hahn; Stefan Schulz
Towards Very Large Ontologies for Medical Language Processing.
L02-1037
: Enrique Alfonseca; Suresh Manandhar
Improving an Ontology Refinement Method with Hyponymy Patterns.
L02-1038
: Enrique Alfonseca; Suresh Manandhar
Proposal for Evaluating Ontology Refinement Methods.
L02-1039
: Matthieu Constant
Methods for Constructing Lexicon-Grammar Resources: The Example of Measure Expressions.
L02-1040
: Kristina Nilsson; Lars Borin
Living off the land: The Web as a source of practice texts for learners of less prevalent languages.
L02-1041
: Sebastian Möller; Ergina Kavallieratou
Diagnostic Assessment of Telephone Transmission Impact on ASR Performance and Human-to-Human Speech Quality.
L02-1042
: Carlos D. Martínez-Hinarejos; Emilio Sanchís; Fernando García-Granada; Pablo Aibar
A Labelling Proposal to Annotate Dialogues.
L02-1043
: Simone Teufel; Noemie Elhadad
Collection and linguistic processing of a large-scale corpus of medical articles.
L02-1044
: Tokunaga Takenobu; Okumura Manabu; Saitô Suguru; Tanaka Hozumi
Constructing a lexicon of action.
L02-1045
: Birte Lönneker
Building Concept Frames based on Text Corpora.
L02-1046
: I. Hernáez; E. Navas; J. Sánchez; I. Madariaga; I. Gaminde; X. Zalbide
BIZKAIFON: A sound archive of dialectal varieties of spoken Basque.
L02-1047
: Roberto Navigli; Paola Velardi
Automatic Adaptation of WordNet to Domains.
L02-1048
: Marta Villegas; Nuria Bel
From DTD to relational dB. An automatic generation of a lexicographical station out off ISLE guidelines.
L02-1049
: Florian Schiel; Silke Steininger; Ulrich Türk
The SmartKom Multimodal Corpus at BAS.
L02-1050
: Nicole Beringer; Katerina Louka; Victoria Penide-Lopez; Uli Türk
End-to-End Evaluation of Multimodal Dialogue Systems can we Transfer Established Methods?
L02-1051
: Antonio Molina; Ferran Pla; Encarna Segarra; Lidia Moreno
Word Sense Disambiguation using Statistical Models and WordNet.
L02-1052
: Hans C. Boas
Bilingual FrameNet Dictionaries for Machine Translation.
L02-1053
: Yllias Chali
Experiments in Topic Detection.
L02-1054
: Gosse Bouma; Geert Kloosterman
Querying Dependency Treebanks in XML.
L02-1055
: Marianne Starlander; Andrei Popescu-Belis
Corpus-based Evaluation of a French Spelling and Grammar Checker.
L02-1056
: Adam Meyers; Ralph Grishman; Michiko Kosaka
Formal Mechanisms for Capturing Regularizations.
L02-1057
: Erhard W. Hinrichs; Sandra Kübler; Frank H. Müller; Tylman Ule
A Hybrid Architecture for Robust Parsing of German.
L02-1058
: Rainer Siemund; Barbara Heuft; Khalid Choukri; Ossama Emam; Emmanuel Maragoudakis; Herbert Tropf; Oren Gedge; Sherrie Shammass; Asuncion Moreno; Albino Nogueiras Rodriguez; Imed Zitouni; Dorota Iskra
OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East.
L02-1059
: Kazutaka Takao; Kenji Imamura; Hideki Kashioka
Comparing and Extracting Paraphrasing Words with 2-Way Bilingual Dictionaries.
L02-1060
: Reinhard Rapp
A Part-of-Speech-Based Search Algorithm for Translation Memories.
L02-1061
: Sabine Brants; Silvia Hansen
Developments in the TIGER Annotation Scheme and their Realization in the Corpus.
L02-1062
: António Branco; José Leitão; João Silva; Luís Gomes
Nexing Corpus: a corpus of verbal protocols on syllogistic reasoning.
L02-1063
: Eva Hajičová; Ivona Kučerová
Argument/Valency Structure in PropBank, LCS Database and Prague Dependency Treebank: A Comparative Pilot Study.
L02-1064
: Karl Weilhammer; Uwe Reichel; Florian Schiel
Multi-Tier Annotations in the Verbmobil Corpus.
L02-1065
: Stefan Schaden
A Database for the Analysis of Cross-Lingual Pronunciation Variants of European City Names.
L02-1066
: Hatem Ghorbel; Giovanni Coray; André Linden
SAM: System for Multi-criteria Text Alignment.
L02-1067
: Pius ten Hacken
Word Formation and the Validation of Lexical Resources.
L02-1068
: A. Cappelli; M. N. Catarsi; P. Michelassi; L. Moretti; M. Baglioni; F. Turini; M. Tavoni
Knowledge Mining and Discovery for Searching in Literary Texts.
L02-1069
: A. Lavelli; F. Pianesi; E. Maci; I. Prodanof; L. Dini; G. Mazzini
SiSSA: An Infrastructure for Developing NLP Applications.
L02-1070
: Kiril Simov; Petya Osenova; Milena Slavcheva; Sia Kolkovska; Elisaveta Balabanova; Dimitar Doikoff; Krassimira Ivanova; Alexander Simov; Milen Kouylekov
Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank
L02-1071
: Ton van der Wouden; Heleen Hoekstra; Michael Moortgat; Bram Renmans; Ineke Schuurman
Syntactic Analysis in the Spoken Dutch Corpus (CGN).
L02-1072
: Andrei Popescu-Belis; Susan Armstrong; Gilbert Robert
Electronic Dictionaries - from Publisher Data to a Distribution Server: the DicoPro, DicoEast and RERO Projects.
L02-1073
: Claudia Kunze; Lothar Lemnitzer
GermaNet - representation, visualization, application.
L02-1074
: Petra Geutner; Frank Steffens; Dietrich Manstetten
Design of the VICO Spoken Dialogue System: Evaluation of User Expectations by Wizard-of-Oz Experiments.
L02-1075
: Nadia Mana; Ornella Corazzari
The Lexico-semantic Annotation of an Italian Treebank
L02-1076
: Bernardo Magnini; Matteo Negri; Roberto Prevete; Hristo Tanev
Towards Automatic Evaluation of Question/Answering Systems
L02-1077
: Martin Rajman; Anthony Hartley
Automatic Ranking of MT Systems.
L02-1078
: Luisa Bentivogli; Emanuele Pianta
Opportunistic Semantic Tagging.
L02-1079
: Petr Pollák; Václav Hanl
Tool for Czech Pronunciation Generation Combining Fixed Rules with Pronunciation Lexicon and Lexicon Management Tool
L02-1080
: Tony Rose; Mark Stevenson; Miles Whitehead
The Reuters Corpus Volume 1 -from Yesterday's News to Tomorrow's Language Resources.
L02-1081
: Tilly Dutilh; Truus Kruyt
Implementation and Evaluation of PAROLE PoS in a National Context
L02-1082
: Zdeněk abokrtský; Petr Sgall; Sao Deroski
A Machine Learning Approach to Automatic Functor Assignment in the Prague Dependency Treebank.
L02-1083
: Carole Tiberius
How to build a multilingual inheritance-based lexicon.
L02-1084
: Carole Tiberius; Dunstan Brown; Greville Corbett.
A typological database of agreement
L02-1085
: Jimmy Lin
The Web as a Resource for Question Answering: Perspectives and Challenges.
L02-1086
: Mitsuo Shimohata; Eiichiro Sumita
Automatic paraphrasing based on parallel corpus for normalization.
L02-1087
: Gianni Lazzari
Speech to Speech Translation: Present and Future Challenges.
L02-1088
: Ivan Kopeček; Karel Pala
Databases of Heterogeneous Segments for Concatenative Speech Synthesis.
L02-1089
: Andrej gank; Zdravko Kačič; Bogomir Horvat
Preliminary Evaluation of Slovenian Mobile Database PoliDat.
L02-1090
: Thierry Poibeau; Dominique Dutoit; Sophie Bizouard
Evaluating resource acquisition tools for Information Extraction.
L02-1091
: Dominique Dutoit; Pierre Nugues
An Algorithm to Find Words from Definitions.
L02-1092
: Algimantas Rudzionis; Vytautas Rudzionis
Lithuanian Speech Database LTDIGITS
L02-1093
: Olivier Ferret; Christian Fluhr; Françoise Rousseau-Hans; Jean-Luc Simoni
Building domain specific lexical hierarchies from corpora.
L02-1094
: Walter Daelemans; Véronique Hoste
Evaluation of Machine Learning Methods for Natural Language Processing Tasks.
L02-1095
: Tristan Van Rullen; Philippe Blache
An evaluation of different symbolic shallow parsing techniques.
L02-1096
: Jeska Buhmann; Johanneke Caspers; Vincent J. van Heuven; Heleen Hoekstra; Jean-Pierre Martens; Marc Swerts
Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus.
L02-1097
: Jean-Pierre Martens; Diana Binnenpoorte; Kris Demuynck; Ruben Van Parys; Tom Laureys; Wim Goedertier; Jacques Duchateau
Word Segmentation in the Spoken Dutch Corpus.
L02-1098
: Nelleke Oostdijk; Wim Goedertier; Frank van Eynde; Louis Boves; Jean-Pierre Martens; Michael Moortgat; Harald Baayen
Experiences from the Spoken Dutch Corpus Project.
L02-1099
: George Mikros
Quantitative parameters in corpus design: Estimating the optimum text size in Modern Greek language.
L02-1100
: Pierrette Bouillon; Vincent Claveau; Cécile Fabre; Pascale Sébillot
Acquisition of Qualia Elements from Corpora - Evaluation of a Symbolic Learning Method
L02-1101
: Michelina Savino; Mario Refice; Domenico Daleno
Methods and Tools for Prosodic Analysis of a Spoken Italian Corpus.
L02-1102
: Oliver Lemon; Alexander Gruenstein
Language Resources for Multi-Modal Dialogue Systems.
L02-1103
: Dominic Widdows; Beate Dorow; Chiu-Ki Chan
Using Parallel Corpora to enrich Multilingual Lexical Resources.
L02-1104
: Ariadna Font Llitjós; Alan W Black
Evaluation and collection of proper name pronunciations online.
L02-1105
: Toshifumi Tanabe; Yasuo Koyama; Kenji Yoshimura; Kosho Shudo
Modal Expressions in Natural Language Sentence and Their Similarity.
L02-1106
: Alex Alsina; Toni Badia; Gemma Boleda; Stefan Bott; Àngel Gil; Martí Quixal; Oriol Valentín
CATCG: a general purpose parsing tool applied.
L02-1107
: Alexander Raake
Does the Content of Speech Influence its Perceived Sound Quality?.
L02-1108
: Monica Ward
Issues in the design, construction and use of Language Resources (LR) for Endangered Languages (Els).
L02-1109
: Aoife Cahill; Josef van Genabith
TTS - A Treebank Tool Suite.
L02-1110
: Rudolf Muhr; Robert Hölrdich; Eva Wächter-Kollpache
The Pronouncing Dictionary of Austrian German and the other Major Varieties of German - A Phonetic Resources Database on the Pronunciation of German.
L02-1111
: Pascale Nicolas; Sabine Letellier-Zarshenas; Igor Schadle; Jean-Yves Antoine; Jean Caelen
Towards a large corpus of spoken dialogue in French that will be freely available: the "Parole Publique" project and its first realisations.
L02-1112
: Steve Cassidy
XQuery as an Annotation Query Language: a Use Case Analysis.
L02-1113
: Constantin Orasan; Ramesh Krishnamurthy
A corpus-based investigation of junk emails.
L02-1114
: Constantin Orasan
Building annotated resources for automatic text summarisation.
L02-1115
: Lynne Bowker; Peter Bennison
Translation Tracking System: A tool for managing translation archives.
L02-1116
: Ilona Steiner; Laura Kallmeyer
VIQTORYA -- A Visual Query Tool for Syntactically Annotated Corpora
L02-1117
: Massimo Poesio; Tomonori Ishikawa; Sabine Schulte im Walde; Renata Vieira
Acquiring Lexical Knowledge for Anaphora Resolution.
L02-1118
: Mike Maxwell
Resources for Morphology Learning and Evaluation.
L02-1119
: Chikashi Nobata; Satoshi Sekine; Hitoshi Isahara; Ralph Grishman
Summarization System Integrated with Named Entity Tagging and IE pattern Discovery.
L02-1120
: Satoshi Sekine; Kiyoshi Sudo; Chikashi Nobata
Extended Named Entity Hierarchy.
L02-1121
: Nick Campbell
Recording techniques for capturing natural every-day speech.
L02-1122
: Kenji Matsumoto; Hideki Tanaka
Automatic Alignment of Japanese and English Newspaper Articles using an MT System and a Bilingual Company Name Dictionary.
L02-1123
: Satoshi Shirai; Kazuhide Yamamoto; Francis Bond; Hozumi Tanaka
Towards a Thesaurus of Predicates.
L02-1124
: Yong-Ju Lee; Bong-Wan Kim; Yongnam Um
Speech Information Technology & Industry Promotion Center in Korea: Activities and Directions.
L02-1125
: Oren Gedge; Christophe Couvreur; Klaus Linhard; Shaunie Shammass; Ami Moyal
Database Adaptation for Speech Recognition in Cross-Environmental Conditions.
L02-1126
: Manolis Maragoudakis; Katia Kermanidis; Nikos Fakotakis; George Kokkinakis
Combining Bayesian and Support Vector Machines Learning to automatically complete Syntactical Information for HPSG-like Formalisms.
L02-1127
: Keiji Yasuda; Fumiaki Sugaya; Toshiyuki Takezawa; Seiichi Yamamoto; Masuzo Yanagida
Automatic machine translation selection scheme to output the best result.
L02-1128
: Aristomenis Thanopoulos; Nikos Fakotakis; George Kokkinakis
Comparative Evaluation of Collocation Extraction Metrics.
L02-1129
: Christophe Laprun; Jonathan G. Fiscus; John Garofolo; Sylvain Pajot
A Pratical Introduction to ATLAS
L02-1130
: John Garofolo; Jonathan G. Fiscus; Alvin Martin; David Pallett; Mark Przybocki
NIST Rich Transcription 2002 Evaluation: A Preview.
L02-1131
: Paloma Martínez; Ana García-Serrano; Alberto Ruiz-Cristina
Integrating Spanish Linguistic Resources in a Web Site Assistant.
L02-1132
: Angelo Dalli
Creation and Evaluation of Extensible Language Resources for Maltese.
L02-1133
: Gregory Grefenstette; Yan Qu; David A. Evans
Expanding lexicons by inducing paradigms and validating attested forms.
L02-1134
: Taro Watanabe; Mitsuo Shimohata; Eiichiro Sumita
Statistical Machine Translation on Paraphrased Corpora.
L02-1135
: Alejandro Bia; Manuel Sánchez Quero
Building ancient Spanish dictionaries for spell-checking of DL texts
L02-1136
: Hideki Kashioka
Translation Unit Concerning Timing of Simultaneous Translation.
L02-1137
: Masumi Narita; Kazuya Kurokawa; Takehito Utsuro
A Web-based English Abstract Writing Tool Using a Tagged E-J Parallel Corpus.
L02-1138
: Ricardo Ribeiro; Luís Oliveira; Isabel Trancoso
Morphosyntactic Disambiguation for TTS Systems.
L02-1139
: Charles J. Fillmore; Collin F. Baker; Hiroaki Sato
Seeing Arguments through Transparent Structures.
L02-1140
: Charles J. Fillmore; Collin F. Baker; Hiroaki Sato
The FrameNet Database and Software Tools.
L02-1141
: Xiaoyi Ma; Haejoong Lee; Steven Bird; Kazuaki Maeda
Models and Tools for Collaborative Annotation
L02-1142
: Doroteo Torre Toledano; Luis A. Hernández Gómez
HMMs for Automatic Phonetic Segmentation
L02-1143
: Helen Wright Hastie; Rashmi Prasad; Marilyn Walker
Automatic Evaluation: Using a DATE Dialogue Act Tagger for User Satisfaction and Task Completion Prediction.
L02-1144
: Nuria Bel; Javier Caminero; Luis Hernández; Montserrat Marimón; José F. Morlesín; Josep M. Otero; José Relaño; M. Carmen Rodríguez; Pedro M. Ruz; Daniel Tapias
Design and Evaluation of a SLDS for E-Mail Access through the Telephone.
L02-1145
: Ann Copestake; Fabre Lambeau; Aline Villavicencio; Francis Bond; Timothy Baldwin; Ivan A. Sag; Dan Flickinger
Multiword expressions: linguistic precision and reusability.
L02-1146
: Keita Tsuji; Beatrice Daille; Kyo Kageura
Extracting French-Japanese Word Pairs from Bilingual Corpora based on Transliteration Rules.
L02-1147
: Elaine Uí Dhonnchadha
A Two-level Morphological Analyser and Generator for Irish using Finite-State Transducers.
L02-1148
: Smaranda Muresan; Judith Klavans
A Method for Automatically Building and Evaluating Dictionary Resources.
L02-1149
: Violeta Seretan; Dan Cristea
The Use of Referential Constraints in Structuring Discourse.
L02-1150
: Kiyoaki Shirai
Construction of a Word Sense Tagged Corpus for SENSEVAL-2 Japanese Dictionary Task.
L02-1151
: Chung-hye Han; Na-Rare Han; Eon-Suk Ko; Martha Palmer
Development and Evaluation of a Korean Treebank and its Application to NLP
L02-1152
: Alexandra Kinyon; Carlos A. Prolo
Identifying Verb Arguments and their Syntactic Function in the Penn Treebank.
L02-1153
: Parham Mokhtari; Nick Campbell
Automatic Detection of Acoustic Centres of Reliability for Tagging Paralinguistic Information in Expressive Speech.
L02-1154
: Jong-Hoon Oh; Saim Shin; Yong-Seok Choi; Key-Sun Choi
Word Sense Disambiguation with Information Retrieval Technique.
L02-1155
: N. Minematsu; Y. Tomiyama; K. Yoshimoto; K. Shimizu; S. Nakagawa; M. Dantsuji; S. Makino
English Speech Database Read by Japanese Learners for CALL System Development.
L02-1156
: Erica Costantini; Susanne Burger; Fabio Pianesi
NESPOLE!s Multilingual and Multimodal Corpus.
L02-1157
: Horacio Saggion; Hamish Cunningham; Diana Maynard; Kalina Bontcheva; Oana Hamza; Christian Ursu; Yorick Wilks
Extracting Information for Automatic Indexing of Multimedia Material.
L02-1158
: Horacio Saggion; Dragomir Radev; Simone Teufel; Wai Lam; Stephanie M. Strassel
Developing Infrastructure for the Evaluation of Single and Multi-document Summarization Systems in a Cross-lingual Environment.
L02-1159
: Harris Papageorgiou; Prokopis Prokopidis; Voula Giouli; Iason Demiros; Alexis Konstantinidis; Stelios Piperidis
Multi-level XML-based Corpus Annotation.
L02-1160
: Harald Höge
Project Proposal TC-STAR - Make Speech to Speech Translation Real
L02-1161
: Igor Boguslavsky; Ivan Chardin; Svetlana Grigorieva; Nikolai Grigoriev; Leonid Iomdin; Leonid Kreidlin; Nadezhda Frid
Development of a Dependency Treebank for Russian and its Possible Applications in NLP.
L02-1162
: Stefan Rapp; Michael Strube
An Iterative Data Collection Approach for Multimodal Dialogue Systems.
L02-1163
: Carol Peters; Martin Braschler
The Importance of Evaluation for Cross-Language System Development: the CLEF Experience.
L02-1164
: R. Muñoz; R. Mitkov; M. Palomar; J. Peral; R. Evans; L. Moreno; C. Orasan; M. Saiz-Noeda; A. Ferrández; C. Barbu; P. Martínez-Barco; A. Suárez
Bilingual alignment of anaphoric expressions.
L02-1165
: Heiki-Jaan Kaalep; Kadri Muischnek
Using the Text Corpus to Create a Comprehensive List of Phrasal Verbs
L02-1166
: Diana Raileanu; Paul Buitelaar; Spela Vintar; Jörg Bay
Evaluation Corpora for Sense Disambiguation in the Medical Domain.
L02-1167
: pela Vintar; Paul Buitelaar; Bärbel Ripplinger; Bogdan Sacaleanu; Diana Raileanu; Detlef Prescher
An Efficient and Flexible Format for Linguistic and Semantic Annotation.
L02-1168
: Markéta Straňáková-Lopatková; Zdenĕk abokrtský
Valency Dictionary of Czech Verbs: Complex Tectogrammatical Annotation.
L02-1169
: Darren Pearce
A Comparative Evaluation of Collocation Extraction Techniques
L02-1170
: Marko Tadić
Building the Croatian National Corpus.
L02-1171
: Thorsten Trippel; Dafydd Gibbon
Annotation Driven Concordancing: the PAX Toolkit.
L02-1172
: Katia Lida Kermanidis; Nikos Fakotakis; George Kokkinakis
DELOS: An Automatically Tagged Economic Corpus for Modern Greek.
L02-1173
: Henk van den Heuvel; Khalid Choukri; Harald Höge
Give me a bug. a framework for a bug report service
L02-1174
: Vladimir Hozjan; Zdravko Kacic; Asunción Moreno; Antonio Bonafonte; Albino Nogueiras
Interface Databases: Design and Collection of a Multilingual Emotional Speech Database.
L02-1175
: Vladimir Hozjan; Zdravko Kacic
Objective analysis of emotional speech for English and Slovenian Interface emotional speech databases.
L02-1176
: Konstantin Biatov; Joachim Köhler
Methods and Tools for Speech Data Acquisition exploiting a Database of German Parliamentary Speeches and Transcripts from the Internet.
L02-1177
: Dorota Iskra; Beate Grosskopf; Krzysztof Marasek; Henk van den Heuvel; Frank Diehl; Andreas Kiessling
SPEECON Speech Databases for Consumer Devices: Database Specification and Validation.
L02-1178
: James Dowdall; Michael Hess; Neeme Kahusk; Kaarel Kaljurand; Mare Koit; Fabio Rinaldi; Kadri Vider
Technical Terminology as a Critical Resource.
L02-1179
: Eva Anna Lenz; Angelika Storrer
Converting a Corpus into a Hypertext: An Approach Using XML Topic Maps and XSLT.
L02-1180
: Rickard Domeij; Ola Knutsson; Kerstin Severinson Eklundh
Different Ways of Evaluating a Swedish Grammar Checker.
L02-1181
: Antonio Moreno Ortiz; Victor Raskin; Sergei Nirenburg
New Developments in Ontological Semantics.
L02-1182
: Amalia Todirascu; Eric Kow; Laurent Romary
Towards Reusable NLP Components.
L02-1183
: Judita Preiss; Anna Korhonen; Ted Briscoe
Subcategorization Acquisition as an Evaluation Method for WSD.
L02-1184
: Shoichiro Hara; Hisashi Yasunaga
Resource Sharing System for Humanity Researches
L02-1185
: A. Benabbou; N. Chenfour; A. Mouradi
Study and quantification of the declination for the Arabic speech synthesis system PARADIS.
L02-1186
: Matej Rojc; Zdravko Kačič; Darinka Verdonik
Design and Implementation of the Slovenian Phonetic and Morphology Lexicons for the Use in Spoken Language Applications.
L02-1187
: Nabil Hathout; Ludovic Tanguy
Webaffix: Discovering Morphological Links on the WWW.
L02-1188
: Natalia V. Loukachevitch; Boris V. Dobrov
Evaluation of Thesaurus on Sociopolitical Life as Information-Retrieval Tool.
L02-1189
: Nabil Hathout
From WordNet to CELEX: acquiring morphological links from dictionaries of synonyms.
L02-1190
: Ulrich Heid; Bettina Säuberlich; Arne Fitschen
Using Descriptive Generalisations in the Acquisition of Lexical Data for Word Formation.
L02-1191
: Masayuki Asahara; Ryuichi Yoneda; Akiko Yamashita; Yasuharu Den; Yuji Matsumoto
Use of XML and Relational Databases for Consistent Development and Maintenance of Lexicons and Annotated Corpora.
L02-1192
: Laura Pecchia; Giuseppe Cappelli; Elisabetta Guazzini
Linguistic and Computational Problems for the Creation of an Italian Children's Corpus of Spoken Language.
L02-1193
: Lars Ahrenberg ; Mikael Andersson; Magnus Merkel
A System for Incremental and Interactive Word Linking.
L02-1194
: Kiril Ribarov
Old Sources and Modern Procedures: Computer Processing of Old-Church Slavonic.
L02-1195
: José Miguel Aguilar Río
Compiling an Interactive Literary Translation Web Site for Education Purposes.
L02-1196
: Thierry Hamon; Olivier Hû
How to evaluate necessary cooperative systems of terminology building?.
L02-1197
: Nilda Ruimy; Monica Monachini; Raffaella Distante; Elisabetta Guazzini; Stefano Molino; Marisa Ulivieri; Nicoletta Calzolari; Antonio Zampolli
CLIPS, a Multi-level Italian Computational Lexicon: a Glimpse to Data.
L02-1198
: Susanne Salmon-Alt; Renata Vieira
Nominal Expressions in Multilingual Corpora: Definites and Demonstratives.
L02-1199
: Jerker Järborg; Dimitrios Kokkinakis; Maria Toporowska Gronostaj
Lexical and Textual Resources for Sense Recognition and Description.
L02-1200
: X. Artola; A. Díaz de Ilarraza; N. Ezeiza; K. Gojenola; G. Hernández; A. Soroa
A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation
L02-1201
: Csaba Oravecz; Péter Dienes
Efficient Stochastic Part-of-Speech Tagging for Hungarian.
L02-1202
: Hannah Kermes; Stefan Evert
YAC - A Recursive Chunker for Unrestricted German Text.
L02-1203
: Koji Eguchi; Kazuko Kuriyama; Noriko Kando
Sensitivity of IR systems Evaluation to Topic Difficulty.
L02-1204
: Brian Mitchell; Robert Gaizauskas
A Comparison of Machine Learning Algorithms for Prepositional Phrase Attachment.
L02-1205
: Dan Cristea; Oana-Diana Postolache; Gabriela-Eugenia Dima; Cătălina Barbu
AR-Engine - a framework for unrestricted co-reference resolution.
L02-1206
: Cătălina Barbu; Richard Evans; Ruslan Mitkov
A corpus based investigation of morphological disagreement in anaphoric relations.
L02-1207
: Cătălina Barbu
Error analysis in anaphora resolution.
L02-1208
: Jean-Yves Antoine; Caroline Bousquet-Vernhettes; Jérôme Goulian; Mohamed Zakaria Kurdi; Sophie Rosset; Nadine Vigouroux; Jeanne Villaneau
Predictive and objective evaluation of speech understanding: the challenge evaluation campaign of the I3 speech workgroup of the French CNRS.
L02-1209
: Michael Moortgat; Richard Moot
Using the Spoken Dutch Corpus for type-logical grammar induction.
L02-1210
: Bolette S. Pedersen; Patrizia Paggio
Semantic Lexical Resources Applied to Content-based Querying - the OntoQuery Project.
L02-1211
: Georgios Petasis; Vangelis Karkaletsis; Georgios Paliouras; Ion Androutsopoulos; Constantine D. Spyropoulos
Ellogon: A New Text Engineering Platform.
L02-1212
: Antonio S. Valderrábanos; Alexander Belskis; Luis Iraola Moreno
Multilingual Terminology Extraction and Validation.
L02-1213
: Laila Dybkjær; Niels Ole Bernsen
Natural Interactivity Resources Data, Annotation Schemes and Tools.
L02-1214
: Niels Ole Bernsen; Laila Dybkjær; Mykola Kolodnytsky
THE NITE WORKBENCH. A Tool for Annotation of Natural Interactivity and Multimodal Data
L02-1215
: Valentin Tablan; Cristian Ursu; Kalina Bontcheva; Hamish Cunningham; Diana Maynard; Oana Hamza; Tony McEnery; Paul Baker; Mark Leisher
A Unicode-based Environment for Creation and Use of Language Resources.
L02-1216
: Dimitra Farmakiotou; Vangelis Karkaletsis; Ioannis Koutsias; George Petasis; Constantine D. Spyropoulos
PatEdit: An Information Extraction Pattern Editor for Fast System Customization.
L02-1217
: Tamás Váradi
The Hungarian National Corpus.
L02-1218
: Paul Clough; Robert Gaizauskas; S. L. Piao
Building and annotating a corpus for the study of journalistic text reuse
L02-1219
: Hennie Brugman; Harriet Spenke; Markus Kramer; Alexander Klassmann
Multimedia Annotation with Multilingual Input Methods and Search Support.
L02-1220
: P. Wittenburg; W. Peters; S. Drude
Analysis of Lexical Structures from Field Linguistics and Language Engineering.
L02-1221
: P. Wittenburg; U. Mosel; A. Dwyer
Methods of Language Documentation in the DOBES project.
L02-1222
: P. Wittenburg; W. Peters; D. Broeder
Metadata Proposals for Corpora and Lexica.
L02-1223
: P. Wittenburg; St. Levinson; S. Kita; H. Brugman
Multimodal Annotations in Gesture and Sign Language Studies
L02-1224
: Daan Broeder; Freddy Offenga; Don Willems
Metadata Tools Supporting Controlled Vocabulary Services.
L02-1225
: Daan Broeder; Peter Wittenburg; Thierry Declerck; Laurent Romary
LREP: A Language Repository Exchange Protocol.
L02-1226
: Caroline Hagège; Claude Roux
A Robust and Flexible Platform for Dependency Extraction.
L02-1227
: Klaus-Dirk Schmitz
Subject-field-specific Ontologies and Terminologies for the Web Community.
L02-1228
: Steve Whittaker; Marilyn Walker; Johanna Moore
Fish or Fowl:A Wizard of Oz Evaluation of Dialogue Strategies in the Restaurant Domain.
L02-1229
: Adriana Roventini; Marisa Ulivieri; Nicoletta Calzolari
Integrating Two Semantic Lexicons, SIMPLE and ItalWordNet: What Can We Gain?.
L02-1230
: Rita Marinelli; Adriana Roventini
Proper Names In A Semantic Database.
L02-1231
: Leonardo Lesmo; Vincenzo Lombardo
Transformed Subcategorization Frames in Chunk Parsing.
L02-1232
: Gabriela Cavaglià
Measuring corpus homogeneity using a range of measures for inter-document distance.
L02-1233
: Claire Grover; Scott McDonald; Donnla Nic Gearailt; Vangelis Karkaletsis; Dimitra Farmakiotou; Georgios Samaritakis; Georgios Petasis; Maria Teresa Pazienza; Michele Vindigni; Frantz Vichot; Francis Wolinski
Multilingual XML-Based Named Entity Recognition for E-Retail Domains.
L02-1234
: Janienke Sturm; Ilse Bakx; Bert Cranen; Jacques Terken; Fusi Wang
Usability Evaluation of a Dutch Multimodal System for Train Timetable Information.
L02-1235
: Katerina Pastra; Diana Maynard; Oana Hamza; Hamish Cunningham; Yorick Wilks
How feasible is the reuse of grammars for Named Entity Recognition?.
L02-1236
: Claudia Soria; Niels Ole Bernsen; Niels Cadée; Jean Carletta; Laila Dybkjær; Stefan Evert; Ulrich Heid; Amy Isard; Mykola Kolodnytsky; Christoph Lauer; Wolfgang Lezius; Lucas P.J.J. Noldus; Vito Pirrelli; Norbert Reithinger; Andreas Vögele
Advanced Tools for the Study of Natural Interactivity.
L02-1237
: Roldano Cattoni; Morena Danieli; Vanessa Sandrini; Claudia Soria
ADAM: The SI-TAL Corpus of Annotated Dialogues.
L02-1238
: Jason Baldridge; John Dowding; Susana Early
Leo: an Architecture for Sharing Resources for Unification-Based Grammars.
L02-1239
: Irena Spasić; Goran Nenadić; Sophia Ananiadou
Tuning Context Features with Genetic Algorithms.
L02-1240
: Goran Nenadić; Irena Spasić; Sophia Ananiadou
Automatic Acronym Acquisition and Term Variation Management within Domain-Specific Texts.
L02-1241
: Sanni Nimb
Adverbs in Semantic Lexica for NLP - The extension of the Danish SIMPLE lexicon with Time Adverbs.
L02-1242
: Anna Sågvall Hein; Eva Forsbom; Jörg Tiedemann; Per Weijnitz; Ingrid Almqvist; Leif-Jöran Olsson; Sten Thaning
Scaling Up an MT Prototype for Industrial Use - Databases and Data Flow.
L02-1243
: Xavier Carreras; Lluís Padró
A Flexible Distributed Architecture for Natural Language Analyzers.
L02-1244
: Eugenio Picchi; Eva Sassolini; Ouafae Nahli; Sebastiana Cucurullo; M. Isabel Vargas
Italian arabic linguistic tools.
L02-1245
: Christopher Cieri; Mark Liberman
Language Resource Creation and Distribution at the Linguistic Data Consortium: A Progress Report.
L02-1246
: Claudia Sassen; Dafydd Gibbon
Enhanced Dialogue Markup for Crisis Talk Scenario Resources
L02-1247
: Jörg Tiedemann
MatsLex - a Multilingual Lexical Database for Machine Translation
L02-1248
: Maria Rzewuska
Terminology Resources in the Context of a Major Translation Project.
L02-1249
: Hartmut R. Pfitzinger
Reducing Segmental Duration Variation by Local Speech Rate Normalization of Large Spoken Language Resources.
L02-1250
: Ted Briscoe; John Carroll
Robust Accurate Statistical Annotation of General Text.
L02-1251
: Catia Cucchiarini; Elisabeth D'Halleweyn; Lisanne Teunissen
A Human Language Technologies Platform for the Dutch language: awareness, management maintenance and distribution.
L02-1252
: D. Binnenpoorte; F. De Vriend; J. Sturm; W. Daelemans; H. Strik; C. Cucchiarini
A Field Survey for Establishing Priorities in the Development of HLT Resources for Dutch.
L02-1253
: Ana M. García-Serrano; Luis Rodrigo-Aguado; Javier Calle
Natural Language Dialogue in a Virtual Assistant Interface.
L02-1254
: Jesús Cardeñosa; Edmundo Tovar; Carolina Gallardo
The UNL System.
L02-1255
: Dieter Maas; Nuebel Rita; Catherine Pease; Paul Schmidt
Bilingual Indexing for Information Retrieval with AUTINDEX.
L02-1256
: Michael Rosner
The Future of Maltilex.
L02-1257
: Nicoletta Calzolari; Ralph Grishman; Marta Palmer
Standards & best practice for multilingual computational lexicons: ISLE MILE and more
L02-1258
: Sue Atkins; Nuria Bel; Francesca Bertagna; Pierrette Bouillon; Nicoletta Calzolari; Christiane Fellbaum; Ralph Grishman; Alessandro Lenci; Catherine MacLeod; Martha Palmer; Gregor Thurmair; Marta Villegas; Antonio Zampolli
From Resources to Applications. Designing the Multilingual ISLE Lexical Entry.
L02-1259
: Nicoletta Calzolari; Charles J. Fillmore; Ralph Grishman; Nancy Ide; Alessandro Lenci; Catherine MacLeod; Antonio Zampolli
Towards Best Practice for Multiword Expressions in Computational Lexicons.
L02-1260
: Alessandro Lenci; Roberto Bartolini; Nicoletta Calzolari; Ana Agua; Stephan Busemann; Emmanuel Cartier; Karine Chevreau; José Coch
Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project.
L02-1261
: Anna Braasch
Current Developments of STO - the Danish Lexicon Project for NLP and HLT Applications.
L02-1262
: Robert E. Frederking; Alan W Black; Ralf D. Brown; John Moody; Eric Steinbrecher
Field Testing the Tongues Speech-to-Speech Machine Translation System.
L02-1263
: Julia Hockenmaier; Mark Steedman
Acquiring Compact Lexicalized Grammars from a Cleaner Treebank.
L02-1264
: F. de Vriend; P.A. Coppen; W. Haeseryn
Using Grammatical Description as a Metalanguage Resource.
L02-1265
: Hélèn François; Olivier Boëffard
The Greedy Algorithm and its Application to the Construction of a Continuous Speech Database.
L02-1266
: Martine Hurault-Plantet; Laura Monceaux
Cooperation between black box and glass box approaches for the evaluation of a question answering system.
L02-1267
: Adán Cassán; Sergi Cervell; Mireia Colom; Rafael Marín; Josep M. Merenciano; Gema Pérez; Lluís Valentín
BDCon: A Spanish knowledge database.
L02-1268
: Adán Cassán; Sergi Cervell; Mireia Colom; Rafael Marín; Josep M. Merenciano; Gema Pérez; Lluís Valentín
A step forward to hypertext.
L02-1269
: Asunción Moreno; Oren Gedge; Henk van den Heuvel; Harald Höge; Sabine Horbach; Patricia Martin; Elisabeth Pinto; Antonio Rincón; Franco Senia; Rafid Sukkar
SpeechDat across all America: SALA II.
L02-1270
: Maya Ando; Jun Okamoto; Shun Ishizaki
Extraction of Associative Attributes from Nouns and Quantitative Expression of Prototype Concept.
L02-1271
: Martin Wynne
The Language Resource Archive of the 21st Century.
L02-1272
: Nordine Fourour; Emmanuel Morin; Béatrice Daille
Incremental Recognition and Referential Categorization of French Proper Names.
L02-1273
: Shigeki Matsubara; Akira Takagi; Nobuo Kawaguchi; Yasuyoshi Inagaki
Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research.
L02-1274
: Marcela Charfuelán; Luis Hernández Gómez; Cristina Esteban López; Holmer Hemsen
A XML-based tool for evaluation of SLDS.
L02-1275
: K. López de Ipiña; N. Ezeiza; G. Bordel
Automatic Morphological Segmentation for Continuous Speech Recognition of Basque.
L02-1276
: Richard F. E. Sutcliffe; Kieran White
Searching via Keywords or Concept Hierarchies - Which is Better?.
L02-1277
: Juliana Galvani Greghi; Ronaldo Teixeira Martins; Maria das Graças Volpe Nunes
DIADORIM - A Lexical Database for Brazilian Portuguese.
L02-1278
: Mónica Caballero; José B. Mariño; Asunción Moreno
Multidialectal Spanish Modeling for ASR.
L02-1279
: Paola Monachesi; Alexis Dimitriadis; Rob Goedemans; Anne-Marie Mineur
A unified system for accessing typological databases.
L02-1280
: Klára Osolsobĕ; Karel Pala; Radek Sedláček; Marek Veber
A Procedure for Word Derivational Processes Concerning Lexicon Extension in Highly Inflected Languages
L02-1281
: Heli Uibo
Experimental Two-Level Morphology of Estonian.
L02-1282
: Marianne Dabbadie; Widad Mustafa El Hadi; Ismaïl Timimi
Terminological Enrichment for non-Interactive MT Evaluation
L02-1283
: Paul Kingsbury; Martha Palmer
From TreeBank to PropBank.
L02-1284
: Almudena Ballester; Ángel Martín Municio; Fernando Pardos; Jordi Porta Zamorano; Rafael J. Ruiz Ureña; Fernando Sánchez León
Combining statistics on n-grams for automatic term recognition.
L02-1285
: Steven Bird; Kazuaki Maeda; Xiaoyi Ma; Haejoong Lee; Beth Randall; Salim Zayat
TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit.
L02-1286
: Kazauki Maeda; Steven Bird; Xiaoyi Ma; Haejoong Lee
Creating Annotation Tools with the Annotation Graph Toolkit.
L02-1287
: Nobuo Kawaguchi; Shigeki Matsubara; Kazuya Takeda; Fumitada Itakura
Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research.
L02-1288
: Laurence Devillers; Sophie Rosset; Hélèn Bonneau-Maynard; Lori Lamel
Annotations for Dynamic Diagnosis of the Dialog State
L02-1289
: Jean-Claude Martin; Michael Kipp
Annotating and Measuring Multimodal Behaviour Tycoon Metrics in the Anvil Tool.
L02-1290
: Emanuela Cresti; Massimo Moneglia; Fernanda Bacelar do Nascimento; Antonio Moreno Sandoval; Jean Veronis; Philippe Martin; Kalid Choukri; Valerie Mapelli; Daniele Falavigna; Antonio Cid; Claude Blum
The C-ORAL-ROM Project. New methods for spoken language archives in a multilingual romance corpus.
L02-1291
: Christopher Cieri; Mark Liberman
TIDES Language Resources: A Resource Map for Translingual Information Access.
L02-1292
: Stefan Eickeler; Martha Larson; Wolff Rüter; Joachim Köhler
Creation of an Annotated German Broadcast Speech Database for Spoken Document Retrieval.
L02-1293
: Jan-Torsten Milde; Ulrike Gut
The TASX-environment: an XML-based toolset for time aligned speech corpora
L02-1294
: Scott Cotton; Steven Bird
An integrated framework for treebanks and multilayer annotations
L02-1295
: Florence Duclaye; François Yvon; Olivier Collin
Using the Web as a Linguistic Resource for Learning Reformulations Automatically.
L02-1296
: Christoph Müller; Michael Strube
An API for Discourse-level Access to XML-encoded Corpora.
L02-1297
: Hidetsugu Nanba; Manabu Okumura
Some Examinations of Intrinsic Methods for Summary Evaluation Based on the Text Summarization Challenge (TSC).
L02-1298
: Mohamed-Zakaria Kurdi; Mohamed Ahafhaf
Toward an objective and generic Method for Spoken Language Understanding Systems Evaluation: an extension of the DCR method.
L02-1299
: Gerhard Heyer; Uwe Quasthoff; Christian Wolff
Information Extraction from Text Corpora: Using Filters on Collocation Sets.
L02-1300
: Christopher Cieri; Stephanie Strassel
The DASL Project: a Case Study in Data Re-Annotation and Re-Use.
L02-1301
: Dragomir R. Radev; Hong Qi; Harris Wu; Weiguo Fan
Evaluating Web-based Question Answering Systems.
L02-1302
: Daisuke Kawahara; Sadao Kurohashi; Kôiti Hasida
Construction of a Japanese Relevance-tagged Corpus.
L02-1303
: Nancy Ide; Randi Reppen; Keith Suderman
The American National Corpus: More Than the Web Can Provide.
L02-1304
: Craig Martell
FORM: An Extensible, Kinematically-based Gesture Annotation Scheme.
L02-1305
: Toshiyuki Takezawa; Eiichiro Sumita; Fumiaki Sugaya; Hirofumi Yamamoto; Seiichi Yamamoto
Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World.
L02-1306
: Michelle Vanni; Keith Miller
Scaling the ISLE Framework: Use of Existing Corpus Resources for Validation of MT Evaluation Metrics across Languages.
L02-1307
: Le An Ha
Learning description of term patterns using glossary resources.
L02-1308
: Barbara Di Eugenio; Michael Glass; Michael J. Scott
The binomial cumulative distribution function, or, is my system better than yours?.
L02-1309
: Fumiaki Suyaga; Toshiyuki Takezawa; Genichiro Kikui; Seiichi Yamamoto
Proposal of a very-large-corpus acquisition method by cell-formed registration.
L02-1310
: Rada F. Mihalcea
Bootstrapping Large Sense Tagged Corpora.
L02-1311
: Takano Ogino; Hitoshi Isahara; Kazuhiro Kobayashi
The Valence Patterns of Japanese Verbs Extracted From The EDR Corpus.
L02-1312
: Jean-Claude Martin; Jean-Hugues Réty; Nelly Bensimon
Multimodal and Adaptative Pedagogical Resources.
L02-1313
: Timothy Baldwin; Slaven Bilac; Ryo Okumura; Takenobu Tokunaga; Hozumi Tanaka
Enhanced Japanese Electronic Dictionary Look-up.
L02-1314
: Romaric Besançon; Martin Rajman
Evaluation of a Vector Space Similarity Measure in a Multilingual Framework.
L02-1315
: Serge A. Yablonsky
Corpora as Object-Oriented System. From UML-notation to Implementation.
L02-1316
: Roberto Bartolini; Alessandro Lenci; Simonetta Montemagni; Vito Pirrelli
The Lexicon-Grammar Balance in Robust Parsing of Italian.
L02-1317
: Daniel Jung
Humans as Corpus - Language Learning Strategies in Virtually Mediated Authentic Environments.
L02-1318
: Atsuko; Koizumi; Hirohiko Sagawa; Masaru Takeuchi
An Annotated Japanese Sign Language Corpus.
L02-1319
: Paul Baker; Andrew Hardie; Tony McEnery; Hamish Cunningham; Rob Gaizauskas
EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.
L02-1320
: Angelika Salmen
Multi-Modal Menus And Traffic Interaction. Timing As A Crucial Factor For User Driven Mode Decision
L02-1321
: Constantin Orăsan; Richard Evans
Assessing the difficulty of finding people in texts.
L02-1322
: Sussi Olsen
Lemma selection in domain specific computational lexica - some specific problems
L02-1323
: Gábor Prószéky; Márton Miháltz
Automatism and User Interaction: Building a Hungarian WordNet.
L02-1324
: Véronique Gendner
Comparative study of oral and written French automatically tagged with morpho-syntactic information.
L02-1325
: Owen Rambow; Cassandre Creswell; Rachel Szekely; Harriet Taber; Marilyn Walker
A Dependency Treebank for English.
L02-1326
: Ganesh Ramesh; Amit Bagga
A Text-based for Detection and Filtering of Commercial Segments in Broadcast News.
L02-1327
: Nancy Ide; Laurent Romary
Standards for Language Resources.
L02-1328
: Nigel Collier; Koichi Takeuchi
PIA-Core: Semantic Annotation through Example-based Learning
L02-1329
: Nigel Collier; Koichi Takeuchi; Chikashi Nobata; Junichi Fukumoto; Norihiro Ogata
Progress on Multi-lingual Named Entity Annotation Guidelines using RDF (S).
L02-1330
: Thomas Hanke
iLex - A tool for Sign Language Lexicography and Corpus Analysis.
L02-1331
: Joanne Capstick; Hans Uszkoreit; Wolfgang Wahlster; Thierry Declerck; Gregor Erbach; Anthony Jameson; Brigitte Jorg; Reinhard Karger; Tillmann Wegst
COLLATE: Competence Center in Speech and Language Technology.
L02-1332
: Emiko Suzuki; Kyoko Kakihana
Japanese and American Sign Language Dictionary System for Japanese and English Users.
L02-1333
: Primo Jakopin
The feasibility of a complete text corpus.
L02-1334
: Catherine Macleod
Lexical Annotation for Multi-word Entries Containing Nominalizations.
L02-1335
: Silja Huttunen; Roman Yangarber; Ralph Grishman
Diversity of Scenarios in Information extraction.
L02-1336
: Mark T. Maybury
Multimodal Systems, Resources and Evaluation.
L02-1337
: Hiromichi Kawanami; Tsuyoshi Masuda; Tomoki Toda; Kiyohiro Shikano
Designing speech database with prosodic variety for expressive TTS system.
L02-1338
: Atsushi Fujii; Katunobu Itou; Tetsuya Ishikawa
Producing a Large-scale Encyclopedic Corpus over the Web.
L02-1339
: Akinobu Lee; Tatsuya Kawahara; Kazuya Takeda; Masato Mimura; Atsushi Yamada; Akinori Ito; Katsunobu Itou; Kiyohiro Shikano
Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models.
L02-1340
: Tony McEnery
Ethical and legal issues in corpus construction
L02-1341
: Antonietta Alonge; Margherita Castelli
Which way should we go? Metaphoric expressions in lexical resources.
L02-1342
: Chai Wutiwiwatchai; Patcharika Cotsomrong; Sinaporn Suebvisai; Supphanat Kanokphara
Phonetically Distributed Continuous Speech Corpus for Thai Language.
L02-1343
: Matthias Denecke
Signatures, Typed Feature Structures and RDFS.
L02-1344
: Marie-Jeanne Derouin; Dr. André Le Meur
Report on the Revision of the Lexicographical Standard ISO 1951 Presentation/Representation of Entries in Dictionaries.
L02-1345
: Véronique Gendner; Gabriel Illouz; Michèle Jardino; Laura Monceaux; Patrick Paroubek; Isabelle Robba; Anne Vilnat
A Protocol for Evaluating Analyzers of Syntax (PEAS).
L02-1346
: Mark T. Maybury; Antonio Zampolli
Language Resources and Evaluation: International Strategy Panel.
L02-1347
: Kishore Papineni
Machine Translation Evaluation: N-grams to the Rescue.
L02-1348
: Michael Kluck; Christa Womser-Hacker
Inside the Evaluation Process of the Cross-Language Evaluation Forum (CLEF): Issues of Multilingual Topic Creation and Multilingual Relevance Assessment.
L02-1349
: Andrew Finch; Ezra Black; Ringo Wathelet
Beyond Tag Trigrams: New Local Features for Tagging.
L02-1350
: Sanda Harabagiu; Finley Lacatusu; Paul Morarescu
Multidocument Summarization with GISTexter.
L02-1351
: Feiyu Xu; Daniela Kurz; Jakub Piskorski; Sven Schmeier
A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping
L02-1352
: Silke Steininger; Florian Schiel; Angelika Glesner
User-State Labeling Procedures For The Multimodal Data Collection Of SmartKom.
L02-1353
: James Pustejovsky
Creating Domain-specific Information Servers.
L02-1354
: Mathieu Lafourcade; Christian Boitet
UNL Lexical Selection with Conceptual Vectors.
