ACL Logo ACL Anthology
A Digital Archive of Research Papers in Computational Linguistics

Google search the Anthology

Special Interest Group on Language Technologies for the Socio-Economic Sciences and Humanities

» Toggle Table of Contents
2016 Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
2015 Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH)
2014 Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH)
2013 Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
2012 Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
2011 Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
2010 Fourth Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
2009 Proceedings of the EACL 2009 Workshop on Language Technology and Resources for Cultural Heritage, Social Sciences, Humanities, and Education (LaTeCH – SHELT&R 2009)
2008 Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
2007 Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007).

2016

  1. Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities

  2. W16-21 [bib]: Entire Volume
  3. W16-2100 [bib]: Front Matter

  4. W16-2101 [bib]: Anne-Kathrin Schumann
    Brave New World: Uncovering Topical Dynamics in the ACL Anthology Reference Corpus Using Term Life Cycle Information
  5. W16-2102 [bib]: Mladen Karan; Jan Šnajder; Daniela Sirinic; Goran Glavaš
    Analysis of Policy Agendas: Lessons Learned from Automatic Topic Classification of Croatian Political Texts
  6. W16-2103 [bib]: Estíbaliz Iglesias-Franjo; Jesús Vilares
    Searching Four-Millenia-Old Digitized Documents: A Text Retrieval System for Egyptologists
  7. W16-2104 [bib]: Yvonne Adesam; Gerlof Bouma
    Old Swedish Part-of-Speech Tagging between Variation and External Knowledge
  8. W16-2105 [bib]: Sarah Schulz; Mareike Keller
    Code-Switching Ubique Est - Language Identification and Part-of-Speech Tagging for Historical Mixed Text
  9. W16-2106 [bib]: Fabian Barteld; Ingrid Schröder; Heike Zinsmeister
    Dealing with word-internal modification and spelling variation in data-driven lemmatization
  10. W16-2107 [bib]: Mariona Coll Ardanuy; Maarten van den Bos; Caroline Sporleder
    You Shall Know People by the Company They Keep: Person Name Disambiguation for Social Network Construction
  11. W16-2108 [bib]: Juri Opitz; Anette Frank
    Deriving Players & Themes in the Regesta Imperii using SVMs and Neural Networks
  12. W16-2109 [bib]: Tuomo Hiippala
    Semi-automated annotation of page-based documents within the Genre and Multimodality framework
  13. W16-2110 [bib]: Marco Budassi; Marco Passarotti
    Nomen Omen. Enhancing the Latin Morphological Analyser Lemlat with an Onomasticon
  14. W16-2111 [bib]: Aditya Joshi; Pushpak Bhattacharyya; Mark Carman; Jaya Saraswati; Rajita Shukla
    How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text
  15. W16-2112 [bib]: Izaskun Etxeberria; Iñaki Alegria; Larraitz Uria; Mans Hulden
    Combining Phonology and Morphology for the Normalization of Historical Texts
  16. W16-2113 [bib]: Çağıl Sönmez; Arzucan Özgür; Erdem Yörük
    Towards Building a Political Protest Database to Explain Changes in the Welfare State
  17. W16-2114 [bib]: Johannes Hellrich; Udo Hahn
    An Assessment of Experimental Protocols for Tracing Changes in Word Semantics Relative to Accuracy and Reliability
  18. W16-2115 [bib]: Eszter Simon; Veronika Vincze
    Universal Morphology for Old Hungarian
  19. W16-2116 [bib]: Annika Marie Schoene; Nina Dethlefs
    Automatic Identification of Suicide Notes from Linguistic and Sentiment Features
  20. W16-2117 [bib]: Dieu-Thu Le; Ngoc Thang Vu; Andre Blessing
    Towards a text analysis system for political debates
  21. W16-2118 [bib]: Vilja Hulden
    Whodunit... and to Whom? Subjects, Objects, and Actions in Research Articles on American Labor Unions
  22. W16-2119 [bib]: Amir Zeldes; Caroline T. Schroeder
    An NLP Pipeline for Coptic
  23. W16-2120 [bib]: Micha Elsner; Emily Lane
    Automatic discovery of Latin syntactic changes
  24. W16-2121 [bib]: Stefania Degaetano-Ortlieb; Elke Teich
    Information-based Modeling of Diachronic Linguistic Change: from Typicality to Productivity

2015

  1. Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH)

  2. W15-3701: Yen-Fu Luo; Anna Rumshisky; Mikhail Gronas
    Catching the Red Priest: Using Historical Editions of Encyclopaedia Britannica to Track the Evolution of Reputations
  3. W15-3702: JinYeong Bak; Alice Oh
    Five Centuries of Monarchy in Korea: Mining the Text of the Annals of the Joseon Dynasty
  4. W15-3703: Yufang Hou; Anette Frank
    Analyzing Sentiment in Classical Chinese Poetry
  5. W15-3704: Victoria Anugrah Lestari; Ruli Manurung
    Measuring the Structural and Conceptual Similarity of Folktales using Plot Graphs
  6. W15-3705: Nils Reiter
    Towards Annotating Narrative Segments
  7. W15-3706: Eva Pettersson; Beáta Megyesi; Joakim Nivre
    Ranking Relevant Verb Phrases Extracted from Historical Text
  8. W15-3707: Stephen Wan; Cécile Paris
    Ranking election issues through the lens of social media
  9. W15-3708: Johannes Bjerva; Raf Praet
    Word Embeddings Pointing the Way for Late Antiquity
  10. W15-3709: Ryan Georgi; Fei Xia; William Lewis
    Enriching Interlinear Text using Automatically Constructed Annotators
  11. W15-3710: Tanja Samardzic; Robert Schikowski; Sabine Stoll
    Automatic interlinear glossing as two-level sequence classification
  12. W15-3711: Aitor Arronte Alvarez
    Enriching Digitized Medieval Manuscripts: Linking Image, Text and Lexical Knowledge
  13. W15-3712 [attachment]: Klemo Vladimir; Marin Silic; Nenad Romic; Goran Delac; Sinisa Srbljic
    A preliminary study on similarity-preserving digital book identifiers
  14. W15-3713: Andrea Bellandi; Davide Albanesi; Giulia Benotto; Emiliano Giovannetti; Gianfranco Di Segni
    When Translation Requires Interpretation: Collaborative Computer–Assisted Translation of Ancient Texts
  15. W15-3714 [attachment]: Chaya Liebeskind; Ido Dagan
    Integrating Query Performance Prediction in Term Scoring for Diachronic Thesaurus
  16. W15-3715 [attachment]: Tommaso Petrolito; Ruggero Petrolito; Francesco Perono Cacciafoco; Gregoire Winterstein
    Minoan linguistic resources: The Linear A Digital Corpus
  17. W15-3716: Tim vor der Brück; Steffen Eger; Alexander Mehler
    Lexicon-assisted tagging and lemmatization in Latin: A comparison of six taggers and two lemmatization models

2014

  1. Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH)

  2. W14-0601: Jochen Tiepmar; Christoph Teichmann; Gerhard Heyer; Monica Berti; Gregory Crane
    A New Implementation for Canonical Text Services
  3. W14-0602: Thierry Declerck; Eveline Wandl-Vogt
    How to semantically relate dialectal Dictionaries in the Linked Data Framework
  4. W14-0603: Ewan Klein; Beatrice Alex; Jim Clifford
    Bootstrapping a historical commodities lexicon with SKOS and DBpedia
  5. W14-0604: Christian Chiarcos; Maria Sukhareva; Roland Mittmann; Timothy Price; Gaye Detmold; Jan Chobotsky
    New Technologies for Old Germanic. Resources and Research on Parallel Bibles in Older Continental Western Germanic
  6. W14-0605: Eva Pettersson; Beáta Megyesi; Joakim Nivre
    A Multilingual Evaluation of Three Spelling Normalisation Methods for Historical Text
  7. W14-0606: Christian Poelitz; Thomas Bartz
    Enhancing the possibilities of corpus-based investigations: Word sense disambiguation on query results of large text corpora
  8. W14-0607: Julia Efremova; Bijan Ranjbar-Sahraei; Toon Calders
    A Hybrid Disambiguation Measure for Inaccurate Cultural Heritage Data
  9. W14-0608: Kata Gábor; Benoît Sagot
    Automated Error Detection in Digitized Cultural Heritage Documents
  10. W14-0609: Mike Kestemont; Folgert Karsdorp; Marten Düring
    Mining the Twentieth Century’s History from the Time Magazine Corpus
  11. W14-0610: Thierry Poibeau; Elisa Omodei; Jean-Philippe Cointet; Yufan Guo
    Social and Semantic Diversity: Socio-semantic Representation of a Scientific Corpus
  12. W14-0611: Drayton Benner
    A Tool for a High-Carat Gold-Standard Word Alignment
  13. W14-0612: Marcel Bollmann; Florian Petran; Stefanie Dipper; Julia Krasselt
    CorA: A web-based annotation tool for historical and other non-standard language data
  14. W14-0613: Amanda Andrei; Alison Dingwall; Theresa Dillon; Jennifer Mathieu
    Developing a Tagalog Linguistic Inquiry and Word Count (LIWC) ‘Disaster’ Dictionary for Understanding Mixed Language Social Media: A Work-in-Progress Paper
  15. W14-0614: Adam Wyner; Jackson Armstrong; Andrew Mackillop; Philip Astley
    Text Analysis of Aberdeen Burgh Records 1530-1531
  16. W14-0615: Marco Passarotti
    From Syntax to Semantics. First Steps Towards Tectogrammatical Annotation of Latin
  17. W14-0616: Sergiu Nisioi
    On the syllabic structures of Aromanian
  18. W14-0617: Claire Grover; Richard Tobin
    A Gazetteer and Georeferencing for Historical English Documents
  19. W14-0618: Hadaiq Sanabila; Ruli Manurung
    Automatic Wayang Ontology Construction using Relation Extraction from Free Text

2013

  1. Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities

  2. W13-2701: Samuel Fernando; Paula Goodale; Paul Clough; Mark Stevenson; Mark Hall; Eneko Agirre
    Generating Paths through Cultural Heritage Collections
  3. W13-2702: Sander Wubben; Emiel Krahmer; Antal van den Bosch
    Using character overlap to improve language transformation
  4. W13-2703: Marijn Schraagen; Dionysius Huijsmans
    Comparison between historical population archives and decentralized databases
  5. W13-2704: Chaya Liebeskind; Ido Dagan; Jonathan Schler
    Semi-automatic Construction of Cross-period Thesaurus
  6. W13-2705: Simon Wibberley; David Weir; Jeremy Reffin
    Language Technology for Agile Social Media Science
  7. W13-2706: Attila Novak; György Orosz; Nóra Wenszky
    Morphological annotation of Old and Middle Hungarian corpora
  8. W13-2707: Eirini Florou; Stasinos Konstantopoulos; Antonis Koukourikos; Pythagoras Karampiperis
    Argument extraction for supporting public policy formulation
  9. W13-2708: Andre Blessing; Jonathan Sonntag; Fritz Kliche; Ulrich Heid; Jonas Kuhn; Manfred Stede
    Towards a Tool for Interactive Concept Building for Large Scale Analysis in the Humanities
  10. W13-2709: Dolf Trieschnigg; Dong Nguyen; Mariët Theune
    Learning to Extract Folktale Keywords
  11. W13-2710: Emily M. Bender; Michael Wayne Goodman; Joshua Crowgey; Fei Xia
    Towards Creating Precision Grammars from Interlinear Glossed Text: Inferring Large-Scale Typological Properties
  12. W13-2711: Marilisa Amoia; José Manuel Martínez
    Using Comparable Collections of Historical Texts for Building a Diachronic Dictionary for Spelling Normalization
  13. W13-2712: Thierry Declerck
    Integration of the Thesaurus for the Social Sciences (TheSoz) in an Information Extraction System
  14. W13-2713: Ruth Jones; Ann Irvine
    The (Un)faithful Machine Translator
  15. W13-2714: Alina Maria Ciobanu; Anca Dinu; Liviu Dinu; Vlad Niculae; Octavia-Maria Şulea
    Temporal classification for historical Romanian texts
  16. W13-2715: Dana Dannells; Aarne Ranta; Ramona Enache; Mariana Damova; Maria Mateva
    Multilingual access to cultural heritage content on the Semantic Web

2012

  1. Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities

  2. W12-1001: Tom Kenter; Tomaž Erjavec; Maja Žorga Dulmin; Darja Fiser
    Lexicon Construction and Corpus Annotation of Historical Language with the CoBaLT Editor
  3. W12-1002: Mark Dingemanse; Jeremy Hammond; Herman Stehouwer; Aarthy Somasundaram; Sebastian Drude
    A high speed transcription interface for annotating primary linguistic data
  4. W12-1003: Manex Agirrezabal; Iñaki Alegria; Bertol Arrieta; Mans Hulden
    BAD: An Assistant tool for making verses in Basque
  5. W12-1004: Dana Dannélls; Lars Borin
    Toward Language Independent Methodology for Generating Artwork Descriptions – Exploring FrameNet Information
  6. W12-1005: Michael Piotrowski; Cathrin Senn
    Harvesting Indices to Grow a Controlled Vocabulary: Towards Improved Access to Historical Legal Texts
  7. W12-1006: Thierry Declerck; Nikolina Koleva; Hans-Ulrich Krieger
    Ontology-Based Incremental Annotation of Characters in Folktales
  8. W12-1007: Daniela Oelke; Dimitrios Kokkinakis; Mats Malm
    Advanced Visual Analytics Methods for Literature Analysis
  9. W12-1008: Aurélie Herbelot; Eva von Redecker; Johanna Müller
    Distributional techniques for philosophical enquiry
  10. W12-1009: Caroline Brun; Vassilina Nikoulina; Nikolaos Lagos
    Linguistically-Adapted Structural Query Annotation for Digital Libraries in the Social Sciences
  11. W12-1010: Eva Pettersson; Beáta Megyesi; Joakim Nivre
    Parsing the Past - Identification of Verb Constructions in Historical Text
  12. W12-1011: John Lee
    A Classical Chinese Corpus with Nested Part-of-Speech Tags
  13. W12-1012: Nikolaos Aletras; Mark Stevenson
    Computing Similarity between Cultural Heritage Items using Multimodal Features
  14. W12-1013: Mark Michael Hall; Oier Lopez de Lacalle; Aitor Soroa Etxabe; Paul Clough; Eneko Agirre
    Enabling the Discovery of Digital Cultural Heritage Objects through Wikipedia
  15. W12-1014: Samuel Fernando; Mark Stevenson
    Adapting Wikification to Cultural Heritage
  16. W12-1015: Vicente Bosch; Alejandro Héctor Toselli; Enrique Vidal
    Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents
  17. W12-1016: Alex Zhicharevich; Nachum Dershowitz
    Language Classification and Segmentation of Noisy Documents in Hebrew Scripts

2011

  1. Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities

  2. W11-1501: Cristina Sánchez-Marco; Gemma Boleda; Lluìs Padró
    Extending the tool, or how to annotate historical language varieties
  3. W11-1502: Jirka Hana; Anna Feldman; Katsiaryna Aharodnik
    A low-budget tagger for Old Czech
  4. W11-1503: Silke Scheible; Richard J. Whitt; Martin Durrell; Paul Bennett
    Evaluating an 'off-the-shelf' POS-tagger on Early Modern German text
  5. W11-1504: Dorothee Beermann; Pavel Mihaylov
    e-Research for Linguists
  6. W11-1505: Tomaž Erjavec
    Automatic linguistic annotation of historical language: ToTrTaLe and XIX century Slovene
  7. W11-1506: Agata Katarzyna Cybulska; Piek Vossen
    Historical Event Extraction from Text
  8. W11-1507: Kalliopi Zervanou; Ioannis Korkontzelos; Antal van den Bosch; Sophia Ananiadou
    Enrichment and Structuring of Archival Description Metadata
  9. W11-1508: Massimo Poesio; Eduard Barbu; Egon Stemle; Christian Girardi
    Structure-Preserving Pipelines for Digital Libraries
  10. W11-1509: Charles Hollingsworth; Stefaan Van Liefferinge; Rebecca A. Smith; Michael A. Covington; Walter D. Potter
    The ARC Project: Creating logical models of Gothic cathedrals using natural language processing
  11. W11-1510: Asad Sayeed; Bryan Rusk; Martin Petrov; Hieu Nguyen; Timothy Meyer; Amy Weinberg
    Crowdsourcing syntactic relatedness judgements for opinion mining in the study of information technology adoption
  12. W11-1511: Sravana Reddy; Kevin Knight
    What We Know About The Voynich Manuscript
  13. W11-1512: Eva Pettersson; Joakim Nivre
    Automatic Verb Extraction from Historical Swedish Texts
  14. W11-1513: Tze-I Yang; Andrew Torget; Rada Mihalcea
    Topic Modeling on Historical Newspapers
  15. W11-1514: Saif Mohammad
    From Once Upon a Time to Happily Ever After: Tracking Emotions in Novels and Fairy Tales
  16. W11-1515: Dong Nguyen; Noah A. Smith; Carolyn P. Rosè
    Author Age Prediction from Text using Linear Regression
  17. W11-1516: Nikhil Johri; Daniel Ramage; Daniel McFarland; Daniel Jurafsky
    A Study of Academic Collaborations in Computational Linguistics using a Latent Mixture of Authors Model

2010

  1. Fourth Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities

    To Meeting Home Page

2009

  1. Proceedings of the EACL 2009 Workshop on Language Technology and Resources for Cultural Heritage, Social Sciences, Humanities, and Education (LaTeCH – SHELT&R 2009)

  2. W09-0301: Guenther Goerz; Martin Scholz
    Content Analysis of Museum Documentation in a Transdisciplinary Perspective
  3. W09-0302: Stasinos Konstantopoulos; Vangelis Karkaletsis; Dimitris Bilidas
    An Intelligent Authoring Environment for Abstract Semantic Representations of Cultural Object Descriptions
  4. W09-0303: Jelena Prokić; Martijn Wieling; John Nerbonne
    Multiple Sequence Alignments in Linguistics
  5. W09-0304: Martijn Wieling; Jelena Prokić; John Nerbonne
    Evaluating the Pairwise String Alignment of Pronunciations
  6. W09-0305: Voula Giouli; Nikos Glaros; Kiril Simov; Petya Osenova
    A Web-Enabled and Speech-Enhanced Parallel Corpus of Greek-Bulgarian Cultural Texts
  7. W09-0306: Barbara McGillivray; Marco Passarotti
    The Development of the “Index Thomisticus” Treebank Valency Lexicon
  8. W09-0307: Fei Xia; William Lewis
    Applying NLP Technologies to the Collection and Enrichment of Language Data on the Web to Aid Linguistic Research
  9. W09-0308: Marieke van Erp; Antal van den Bosch; Sander Wubben; Steve Hunt
    Instance-Driven Discovery of Ontological Relation Labels
  10. W09-0309: Milena Dobreva; Nikola Ikonomov
    The Role of Metadata in the Longevity of Cultural Heritage Resources

2008

  1. Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities

    To Meeting Home Page

2007

  1. Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007).

  2. W07-0901: Lars Borin; Dimitrios Kokkinakis; Leif-Jöran Olsson
    Naming the Past: Named Entity and Animacy Recognition in 19th Century Swedish Literature
  3. W07-0902: Alejandro H. Toselli; Verónica Romero; Enrique Vidal
    Viterbi Based Alignment between Text Images and their Transcripts
  4. W07-0903: Marieke van Erp
    Retrieving Lost Information from Textual Databases: Rediscovering Expeditions from an Animal Specimen Database
  5. W07-0904: Tandeep Sidhu; Judith Klavans; Jimmy Lin
    Concept Disambiguation for Improved Subject Access Using Multiple Knowledge Sources
  6. W07-0905: David Bamman; Gregory Crane
    The Latin Dependency Treebank in a Cultural Heritage Digital Library
  7. W07-0906: Michel Généreux
    Cultural Heritage Digital Resources: From Extraction to Querying
  8. W07-0907: Karl Grieser; Timothy Baldwin; Steven Bird
    Dynamic Path Prediction and Recommendation in a Museum Environment
  9. W07-0908: Véronique Malaisé; Antoine Isaac; Luit Gazendam; Hennie Brugman
    Anchoring Dutch Cultural Heritage Thesauri to WordNet: Two Case Studies
  10. W07-0909: Idan Szpektor; Ido Dagan; Alon Lavie; Danny Shacham; Shuly Wintner
    Cross Lingual and Semantic Retrieval for Cultural Heritage Appreciation
  11. W07-0910: Avi Arampatzis; Jaap Kamps; Marijn Koolen; Nir Nussbaum
    Deriving a Domain Specific Test Collection from a Query Log
  12. W07-0911: Gareth J. F. Jones; Ying Zhang; Eamonn Newman; Fabio Fantino; Franca Debole
    Multilingual Search for Cultural Heritage Archives via Combining Multiple Translation Resources
  13. W07-0912: Douglas W. Oard
    Invited Talk: Lessons from the MALACH Project: Applying New Technologies to Improve Intellectual Access to Large Oral History Collections