ACL Logo ACL Anthology
A Digital Archive of Research Papers in Computational Linguistics

Google search the Anthology

Special Interest Group on Web as Corpus (SIGWAC)

To SIGWAC Home Page

» Toggle Table of Contents

2016 Proceedings of the 10th Web as Corpus Workshop
2015 Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis
2014 Proceedings of the 9th Web as Corpus Workshop (WaC-9)
2010 Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop
2007 WAC3, Louvain-la-Neuve, Belgium, 15-16 September 2007
2006 Proceedings of the 2nd International Workshop on Web as Corpus
2005 WAC1, at Corpus Linguistics conference, Birmingham, UK, July 2005

2016

  1. Proceedings of the 10th Web as Corpus Workshop

  2. W16-26 [bib]: Entire Volume
  3. W16-2600 [bib]: Front Matter

  4. W16-2601 [bib]: Roland Schäfer; Felix Bildhauer
    Automatic Classification by Topic Domain for Meta Data Generation, Web Corpus Evaluation, and Corpus Comparison
  5. W16-2602 [bib]: Adrien Barbaresi
    Efficient construction of metadata-enhanced web corpora
  6. W16-2603 [bib]: Andrew Salway; Dag Elgesem; Knut Hofland; Øystein Reigem; Lubos Steskal
    Topically-focused Blog Corpora for Multiple Languages
  7. W16-2604 [bib]: Anne Krause
    The Challenges and Joys of Analysing Ongoing Language Change in Web-based Corpora: a Case Study
  8. W16-2605 [bib]: Quirin Würschinger; Mohammad Fazleh Elahi; Desislava Zhekova; Hans-Jörg Schmid
    Using the Web and Social Media as Corpora for Monitoring the Spread of Neologisms. The case of 'rapefugee', 'rapeugee', and 'rapugee'.
  9. W16-2606 [bib]: Michael Beißwenger; Sabine Bartsch; Stefan Evert; Kay-Michael Würzner
    EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora
  10. W16-2607 [bib]: Thomas Proisl; Peter Uhrig
    SoMaJo: State-of-the-art tokenization for German web and social media texts
  11. W16-2608 [bib]: Jakob Prange; Andrea Horbach; Stefan Thater
    UdS-(retrain|distributional|surface): Improving POS Tagging for OOV Words in German CMC and Web Data
  12. W16-2609 [bib]: Gideon Mendels; Erica Cooper; Julia Hirschberg
    Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search
  13. W16-2610 [bib]: Nikola Ljubešić; Darja Fišer
    A Global Analysis of Emoji Usage
  14. W16-2611 [bib]: Erika Dalan; Serge Sharoff
    Genre classification for a corpus of academic webpages
  15. W16-2612 [bib]: Roland Schäfer
    On Bias-free Crawling and Representative Web Corpora
  16. W16-2613 [bib]: Steffen Remus; Gerold Hintz; Chris Biemann; Christian M. Meyer; Darina Benikova; Judith Eckle-Kohler; Margot Mieskes; Thomas Arnold
    EmpiriST: AIPHES - Robust Tokenization and POS-Tagging for Different Genres
  17. W16-2614 [bib]: Egon Stemle
    bot.zen $@$ EmpiriST 2015 - A minimally-deep learning PoS-tagger (trained for German CMC and Web data)
  18. W16-2615 [bib]: Tobias Horsmann; Torsten Zesch
    LTL-UDE $@$ EmpiriST 2015: Tokenization and PoS Tagging of Social Media Text

2015

  1. Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis

  2. W15-2901: Zornitsa Kozareva
    Multilingual Affect Polarity and Valence Prediction in Metaphors
  3. W15-2902 [attachment]: David Vilares; Miguel A. Alonso; Carlos Gómez-Rodríguez
    Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora
  4. W15-2903: Marine Carpuat
    Connotation in Translation
  5. W15-2904: Soroush Vosoughi; Helen Zhou; deb roy
    Enhanced Twitter Sentiment Classification Using Contextual Information
  6. W15-2905: Anupam Khattri; Aditya Joshi; Pushpak Bhattacharyya; Mark Carman
    Your Sentiment Precedes You: Using an author’s historical tweets to predict sarcasm
  7. W15-2906: Thomas Kober; David Weir
    Optimising Agile Social Media Analysis
  8. W15-2907: Takaaki Tsunoda; Takashi Inui; Satoshi Sekine
    Utilizing review analysis to suggest product advertisement improvements
  9. W15-2908 [attachment]: Wiltrud Kessler; Roman Klinger; Jonas Kuhn
    Towards Opinion Mining from Reviews for the Prediction of Product Rankings
  10. W15-2909: Leticia Cagnina; Paolo Rosso
    Classification of deceptive opinions using a low dimensionality representation
  11. W15-2910: Josef Ruppenhofer; Jasper Brandes
    Extending effect annotation with lexical decomposition
  12. W15-2911: Lucie Flekova; Daniel Preoţiuc-Pietro; Eugen Ruppert
    Analysing domain suitability of a sentiment lexicon by identifying distributionally bipolar words
  13. W15-2912: Saif Mohammad
    Imagisaurus: An Interactive Visualizer of Valence and Emotion in the Roget's Thesaurus
  14. W15-2913: Barbara Plank; Dirk Hovy
    Personality Traits on Twitter—or—How to Get 1,500 Personality Tests in a Week
  15. W15-2914: Johan Reitan; Jørgen Faret; Björn Gambäck; Lars Bungum
    Negation Scope Detection for Twitter Sentiment Analysis
  16. W15-2915: Sebastian Ebert; Ngoc Thang Vu; Hinrich Schütze
    A Linguistically Informed Convolutional Neural Network
  17. W15-2916: Chiraag Sumanth; Diana Inkpen
    How much does word sense disambiguation help in sentiment analysis of micropost data?
  18. W15-2917: Wernard Schmit; Sander Wubben
    Predicting Ratings for New Movie Releases from Twitter Content
  19. W15-2918: Márton Miháltz; Tamás Váradi; István Csertő; Éva Fülöp; Tibor Pólya; Pál Kővágó
    Beyond Sentiment: Social Psychological Analysis of Political Facebook Comments in Hungary
  20. W15-2919: Manfred Klenner
    Verb-centered Sentiment Inference with Description Logics
  21. W15-2920: Priyanka Sinha; Lipika Dey; Pabitra Mitra; Anupam Basu
    Mining HEXACO personality traits from Enterprise Social Media
  22. W15-2921: Michael Wiegand; Marc Schulder; Josef Ruppenhofer
    Opinion Holder and Target Extraction for Verb-based Opinion Predicates – The Problem is Not Solved
  23. W15-2922: Umar Maqsud
    Synthetic Text Generation for Sentiment Analysis
  24. W15-2923: Maria Skeppstedt; Teri Schamp-Bjerede; Magnus Sahlgren; Carita Paradis; Andreas Kerren
    Detecting speculations, contrasts and conditionals in consumer reviews
  25. W15-2924: Credell Simeon; Robert Hilderman
    Using Combined Lexical Resources to Identify Hashtag Types
  26. W15-2925: Phillip Smith; Mark Lee
    Sentiment Classification via a Response Recalibration Framework

2014

  1. Proceedings of the 9th Web as Corpus Workshop (WaC-9)

  2. W14-0401: Adrien Barbaresi
    Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources
  3. W14-0402: Roland Schäfer; Adrien Barbaresi; Felix Bildhauer
    Focused Web Corpus Crawling
  4. W14-0403: Maik Stührenberg
    Less Destructive Cleaning of Web Documents by Using Standoff Annotation
  5. W14-0404: Magali Sanches Duran; Lucas Avanço; Sandra Aluísio; Thiago Pardo; Maria da Graça Volpe Nunes
    Some Issues on the Normalization of a Corpus of Products Reviews in Portuguese
  6. W14-0405: Nikola Ljubešić; Filip Klubička
    {bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian
  7. W14-0406: Verena Lyding; Egon Stemle; Claudia Borghetti; Marco Brunello; Sara Castagnoli; Felice Dell'Orletta; Henrik Dittmann; Alessandro Lenci; Vito Pirrelli
    The PAISÀ Corpus of Italian Web Texts

2010

  1. Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop

  2. W10-1501: Emiliano Raul Guevara
    NoWaC: a large web-based corpus for Norwegian
  3. W10-1502: Markus Dickinson; Ross Israel; Sun-Hee Lee
    Building a Korean Web Corpus for Analyzing Learner Language
  4. W10-1503: Amit Goyal; Jagadeesh Jagaralamudi; Hal Daumé III; Suresh Venkatasubramanian
    Sketching Techniques for Large Scale NLP
  5. W10-1504: George Dillon
    Building Webcorpora of Academic Prose with BootCaT
  6. W10-1505: Stefan Evert
    Google Web 1T 5-Grams Made Easy (but not for the computer)

2007

  1. WAC3, Louvain-la-Neuve, Belgium, 15-16 September 2007

    To Meeting Home Page

2006

  1. Proceedings of the 2nd International Workshop on Web as Corpus

  2. W06-1701: BBEAndrás Kornai]; BBEPéter Halácsy]; BBEViktor Nagy]; BBECsaba Oravecz]; BBEViktor Trón]; BBEDániel Varga]
    Web-based frequency dictionaries for medium density languages
  3. W06-1702: BBEMike Cafarella]; BBEOren Etzioni]
    BE: A search engine for NLP research
  4. W06-1703: BBEMasatsugu Tonoike]; BBEMitsuhiro Kida]; BBEToshihiro Takagi]; BBEYasuhiro Sasaki]; BBETakehito Utsuro ]; BBES. Sato]
    A comparative study on compositional translation estimation using a domain/topic-specific corpus collected from the Web
  5. W06-1704: BBEGemma Boleda]; BBEStefan Bott]; BBERodrigo Meza]; BBECarlos Castillo]; BBEToni Badia]; BBEVicente López]
    CUCWeb: A Catalan corpus built from the Web
  6. W06-1705: BBEPaul Rayson]; BBEJames Walkerdine]; BBEWilliam H. Fletcher]; BBEAdam Kilgarriff]
    Annotated Web as corpus
  7. W06-1706: BBEArno Scharl]; BBEAlbert Weichselbraun]
    Web coverage of the 2004 US Presidential election
  8. W06-1707: BBECédrick Fairon]
    Corporator: A tool for creating RSS-based specialized corpora
  9. W06-1708: BBEDavide Fossati]; BBEGabriele Ghidoni]; BBEBarbara Di Eugenio]; BBEIsabel Cruz]; BBEHuiyong Xiao]; BBERajen Subba]
    The problem of ontology alignment on the Web: A first report
  10. W06-1709: BBEKie Zuraw]
    Using the Web as a phonological corpus: A case study from Tagalog
  11. W06-1710: BBERüdiger Gleim]; BBEAlexander Mehler ]; BBEMatthias Dehmer]
    Web corpus mining by instance of Wikipedia

2005

  1. WAC1, at Corpus Linguistics conference, Birmingham, UK, July 2005

    To Meeting Home Page