Workshop on Computational Approaches to Discourse
https://sites.google.com/view/codi-2023/
The last ten years have seen a dramatic improvement in the ability of NLP systems to understand and produce words and sentences. This development has created a renewed interest in discourse phenomena as researchers move towards the processing of long-form text and conversations. There is a surge of activity in discourse parsing, coherence models, text summarization, corpora for discourse level reading comprehension, and discourse related/aided representation learning, to name a few, but the problems in computational approaches to discourse are still substantial. At this juncture, we have organized three Workshops on Computational Approaches to Discourse (CODI) at EMNLP 2020, EMNLP 2021 and COLING 2022 to bring together discourse experts and upcoming researchers. These workshops have catalyzed work to improve the speed and knowledge needed to solve such problems and have served as a forum for the discussion of suitable datasets and reliable evaluation methods.
The previous workshops on discourse in machine translation (DiscoMT), linking lexical, sentential and discourse semantics (LSDSem), discourse structure in natural language generation (DSNNLG), discourse relation parsing and treebanking (DISRPT) and coreference (CORBON/CRAC), have shown that there is considerable interest and success in bringing together the community working on specific problems in discourse. We believe that the discourse community will also benefit from a general forum where work ranging from corpus development/analysis to computational models, and evaluation is discussed, and desiderata can be drawn for future progress.
The 4th CODI workshop is planned as a 2 day event which brings together different subcommunities. It will feature invited talks and regular papers on the first day. The second day will be dedicated to shared tasks and special sessions which focus on the issues mentioned above. After a first successful iteration in 2019 and 2021 the shared task on Discourse Relation Parsing and Treebanking (DISRPT) will be held again in 2023, with three tasks: discourse segmentation, discourse connective identification and discourse relation classification, including new datasets and languages. For more information on the shared task see: https://sites.google.com/view/disrpt2023/