Difference between revisions of "BioNLP 2023"

From ACL Wiki
Jump to navigation Jump to search
Line 154: Line 154:
  
 
<h5>Task 1A. Problem List Summarization</h5>
 
<h5>Task 1A. Problem List Summarization</h5>
 +
 +
<b> The deadline for registration is March 1st, after which no further registrations will be accepted.</b>
 +
 
Automatically summarizing patients’ main problems from the daily care notes in the electronic health record can help mitigate information and cognitive overload for clinicians and provide augmented intelligence via computerized diagnostic decision support at the bedside. The task of Problem List Summarization aims to generate a list of diagnoses and problems in a patient’s daily care plan using input from the provider’s progress notes during hospitalization.This task aims to promote NLP model development for downstream applications in diagnostic decision support systems that could improve efficiency and reduce diagnostic errors in hospitals. This task will contain 768 hospital daily progress notes and 2783 diagnoses in the training set, and a new set of 300 daily progress notes will be annotated by physicians as the test set. The annotation methods and annotation quality have previously been reported [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9354726/ here]. The goal of this shared task is to attract future research efforts in building NLP models for real-world decision support applications, where a system generating relevant and accurate diagnoses will assist the healthcare providers’ decision-making process and improve the quality of care for patients.
 
Automatically summarizing patients’ main problems from the daily care notes in the electronic health record can help mitigate information and cognitive overload for clinicians and provide augmented intelligence via computerized diagnostic decision support at the bedside. The task of Problem List Summarization aims to generate a list of diagnoses and problems in a patient’s daily care plan using input from the provider’s progress notes during hospitalization.This task aims to promote NLP model development for downstream applications in diagnostic decision support systems that could improve efficiency and reduce diagnostic errors in hospitals. This task will contain 768 hospital daily progress notes and 2783 diagnoses in the training set, and a new set of 300 daily progress notes will be annotated by physicians as the test set. The annotation methods and annotation quality have previously been reported [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9354726/ here]. The goal of this shared task is to attract future research efforts in building NLP models for real-world decision support applications, where a system generating relevant and accurate diagnoses will assist the healthcare providers’ decision-making process and improve the quality of care for patients.
  

Revision as of 14:46, 24 February 2023

SIGBIOMED

BIONLP 2023 and Shared Tasks @ ACL 2023

The 22nd BioNLP workshop associated with the ACL SIGBIOMED special interest group is co-located with ACL 2023


IMPORTANT DATES

TENTATIVE

  • April 24, 2023: Workshop Paper Due Date. Submission site: https://softconf.com/acl2023/BioNLP2023/
  • May 29, 20232: Notification of Acceptance
  • June 6, 2023: Camera-ready papers due
  • June 12, 2023: Pre-recorded video due
  • BioNLP 2023 Workshop at ACL, July 13 OR 14, 2023, Toronto, Canada

WORKSHOP OVERVIEW AND SCOPE

The BioNLP workshop associated with the ACL SIGBIOMED special interest group has established itself as the primary venue for presenting foundational research in language processing for the biological and medical domains. The workshop is running every year since 2002 and continues getting stronger. BioNLP welcomes and encourages work on languages other than English, and inclusion and diversity. BioNLP truly encompasses the breadth of the domain and brings together researchers in bio- and clinical NLP from all over the world. The workshop will continue presenting work on a broad and interesting range of topics in NLP. The interest to biomedical language has broadened significantly due to the COVID-19 pandemic and continues to grow: as access to information becomes easier and more people generate and access health-related text, it becomes clearer that only language technologies can enable and support adequate use of the biomedical text.

BioNLP 2023 will be particularly interested in language processing that supports DEIA (Diversity, Equity, Inclusion and Accessibility). The work on detection and mitigation of bias and misinformation continues to be of interest. Research in languages other than English, particularly, under-represented languages, and health disparities are always of interest to BioNLP.

Other active areas of research include, but are not limited to:

  • Tangible results of biomedical language processing applications;
  • Entity identification and normalization (linking) for a broad range of semantic categories;
  • Extraction of complex relations and events;
  • Discourse analysis;
  • Anaphora/coreference resolution;
  • Text mining / Literature based discovery;
  • Summarization;
  • Τext simplification;
  • Question Answering;
  • Resources and strategies for system testing and evaluation;
  • Infrastructures and pre-trained language models for biomedical NLP (Processing and annotation platforms);
  • Development of synthetic data & data augmentation;
  • Translating NLP research into practice;
  • Getting reproducible results.


SUBMISSION INSTRUCTIONS

Two types of submissions are invited: full (long) papers, short papers, ST_1A, ST_1B, ST_2 shared task participants' reports.

The reports on the shared task participation will be reviewed by the task organizers.

Publication chairs for the tasks:

  • 1A: Yanjun Gao
  • 1B: Jean Benoit Delbrouck
  • 2: Chenghua Lin, Tomas Goldsack

Full (long) papers should not exceed eight (8) pages of text, plus unlimited references. Final versions of full papers will be given one additional page of content (up to 9 pages) so that reviewers' comments can be taken into account. Full papers are intended to be reports of original research.

BioNLP aims to be the forum for interesting, innovative, and promising work involving biomedicine and language technology, whether or not yielding high performance at the moment. This by no means precludes our interest in and preference for mature results, strong performance, and thorough evaluation. Both types of research and combinations thereof are encouraged.

Short papers may consist of up to four (4) pages of content, plus unlimited references. Upon acceptance, short papers will still be given up to five (5) content pages in the proceedings. Appropriate short paper topics include preliminary results, application notes, descriptions of work in progress, etc.


Electronic Submission

Submissions must be electronic and in PDF format, using the Softconf START conference management system at https://softconf.com/acl2023/BioNLP2023/

We strongly recommend consulting the ACL Policies for Submission, Review, and Citation: https://2023.aclweb.org/calls/main_conference/ and using ACL LaTeX style files tailored for this year's conference. Submissions must conform to the official style guidelines: https://2023.aclweb.org/calls/style_and_formatting/

Submissions need to be anonymous.

Dual submission policy: papers may NOT be submitted to the BioNLP 2023 workshop if they are or will be concurrently submitted to another meeting or publication.

Program Committee

 * Sophia Ananiadou, National Centre for Text Mining and University of Manchester, UK 
 * Emilia Apostolova, Anthem, Inc., USA
 * Eiji Aramaki, University of Tokyo, Japan 
 * Steven Bethard, University of Arizona, USA
 * Olivier Bodenreider, US National Library of Medicine 
 * Robert Bossy, Inrae, Université Paris Saclay, France
 * Leonardo Campillos-Llanos, Centro Superior de Investigaciones Científicas - CSIC, Spain
 * Kevin Bretonnel Cohen, University of Colorado School of Medicine, USA 
 * Brian Connolly, Ohio, USA
 * Mike Conway, University of Melbourne, Australia
 * Manirupa Das, Amazon, USA
 * Berry de Bruijn, National Research Council, Canada
 * Dina Demner-Fushman, US National Library of Medicine 
 * Bart Desmet, National Institutes of Health, USA
 * Dmitriy Dligach, Loyola University Chicago, USA
 * Kathleen C.	Fraser, National Research Council Canada
 * Travis Goodwin, Amazon Web Services (AWS), Seattle, Washington, USA
 * Natalia Grabar, CNRS, U Lille, France
 * Cyril Grouin, Université Paris-Saclay, CNRS
 * Tudor Groza, EMBL-EBI
 * Deepak Gupta, US National Library of Medicine 
 * William Hogan, UCSD, USA
 * Thierry Hamon, LIMSI-CNRS, France
 * Richard Jackson, AstraZeneca
 * Antonio Jimeno Yepes, IBM, Melbourne Area, Australia
 * Sarvnaz Karimi, CSIRO, Australia
 * Nazmul Kazi, University of North Florida, USA
 * Roman Klinger, University of Stuttgart, Germany
 * Anna Koroleva, Omdena
 * Andre Lamurias, Aalborg University, Denmark
 * Alberto Lavelli, FBK-ICT, Italy
 * Robert Leaman, US National Library of Medicine 
 * Lung-Hao Lee, National Central University, Taiwan
 * Ulf Leser, Humboldt-Universität zu Berlin, Germany 
 * Timothy Miller, Boston Childrens Hospital and Harvard Medical School, USA
 * Claire Nedellec, French national institute of agronomy (INRA)
 * Guenter Neumann, German Research Center for Artificial Intelligence (DFKI)
 * Mariana Neves, Hasso-Plattner-Institute at the University of Potsdam, Germany
 * Nhung Nguyen, National Centre for Text Mining, University of Manchester, UK
 * Aurélie Névéol, CNRS, France
 * Amandalynne	Paullada, University of Washington School of Medicine
 * Yifan Peng,  Weill Cornell Medical College, USA
 * Laura Plaza, Universidad Nacional de Educación a Distancia
 * Francisco J. Ribadas-Pena, University of Vigo, Spain
 * Anthony Rios, The University of Texas at San Antonio, USA
 * Kirk Roberts, The University of Texas Health Science Center at Houston, USA 
 * Roland Roller, DFKI, Germany
 * Mourad Sarrouti, Sumitovant Biopharma, Inc., USA
 * Diana Sousa, University of Lisbon, Portugal
 * Peng Su, University of Delaware, USA
 * Madhumita Sushil, University of California, San Francisco, USA
 * Mario Sänger, Humboldt Universität zu Berlin, Germany
 * Andrew Taylor, Yale University School of Medicine, USA
 * Karin Verspoor, RMIT University, Australia
 * Leon Weber, Humboldt Universität Berlin, Germany
 * Nathan M. White, James Cook University, Australia
 * Dustin Wright, University of Copenhagen,Denmark
 * Amelie Wührl,  University of Stuttgart, Germany
 * Dongfang Xu, Harvard University, USA
 * Jingqing Zhang,  Imperial College London, UK
 * Ayah Zirikly, Johns Hopkins Whiting School of Engineering, USA
 * Pierre Zweigenbaum, LIMSI - CNRS, France


Organizers

 * Kevin Bretonnel Cohen, University of Colorado School of Medicine
 * Dina Demner-Fushman, US National Library of Medicine
 * Sophia Ananiadou, National Centre for Text Mining and University of Manchester, UK
 * Jun-ichi Tsujii, National Institute of Advanced Industrial Science and Technology, Japan

SHARED TASKS 2023

Shared Tasks on Summarization of Clinical Notes and Scientific Articles

The first task focuses on Clinical Text.

Task 1A. Problem List Summarization

The deadline for registration is March 1st, after which no further registrations will be accepted.

Automatically summarizing patients’ main problems from the daily care notes in the electronic health record can help mitigate information and cognitive overload for clinicians and provide augmented intelligence via computerized diagnostic decision support at the bedside. The task of Problem List Summarization aims to generate a list of diagnoses and problems in a patient’s daily care plan using input from the provider’s progress notes during hospitalization.This task aims to promote NLP model development for downstream applications in diagnostic decision support systems that could improve efficiency and reduce diagnostic errors in hospitals. This task will contain 768 hospital daily progress notes and 2783 diagnoses in the training set, and a new set of 300 daily progress notes will be annotated by physicians as the test set. The annotation methods and annotation quality have previously been reported here. The goal of this shared task is to attract future research efforts in building NLP models for real-world decision support applications, where a system generating relevant and accurate diagnoses will assist the healthcare providers’ decision-making process and improve the quality of care for patients.


Shared Task 1A Registration: https://forms.gle/yp6TKD66G8KGpweN9

Please join our Google discussion group for the important update: https://groups.google.com/g/bionlp2023problemsumm

Full Task 1A Details at:

https://physionet.org/content/bionlp-workshop-2023-task-1a/1.0.0/


Important Dates:

  • Registration Started: January 13th, 2023
  • Releasing of training and validation data: January 13th, 2023
  • Releasing of test data: April 13th, 2023
  • System submission deadline: April 20th, 2023
  • System papers due date: May 4th, 2023
  • Notification of acceptance: June 1st, 2023
  • Camera-ready system papers due: June 13th, 2023
  • BioNLP Workshop Date: July 13th or 14th, 2023


Task 1A Organizers:

  • Majid Afshar, Department of Medicine University of Wisconsin - Madison.
  • Yanjun Gao, University of Wisconsin Madison.
  • Dmitriy Dligach, Department of Computer Science at Loyola University Chicago.
  • Timothy Miller, Boston Children’s Hospital and Harvard Medical School.
Task 1B. Radiology report summarization

Radiology report summarization is a growing area of research. Given the Findings and/or Background sections of a radiology report, the goal is to generate a summary (called an Impression section) that highlights the key observations and conclusions of the radiology study.

The research area of radiology report summarization currently faces an important limitation: most research is carried out on chest X-rays. To palliate these limitations, we propose two datasets: A shared summarization task that includes six different modalities and anatomies, totalling 79,779 samples, based on the MIMIC-III database.

A shared summarization task on chest x-ray radiology reports with images and a brand new out-of-domain test-set from Stanford.

Full Task 1B details at:

https://vilmedic.app/misc/bionlp23/sharedtask

Task 1B Organizers:

  • Jean-Benoit Delbrouck, Stanford University.
  • Maya Varma, Stanford University.


Task 2. Lay Summarization of Biomedical Research Articles

Biomedical publications contain the latest research on prominent health-related topics, ranging from common illnesses to global pandemics. This can often result in their content being of interest to a wide variety of audiences including researchers, medical professionals, journalists, and even members of the public. However, the highly technical and specialist language used within such articles typically makes it difficult for non-expert audiences to understand their contents.

Abstractive summarization models can be used to generate a concise summary of an article, capturing its salient point using words and sentences that aren’t used in the original text. As such, these models have the potential to help broaden access to highly technical documents when trained to generate summaries that are more readable, containing more background information and less technical terminology (i.e., a “lay summary”).

This shared task surrounds the abstractive summarization of biomedical research articles, with an emphasis on controllability and catering to non-expert audiences. Through this task, we aim to help foster increased research interest in controllable summarization that helps broaden access to technical texts and progress toward more usable abstractive summarization models in the biomedical domain.

For more information on Task 2, see:

Detailed descriptions of the motivation, the tasks, and the data are also published in:

  • Goldsack, T., Zhang, Z., Lin, C., Scarton, C.. Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature. EMNLP 2022.
  • Luo, Z., Xie, Q., Ananiadou, S.. Readability Controllable Biomedical Document Summarization. EMNLP 2022 Findings.


Task 2 Organizers:

  • Chenghua Lin, Deputy Director of Research and Innovation in the Computer Science Department, University of Sheffield.
  • Sophia Ananiadou, Turing Fellow, Director of the National Centre for Text Mining and Deputy Director of the Institute of Data Science and AI at the University of Manchester.
  • Carolina Scarton, Computer Science Department at the University of Sheffield.
  • Qianqian Xie, National Centre for Text Mining (NaCTeM).
  • Tomas Goldsack, University of Sheffield.
  • Zheheng Luo, the University of Manchester.
  • Zhihao Zhang, Beihang University.