Second CFP: Advances in Language and Vision Research (ALVR) @ ACL 2024

Event Notification Type: 
Call for Papers
Location: 
Bangkok, Thailand
AttachmentSize
Image icon WechatIMG10-min.jpg800.42 KB
Thursday, 15 August 2024
Contact Email: 
Contact: 
Jing Gu
Submission Deadline: 
Friday, 17 May 2024

ALVR @ ACL 2024 - Call for Papers

Second Call for Papers for ALVR @ ACL 2024
Submission link: https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/ALVR

Important Dates

  • Paper submission deadline: May 17 (Friday), 2024
  • Notification of acceptance: June 17 (Monday), 2024
  • Camera-ready paper due: July 1 (Monday), 2024
  • Workshop dates: August 15, 2024

Workshop Overview
Third workshop on Advances in Language and Vision Research (ALVR) promote important frontier on Vision&Language and bring researchers together to discuss how to best tackle real-world problems in this area. Language&Vision research has attracted great attention from both natural language processing (NLP) and computer vision (CV) communities, with contemporary research shifting from passive perception, annotated data, templated language, and synthetic imagery to active perception, self-supervised learning, natural language, and real-world environments, and the revolution of Large Vision\&Language Model. In the last few years, we have witnessed multiple breakthroughs in language \& vision research that have a profound impact on other areas.

Submission Topics

  • Self-supervised vision and language pre-training
  • New tasks and datasets that provide real-world solutions in language and vision
  • Text-to-image/video generation and text-guided image/video editing
  • External knowledge integration in visual and language understanding
  • Visually-grounded natural language understanding and generation
  • Language-grounded visual recognition and reasoning
  • Language-grounded embodied agents, e.g., vision-and-language navigation
  • Visually-grounded multilingual study, e.g., multimodal machine translation
  • Shortcomings of the existing large vision\&language models on downstream tasks and solutions
  • Ethics and bias on large vision\&language model.
  • Multidisciplinary study that may involve linguistics, cognitive science, robotics, etc.
  • Explainability and interpretability on large vision\&language model

Submission Instructions
Long papers may consist of up to 8 pages of content, plus unlimited pages for
references and an appendix; final versions of long papers will be given one
additional page of content (up to 9 pages) so that reviewers’ comments can
be considered.
Short papers may consist of up to 4 pages of content, plus unlimited
references and an appendix. Short papers will be given 5 content pages in the
proceedings upon acceptance. Authors are encouraged to use this additional
page to address reviewers’ comments in their final versions.
We are also including a non-archival track to allow dual submission of work to ALVR 2024 and other conferences/journals. Space permitting, these submissions will still participate and present their work in the workshop and will be hosted on the workshop website but will not be included in the official proceedings. Please apply the ACL format and submit through openreview but indicate that this is a cross-submission (non-archival) at the bottom of the submission form.
Submissions should follow the ACL 2024 formatting requirements.

Organizing Committee

  • Jing Gu, University of California, Santa Cruz
  • Tsu-Jui (Ray) Fu, University of California, Santa Barbara
  • Drew Hudson, Google DeepMind
  • Asli Celikyilmaz, Fundamentals AI Research (FAIR) @ Meta
  • William Wang University of California, Santa Barbara
  • Xin (Eric) Wang, University of California, Santa Cruz

Further information can be found online at: https://alvr-workshop.github.io/