Difference between revisions of "BioNLP 2023"

Revision as of 19:24, 8 June 2023

BIONLP 2023 and Shared Tasks @ ACL 2023

The 22nd BioNLP workshop associated with the ACL SIGBIOMED special interest group is co-located with ACL 2023

IMPORTANT DATES

April 24, 2023: Workshop Paper Due Date.
Submission site for the workshop only: https://softconf.com/acl2023/BioNLP2023/
Submission site for the SHARED TASKS only: https://softconf.com/acl2023/BioNLP2023-ST
May 29, 20232: Notification of Acceptance
June 6, 2023: Camera-ready papers due
June 12, 2023: Pre-recorded video due

Video is optional. Instructions (below) are for the video only, not for the final paper submission. Video should not exceed 10 minutes.

Instructions:

 https://docs.google.com/presentation/d/1STKSZ22v3ucS9smfDfhREQhwRB9_bIwu7mnVYKUq7A8/edit?usp=sharing

Form (linked in SLIDE 4) https://acl2023workshops.paperform.co/

BioNLP 2023 Workshop at ACL, July 13, 2023, Toronto, Canada

VISA Information

ACL organizers are processing the requests.

Please see the instructions here: https://2023.aclweb.org/blog/visa-info/

Poster size:

All posters should be A0, orientation: Portrait.

BioNLP 2023: Program TENTATIVE

<

Thursday July 13, 2023
8:30–8:40	Opening remarks
	Session 1: Evaluating speech, models and literature-related tasks
8:40–9:00	Evaluating and Improving Automatic Speech Recognition using Severity Ryan Whetten and Casey Kennington, Boise State University
9:00–9:20	Is the ranking of PubMed similar articles good enough? An evaluation of text similarity methods for three datasets Mariana Neves, Ines Schadock, Beryl Eusemann, Gilbert Schönfelder, Bettina Bert, Daniel Butzke, German Federal Institute for Risk Assessment
9:20–9:40	BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity Recognition Vera Pavlova and Mohammed Makhlouf, rttl.ai
9:40–10:00	Promoting Fairness in Classification of Quality of Medical Evidence/i> Simon Suster¹, Timothy Baldwin², Karin Verspoor³, ¹University of Melbourne, ²MBZUAI, ³RMIT University
10:00–10:30	BioLaySumm 2023 Shared Task: Lay Summarisation of Biomedical Research Articles Tomas Goldsack¹, Zheheng Luo², Qianqian Xie², Carolina Scarton¹, Matthew Shardlow³, Sophia Ananiadou², Chenghua Lin¹, ¹University of Sheffield, ²University of Manchester, ³Manchester Metropolitan University/i>
10:30–11:00	*Coffee Break*
	Session 2: Clinical Language Processing
11:00–11:40	Invited Talk: Dementia Detection from Speech: New Developments and Future Directions Speaker: Kathleen Fraser
11:40–12:10	Overview of the Problem List Summarization (ProbSum) 2023 Shared Task on Summarizing Patients' Active Diagnoses and Problems from Electronic Health Record Progress Notes Yanjun Gao¹, Dmitriy Dligach², Timothy Miller³, Majid Afshar¹, ¹University of Wisconsin, ²Loyola University Chicago, ³Boston Children's Hospital and Harvard Medical School
12:10–12:40	Overview of the RadSum23 Shared Task on Multi-modal and Multi-anatomical Radiology Report Summarization Jean-Benoit Delbrouck, Maya Varma, Pierre Chambon, Curtis Langlotz, Stanford University
12:40–13:00	RadAdapt: Radiology Report Summarization via Lightweight Domain Adaptation of Large Language Models Dave Van Veen¹, Cara Van Uden¹, Maayane Attias¹, Anuj Pareek¹, Christian Bluethgen¹, Malgorzata Polacin², Wah Chiu¹, Jean-Benoit Delbrouck¹, Juan Zambrano Chaves¹, Curtis Langlotz¹, Akshay Chaudhari¹, John Pauly¹, ¹Stanford University, ²Stanford University, ETH Zurich
13:00–14:30	*Lunch*
14:30–17:45	Onsite Poster Session 1
	Exploring Partial Knowledge Base Inference in Biomedical Entity Linking Hongyi Yuan¹, Keming Lu², Zheng Yuan³, ¹Tsinghua University, ²University of Southern California, ³Alibaba Group
	How Much do Knowledge Graphs Impact Transformer Models for Extracting Biomedical Events? Laura Zanella and Yannick Toussaint, LORIA, Université de Lorraine
	DISTANT: Distantly Supervised Entity Span Detection and Classification Ken Yano¹, Makoto Miwa², Sophia Ananiadou³, ¹The National Institute of Advanced Industrial Science and Technology, ²Toyota Technological Institute, ³University of Manchester
	Event-independent temporal positioning: application to French clinical text Nesrine Bannour¹, Bastien Rance², Xavier Tannier³, Aurélie Névéol¹, ¹Université Paris Saclay, CNRS, LISN, ²INSERM, centre de Recherche des Cordeliers, Université Paris Cité, Sorbonne Paris Cité, AP-HP, HEGP, HeKa, Inria Paris, ³Sorbonne Université, Inserm, LIMICS
	AliBERT: A Pre-trained Language Model for French Biomedical Text Aman Berhe¹, Guillaume Draznieks², Vincent Martenot², Valentin Masdeu², Lucas Davy², Jean-Daniel Zucker³, ¹SU/IRD UMMISCO & Quinten, ²Quinten, ³SU/IRD, UMMISCO
	Building a Corpus for Biomedical Relation Extraction of Species Mentions Oumaima El Khettari, Solen Quiniou, Samuel Chaffron, Nantes Université - LS2N
	Automated Extraction of Molecular Interactions and Pathway Knowledge using Large Language Model, Galactica: Opportunities and Challenges Gilchan Park¹, Byung-Jun Yoon¹, Xihaier Luo¹, Vanessa López-Marrero¹, Patrick Johnstone¹, Shinjae Yoo², Francis Alexander¹, ¹Brookhaven National Laboratory, ²BNL
	Automatic Glossary of Clinical Terminology: a Large-Scale Dictionary of Biomedical Definitions Generated from Ontological Knowledge François Remy, Kris Demuynck, Thomas Demeester, Ghent University - imec
	Resolving Elliptical Compounds in German Medical Text Niklas Kämmer¹, Florian Borchert¹, Silvia Winkler¹, Gerard de Melo², Matthieu-P. Schapranow¹, 1Hasso Plattner Institute, University of Potsdam, 2HPI/University of Potsdam
	End-to-end clinical temporal information extraction with multi-head attention Timothy Miller¹, Steven Bethard², Dmitriy Dligach³, Guergana Savova¹, ¹Boston Children's Hospital and Harvard Medical School, ²University of Arizona, ³Loyola University Chicago
	Intermediate Domain Finetuning for Weakly Supervised Domain-adaptive Clinical NER Shilpa Suresh, Nazgol Tavabi, Shahriar Golchin, Leah Gilreath, Rafael Garcia-Andujar, Alexander Kim, Joseph Murray, Blake Bacevich, Ata Kiapour, Musculoskeletal Informatics Group, Boston Children's Hospital, Harvard Medical School
	Biomedical Language Models are Robust to Sub-optimal Tokenization Bernal Jimenez Gutierrez, Huan Sun, Yu Su, The Ohio State University
	BioNART: A Biomedical Non-AutoRegressive Transformer for Natural Language Generation Masaki Asada¹ and Makoto Miwa², ¹National Institute of Advanced Industrial Science and Technology, ²Toyota Technological Institute
	Can Social Media Inform Dietary Approaches for Health Management? A Dataset and Benchmark for Low-Carb Diet Skyler Zou, Xiang Dai, Grant Brinkworth, Pennie Taylor, Sarvnaz Karimi, CSIRO
	Hospital Discharge Summarization Data Provenance Paul Landes¹, Aaron Chase², Kunal Patel¹, Sean Huang², Barbara Di Eugenio¹, ¹University of Illinois at Chicago, ²Vanderbilt University
15:30–16:00	*Coffee Break*
14:30–17:45	Virtual Session 1
	Multi-Source (Pre-)Training for Cross-Domain Measurement, Unit and Context Extraction Yueling Li¹, Sebastian Martschat¹, Simone Paolo Ponzetto², ¹BASF SE, ²University of Mannheim
	Gaussian Distributed Prototypical Network for Few-shot Genomic Variant Detection Jiarun Cao, Niels Peek, Andrew Renehan, Sophia Ananiadou, University of Manchester
	Boosting Radiology Report Generation by Infusing Comparison Prior Sanghwan Kim¹, Farhad Nooralahzadeh², Morteza Rohanian², Koji Fujimoto³, Mizuho Nishio³, Ryo Sakamoto³, Fabio Rinaldi⁴, Michael Krauthammer², ¹ETH Zürich, ²University of Zurich, ³Kyoto University Graduate School of Medicine, ⁴IDSIA, Swiss AI Institute
	Using Bottleneck Adapters to Identify Cancer in Clinical Notes under Low-Resource Constraints Omid Rohanian, Hannah Jauncey, Mohammadmahdi Nouriborji, Vinod Kumar, Bronner P. Gonçalves, Christiana Kartsonaki, ISARIC Clinical Characterisation Group, Laura Merson, David Clifton, University of Oxford
	Zero-shot Temporal Relation Extraction with ChatGPT Chenhan Yuan, Qianqian Xie, Sophia Ananiadou, University of Manchester
	Good Data, Large Data, or No Data? Comparing Three Approaches in Developing Research Aspect Classifiers for Biomedical Papers Shreya Chandrasekhar, Chieh-Yang Huang, Ting-Hao Huang, Penn State University
	Sentiment-guided Transformer with Severity-aware Contrastive Learning for Depression Detection on Social Media Tianlin Zhang, Kailai Yang, Sophia Ananiadou, University of Manchester
	Exploring Drug Switching in Patients: A Deep Learning-based Approach to Extract Drug Changes and Reasons from Social Media Mourad Sarrouti, Carson Tao, Yoann Mamy Randriamihaja, Sumitovant Biopharma
	An end-to-end neural model based on cliques and scopes for frame extraction in long breast radiology reports Perceval Wajsburt¹ and Xavier Tannier², ¹Sorbonne Université, ²Sorbonne Université, Inserm, LIMICS
	Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction Simon Meoni¹, Éric De la Clergerie², Théo Ryffel³, ¹Arkhn/INRIA, ²Iniria, ³Arkhn
	ADEQA: A Question Answer based approach for joint ADE-Suspect Extraction using Sequence-To-Sequence Transformers Vinayak Arannil, Tomal Deb, Atanu Roy, Amazon
	Privacy Aware Question-Answering System for Online Mental Health Risk Assessment Prateek Chhikara, Ujjwal Pasupulety, John Marshall, Dhiraj Chaurasia, Shweta Kumari, University of Southern California
	Multiple Evidence Combination for Fact-Checking of Health-Related Information Pritam Deka, Anna Jurek-Loughrey, Deepak P, Queen's University Belfast
	Comparing and combining some popular NER approaches on Biomedical tasks Harsh Verma, Sabine Bergler, Narjesossadat Tahaei, Concordia University
	Extracting Drug-Drug and Protein-Protein Interactions from Text using a Continuous Update of Tree-Transformers Sudipta Singha Roy and Robert E. Mercer, The University of Western Ontario
	Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental Health Chandreen Liyanage¹, Muskan Garg², Vijay Mago¹, Sunghwan Sohn², ¹Lakehead University, ²Mayo Clinic
	Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers Israt Jahan¹, Md Tahmid Rahman Laskar², Chun Peng¹, Jimmy Huang¹, ¹York University, ²Dialpad Inc.
	Distantly Supervised Document-Level Biomedical Relation Extraction with Neighborhood Knowledge Graphs Takuma Matsubara, Makoto Miwa, Yutaka Sasaki, Toyota Technological Institute
	Biomedical Relation Extraction with Entity Type Markers and Relation-specific Question Answering Koshi Yamada, Makoto Miwa, Yutaka Sasaki, Toyota Technological Institute
	Biomedical Document Classification with Literature Graph Representations of Bibliographies and Entities Ryuki Ida, Makoto Miwa, Yutaka Sasaki, Toyota Technological Institute
	Zero-Shot Information Extraction for Clinical Meta-Analysis using Large Language Models David Kartchner^1,3, Selvi Ramalingam², Irfan Al-Hussaini³, Olivia Kronick³, Cassie Mitchell³, ¹Enveda Biosciences, ²Emory University, ³Georgia Institute of Technology
	WeLT: Improving Biomedical Fine-tuned Pre-trained Language Models with Cost-sensitive Learning Ghadeer Mobasher^1,2, Wolfgang Müller², Olga Krebs², Michael Gertz¹ ¹Heidelberg University, ²Heidelberg Institute for Theoretical Studies – HITS gGmbH
17:45-18:00	Closing remarks

WORKSHOP OVERVIEW AND SCOPE

The BioNLP workshop associated with the ACL SIGBIOMED special interest group has established itself as the primary venue for presenting foundational research in language processing for the biological and medical domains. The workshop is running every year since 2002 and continues getting stronger. BioNLP welcomes and encourages work on languages other than English, and inclusion and diversity. BioNLP truly encompasses the breadth of the domain and brings together researchers in bio- and clinical NLP from all over the world. The workshop will continue presenting work on a broad and interesting range of topics in NLP. The interest to biomedical language has broadened significantly due to the COVID-19 pandemic and continues to grow: as access to information becomes easier and more people generate and access health-related text, it becomes clearer that only language technologies can enable and support adequate use of the biomedical text.

BioNLP 2023 will be particularly interested in language processing that supports DEIA (Diversity, Equity, Inclusion and Accessibility). The work on detection and mitigation of bias and misinformation continues to be of interest. Research in languages other than English, particularly, under-represented languages, and health disparities are always of interest to BioNLP.

Other active areas of research include, but are not limited to:

Tangible results of biomedical language processing applications;
Entity identification and normalization (linking) for a broad range of semantic categories;
Extraction of complex relations and events;
Discourse analysis;
Anaphora/coreference resolution;
Text mining / Literature based discovery;
Summarization;
Τext simplification;
Question Answering;
Resources and strategies for system testing and evaluation;
Infrastructures and pre-trained language models for biomedical NLP (Processing and annotation platforms);
Development of synthetic data & data augmentation;
Translating NLP research into practice;
Getting reproducible results.

SUBMISSION INSTRUCTIONS

Two types of submissions are invited: full (long) papers and short papers.

Submission site for the workshop only: https://softconf.com/acl2023/BioNLP2023/

Shared task participants' reports should be submitted at https://softconf.com/acl2023/BioNLP2023-ST.

The reports on the shared task participation will be reviewed by the task organizers.

Publication chairs for the tasks:

1A: Yanjun Gao
1B: Jean Benoit Delbrouck
2: Chenghua Lin, Tomas Goldsack

Full (long) papers should not exceed eight (8) pages of text, plus unlimited references. Final versions of full papers will be given one additional page of content (up to 9 pages) so that reviewers' comments can be taken into account. Full papers are intended to be reports of original research.

BioNLP aims to be the forum for interesting, innovative, and promising work involving biomedicine and language technology, whether or not yielding high performance at the moment. This by no means precludes our interest in and preference for mature results, strong performance, and thorough evaluation. Both types of research and combinations thereof are encouraged.

Short papers may consist of up to four (4) pages of content, plus unlimited references. Upon acceptance, short papers will still be given up to five (5) content pages in the proceedings. Appropriate short paper topics include preliminary results, application notes, descriptions of work in progress, etc.

Electronic Submission

Submissions must be electronic and in PDF format, using the Softconf START conference management system at https://softconf.com/acl2023/BioNLP2023/

We strongly recommend consulting the ACL Policies for Submission, Review, and Citation: https://2023.aclweb.org/calls/main_conference/ and using ACL LaTeX style files tailored for this year's conference. Submissions must conform to the official style guidelines: https://2023.aclweb.org/calls/style_and_formatting/

Submissions need to be anonymous.

Dual submission policy: papers may NOT be submitted to the BioNLP 2023 workshop if they are or will be concurrently submitted to another meeting or publication.

Program Committee

 * Sophia Ananiadou, National Centre for Text Mining and University of Manchester, UK 
 * Emilia Apostolova, Anthem, Inc., USA
 * Eiji Aramaki, University of Tokyo, Japan 
 * Saadullah Amin, Saarland University, Germany
 * Steven Bethard, University of Arizona, USA
 * Olivier Bodenreider, US National Library of Medicine 
 * Robert Bossy, Inrae, Université Paris Saclay, France
 * Leonardo Campillos-Llanos, Centro Superior de Investigaciones Científicas - CSIC, Spain
 * Kevin Bretonnel Cohen, University of Colorado School of Medicine, USA 
 * Brian Connolly, Ohio, USA
 * Mike Conway, University of Melbourne, Australia
 * Manirupa Das, Amazon, USA
 * Berry de Bruijn, National Research Council, Canada
 * Dina Demner-Fushman, US National Library of Medicine 
 * Bart Desmet, National Institutes of Health, USA
 * Dmitriy Dligach, Loyola University Chicago, USA
 * Kathleen C.	Fraser, National Research Council Canada
 * Travis Goodwin, Amazon Web Services (AWS), Seattle, Washington, USA
 * Natalia Grabar, CNRS, U Lille, France
 * Cyril Grouin, Université Paris-Saclay, CNRS
 * Tudor Groza, EMBL-EBI
 * Deepak Gupta, US National Library of Medicine 
 * William Hogan, UCSD, USA
 * Thierry Hamon, LIMSI-CNRS, France
 * Richard Jackson, AstraZeneca
 * Antonio Jimeno Yepes, IBM, Melbourne Area, Australia
 * Sarvnaz Karimi, CSIRO, Australia
 * Nazmul Kazi, University of North Florida, USA
 * Roman Klinger, University of Stuttgart, Germany
 * Anna Koroleva, Omdena
 * Majid Latifi, Department of Computer Science, University of York, York, UK
 * Andre Lamurias, Aalborg University, Denmark
 * Alberto Lavelli, FBK-ICT, Italy
 * Robert Leaman, US National Library of Medicine 
 * Lung-Hao Lee, National Central University, Taiwan
 * Ulf Leser, Humboldt-Universität zu Berlin, Germany 
 * Timothy Miller, Boston Childrens Hospital and Harvard Medical School, USA
 * Claire Nedellec, French national institute of agronomy (INRA)
 * Guenter Neumann, German Research Center for Artificial Intelligence (DFKI)
 * Mariana Neves, Hasso-Plattner-Institute at the University of Potsdam, Germany
 * Nhung Nguyen, National Centre for Text Mining, University of Manchester, UK
 * Aurélie Névéol, CNRS, France
 * Amandalynne	Paullada, University of Washington School of Medicine
 * Yifan Peng,  Weill Cornell Medical College, USA
 * Laura Plaza, Universidad Nacional de Educación a Distancia
 * Francisco J. Ribadas-Pena, University of Vigo, Spain
 * Anthony Rios, The University of Texas at San Antonio, USA
 * Kirk Roberts, The University of Texas Health Science Center at Houston, USA 
 * Roland Roller, DFKI, Germany
 * Mourad Sarrouti, Sumitovant Biopharma, Inc., USA
 * Diana Sousa, University of Lisbon, Portugal
 * Peng Su, University of Delaware, USA
 * Madhumita Sushil, University of California, San Francisco, USA
 * Mario Sänger, Humboldt Universität zu Berlin, Germany
 * Andrew Taylor, Yale University School of Medicine, USA
 * Karin Verspoor, RMIT University, Australia
 * Leon Weber, Humboldt Universität Berlin, Germany
 * Nathan M. White, James Cook University, Australia
 * Dustin Wright, University of Copenhagen,Denmark
 * Amelie Wührl,  University of Stuttgart, Germany
 * Dongfang Xu, Harvard University, USA
 * Jingqing Zhang,  Imperial College London, UK
 * Ayah Zirikly, Johns Hopkins Whiting School of Engineering, USA
 * Pierre Zweigenbaum, LIMSI - CNRS, France

Organizers

 * Kevin Bretonnel Cohen, University of Colorado School of Medicine
 * Dina Demner-Fushman, US National Library of Medicine
 * Sophia Ananiadou, National Centre for Text Mining and University of Manchester, UK
 * Jun-ichi Tsujii, National Institute of Advanced Industrial Science and Technology, Japan

SHARED TASKS 2023

Shared Tasks on Summarization of Clinical Notes and Scientific Articles

The first task focuses on Clinical Text.

Task 1A. Problem List Summarization

Codalab competition for Problem List Summarization Evaluation: https://codalab.lisn.upsaclay.fr/competitions/12388 Test Set Release: https://physionet.org/content/bionlp-workshop-2023-task-1a/1.1.0/

The deadline for registration is March 1st, after which no further registrations will be accepted.

Automatically summarizing patients’ main problems from the daily care notes in the electronic health record can help mitigate information and cognitive overload for clinicians and provide augmented intelligence via computerized diagnostic decision support at the bedside. The task of Problem List Summarization aims to generate a list of diagnoses and problems in a patient’s daily care plan using input from the provider’s progress notes during hospitalization.This task aims to promote NLP model development for downstream applications in diagnostic decision support systems that could improve efficiency and reduce diagnostic errors in hospitals. This task will contain 768 hospital daily progress notes and 2783 diagnoses in the training set, and a new set of 300 daily progress notes will be annotated by physicians as the test set. The annotation methods and annotation quality have previously been reported here. The goal of this shared task is to attract future research efforts in building NLP models for real-world decision support applications, where a system generating relevant and accurate diagnoses will assist the healthcare providers’ decision-making process and improve the quality of care for patients.

Shared Task 1A Registration: https://forms.gle/yp6TKD66G8KGpweN9

Please join our Google discussion group for the important update: https://groups.google.com/g/bionlp2023problemsumm

Full Task 1A Details at:

https://physionet.org/content/bionlp-workshop-2023-task-1a/1.0.0/

Important Dates:

~~Registration Started: January 13th, 2023~~
~~Releasing of training and validation data: January 13th, 2023~~
~~Registration stops: March 1, 2023~~
Releasing of test data: April 13th, 2023

Codalab competition for Problem List Summarization Evaluation: https://codalab.lisn.upsaclay.fr/competitions/12388 Test Set Release: https://physionet.org/content/bionlp-workshop-2023-task-1a/1.1.0/

System submission deadline: April 20th, 2023
System papers due date: April 28th, 2023
Notification of acceptance: June 1st, 2023
Camera-ready system papers due: June 6, 2023
BioNLP Workshop Date: July 13th, 2023

Task 1A Organizers:

Majid Afshar, Department of Medicine University of Wisconsin - Madison.
Yanjun Gao, University of Wisconsin Madison.
Dmitriy Dligach, Department of Computer Science at Loyola University Chicago.
Timothy Miller, Boston Children’s Hospital and Harvard Medical School.

Task 1B. Radiology report summarization

Radiology report summarization is a growing area of research. Given the Findings and/or Background sections of a radiology report, the goal is to generate a summary (called an Impression section) that highlights the key observations and conclusions of the radiology study.

The research area of radiology report summarization currently faces an important limitation: most research is carried out on chest X-rays. To palliate these limitations, we propose two datasets: A shared summarization task that includes six different modalities and anatomies, totalling 79,779 samples, based on the MIMIC-III database.

A shared summarization task on chest x-ray radiology reports with images and a brand new out-of-domain test-set from Stanford.

Full Task 1B details at:

https://vilmedic.app/misc/bionlp23/sharedtask

Task 1B Organizers:

Jean-Benoit Delbrouck, Stanford University.
Maya Varma, Stanford University.

Task 2. Lay Summarization of Biomedical Research Articles

Biomedical publications contain the latest research on prominent health-related topics, ranging from common illnesses to global pandemics. This can often result in their content being of interest to a wide variety of audiences including researchers, medical professionals, journalists, and even members of the public. However, the highly technical and specialist language used within such articles typically makes it difficult for non-expert audiences to understand their contents.

Abstractive summarization models can be used to generate a concise summary of an article, capturing its salient point using words and sentences that aren’t used in the original text. As such, these models have the potential to help broaden access to highly technical documents when trained to generate summaries that are more readable, containing more background information and less technical terminology (i.e., a “lay summary”).

This shared task surrounds the abstractive summarization of biomedical research articles, with an emphasis on controllability and catering to non-expert audiences. Through this task, we aim to help foster increased research interest in controllable summarization that helps broaden access to technical texts and progress toward more usable abstractive summarization models in the biomedical domain.

For more information on Task 2, see:

Main site: https://biolaysumm.org/
CodaLab page - subtask 1: https://codalab.lisn.upsaclay.fr/competitions/9541
CodaLab page - subtask 2: https://codalab.lisn.upsaclay.fr/competitions/9544

Detailed descriptions of the motivation, the tasks, and the data are also published in:

Goldsack, T., Zhang, Z., Lin, C., Scarton, C.. Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature. EMNLP 2022.
Luo, Z., Xie, Q., Ananiadou, S.. Readability Controllable Biomedical Document Summarization. EMNLP 2022 Findings.

Task 2 Organizers:

Chenghua Lin, Deputy Director of Research and Innovation in the Computer Science Department, University of Sheffield.
Sophia Ananiadou, Turing Fellow, Director of the National Centre for Text Mining and Deputy Director of the Institute of Data Science and AI at the University of Manchester.
Carolina Scarton, Computer Science Department at the University of Sheffield.
Qianqian Xie, National Centre for Text Mining (NaCTeM).
Tomas Goldsack, University of Sheffield.
Zheheng Luo, the University of Manchester.
Zhihao Zhang, Beihang University.

@@ Line 39: / Line 39: @@
 ===BioNLP 2023: Program TENTATIVE===
+<table cellspacing="0" cellpadding="5" border="0">
+<tr><td colspan=2><h4>Thursday July 13, 2023</h4></td></tr>
+<tr><td valign=top>8:30&#8211;8:40</td><td valign=top><b> Opening remarks</b></td></tr>
+<tr><td valign=top>&nbsp;</td><td valign=top><b>Session 1: Evaluating speech, models and literature-related tasks</b></td></tr>
+<tr><td valign=top width=100>8:40&#8211;9:00</td><td valign=top align=left><i>Evaluating and Improving Automatic Speech Recognition using Severity</i><br>
+Ryan Whetten and Casey Kennington, <i>Boise State University</i></td></tr>
+<tr><td valign=top width=100>9:00&#8211;9:20</td><td valign=top align=left><i>Is the ranking of PubMed similar articles good enough? An evaluation of text similarity methods for three datasets</i><br>
+Mariana Neves, Ines Schadock, Beryl Eusemann, Gilbert Schönfelder, Bettina Bert, Daniel Butzke, <i>German Federal Institute for Risk Assessment</i></td></tr>
+<tr><td valign=top width=100>9:20&#8211;9:40</td><td valign=top align=left><i>BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity Recognition</i><br>
+Vera Pavlova and Mohammed Makhlouf, <i>rttl.ai</i></td></tr>
+<tr><td valign=top width=100>9:40&#8211;10:00</td><td valign=top align=left><i>Promoting Fairness in Classification of Quality of Medical Evidence/i><br>Simon Suster<sup>1</sup>, Timothy Baldwin<sup>2</sup>, Karin Verspoor<sup>3</sup>, <i><sup>1</sup>University of Melbourne, <sup>2</sup>MBZUAI, <sup>3</sup>RMIT University</i></td></tr>
+<tr><td valign=top width=100>10:00&#8211;10:30</td><td valign=top align=left><i>BioLaySumm 2023 Shared Task: Lay Summarisation of Biomedical Research Articles</i><br>
+Tomas Goldsack<sup>1</sup>, Zheheng Luo<sup>2</sup>, Qianqian Xie<sup>2</sup>, Carolina Scarton<sup>1</sup>, Matthew Shardlow<sup>3</sup>, Sophia Ananiadou<sup>2</sup>, Chenghua Lin<sup>1</sup>, <i>
+<sup>1</sup>University of Sheffield, <sup>2</sup>University of Manchester, <sup>3</sup>Manchester Metropolitan University/i></td></tr>
+<tr><td valign=top style="padding-top: 14px;"><b>10:30&#8211;11:00</b></td><td valign=top style="padding-top: 14px;"><b><em>Coffee Break</em></b></td></tr>
+<tr><td valign=top>&nbsp;</td><td valign=top><b>Session 2: Clinical Language Processing</b></td></tr>
+<tr><td valign=top style="padding-top: 14px;"><b>11:00&#8211;11:40</b></td><td valign=top style="padding-top: 14px;"><b>Invited Talk: <i>Dementia Detection from Speech: New Developments and Future Directions</i> <br> Speaker:  Kathleen Fraser</b></td></tr>
+<tr><td valign=top width=100>11:40&#8211;12:10</td><td valign=top align=left><i>Overview of the Problem List Summarization (ProbSum) 2023 Shared Task on Summarizing Patients' Active Diagnoses and Problems from Electronic Health Record Progress Notes</i><br>
+Yanjun Gao<sup>1</sup>, Dmitriy Dligach<sup>2</sup>, Timothy Miller<sup>3</sup>, Majid Afshar<sup>1</sup>, <i>
+<sup>1</sup>University of Wisconsin, <sup>2</sup>Loyola University Chicago, <sup>3</sup>Boston Children's Hospital and Harvard Medical School</i></td></tr>
+<tr><td valign=top width=100>12:10&#8211;12:40</td><td valign=top align=left><i>Overview of the RadSum23 Shared Task on Multi-modal and Multi-anatomical Radiology Report Summarization</i><br>
+Jean-Benoit Delbrouck, Maya Varma, Pierre Chambon, Curtis Langlotz, <i>Stanford University</i></td></tr>
+<tr><td valign=top width=100>12:40&#8211;13:00</td><td valign=top align=left><i>RadAdapt: Radiology Report Summarization via Lightweight Domain Adaptation of Large Language Models</i><br>
+Dave Van Veen<sup>1</sup>, Cara Van Uden<sup>1</sup>, Maayane Attias<sup>1</sup>, Anuj Pareek<sup>1</sup>, Christian Bluethgen<sup>1</sup>, Malgorzata Polacin<sup>2</sup>, Wah Chiu<sup>1</sup>, Jean-Benoit Delbrouck<sup>1</sup>, Juan Zambrano Chaves<sup>1</sup>, Curtis Langlotz<sup>1</sup>, Akshay Chaudhari<sup>1</sup>, John Pauly<sup>1</sup>, <i>
+<sup>1</sup>Stanford University, <sup>2</sup>Stanford University, ETH Zurich</i></td></tr>
+<<tr><td valign=top style="padding-top: 14px;"><b>13:00&#8211;14:30</b></td><td valign=top style="padding-top: 14px;"><b><em>Lunch</em></b></td></tr>
+<tr><td valign=top style="padding-top: 14px;"><b>14:30&#8211;17:45</b></td><td valign=top style="padding-top: 14px;"><b>Onsite Poster Session 1</b></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Exploring Partial Knowledge Base Inference in Biomedical Entity Linking</i><br>Hongyi Yuan<sup>1</sup>, Keming Lu<sup>2</sup>, Zheng Yuan<sup>3</sup>, <i>
+<sup>1</sup>Tsinghua University, <sup>2</sup>University of Southern California, <sup>3</sup>Alibaba Group</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>How Much do Knowledge Graphs Impact Transformer Models for Extracting Biomedical Events?</i><br>
+Laura Zanella and Yannick Toussaint, <i>LORIA, Université de Lorraine</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>DISTANT: Distantly Supervised Entity Span Detection and Classification</i><br>
+Ken Yano<sup>1</sup>, Makoto Miwa<sup>2</sup>, Sophia Ananiadou<sup>3</sup>, <i>
+<sup>1</sup>The National Institute of Advanced Industrial Science and Technology, <sup>2</sup>Toyota Technological Institute, <sup>3</sup>University of Manchester</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Event-independent temporal positioning: application to French clinical text</i><br>
+Nesrine Bannour<sup>1</sup>, Bastien Rance<sup>2</sup>, Xavier Tannier<sup>3</sup>, Aurélie Névéol<sup>1</sup>, <i>
+<sup>1</sup>Université Paris Saclay, CNRS, LISN, <sup>2</sup>INSERM, centre de Recherche des Cordeliers, Université Paris Cité, Sorbonne Paris Cité, AP-HP, HEGP, HeKa, Inria Paris, <sup>3</sup>Sorbonne Université, Inserm, LIMICS</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>AliBERT: A Pre-trained Language Model for French Biomedical Text</i><br>
+Aman Berhe<sup>1</sup>, Guillaume Draznieks<sup>2</sup>, Vincent Martenot<sup>2</sup>, Valentin Masdeu<sup>2</sup>, Lucas Davy<sup>2</sup>, Jean-Daniel Zucker<sup>3</sup>, <i>
+<sup>1</sup>SU/IRD UMMISCO & Quinten, <sup>2</sup>Quinten, <sup>3</sup>SU/IRD, UMMISCO</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Building a Corpus for Biomedical Relation Extraction of Species Mentions</i><br>
+Oumaima El Khettari, Solen Quiniou, Samuel Chaffron, <i>Nantes Université - LS2N</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Automated Extraction of Molecular Interactions and Pathway Knowledge using Large Language Model, Galactica: Opportunities and Challenges</i><br>
+Gilchan Park<sup>1</sup>, Byung-Jun Yoon<sup>1</sup>, Xihaier Luo<sup>1</sup>, Vanessa López-Marrero<sup>1</sup>, Patrick Johnstone<sup>1</sup>, Shinjae Yoo<sup>2</sup>, Francis Alexander<sup>1</sup>, <i> <sup>1</sup>Brookhaven National Laboratory, <sup>2</sup>BNL
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Automatic Glossary of Clinical Terminology: a Large-Scale Dictionary of Biomedical Definitions Generated from Ontological Knowledge</i><br>
+François Remy, Kris Demuynck, Thomas Demeester, <i>Ghent University - imec</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Resolving Elliptical Compounds in German Medical Text</i><br>
+Niklas Kämmer<sup>1</sup>, Florian Borchert<sup>1</sup>, Silvia Winkler<sup>1</sup>, Gerard de Melo<sup>2</sup>, Matthieu-P. Schapranow<sup>1</sup>, <i>
+Hasso Plattner Institute, University of Potsdam, 2HPI/University of Potsdam</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>End-to-end clinical temporal information extraction with multi-head attention</i><br>
+Timothy Miller<sup>1</sup>, Steven Bethard<sup>2</sup>, Dmitriy Dligach<sup>3</sup>, Guergana Savova<sup>1</sup>, <i> <sup>1</sup>Boston Children's Hospital and Harvard Medical School, <sup>2</sup>University of Arizona, <sup>3</sup>Loyola University Chicago</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Intermediate Domain Finetuning for Weakly Supervised Domain-adaptive Clinical NER</i><br>
+Shilpa Suresh, Nazgol Tavabi, Shahriar Golchin, Leah Gilreath, Rafael Garcia-Andujar, Alexander Kim, Joseph Murray, Blake Bacevich, Ata Kiapour, <i>Musculoskeletal Informatics Group, Boston Children's Hospital, Harvard Medical School</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Biomedical Language Models are Robust to Sub-optimal Tokenization</i><br>
+Bernal Jimenez Gutierrez, Huan Sun, Yu Su, <I>The Ohio State University</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>BioNART: A Biomedical Non-AutoRegressive Transformer for Natural Language Generation</i><br>
+Masaki Asada<sup>1</sup> and Makoto Miwa<sup>2</sup>, <i>
+<sup>1</sup>National Institute of Advanced Industrial Science and Technology, <sup>2</sup>Toyota Technological Institute</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Can Social Media Inform Dietary Approaches for Health Management? A Dataset and Benchmark for Low-Carb Diet</i><br>
+Skyler Zou, Xiang Dai, Grant Brinkworth, Pennie Taylor, Sarvnaz Karimi, <i>CSIRO</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Hospital Discharge Summarization Data Provenance</i><br>
+Paul Landes<sup>1</sup>, Aaron Chase<sup>2</sup>, Kunal Patel<sup>1</sup>, Sean Huang<sup>2</sup>, Barbara Di Eugenio<sup>1</sup>, <i>
+<sup>1</sup>University of Illinois at Chicago, <sup>2</sup>Vanderbilt University</i></td></tr>
+<tr><td valign=top style="padding-top: 14px;"><b>15:30&#8211;16:00</b></td><td valign=top style="padding-top: 14px;"><b><em>Coffee Break</em></b></td></tr>
+<tr><td valign=top style="padding-top: 14px;"><b>14:30&#8211;17:45</b></td><td valign=top style="padding-top: 14px;"><b>Virtual Session 1</b></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Multi-Source (Pre-)Training for Cross-Domain Measurement, Unit and Context Extraction</i><br>
+Yueling Li<sup>1</sup>, Sebastian Martschat<sup>1</sup>, Simone Paolo Ponzetto<sup>2</sup>, <i>
+<sup>1</sup>BASF SE, <sup>2</sup>University of Mannheim</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Gaussian Distributed Prototypical Network for Few-shot Genomic Variant Detection</i><br>
+Jiarun Cao, Niels Peek, Andrew Renehan, Sophia Ananiadou, <i> University of Manchester</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Boosting Radiology Report Generation by Infusing Comparison Prior</i><br>
+Sanghwan Kim<sup>1</sup>, Farhad Nooralahzadeh<sup>2</sup>, Morteza Rohanian<sup>2</sup>, Koji Fujimoto<sup>3</sup>, Mizuho Nishio<sup>3</sup>, Ryo Sakamoto<sup>3</sup>, Fabio Rinaldi<sup>4</sup>, Michael Krauthammer<sup>2</sup>, <i>
+<sup>1</sup>ETH Zürich, <sup>2</sup>University of Zurich, <sup>3</sup>Kyoto University Graduate School of Medicine, <sup>4</sup>IDSIA, Swiss AI Institute</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Using Bottleneck Adapters to Identify Cancer in Clinical Notes under Low-Resource Constraints</i><br>
+Omid Rohanian, Hannah Jauncey, Mohammadmahdi Nouriborji, Vinod Kumar, Bronner P. Gonçalves, Christiana Kartsonaki, ISARIC Clinical Characterisation Group, Laura Merson, David Clifton, <i>
+University of Oxford</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Zero-shot Temporal Relation Extraction with ChatGPT</i><br>
+Chenhan Yuan, Qianqian Xie, Sophia Ananiadou, <i>University of Manchester</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Good Data, Large Data, or No Data? Comparing Three Approaches in Developing Research Aspect Classifiers for Biomedical Papers</i><br>
+Shreya Chandrasekhar, Chieh-Yang Huang, Ting-Hao Huang, <i> Penn State University</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Sentiment-guided Transformer with Severity-aware Contrastive Learning for Depression Detection on Social Media</i><br>
+Tianlin Zhang, Kailai Yang, Sophia Ananiadou, <i>University of Manchester</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Exploring Drug Switching in Patients: A Deep Learning-based Approach to Extract Drug Changes and Reasons from Social Media</i><br>
+Mourad Sarrouti, Carson Tao, Yoann Mamy Randriamihaja, <i>Sumitovant Biopharma</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>An end-to-end neural model based on cliques and scopes for frame extraction in long breast radiology reports</i><br>
+Perceval Wajsburt<sup>1</sup> and Xavier Tannier<sup>2</sup>, <i>
+<sup>1</sup>Sorbonne Université, <sup>2</sup>Sorbonne Université, Inserm, LIMICS</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction</i><br>
+Simon Meoni<sup>1</sup>, Éric De la Clergerie<sup>2</sup>, Théo Ryffel<sup>3</sup>,<i>
+<sup>1</sup>Arkhn/INRIA, <sup>2</sup>Iniria, <sup>3</sup>Arkhn</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>ADEQA: A Question Answer based approach for joint ADE-Suspect Extraction using Sequence-To-Sequence Transformers</i><br>
+Vinayak Arannil, Tomal Deb, Atanu Roy, <i>Amazon</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Privacy Aware Question-Answering System for Online Mental Health Risk Assessment</i><br>
+Prateek Chhikara, Ujjwal Pasupulety, John Marshall, Dhiraj Chaurasia, Shweta Kumari, <i>University of Southern California</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Multiple Evidence Combination for Fact-Checking of Health-Related Information</i><br>
+Pritam Deka, Anna Jurek-Loughrey, Deepak P, <i>Queen's University Belfast</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Comparing and combining some popular NER approaches on Biomedical tasks</i><br>
+Harsh Verma, Sabine Bergler, Narjesossadat Tahaei, <i>Concordia University</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Extracting Drug-Drug and Protein-Protein Interactions from Text using a Continuous Update of Tree-Transformers</i><br>
+Sudipta Singha Roy and Robert E. Mercer, <i>The University of Western Ontario</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental Health</i><br>
+Chandreen Liyanage<sup>1</sup>, Muskan Garg<sup>2</sup>, Vijay Mago<sup>1</sup>, Sunghwan Sohn<sup>2</sup>, <i>
+<sup>1</sup>Lakehead University, <sup>2</sup>Mayo Clinic</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers</i><br>
+Israt Jahan<sup>1</sup>, Md Tahmid Rahman Laskar<sup>2</sup>, Chun Peng<sup>1</sup>, Jimmy Huang<sup>1</sup>, <i>
+<sup>1</sup>York University, <sup>2</sup>Dialpad Inc.</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Distantly Supervised Document-Level Biomedical Relation Extraction with Neighborhood Knowledge Graphs</i><br>
+Takuma Matsubara, Makoto Miwa, Yutaka Sasaki, <i>Toyota Technological Institute</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Biomedical Relation Extraction with Entity Type Markers and Relation-specific Question Answering</i><br>
+Koshi Yamada, Makoto Miwa, Yutaka Sasaki, <i>Toyota Technological Institute</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Biomedical Document Classification with Literature Graph Representations of Bibliographies and Entities</i><br>
+Ryuki Ida, Makoto Miwa, Yutaka Sasaki, <i>Toyota Technological Institute</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>Zero-Shot Information Extraction for Clinical Meta-Analysis using Large Language Models</i><br>
+David Kartchner<sup>1,3</sup>, Selvi Ramalingam<sup>2</sup>, Irfan Al-Hussaini<sup>3</sup>, Olivia Kronick<sup>3</sup>, Cassie Mitchell<sup>3</sup>, <i><sup>1</sup>Enveda Biosciences, <sup>2</sup>Emory University, <sup>3</sup>Georgia Institute of Technology</i></td></tr>
+<tr><td valign=top width=100>&nbsp;</td><td valign=top align=left><i>WeLT: Improving Biomedical Fine-tuned Pre-trained Language Models with Cost-sensitive Learning</i><br>
+Ghadeer Mobasher<sup>1,2</sup>, Wolfgang Müller<sup>2</sup>, Olga Krebs<sup>2</sup>, Michael Gertz<sup>1</sup>
+<sup>1</sup>Heidelberg University, <sup>2</sup>Heidelberg Institute for Theoretical Studies – HITS gGmbH</i></td></tr>
+ <tr><td valign=top width=100>'''17:45-18:00'''</td> <td><b>Closing remarks</b></td></tr>
+</table>
 ===WORKSHOP OVERVIEW AND SCOPE===