2nd Open Language Data Initiative shared task at WMT25

Event Notification Type: 
Call for Participation
Abbreviated Title: 
OLDI at WMT25
Wednesday, 5 November 2025 to Sunday, 9 November 2025
Country: 
China
Contact Email: 
City: 
Suzhou
Contact: 
Open Language Data Initiative (OLDI) organisers
Submission Deadline: 
Thursday, 14 August 2025

We are excited to announce the 2nd edition of the Open Language Data Initiative shared task at WMT25, co-located with EMNLP 2025.

Task Description

The primary goal of this shared task is to expand OLDI’s open datasets to more languages. We are soliciting contributions to the following:

  • The MT evaluation dataset FLORES+.
  • The MT Seed dataset.
  • Other high-quality, massively-parallel and open-source datasets.

Contributions may consist of either the addition of entirely new languages, varieties or dialects to the above datasets, or substantial improvements to existing datasets. To describe and publicise their contributions, task participants will be asked to submit a 4-6 page paper to be presented at the WMT 2025 conference.

Important Dates

All dates follow WMT/EMNLP.

  • Paper and data submission deadline: 14 August
  • Notification of acceptance: 13 September

More Information

For more information: