An automatic discourse relation alignment experiment on TED-MDB

Sibel Ozer, Deniz Zeyrek


Abstract
This paper describes an automatic discourse relation alignment experiment as an empirical justification of the planned annotation projection approach to enlarge the 3600-word multilingual corpus of TED Multilingual Discourse Bank (TED-MDB). The experiment is carried out on a single language pair (English-Turkish) included in TED-MDB. The paper first describes the creation of a large corpus of English-Turkish bi-sentences, then it presents a sense-based experiment that automatically aligns the relations in the English sentences of TED-MDB with the Turkish sentences. The results are very close to the results obtained from an earlier semi-automatic post-annotation alignment experiment validated by human annotators and are encouraging for future annotation projection tasks.
Anthology ID:
W19-3612
Volume:
Proceedings of the 2019 Workshop on Widening NLP
Month:
August
Year:
2019
Address:
Florence, Italy
Editors:
Amittai Axelrod, Diyi Yang, Rossana Cunha, Samira Shaikh, Zeerak Waseem
Venue:
WiNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
31–34
Language:
URL:
https://aclanthology.org/W19-3612
DOI:
Bibkey:
Cite (ACL):
Sibel Ozer and Deniz Zeyrek. 2019. An automatic discourse relation alignment experiment on TED-MDB. In Proceedings of the 2019 Workshop on Widening NLP, pages 31–34, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
An automatic discourse relation alignment experiment on TED-MDB (Ozer & Zeyrek, WiNLP 2019)
Copy Citation: