Towards Grounding of Formulae

Takuto Asakura, André Greiner-Petter, Akiko Aizawa, Yusuke Miyao


Abstract
A large amount of scientific knowledge is represented within mixed forms of natural language texts and mathematical formulae. Therefore, a collaboration of natural language processing and formula analyses, so-called mathematical language processing, is necessary to enable computers to understand and retrieve information from the documents. However, as we will show in this project, a mathematical notation can change its meaning even within the scope of a single paragraph. This flexibility makes it difficult to extract the exact meaning of a mathematical formula. In this project, we will propose a new task direction for grounding mathematical formulae. Particularly, we are addressing the widespread misconception of various research projects in mathematical information retrieval, which presume that mathematical notations have a fixed meaning within a single document. We manually annotated a long scientific paper to illustrate the task concept. Our high inter-annotator agreement shows that the task is well understood for humans. Our results indicate that it is worthwhile to grow the techniques for the proposed task to contribute to the further progress of mathematical language processing.
Anthology ID:
2020.sdp-1.16
Volume:
Proceedings of the First Workshop on Scholarly Document Processing
Month:
November
Year:
2020
Address:
Online
Editors:
Muthu Kumar Chandrasekaran, Anita de Waard, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Eduard Hovy, Petr Knoth, David Konopnicki, Philipp Mayr, Robert M. Patton, Michal Shmueli-Scheuer
Venue:
sdp
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
138–147
Language:
URL:
https://aclanthology.org/2020.sdp-1.16
DOI:
10.18653/v1/2020.sdp-1.16
Bibkey:
Cite (ACL):
Takuto Asakura, André Greiner-Petter, Akiko Aizawa, and Yusuke Miyao. 2020. Towards Grounding of Formulae. In Proceedings of the First Workshop on Scholarly Document Processing, pages 138–147, Online. Association for Computational Linguistics.
Cite (Informal):
Towards Grounding of Formulae (Asakura et al., sdp 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.sdp-1.16.pdf
Video:
 https://slideslive.com/38940733