Enabling Language Models to Fill in the Blanks

Chris Donahue, Mina Lee, Percy Liang


Abstract
We present a simple approach for text infilling, the task of predicting missing spans of text at any position in a document. While infilling could enable rich functionality especially for writing assistance tools, more attention has been devoted to language modeling—a special case of infilling where text is predicted at the end of a document. In this paper, we aim to extend the capabilities of language models (LMs) to the more general task of infilling. To this end, we train (or fine tune) off-the-shelf LMs on sequences containing the concatenation of artificially-masked text and the text which was masked. We show that this approach, which we call infilling by language modeling, can enable LMs to infill entire sentences effectively on three different domains: short stories, scientific abstracts, and lyrics. Furthermore, we show that humans have difficulty identifying sentences infilled by our approach as machine-generated in the domain of short stories.
Anthology ID:
2020.acl-main.225
Volume:
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Month:
July
Year:
2020
Address:
Online
Editors:
Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2492–2501
Language:
URL:
https://aclanthology.org/2020.acl-main.225
DOI:
10.18653/v1/2020.acl-main.225
Bibkey:
Cite (ACL):
Chris Donahue, Mina Lee, and Percy Liang. 2020. Enabling Language Models to Fill in the Blanks. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 2492–2501, Online. Association for Computational Linguistics.
Cite (Informal):
Enabling Language Models to Fill in the Blanks (Donahue et al., ACL 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.acl-main.225.pdf
Video:
 http://slideslive.com/38929175
Code
 chrisdonahue/ilm +  additional community code