SCAR: Sentence Compression using Autoencoders for Reconstruction

Chanakya Malireddy, Tirth Maniar, Manish Shrivastava


Abstract
Sentence compression is the task of shortening a sentence while retaining its meaning. Most methods proposed for this task rely on labeled or paired corpora (containing pairs of verbose and compressed sentences), which is often expensive to collect. To overcome this limitation, we present a novel unsupervised deep learning framework (SCAR) for deletion-based sentence compression. SCAR is primarily composed of two encoder-decoder pairs: a compressor and a reconstructor. The compressor masks the input, and the reconstructor tries to regenerate it. The model is entirely trained on unlabeled data and does not require additional inputs such as explicit syntactic information or optimal compression length. SCAR’s merit lies in the novel Linkage Loss function, which correlates the compressor and its effect on reconstruction, guiding it to drop inferable tokens. SCAR achieves higher ROUGE scores on benchmark datasets than the existing state-of-the-art methods and baselines. We also conduct a user study to demonstrate the application of our model as a text highlighting system. Using our model to underscore salient information facilitates speed-reading and reduces the time required to skim a document.
Anthology ID:
2020.acl-srw.13
Volume:
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
Month:
July
Year:
2020
Address:
Online
Editors:
Shruti Rijhwani, Jiangming Liu, Yizhong Wang, Rotem Dror
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
88–94
Language:
URL:
https://aclanthology.org/2020.acl-srw.13
DOI:
10.18653/v1/2020.acl-srw.13
Bibkey:
Cite (ACL):
Chanakya Malireddy, Tirth Maniar, and Manish Shrivastava. 2020. SCAR: Sentence Compression using Autoencoders for Reconstruction. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pages 88–94, Online. Association for Computational Linguistics.
Cite (Informal):
SCAR: Sentence Compression using Autoencoders for Reconstruction (Malireddy et al., ACL 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.acl-srw.13.pdf
Video:
 http://slideslive.com/38928649
Code
 m-chanakya/scar