Pointing to Select: A Fast Pointer-LSTM for Long Text Classification

Jinhua Du, Yan Huang, Karo Moilanen


Abstract
Recurrent neural networks (RNNs) suffer from well-known limitations and complications which include slow inference and vanishing gradients when processing long sequences in text classification. Recent studies have attempted to accelerate RNNs via various ad hoc mechanisms to skip irrelevant words in the input. However, word skipping approaches proposed to date effectively stop at each or a given time step to decide whether or not a given input word should be skipped, breaking the coherence of input processing in RNNs. Furthermore, current methods cannot change skip rates during inference and are consequently unable to support different skip rates in demanding real-world conditions. To overcome these limitations, we propose Pointer- LSTM, a novel LSTM framework which relies on a pointer network to select important words for target prediction. The model maintains a coherent input process for the LSTM modules and makes it possible to change the skip rate during inference. Our evaluation on four public data sets demonstrates that Pointer-LSTM (a) is 1.1x∼3.5x faster than the standard LSTM architecture; (b) is more accurate than Leap-LSTM (the state-of-the-art LSTM skipping model) at high skip rates; and (c) reaches robust accuracy levels even when the skip rate is changed during inference.
Anthology ID:
2020.coling-main.544
Volume:
Proceedings of the 28th International Conference on Computational Linguistics
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Editors:
Donia Scott, Nuria Bel, Chengqing Zong
Venue:
COLING
SIG:
Publisher:
International Committee on Computational Linguistics
Note:
Pages:
6184–6193
Language:
URL:
https://aclanthology.org/2020.coling-main.544
DOI:
10.18653/v1/2020.coling-main.544
Bibkey:
Cite (ACL):
Jinhua Du, Yan Huang, and Karo Moilanen. 2020. Pointing to Select: A Fast Pointer-LSTM for Long Text Classification. In Proceedings of the 28th International Conference on Computational Linguistics, pages 6184–6193, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):
Pointing to Select: A Fast Pointer-LSTM for Long Text Classification (Du et al., COLING 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.coling-main.544.pdf
Data
AG News