Information Aggregation via Dynamic Routing for Sequence Encoding

Jingjing Gong; Xipeng Qiu; Shaojing Wang; Xuan-Jing Huang

Information Aggregation via Dynamic Routing for Sequence Encoding

Jingjing Gong, Xipeng Qiu, Shaojing Wang, Xuanjing Huang

Abstract

While much progress has been made in how to encode a text sequence into a sequence of vectors, less attention has been paid to how to aggregate these preceding vectors (outputs of RNN/CNN) into fixed-size encoding vector. Usually, a simple max or average pooling is used, which is a bottom-up and passive way of aggregation and lack of guidance by task information. In this paper, we propose an aggregation mechanism to obtain a fixed-size encoding with a dynamic routing policy. The dynamic routing policy is dynamically deciding that what and how much information need be transferred from each word to the final encoding of the text sequence. Following the work of Capsule Network, we design two dynamic routing policies to aggregate the outputs of RNN/CNN encoding layer into a final encoding vector. Compared to the other aggregation methods, dynamic routing can refine the messages according to the state of final encoding vector. Experimental results on five text classification tasks show that our method outperforms other aggregating models by a significant margin. Related source code is released on our github page. Related source code is released on our github page.

Anthology ID:: C18-1232
Volume:: Proceedings of the 27th International Conference on Computational Linguistics
Month:: August
Year:: 2018
Address:: Santa Fe, New Mexico, USA
Editors:: Emily M. Bender, Leon Derczynski, Pierre Isabelle
Venue:: COLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2742–2752
Language:
URL:: https://aclanthology.org/C18-1232
DOI:
Bibkey:
Cite (ACL):: Jingjing Gong, Xipeng Qiu, Shaojing Wang, and Xuanjing Huang. 2018. Information Aggregation via Dynamic Routing for Sequence Encoding. In Proceedings of the 27th International Conference on Computational Linguistics, pages 2742–2752, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):: Information Aggregation via Dynamic Routing for Sequence Encoding (Gong et al., COLING 2018)
Copy Citation:
PDF:: https://aclanthology.org/C18-1232.pdf
Code: FudanNLP/Capsule4TextClassification + additional community code
Data: IMDb Movie Reviews, SST, SST-2

PDF Cite Search Code