A General Optimization Framework for Multi-Document Summarization Using Genetic Algorithms and Swarm Intelligence

Maxime Peyrard, Judith Eckle-Kohler


Abstract
Extracting summaries via integer linear programming and submodularity are popular and successful techniques in extractive multi-document summarization. However, many interesting optimization objectives are neither submodular nor factorizable into an integer linear program. We address this issue and present a general optimization framework where any function of input documents and a system summary can be plugged in. Our framework includes two kinds of summarizers – one based on genetic algorithms, the other using a swarm intelligence approach. In our experimental evaluation, we investigate the optimization of two information-theoretic summary evaluation metrics and find that our framework yields competitive results compared to several strong summarization baselines. Our comparative analysis of the genetic and swarm summarizers reveals interesting complementary properties.
Anthology ID:
C16-1024
Volume:
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Yuji Matsumoto, Rashmi Prasad
Venue:
COLING
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
247–257
Language:
URL:
https://aclanthology.org/C16-1024
DOI:
Bibkey:
Cite (ACL):
Maxime Peyrard and Judith Eckle-Kohler. 2016. A General Optimization Framework for Multi-Document Summarization Using Genetic Algorithms and Swarm Intelligence. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 247–257, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
A General Optimization Framework for Multi-Document Summarization Using Genetic Algorithms and Swarm Intelligence (Peyrard & Eckle-Kohler, COLING 2016)
Copy Citation:
PDF:
https://aclanthology.org/C16-1024.pdf
Code
 UKPLab/coling2016-genetic-swarm-MDS