Extractive Text Summarization using Recurrent Neural Networks with Attention Mechanism

Shimirwa Aline Valerie and Jian Xu, Nanjing University of Science and Technology, China; Shimirwa Aline Valerie and Jian Xu, Nanjing University of Science and Technology, China

Extractive Text Summarization using Recurrent Neural Networks with Attention Mechanism

Authors

Shimirwa Aline Valerie and Jian Xu, Nanjing University of Science and Technology, China

Abstract

Extractive summarization aims to select the most important sentences or words from a document to generate a summary. Traditional summarization approaches have relied extensively on features manually designed by humans. In this paper, based on the recurrent neural network equipped with the attention mechanism, we propose a data-driven technique. We set up a general framework that consists of a hierarchical sentence encoder and an attentionbased sentence extractor. The framework allows us to establish various extractive summarization models and explore them. Comprehensive experiments are conducted on two benchmark datasets, and experimental results show that training extractive models based on Reward Augmented Maximum Likelihood (RAML)can improve the model’s generalization capability. And we realize that complicated components of the state-of-the-art extractive models do not attain good performance over simpler ones. We hope that our work can give more hints for future research on extractive text summarization.

Keywords

Extractive summarization, Recurrent neural networks, Attention mechanism, Maximum Likelihood Estimation, Reward Augmented Maximum Likelihood. .

CS&IT Conference Proceedings

Extractive Text Summarization using Recurrent Neural Networks with Attention Mechanism