Silla Jr, Carlos N. and Pappa, Gisele L. and Freitas, Alex A. and Kaestner, Celso A.A.
(2004)
Automatic Text Summarization with Genetic Algorithm-Based Attribute Selection.
In: Lemaitre, Christian and Reyes, Carlos A. and Gonzalez, Jesus A., eds.
Lecture Notes in Computer Science.
Lecture Notes in Computer Science, 3315.
Springer
pp. 305-314.
ISBN 3-540-23806-9.
(doi:https://doi.org/10.1007/b102591)
(The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided.
(Contact us about this Publication)

Abstract

The task of automatic text summarization consists of generating a summary of the original text that allows the user to obtain the main pieces of information available in that text, but with a much shorter reading time. This is an increasingly important task in the current era of information overload. given the huge amount of text available in documents. In this paper the automatic text summarization is cast as a classification (supervised learning) problem, so that machine learning-oriented classification methods are used to produce summaries for documents based on a set of attributes describing those documents. The goal of the paper is to investigate the effectiveness of Genetic Algorithm (GA)-based attribute selection in improving the performance of classification algorithms solving the automatic text summarization task. Computational results are reported for experiments with a document base formed by news extracted from The Wall Street Journal of the TIPSTER collection-a collection that is often used as a benchmark in the text summarization literature.