<p>problem is automatic domain-specific summarization tailored to user's needs, which is a kind of high-level data cleaning. This requires some method of discovering classes of similar items that may be grouped into predetermined domains. We explore whether there exists a synergic relation between systems for classification and those for summarization by way of composing those subsystems. In other words, we examine whether prior summarization will increase the performance of the classifier system and vice versa. In both cases, the answer is affirmative, as we show in this paper. We propose a text-mining framework in which these subsystems are treated as constituents of a knowledge discovery process for text corpora.
M. Saravanan, Raj, P. C. Reghu, Raman, S., and Dr. P. P. Nikhil Raj, “Summarization and categorization of text data in high-level data cleaning for information retrieval”, Applied Artificial Intelligence, vol. 17, pp. 461-474, 2003.