ABSTRACT
In order to produce a good summary, one has to identify the most relevant portions of a given text. We describe in this paper a method for automatically training topic signatures-sets of related words, with associated weights, organized around head topics and illustrate with signatures we created with 6,194 TREC collection texts over 4 selected topics. We describe the possible integration of topic signatures with outologies and its evaluaton on an automated text summarization system.
- Eneko Agirre, Olatz Ansa, Eduard Hovy, and David Martínez. 2000. Enriching very large outologies using the www. In Proceedings of the Workshop on Ontology Construction of the European Conference of AI (ECAI).Google Scholar
- Kenneth Church and Patrick Hanks. 1990. Word association norms, mutual information and lexicography. In Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics (ACL-90), pages 76--83. Google ScholarDigital Library
- Thomas Cover and Joy A. Thomas. 1991. Elements of Information Theory. John Wiley & Sons. Google ScholarDigital Library
- Gerald DeJong. 1982. An overview of the FRUMP system. In Wendy G. Lehnert and Martin H. Ringle, editors, Strategies for natural language processing, pages 149--76. Lawrence Erlbaum Associates.Google Scholar
- Ted Dunning. 1993. Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19:61--74. Google ScholarDigital Library
- Marti Hearst. 1997. Text Tiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics, 23:33--64. Google ScholarDigital Library
- Eduard Hovy and Chin-Yew Lin. 1999. Automated text summarization in SUMMARIST. In Inderjeet Mani and Mark T. Maybury, editors, Advances in Automatic Text Summarization, chapter 8, pages 81--94. MIT Press.Google Scholar
- Kevin Knight and Steve K. Luk. 1994. Building a large knowledge base for machine translation. In Proceedings of the Eleventh National Conference on Artificial Intelligence (AAAI-94). Google ScholarDigital Library
- Inderjeet Mani, David House, Gary Klein, Lynette Hirschman, Leo Obrst, Thérèse Firmin, Michael Chrzanowski, and Beth Sundheim. 1998. The TIPSTER SUMMAC text summarization evaluation final report. Technical Report MTR98W0000138, The MITRE Corporation.Google Scholar
- Christopher Manning and Hinrich Schütze. 1999. Foudations of Statistical Natural Language Processing. MIT Press. Google ScholarDigital Library
- Kathleen McKeown and Dragomir R. Radev. 1999. Generating, summaries of multiple news articles. In Inderjeet Mani and Mark T. Maybury, editors, Advances in Automatic Text Summarization, chapter 24, pages 381--389. MIT Press.Google Scholar
- Ellen Riloff and Jeffrey Lorenzen. 1999. Extraction-based text categorization: Generating domain specific role relationships automatically. In Tomek Strzalkowski, editor, Natural Language Information Retrieval. Kluwer Academic Publishers.Google Scholar
- Ellen Riloff. 1996. An empirical study of automated dictionary construction for information extraction in three domains. Artificial Intelligence. Journal, 85, August. Google ScholarDigital Library
- SAIC. 1998. Introduction to information extraction. http://www.muc.saic.com.Google Scholar
- Jinxi Xu and W. Bruce Croft. 1996. Query expansion using local and global document analysis. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 4--11. Google ScholarDigital Library
- The automated acquisition of topic signatures for text summarization
Recommendations
Topic-driven reader comments summarization
CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge managementReaders of a news article often read its comments contributed by other readers. By reading comments, readers obtain not only complementary information about this news article but also the opinions from other readers. However, the existing ranking ...
Extractive text summarization using clustering-based topic modeling
AbstractText summarization is the process of converting the input document into a short form, provided that it preserves the overall meaning associated with it. Primarily, text summarization is achieved in two ways, i.e., abstractive and extractive. ...
Topic analysis for topic-focused multi-document summarization
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementTopic-focused multi-document summarization has been a challenging task because the created summary is required to be biased to the given topic or query. Existing methods consider the given topic as a single coarse unit and then directly incorporate the ...
Comments