ABSTRACT
Automatic summarization of open domain spoken dialogues is a new research area. This paper introduces the task, the challenges involved, and presents an approach to obtain automatic extract summaries for multi-party dialogues of four different genres, without any restriction on domain. We address the following issues which are intrinsic to spoken dialogue summarization and typically can be ignored when summarizing written text such as newswire data: (i) detection and removal of speech disfluencies; (ii) detection and insertion of sentence boundaries; (iii) detection and linking of cross-speaker information units (question-answer pairs). A global system evaluation using a corpus of 23 relevance annotated dialogues containing 80 topical segments shows that for the two more informal genres, our summarization system using dialogue specific components significantly outperforms a baseline using TFIDF term weighting with maximum marginal relevance ranking (MMR).
- 1.E. Brill. Some advances in transformation-based part of speech tagging. In Proceeedings of AAAI-94, 1994. Google ScholarDigital Library
- 2.J. Carbonell and J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st ACM-SIGIR International Conference onResearch and Development in Information Retrieval, Melbourne, Australia, 1998. Google ScholarDigital Library
- 3.J. S. Garofolo, E. M. Voorhees, C. G. P. Auzanne, and V. M. Stanford. Spoken document retrieval: 1998 evaluation and investigation of new metrics. In Proceedings of the ESCA workshop: Accessing information in spoken audio, pages 1-7. Cambridge, UK, Apr. 1999.Google Scholar
- 4.M. Gavalda a, K. Zechner, and G. Aist. High performance segmentation of spontaneous speech using part of speech and trigger word information. In Proceedings of the 5th ANLP Conference, Washington DC, pages 12-15, 1997. Google ScholarDigital Library
- 5.M. A. Hearst. TextTiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics, 23(1):33-64, March 1997. Google ScholarDigital Library
- 6.P. A. Heeman and J. F. Allen. Intonational boundaries, speech repairs and discourse markers: Modeling spoken dialog. In Proceedings of the ACL/EACL-97, Madrid, Spain, pages 254-261, 1997. Google ScholarDigital Library
- 7.C. Hori and S. Furui. Improvements in automatic speech summarization and evaluation methods. In Proceedings of ICSLP-00, Beijing, China, October, pages 326-329, 2000.Google Scholar
- 8.D. Jurafsky, R. Bates, N. Coccaro, R. Martin, M. Meteer, K. Ries, E. Shriberg, A. Stolcke, P. Taylor, and C. V. Ess-Dykema. SwitchBoard discourse language modeling project, final report. Research Note 30, Center for Language and Speech Processing, Johns Hopkins University, Baltimore, MD, 1998.Google Scholar
- 9.M. Kameyama, G. Kawai, and I. Arima. A real-time system for summarizing human-human spontaneous spoken dialogues. In Proceedings of the ICSLP-96, pages 681-684, 1996.Google ScholarCross Ref
- 10.K. Koumpis and S. Renals. Transcription and summarization of voicemail speech. In Proceedings of ICSLP-00, Beijing, China, October, pages 688-91, 2000.Google Scholar
- 11.J. Kupiec, J. Pedersen, and F. Chen. A trainable document summarizer. In Proceedings of the 18th ACM-SIGIR Conference, pages 68-73, 1995. Google ScholarDigital Library
- 12.Linguistic Data Consortium. LDC. CallHome and CallFriend LVCSR databases, 1996.Google Scholar
- 13.Linguistic Data Consortium. LDC. Treebank-3: CD-ROM containing databases of dis uency annotated Switchboard transcripts (LDC99T42), 1999.Google Scholar
- 14.I. Mani, D. House, G. Klein, L. Hirschman, L. Obrst, T. Firmin, M. Chrzanowski, and B. Sundheim. The TIPSTER SUMMAC text summarization evaluation. Mitre Technical Report MTR 98W0000138, October 1998, 1998.Google Scholar
- 15.I. Mani and M. T. Maybury, editors. Advances in automatic text summarization. MIT Press, Cambridge, MA, 1999. Google ScholarDigital Library
- 16.D. Marcu. Discourse trees are good indicators of importance in text. In Mani and Maybury {15}, pages 123-136.Google Scholar
- 17.M. Meteer, A. Taylor, R. MacIntyre, and R. Iyer. Dys uency annotation stylebook for the Switchboard corpus. Revised by Ann Taylor, June 1995, available on the LDC99T42 CD-ROM, published by LDC, 1995.Google Scholar
- 18.J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA, 1992. Google ScholarDigital Library
- 19.G. J. Rath, A. Resnick, and T. R. Savage. The formation of abstracts by the selection of sentences. American Documentation, 12(2):139-143, 1961.Google ScholarCross Ref
- 20.N. Reithinger, M. Kipp, R. Engel, and J. Alexandersson. Summarizing multilingual spoken negotiation dialogues. In Proceedings of the 38th Conference of the Association for Computational Linguistics, Hongkong, China, October, pages 310-317, 2000. Google ScholarDigital Library
- 21.R. L. Rose. The communicative value of filled pauses in spontaneous speech. PhD thesis, University of Birmingham, Birmingham, UK, 1998.Google Scholar
- 22.E. E. Shriberg. Preliminaries to a Theory of Speech Dis uencies. PhD thesis, University ofBerkeley, Berkeley, CA, 1994.Google Scholar
- 23.A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. V. Ess-Dykema, and M. Meteer. Dialogue act modeling for automatic tagging and recognition of conversational speech. Computational Linguistics, 26(3):339-373, September 2000. Google ScholarDigital Library
- 24.A. Stolcke, E. Shriberg, R. Bates, M. Ostendorf, D. Hakkani, M. Plauche, G. T. ur, and Y. Lu. Automatic detection of sentence boundaries and dis uencies based on recognized words. In Proceedings of the ICSLP-98, Sydney, Australia, December, volume 5, pages 2247-2250, 1998.Google Scholar
- 25.S. Teufel and M. Moens. Sentence extraction as a classification task. In ACL/EACL-97 Workshop on Intelligent and Scalable Text Summarization, Madrid, Spain, 1997.Google Scholar
- 26.R. Valenza, T. Robinson, M. Hickey, and R. Tucker. Summarisation of spoken audio through information extraction. In Proceedings of the ESCA workshop: Accessing information in spoken audio, pages 111-116. Cambridge, UK, Apr. 1999.Google Scholar
- 27.W. Wahlster. Verbmobil | translation of face-to-face dialogs. In Proceedings of MT Summit IV, Kobe, Japan, 1993. Google ScholarDigital Library
- 28.A. Waibel, M. Bett, and M. Finke. Meeting browser: Tracking and summarizing meetings. In Proceedings of the DARPA Broadcast News Workshop, 1998.Google Scholar
- 29.A. Waibel, M. Bett, F. Metze, K. Ries, T. Schaaf, T. Schultz, H. Soltau, H. Yu,and K. Zechner. Advances in automatic meeting record creation and access. In Proceedings of ICASSP-2001, Salt Lake City, UT, May, 2001.Google ScholarCross Ref
- 30.S. Whittaker, J. Hirschberg, J. Choi, D. Hindle, F. Pereira, and A. Singhal. SCAN: Designing and evaluating user interfaces to support retrieval from speech archives. In Proceedings of the 22nd ACM-SIGIR International Conference onResearch and Development in Information Retrieval, Berkeley, CA, August, pages 26-33, 1999. Google ScholarDigital Library
- 31.K. Zechner. Automatic Summarization of Spoken Dialogues in Unrestricted Domains. PhD thesis, Language Technologies Institute, School of Computer Science, Carnegie Mellon University, forthcoming.Google Scholar
- 32.K. Zechner and A. Lavie. Increasing the coherence of spoken dialogue summaries by cross-speaker information linking. In Proceedings of the NAACL-01 Workshop on Automatic Summarization, Pittsburgh, PA, June, 2001.Google Scholar
- 33.K. Zechner and A. Waibel. DiaSumm: Flexible summarization of spontaneous dialogues in unrestricted domains. In Proceedings of COLING-2000, Saarbr. ucken, Germany, July/August, pages 968-974, 2000. Google ScholarDigital Library
- 34.K. Zechner and A. Waibel. Minimizing word error rate in textual summaries of spoken language. In Proceedings of the First Meeting of the North American Chapter of the Association for Computational Linguistics, NAACL-2000, Seattle, WA, April/May, pages 186-193, 2000. Google ScholarDigital Library
Index Terms
- Automatic generation of concise summaries of spoken dialogues in unrestricted domains
Recommendations
Automatic generation of short informative sentiment summaries
EACL '12: Proceedings of the 13th Conference of the European Chapter of the Association for Computational LinguisticsIn this paper, we define a new type of summary for sentiment analysis: a single-sentence summary that consists of a supporting sentence that conveys the overall sentiment of a review as well as a convincing reason for this sentiment. We present a system ...
(Semi-)Automatic Analysis of Dialogues
ICAART 2014: Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1We study human-human dialogues and human-computer dialogues with the aim to determine which dialogue acts and communicative strategies do the participants of interaction use, and which structural parts does a dialogue include. In order to simplify the ...
Applying Topic Recognition to Spoken Language in Human-Robot Interaction Dialogues
MMRWHRI '14: Proceedings of the 2014 Workshop on Multimodal, Multi-Party, Real-World Human-Robot InteractionHuman-robot interaction systems that work in everyday situations need to be able to talk about different topics, for example when the robot is a bartender that serves drinks to human customers. We applied a topic recognition approach that is based on ...
Comments