Article

Automatic generation of concise summaries of spoken dialogues in unrestricted domains

Author:
Klaus Zechner

Carnegie Mellon Univ., Pittsburgh, PA

Carnegie Mellon Univ., Pittsburgh, PA
View Profile

SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrievalSeptember 2001Pages 199–207https://doi.org/10.1145/383952.383989

Published:01 September 2001Publication History

SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval

Pages 199–207

ABSTRACT

Automatic summarization of open domain spoken dialogues is a new research area. This paper introduces the task, the challenges involved, and presents an approach to obtain automatic extract summaries for multi-party dialogues of four different genres, without any restriction on domain. We address the following issues which are intrinsic to spoken dialogue summarization and typically can be ignored when summarizing written text such as newswire data: (i) detection and removal of speech disfluencies; (ii) detection and insertion of sentence boundaries; (iii) detection and linking of cross-speaker information units (question-answer pairs). A global system evaluation using a corpus of 23 relevance annotated dialogues containing 80 topical segments shows that for the two more informal genres, our summarization system using dialogue specific components significantly outperforms a baseline using TFIDF term weighting with maximum marginal relevance ranking (MMR).

References

1.E. Brill. Some advances in transformation-based part of speech tagging. In Proceeedings of AAAI-94, 1994. Google ScholarDigital Library
2.J. Carbonell and J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st ACM-SIGIR International Conference onResearch and Development in Information Retrieval, Melbourne, Australia, 1998. Google ScholarDigital Library
3.J. S. Garofolo, E. M. Voorhees, C. G. P. Auzanne, and V. M. Stanford. Spoken document retrieval: 1998 evaluation and investigation of new metrics. In Proceedings of the ESCA workshop: Accessing information in spoken audio, pages 1-7. Cambridge, UK, Apr. 1999.Google Scholar
4.M. Gavalda a, K. Zechner, and G. Aist. High performance segmentation of spontaneous speech using part of speech and trigger word information. In Proceedings of the 5th ANLP Conference, Washington DC, pages 12-15, 1997. Google ScholarDigital Library
5.M. A. Hearst. TextTiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics, 23(1):33-64, March 1997. Google ScholarDigital Library
6.P. A. Heeman and J. F. Allen. Intonational boundaries, speech repairs and discourse markers: Modeling spoken dialog. In Proceedings of the ACL/EACL-97, Madrid, Spain, pages 254-261, 1997. Google ScholarDigital Library
7.C. Hori and S. Furui. Improvements in automatic speech summarization and evaluation methods. In Proceedings of ICSLP-00, Beijing, China, October, pages 326-329, 2000.Google Scholar
8.D. Jurafsky, R. Bates, N. Coccaro, R. Martin, M. Meteer, K. Ries, E. Shriberg, A. Stolcke, P. Taylor, and C. V. Ess-Dykema. SwitchBoard discourse language modeling project, final report. Research Note 30, Center for Language and Speech Processing, Johns Hopkins University, Baltimore, MD, 1998.Google Scholar
9.M. Kameyama, G. Kawai, and I. Arima. A real-time system for summarizing human-human spontaneous spoken dialogues. In Proceedings of the ICSLP-96, pages 681-684, 1996.Google ScholarCross Ref
10.K. Koumpis and S. Renals. Transcription and summarization of voicemail speech. In Proceedings of ICSLP-00, Beijing, China, October, pages 688-91, 2000.Google Scholar
11.J. Kupiec, J. Pedersen, and F. Chen. A trainable document summarizer. In Proceedings of the 18th ACM-SIGIR Conference, pages 68-73, 1995. Google ScholarDigital Library
12.Linguistic Data Consortium. LDC. CallHome and CallFriend LVCSR databases, 1996.Google Scholar
13.Linguistic Data Consortium. LDC. Treebank-3: CD-ROM containing databases of dis uency annotated Switchboard transcripts (LDC99T42), 1999.Google Scholar
14.I. Mani, D. House, G. Klein, L. Hirschman, L. Obrst, T. Firmin, M. Chrzanowski, and B. Sundheim. The TIPSTER SUMMAC text summarization evaluation. Mitre Technical Report MTR 98W0000138, October 1998, 1998.Google Scholar
15.I. Mani and M. T. Maybury, editors. Advances in automatic text summarization. MIT Press, Cambridge, MA, 1999. Google ScholarDigital Library
16.D. Marcu. Discourse trees are good indicators of importance in text. In Mani and Maybury {15}, pages 123-136.Google Scholar
17.M. Meteer, A. Taylor, R. MacIntyre, and R. Iyer. Dys uency annotation stylebook for the Switchboard corpus. Revised by Ann Taylor, June 1995, available on the LDC99T42 CD-ROM, published by LDC, 1995.Google Scholar
18.J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA, 1992. Google ScholarDigital Library
19.G. J. Rath, A. Resnick, and T. R. Savage. The formation of abstracts by the selection of sentences. American Documentation, 12(2):139-143, 1961.Google ScholarCross Ref
20.N. Reithinger, M. Kipp, R. Engel, and J. Alexandersson. Summarizing multilingual spoken negotiation dialogues. In Proceedings of the 38th Conference of the Association for Computational Linguistics, Hongkong, China, October, pages 310-317, 2000. Google ScholarDigital Library
21.R. L. Rose. The communicative value of filled pauses in spontaneous speech. PhD thesis, University of Birmingham, Birmingham, UK, 1998.Google Scholar
22.E. E. Shriberg. Preliminaries to a Theory of Speech Dis uencies. PhD thesis, University ofBerkeley, Berkeley, CA, 1994.Google Scholar
23.A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. V. Ess-Dykema, and M. Meteer. Dialogue act modeling for automatic tagging and recognition of conversational speech. Computational Linguistics, 26(3):339-373, September 2000. Google ScholarDigital Library
24.A. Stolcke, E. Shriberg, R. Bates, M. Ostendorf, D. Hakkani, M. Plauche, G. T. ur, and Y. Lu. Automatic detection of sentence boundaries and dis uencies based on recognized words. In Proceedings of the ICSLP-98, Sydney, Australia, December, volume 5, pages 2247-2250, 1998.Google Scholar
25.S. Teufel and M. Moens. Sentence extraction as a classification task. In ACL/EACL-97 Workshop on Intelligent and Scalable Text Summarization, Madrid, Spain, 1997.Google Scholar
26.R. Valenza, T. Robinson, M. Hickey, and R. Tucker. Summarisation of spoken audio through information extraction. In Proceedings of the ESCA workshop: Accessing information in spoken audio, pages 111-116. Cambridge, UK, Apr. 1999.Google Scholar
27.W. Wahlster. Verbmobil | translation of face-to-face dialogs. In Proceedings of MT Summit IV, Kobe, Japan, 1993. Google ScholarDigital Library
28.A. Waibel, M. Bett, and M. Finke. Meeting browser: Tracking and summarizing meetings. In Proceedings of the DARPA Broadcast News Workshop, 1998.Google Scholar
29.A. Waibel, M. Bett, F. Metze, K. Ries, T. Schaaf, T. Schultz, H. Soltau, H. Yu,and K. Zechner. Advances in automatic meeting record creation and access. In Proceedings of ICASSP-2001, Salt Lake City, UT, May, 2001.Google ScholarCross Ref
30.S. Whittaker, J. Hirschberg, J. Choi, D. Hindle, F. Pereira, and A. Singhal. SCAN: Designing and evaluating user interfaces to support retrieval from speech archives. In Proceedings of the 22nd ACM-SIGIR International Conference onResearch and Development in Information Retrieval, Berkeley, CA, August, pages 26-33, 1999. Google ScholarDigital Library
31.K. Zechner. Automatic Summarization of Spoken Dialogues in Unrestricted Domains. PhD thesis, Language Technologies Institute, School of Computer Science, Carnegie Mellon University, forthcoming.Google Scholar
32.K. Zechner and A. Lavie. Increasing the coherence of spoken dialogue summaries by cross-speaker information linking. In Proceedings of the NAACL-01 Workshop on Automatic Summarization, Pittsburgh, PA, June, 2001.Google Scholar
33.K. Zechner and A. Waibel. DiaSumm: Flexible summarization of spontaneous dialogues in unrestricted domains. In Proceedings of COLING-2000, Saarbr. ucken, Germany, July/August, pages 968-974, 2000. Google ScholarDigital Library
34.K. Zechner and A. Waibel. Minimizing word error rate in textual summaries of spoken language. In Proceedings of the First Meeting of the North American Chapter of the Association for Computational Linguistics, NAACL-2000, Seattle, WA, April/May, pages 186-193, 2000. Google ScholarDigital Library

Index Terms

Automatic generation of concise summaries of spoken dialogues in unrestricted domains
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Information systems
  1. Information retrieval

Recommendations

Automatic generation of short informative sentiment summaries
EACL '12: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics

In this paper, we define a new type of summary for sentiment analysis: a single-sentence summary that consists of a supporting sentence that conveys the overall sentiment of a review as well as a convincing reason for this sentiment. We present a system ...
Read More
(Semi-)Automatic Analysis of Dialogues
ICAART 2014: Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1

We study human-human dialogues and human-computer dialogues with the aim to determine which dialogue acts and communicative strategies do the participants of interaction use, and which structural parts does a dialogue include. In order to simplify the ...
Read More
Applying Topic Recognition to Spoken Language in Human-Robot Interaction Dialogues
MMRWHRI '14: Proceedings of the 2014 Workshop on Multimodal, Multi-Party, Real-World Human-Robot Interaction

Human-robot interaction systems that work in everyday situations need to be able to talk about different topics, for example when the robot is a bartender that serves drinks to human customers. We applied a topic recognition approach that is based on ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
September 2001
454 pages
ISBN:1581133316
DOI:10.1145/383952
Chairmen:
Donald H. Kraft
Louisiana State Univ.
,
W. Bruce Croft
University of Massachusetts, (For the Americas)
,
David J. Harper
The Robert Gordon University, (For Europe and Africa)
,
Justin Zobel
RMIT University, (For Asia and Australasia)
Copyright © 2001 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 September 2001
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
SIGIR '01 Paper Acceptance Rate47of201submissions,23%Overall Acceptance Rate792of3,983submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 32
  Total Citations
  View Citations
- 575
  Total Downloads
- Downloads (Last 12 months)9
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Automatic generation of concise summaries of spoken dialogues in unrestricted domains

SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Automatic generation of short informative sentiment summaries

(Semi-)Automatic Analysis of Dialogues

Applying Topic Recognition to Spoken Language in Human-Robot Interaction Dialogues