skip to main content
10.1145/383952.383989acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Automatic generation of concise summaries of spoken dialogues in unrestricted domains

Published:01 September 2001Publication History

ABSTRACT

Automatic summarization of open domain spoken dialogues is a new research area. This paper introduces the task, the challenges involved, and presents an approach to obtain automatic extract summaries for multi-party dialogues of four different genres, without any restriction on domain. We address the following issues which are intrinsic to spoken dialogue summarization and typically can be ignored when summarizing written text such as newswire data: (i) detection and removal of speech disfluencies; (ii) detection and insertion of sentence boundaries; (iii) detection and linking of cross-speaker information units (question-answer pairs). A global system evaluation using a corpus of 23 relevance annotated dialogues containing 80 topical segments shows that for the two more informal genres, our summarization system using dialogue specific components significantly outperforms a baseline using TFIDF term weighting with maximum marginal relevance ranking (MMR).

References

  1. 1.E. Brill. Some advances in transformation-based part of speech tagging. In Proceeedings of AAAI-94, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.J. Carbonell and J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st ACM-SIGIR International Conference onResearch and Development in Information Retrieval, Melbourne, Australia, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.J. S. Garofolo, E. M. Voorhees, C. G. P. Auzanne, and V. M. Stanford. Spoken document retrieval: 1998 evaluation and investigation of new metrics. In Proceedings of the ESCA workshop: Accessing information in spoken audio, pages 1-7. Cambridge, UK, Apr. 1999.Google ScholarGoogle Scholar
  4. 4.M. Gavalda a, K. Zechner, and G. Aist. High performance segmentation of spontaneous speech using part of speech and trigger word information. In Proceedings of the 5th ANLP Conference, Washington DC, pages 12-15, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.M. A. Hearst. TextTiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics, 23(1):33-64, March 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.P. A. Heeman and J. F. Allen. Intonational boundaries, speech repairs and discourse markers: Modeling spoken dialog. In Proceedings of the ACL/EACL-97, Madrid, Spain, pages 254-261, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.C. Hori and S. Furui. Improvements in automatic speech summarization and evaluation methods. In Proceedings of ICSLP-00, Beijing, China, October, pages 326-329, 2000.Google ScholarGoogle Scholar
  8. 8.D. Jurafsky, R. Bates, N. Coccaro, R. Martin, M. Meteer, K. Ries, E. Shriberg, A. Stolcke, P. Taylor, and C. V. Ess-Dykema. SwitchBoard discourse language modeling project, final report. Research Note 30, Center for Language and Speech Processing, Johns Hopkins University, Baltimore, MD, 1998.Google ScholarGoogle Scholar
  9. 9.M. Kameyama, G. Kawai, and I. Arima. A real-time system for summarizing human-human spontaneous spoken dialogues. In Proceedings of the ICSLP-96, pages 681-684, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  10. 10.K. Koumpis and S. Renals. Transcription and summarization of voicemail speech. In Proceedings of ICSLP-00, Beijing, China, October, pages 688-91, 2000.Google ScholarGoogle Scholar
  11. 11.J. Kupiec, J. Pedersen, and F. Chen. A trainable document summarizer. In Proceedings of the 18th ACM-SIGIR Conference, pages 68-73, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.Linguistic Data Consortium. LDC. CallHome and CallFriend LVCSR databases, 1996.Google ScholarGoogle Scholar
  13. 13.Linguistic Data Consortium. LDC. Treebank-3: CD-ROM containing databases of dis uency annotated Switchboard transcripts (LDC99T42), 1999.Google ScholarGoogle Scholar
  14. 14.I. Mani, D. House, G. Klein, L. Hirschman, L. Obrst, T. Firmin, M. Chrzanowski, and B. Sundheim. The TIPSTER SUMMAC text summarization evaluation. Mitre Technical Report MTR 98W0000138, October 1998, 1998.Google ScholarGoogle Scholar
  15. 15.I. Mani and M. T. Maybury, editors. Advances in automatic text summarization. MIT Press, Cambridge, MA, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.D. Marcu. Discourse trees are good indicators of importance in text. In Mani and Maybury {15}, pages 123-136.Google ScholarGoogle Scholar
  17. 17.M. Meteer, A. Taylor, R. MacIntyre, and R. Iyer. Dys uency annotation stylebook for the Switchboard corpus. Revised by Ann Taylor, June 1995, available on the LDC99T42 CD-ROM, published by LDC, 1995.Google ScholarGoogle Scholar
  18. 18.J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.G. J. Rath, A. Resnick, and T. R. Savage. The formation of abstracts by the selection of sentences. American Documentation, 12(2):139-143, 1961.Google ScholarGoogle ScholarCross RefCross Ref
  20. 20.N. Reithinger, M. Kipp, R. Engel, and J. Alexandersson. Summarizing multilingual spoken negotiation dialogues. In Proceedings of the 38th Conference of the Association for Computational Linguistics, Hongkong, China, October, pages 310-317, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. 21.R. L. Rose. The communicative value of filled pauses in spontaneous speech. PhD thesis, University of Birmingham, Birmingham, UK, 1998.Google ScholarGoogle Scholar
  22. 22.E. E. Shriberg. Preliminaries to a Theory of Speech Dis uencies. PhD thesis, University ofBerkeley, Berkeley, CA, 1994.Google ScholarGoogle Scholar
  23. 23.A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. V. Ess-Dykema, and M. Meteer. Dialogue act modeling for automatic tagging and recognition of conversational speech. Computational Linguistics, 26(3):339-373, September 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. 24.A. Stolcke, E. Shriberg, R. Bates, M. Ostendorf, D. Hakkani, M. Plauche, G. T. ur, and Y. Lu. Automatic detection of sentence boundaries and dis uencies based on recognized words. In Proceedings of the ICSLP-98, Sydney, Australia, December, volume 5, pages 2247-2250, 1998.Google ScholarGoogle Scholar
  25. 25.S. Teufel and M. Moens. Sentence extraction as a classification task. In ACL/EACL-97 Workshop on Intelligent and Scalable Text Summarization, Madrid, Spain, 1997.Google ScholarGoogle Scholar
  26. 26.R. Valenza, T. Robinson, M. Hickey, and R. Tucker. Summarisation of spoken audio through information extraction. In Proceedings of the ESCA workshop: Accessing information in spoken audio, pages 111-116. Cambridge, UK, Apr. 1999.Google ScholarGoogle Scholar
  27. 27.W. Wahlster. Verbmobil | translation of face-to-face dialogs. In Proceedings of MT Summit IV, Kobe, Japan, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. 28.A. Waibel, M. Bett, and M. Finke. Meeting browser: Tracking and summarizing meetings. In Proceedings of the DARPA Broadcast News Workshop, 1998.Google ScholarGoogle Scholar
  29. 29.A. Waibel, M. Bett, F. Metze, K. Ries, T. Schaaf, T. Schultz, H. Soltau, H. Yu,and K. Zechner. Advances in automatic meeting record creation and access. In Proceedings of ICASSP-2001, Salt Lake City, UT, May, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  30. 30.S. Whittaker, J. Hirschberg, J. Choi, D. Hindle, F. Pereira, and A. Singhal. SCAN: Designing and evaluating user interfaces to support retrieval from speech archives. In Proceedings of the 22nd ACM-SIGIR International Conference onResearch and Development in Information Retrieval, Berkeley, CA, August, pages 26-33, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. 31.K. Zechner. Automatic Summarization of Spoken Dialogues in Unrestricted Domains. PhD thesis, Language Technologies Institute, School of Computer Science, Carnegie Mellon University, forthcoming.Google ScholarGoogle Scholar
  32. 32.K. Zechner and A. Lavie. Increasing the coherence of spoken dialogue summaries by cross-speaker information linking. In Proceedings of the NAACL-01 Workshop on Automatic Summarization, Pittsburgh, PA, June, 2001.Google ScholarGoogle Scholar
  33. 33.K. Zechner and A. Waibel. DiaSumm: Flexible summarization of spontaneous dialogues in unrestricted domains. In Proceedings of COLING-2000, Saarbr. ucken, Germany, July/August, pages 968-974, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. 34.K. Zechner and A. Waibel. Minimizing word error rate in textual summaries of spoken language. In Proceedings of the First Meeting of the North American Chapter of the Association for Computational Linguistics, NAACL-2000, Seattle, WA, April/May, pages 186-193, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Automatic generation of concise summaries of spoken dialogues in unrestricted domains

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
            September 2001
            454 pages
            ISBN:1581133316
            DOI:10.1145/383952

            Copyright © 2001 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 1 September 2001

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            SIGIR '01 Paper Acceptance Rate47of201submissions,23%Overall Acceptance Rate792of3,983submissions,20%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader