skip to main content
10.1145/1321440.1321459acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Automatic call section segmentation for contact-center calls

Published:06 November 2007Publication History

ABSTRACT

This paper presents a SVM (Support Vector Machine) classification system which divides contact-center call transcripts into "Greeting", "Question", "Refine", "Research", "Resolution", "Closing" and "Out-of-topic" sections. This call section segmentation is useful to improve search and retrieval functions and to provide more detailed statistics on calls. We use an off-the-shelf automatic speech recognition (ASR) system to generate call transcripts from recorded calls between customers and service representatives.

We first classify an individual utterance into a call section by applying the SVM classifier and then merge adjacent utterances classified into a same call section. We experiment with the proposed system on 100 automatically transcribed calls. The 10-fold cross validation shows 87.2% classification accuracy. We also compare the proposed algorithm with two other approaches - the most frequent section only method and a maximum entropy-based segmentation. The evaluation shows that our system's accuracy is 12% higher than the first baseline system and 6% higher than the second baseline system respectively.

References

  1. J. Baldridge. The opennlp project. http://opennlp.sourceforge.net/, 2002.Google ScholarGoogle Scholar
  2. C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, (2):121--167, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Busemann, S. Schmeier, and R. G. Arens. Message classification in the call center. In Proceedings of the sixth conference on Applied natural language processing, pages 158--165, San Francisco, CA, USA, 2000. Morgan Kaufmann Publishers Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. Chang and C. Lin. Libsvm: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/ cjlin/libsvm, 2001.Google ScholarGoogle Scholar
  5. H. Christensen, B. Kolluru, Y. Gotoh, and S. Renals. Maximum entropy segmentation of broadcast news. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  6. J. Chu-Carroll and B. Carpenter. Vector-based natural language call routing. Computational Linguistics, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. S. G. Tur, D. Hakkani-Tur and E. Shriberg. Integrating prosodic and lexical cues for automatic topic segmentation. Computational Linguistics,(27):31--57, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Garofolo, G. Auzanne, and E. Voorhees. The trec spoken document retrieval track: A success story. In Proceedings of the Ninth Text Tretrieval Conference (TREC-9). National Institute of Standards and Technology (NIST), 2000.Google ScholarGoogle Scholar
  9. Y. Gotoh and S. Renals. Sentence boundary detection in broadcast speech transcripts. Proceedings of ISCA Workshop: Automatic Speech Recognition: Challenges for the New millennium ASR-2000, 2000.Google ScholarGoogle Scholar
  10. L. M. D. P. G. S. H. Soltau, B. Kingsbury and G. Zweig. The ibm 2004 conversational telephony system for rich transcription. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2004.Google ScholarGoogle Scholar
  11. T. Joachims. Text categorization with support vector machines: learning with many relevant features. In Proceedings of ECML-98, 10th European Conference on Machine Learning, pages 137--142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Y. Liu, M. Harper, E. Shriberg, and A. Stolcke. Using conditional random fields for sentence boundary detection in speech. In Proceedings of the Association of Computational Linguistics (ACL), 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Mamou, D. Carmel, and R. Hoory. Spoken document retrieval from call-center conversations. In Proceedings of SIGIR'06. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. G. Mishne, D. Carmel, and R. Hoory. Automatic analysis of call-center conversations. In Proceedings of the 14th ACM International Conference on Information and Knowledge Management (CIKM), 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. G. T. P. Haffner and J. Wright. Optimizing svms for complex call classification. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP'03), 2003.Google ScholarGoogle ScholarCross RefCross Ref
  16. Y. Park, R. J. Byrd, and B. K. Boguraev. Automatic glossary extraction: Beyond terminology identification. In Proceedings of the Nineteenth International Conference on Computational Linguistics (COLING02), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Ratnaparkhi. Maximum entropy models for natural language ambiguity resolution. Ph.D. thesis, University of Pennsylvania, Philadelphia, PA., 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. G. Riccardi, A. Gorin, A. Ljolje, and M. Riley. A spoken language system for automated call routing. In Proceedings of Proceedings of International Conference on Acoustics, Speech, and Signal Processing(ICASSP'97), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S. Roy and L. Subramaniam. Automatic generation of domain models for call-centers from noisy transcriptions. In Proceedings of COLING-ACL 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. M. Tang, B. Pellom, and K. Hacioglu. Call-type classification and unsupervised training for the call center domain. In IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, 2003.Google ScholarGoogle Scholar
  21. V. Vapnik. The Nature of Statistical Learning Theory. Springer-Verlag, New York, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Automatic call section segmentation for contact-center calls

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader