Skip to main content

Automated Text Classification for Fast Feedback – Investigating the Effects of Document Representation

  • Conference paper
Knowledge-Based Intelligent Information and Engineering Systems (KES 2003)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2774))

  • 1280 Accesses

Abstract

New trends such as increased product complexity, changing customer requirements and shortening development time, have given rise to an increase in the number of unexpected events within the Product Development Process (PDP). Traditional tools are only partially adequate (either insufficient coverage or simply too late) to cover these unexpected events. As such, new tools are being sought to complement traditional ones. This paper investigates the use of one such tool, textual data mining, for the purpose of facilitating fast feedback. The motivation for this paper stems from the need to handle widely ignored and “loosely structured textual data” within the PDP. In particular this study would focus on the automated classification of call center records from a Multi National Company (MNC). Different document representation schemes are studied in view of determining the most appropriate scheme that maximizes classification accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cheng, P.S., Chang, P.: Transforming Corporate Information into Value through Data Warehousing and Data Mining. Aslib Proceedings 50(5), 109–113 (1998)

    Article  Google Scholar 

  2. Grigori, D., Casati, F., Dayal, H., Shan, M.-C.: Improving Business process quality through exception understanding, prediction and presentation. Proceedings of the (VLDB) journal, 159–168 (2001)

    Google Scholar 

  3. Dagli, C.H., Lee, H.-C.: Impacts of Data Mining Technology on Product Design and Planning. In: Plonka, F., Olling, G. (eds.) Computer Applications in Production and Engineering, pp. 58–70. Chapman & Hall, Boca Raton (1997)

    Google Scholar 

  4. Wang, X.Z., Chen, B.H., McGreavy, C.: Data Mining for Failure Diagnosis of Process Units by Learning Probabilistic Network. Transactions of Institute of Chemical Engineers 75(Part B), 210–216 (1997)

    Google Scholar 

  5. Koonce, D.A., Fang, C.-H., Tsai, S.-C.: A Data Mining Tool for Learning from Manufacturing Systems. Computers and Industrial Engineering 33(1-2), 27–30 (1997)

    Article  Google Scholar 

  6. Yoon, Y.-H., Kim, Y.-S., Kim, S.-J., Yum, B.-J.: Development of a Framework for Analyzing Process Monitoring Data with Applications, Web-site http://citeseer.nj.nec.com/546744.html

  7. Lee, J.-H., Pork, S.-C.: Data mining for High Quality and quick response manufacturing. In: Braha, D. (ed.) Data Mining for Design and Manufacturing, pp. 179–205. Kluwer Academic Publishers, Dordrecht (2001)

    Google Scholar 

  8. Lu, J.-C.: Methodology for Mining Massive Data sets for Improving Manufacturing quality/efficiency. In: Braha, D. (ed.) Data Mining for Design and Manufacturing, pp. 255–289. Kluwer Academic Publishers, Dordrecht (2001)

    Google Scholar 

  9. Busemann, S., Schmeier, S., Arens, R.: Message classification in the call center. In: Proceedings of the 6th conference on applied natural language processing Seattle, WA (2000)

    Google Scholar 

  10. Tan, P.-N., Blau, H., Harp, S.: Goldman R. Textual Data Mining of Service Centre Call Records. In: The Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, USA (2000)

    Google Scholar 

  11. Sullivan, D.: Document Warehousing and Text Mining: Techniques for Improving Business Operations, Marketing and Sales. Wiley and Sons, Chichester (2001)

    Google Scholar 

  12. Furnas, G.W., Landauer, T.K., Gomez, L.M., Dumais, S.T.: The Vocabulary Problem in Human-System Communications. Communications of the ACM 30, 964–971 (1987)

    Article  Google Scholar 

  13. Lovins, J.: Development of a Stemming Algorithm. Mechanical Translation and Computational Linguistics 11, 22–31 (1968)

    Google Scholar 

  14. Porter, M.: An Algorithm for Suffix Stripping. Program 14(3), 130–137 (1980)

    Google Scholar 

  15. Van Rijsbergen, C.J.: Information Retrieval. Butter Worths, Butterworths (1979)

    Google Scholar 

  16. Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval, 1st edn. McGraw-Hill International Book Company, New York (1983)

    MATH  Google Scholar 

  17. Salton, G., Buckley, C.: Term Weighting Approaches in Automatic Text Retrieval. Information Processing and Management 24(5), 513–523 (1988)

    Article  Google Scholar 

  18. Pyle, D.: Data Preparation for Data Mining, pp. 257–258. Morgan Kaufmann Publishers, Incorporation, San Francisco (1999)

    Google Scholar 

  19. Dumais, S.T.: Enhancing Performance in Latent Semantic Indexing (LSI) Retrieval. TM-ARH-017527 Technical Report. Bellcore (1990)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Menon, R., Tong, L.H., Sathiyakeerthi, S., Brombacher, A. (2003). Automated Text Classification for Fast Feedback – Investigating the Effects of Document Representation. In: Palade, V., Howlett, R.J., Jain, L. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2003. Lecture Notes in Computer Science(), vol 2774. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45226-3_138

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45226-3_138

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40804-8

  • Online ISBN: 978-3-540-45226-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics