A domain-specific decision support system for knowledge discovery using association and text mining

Rajpathak, Dnyanesh; Chougule, Rahul; Bandyopadhyay, Pulak

doi:10.1007/s10115-011-0409-1

A domain-specific decision support system for knowledge discovery using association and text mining

Regular Paper
Published: 18 May 2011

Volume 31, pages 405–432, (2012)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Dnyanesh Rajpathak¹,
Rahul Chougule¹ &
Pulak Bandyopadhyay¹

705 Accesses
29 Citations
Explore all metrics

Abstract

We propose a novel association and text mining system for knowledge discovery (ASTEK) from the warranty and service data in the automotive domain. The complex architecture of modern vehicles makes fault diagnosis and isolation a non-trivial task. The association mining isolates anomaly cases from the millions of service and claims records. ASTEK has shown 86% accuracy in correctly identifying the anomaly cases. The text mining subscribes to the diagnosis and prognosis (D&P) ontology, which provides the necessary domain-specific knowledge. The root causes associated with the anomaly cases are identified by discovering frequent symptoms associated with the part failures along with the repair actions used to fix the part failures. The best-practice knowledge is disseminated to the dealers involved in the anomaly cases. ASTEK has been implemented as a prototype in the service and quality department of GM and its performance has been validated in the real life set up. On an average, the analysis time is reduced from few weeks to few minutes, which in real life industry are significant improvements.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Big Data in Asset Management: Knowledge Discovery in Asset Data by the Means of Data Mining

Association Mining for Operation and Maintenance Safety Risks of EMUs Based on Unstructured Event Data

Article Open access 08 March 2025

An Interpretable Fault Prediction Method Based on Machine Learning and Knowledge Graphs

References

Agarwal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases, In: Proceedings of the 1993 ACM SIGMOD conference. Washington DC, USA, pp 207–216
Agosti M, Ferro N (2005) Annotations as context for searching documents, In: Crestani F, Ruthven I (eds) Proceedings of the 5th international conference on conceptions of library and information science—context: nature, impact and role, Lecture Notes in Computer Science, Springer, Heidelberg, Germany, pp 155–170
Beckett D (ed). RDF/XML Syntax Specification (Revised), W3C Recommendation, 2004. http://www.w3.org/TR/rdf-syntax-grammar/
Benedittini O, Baines TS, Lightfoot HW, Greenough RM (2009) State-of-the-art in integrated vehicle health management. J Aer Eng 223(2): 157–170
Article Google Scholar
Bloehdorn S, Cimiano P, Hotho A, Staab S (2005) An ontology-based framework for text mining. LDV Forum 20(1): 87–112
Google Scholar
Buddhakulsomsiri J, Zakarian A (2009) Sequential pattern mining algorithm for automotive warranty data. Comput Ind Eng 57(1): 137–147
Article Google Scholar
Chougule R, Chakrabarty S (2009) Application of ontology guided search for improved equipment diagnosis in a vehicle assembly plant. In: Proceedings of fifth annual IEEE conference on automation science and engineering (IEEE CASE 2009). IEEE Press, Bangalore, India, pp 90–95
Cios KJ, Pedrycz W, Świniarski RW (1998) Data mining methods for knowledge discovery. Kluwer, Norwell
Book MATH Google Scholar
Corcho O (2006) Ontology based document annotation: trends and open research problems. Int J Metadata Semant Ontol 1(1): 47–57
Article MathSciNet Google Scholar
Cunningham H (2002) GATE, a general architecture for text engineering. Comput Humanit 36: 223–254
Article Google Scholar
Davi A, Haughton D, Nasr N, Shah G, Skaletsky M, Spack R (2005) A review of two text-mining packages: SAS TextMining and WordStat. (Product/Service Evaluation). Am Stat 59(1): 89–103
Article Google Scholar
Dean, PM (eds) (1995) Molecular similarity in drug design. Blackie Academic & Professional, London, pp 111–137
Google Scholar
Fensel D, Straatman R (1998) The essence of problem-solving methods: making assumptions to gain efficiency. Int J Human-Comput Stud 48: 181–215
Article Google Scholar
Francisco V, Gervas P, Peinado F (2010) Ontological reasoning for improving the treatment of emotions in text. Knowl Inf Syst 25: 421–443
Article Google Scholar
Gruber TR (1993) A translation approach to portable ontology specifications. Knowl Acq 5(2): 199–220
Article Google Scholar
Gusikhin O, Rychtyckyj N, Filev D (2007) Intelligent systems in the automotive industry: applications and trends. Knowl Inf Syst 12(2): 147–168
Article Google Scholar
Hearst T (1999) Untangling text data mining. University of Maryland, College Park, pp 3–10
Google Scholar
Janasak KM, Beshears RR (2007) Diagnostics to prognostics—a product technology evolution, In: Proceedgins of the 2007 reliability and maintainability symposium—RAMS’07. Orlando, Florida, USA
Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3): 264–323
Article Google Scholar
Jing Y, Choi Y, Xiong Y, Han K, Shin S, Lee Y (2007) A knowledge acquisition and management system for fault diagnosis and maintenance of equipments, In: Proceedings of the 6th WSEAS international conference on applied computer science. Hangzhou, China, pp 296–300
Jing L, Ng KM, Huang JZ (2009) Knowledge-based vector space model for text clustering. Knowl Inf Syst 25(1): 35–55
Article Google Scholar
Kotsiantis S, Kanellopoulos D (2006) Association rules mining: a recent overview. Int Trans Comput Sci Eng 32(1): 71–82
Google Scholar
Kuehnast J, Hengeveld W (2009) Enterprise application integration (white paper). T-systems enterprise services. GmbH, Berlin
Google Scholar
Luhn HP (1960) Keyword in context index for technical literature (KWIC Index). Am Docu 11: 288–295
Article Google Scholar
Li J-q, Niu C-l, Liu J-z, Zhang L-y (2006) Research and application of data mining in power plant process control and optimization. Lec Notes Comp Sci 3930: 149–158
Article Google Scholar
Ovsiannikov IA, Arbib MA, Mcneill TH (1999) Annotation technology. Int J Human-Comput Stud 50(4): 329–362
Article Google Scholar
Palmer DD, Hearst MA (1994) Adaptive sentence boundary disambiguation. Report No. UCB/CSD 94/797
Quan X, Liu G, Lu Z, Ni X, Wenyin L (2010) Short text similarity based on probabilistic topics. Knowl Inf Syst 25: 473–491
Article Google Scholar
Rajpathak D, Motta E, Zrahal Z, Roy R (2006) A generic library of problem solving methods for scheduling applications. IEEE Trans Knowl Data Eng 18(6): 815–828
Article Google Scholar
Salton G, McGill MJ (1983) Introduction to modern information retrieval. McGraw-Hill, New York
MATH Google Scholar
Saxena A, Wu B, Vachtsevanos G (2005) Integrated diagnosis and prognosis architecture for fleet vehicles using dynamic case based reasoning. In: Proceedings of the IEEE Autotestcon, pp 96–102
Stevenson M, Gaizauskas R (2000) Experiments on sentence boundary detection. In: Proceedings of the 6th conference on applied natural language processing. Seattle, USA, pp 84–89
Tan P, Kumar V, Srivastava J (2002) Selecting the right interestingness measure for association patterns. In: Proceedings of the SIGKDD’02 conference. Edmonton, Alberta, Canada, pp 32–41
Venkatasubramanian V, Rengaswamy R, Yin K, Kavuri S (2003) A review of process fault detection and diagnosis part I: quantitative model-based methods. Comput Chem Eng 27: 293–311
Article Google Scholar
Venkatasubramanian V, Rengaswamy R, Kavuri S (2003) A review of process fault detection and diagnosis part II: qualitative models and search strategies. Comput Chem Eng 27: 313–326
Article Google Scholar
Venkatasubramanian V, Rengaswamy R, Yin K, Kavuri S (2003) A review of process fault detection and diagnosis part III: process history based methods. Comput Chem Eng 27: 327–346
Article Google Scholar
Wang S, Hsu S (2004) A Web-based CBR knowledge management system for PC troubleshooting. Int J Adv Manuf Tech 23(7–8): 532–540
Article Google Scholar
Williams Z (2006) Benefits of IVHM: an analytical approach. In: Proceedings of the Aerospace Conference. Big Sky, Montana, USA

Download references

Author information

Authors and Affiliations

Diagnosis and Prognosis Group, India Science Lab, General Motors Global Research and Development, GM Technical Centre India Pvt. Ltd., Creator Building, International Technology Park, Whitefiled, Bangalore, 560066, Karnataka, India
Dnyanesh Rajpathak, Rahul Chougule & Pulak Bandyopadhyay

Authors

Dnyanesh Rajpathak
View author publications
You can also search for this author inPubMed Google Scholar
Rahul Chougule
View author publications
You can also search for this author inPubMed Google Scholar
Pulak Bandyopadhyay
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Dnyanesh Rajpathak.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rajpathak, D., Chougule, R. & Bandyopadhyay, P. A domain-specific decision support system for knowledge discovery using association and text mining. Knowl Inf Syst 31, 405–432 (2012). https://doi.org/10.1007/s10115-011-0409-1

Download citation

Received: 13 January 2010
Revised: 17 February 2011
Accepted: 16 April 2011
Published: 18 May 2011
Issue Date: June 2012
DOI: https://doi.org/10.1007/s10115-011-0409-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A domain-specific decision support system for knowledge discovery using association and text mining

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Big Data in Asset Management: Knowledge Discovery in Asset Data by the Means of Data Mining

Association Mining for Operation and Maintenance Safety Risks of EMUs Based on Unstructured Event Data

An Interpretable Fault Prediction Method Based on Machine Learning and Knowledge Graphs

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now