Skip to main content
Log in

KISTCM: knowledge discovery system for traditional Chinese medicine

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Objective: Traditional Chinese Medicine (TCM) provides an alternative method for achieving and maintaining good health. Due to the increasing prevalence of TCM and the large volume of TCM data accumulated though thousands of years, there is an urgent need to efficiently and effectively explore this information and its hidden rules with knowledge discovery in database (KDD) techniques. This paper describes the design and development of a knowledge discovery system for TCM as well as the newly proposed KDD techniques integrated in this system.

Methods: A novel Knowledge dIscovery System for TCM (KISTCM) is developed by incorporating several data mining techniques, primarily including a medicine dependency relationship discovery algorithm, an efficacy dimension reduction algorithm based on neural networks, a method for exploring the relationships between formulae and syndromes using gene expression programming (GEP), and an approach for discovering the properties in terms of nature, taste and meridian based on the herbal dosage by employing the effect degree function to calculate the effect of each property.

Results: Representative experimental cases are used to evaluate the system performance. Encouraging results are obtained, including rules previously unknown to algorithm designers and experiment runners. Experiments demonstrate that KISTCM has powerful knowledge discovery and data analysis capabilities, and is a useful tool for discovering the underlying rules in formulae. Our proposed techniques successfully discover hidden knowledge from TCM data, which is a new direction in knowledge discovery. From TCM experts’ perspective, the accuracy of data analysis for KISTCM is an improvement, and these results compare favorably to other existing TCM data mining techniques. The system could be expected to be useful in the practice of TCM, e.g., assisting TCM physicians in prescribing formulae or automatically distinguishing between minister and assistant herbs in a formula.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Feng Y, Wu ZH, Zhou XZ, Zhou ZM, Fan WY (2006) Knowledge discovery in traditional Chinese medicine: state of the art and perspectives. Artif Intell Med 38(3):219–236

    Article  Google Scholar 

  2. Xiang ZG (2003) A 3-stage voting algorithm for mining optimal ingredient pattern of traditional Chinese medicine. J Softw 14(11):1882–1890

    MATH  Google Scholar 

  3. Liu ZW, Jiang YN (1998) Formulas of traditional Chinese medicine. Academy Press, Beijing (in Chinese)

    Google Scholar 

  4. Frawley WJ, Piatetsky-Shapiro G, Matheus C (1991) In: Piatetsky-Shapiro G, Frawley WJ (eds) Knowledge discovery in databases: an overview. AAAI Press/MIT Press, Cambridge, pp 1–30

    Google Scholar 

  5. Bath PA (2004) Data mining in health and medical information. Annual Rev Inf Sci Technol 38(1):331–369

    Article  Google Scholar 

  6. Yong XJ, Jiang YG, Cao L, Song YP (2005) Analysis on factors of affecting medicine effects in formula compatibility. Liaoning J Tradit Chi Med 32(9):867–868. (in Chinese)

    Google Scholar 

  7. Altman RB (1999) AI in medicine: the spectrum of challenges from managed care to molecular medicine. AI Mag 20(3):67–77

    Google Scholar 

  8. Zupan B, Lavrac N, Keravnou E (1999) Data mining techniques and applications in medicine. Artif Intell Med 16(1):1–2

    Article  Google Scholar 

  9. Zhou XZ, Liu BY, Wu ZH, Feng Y (2007) Integrative mining of traditional Chinese medicine literature and MEDLINE for functional gene networks. Artif Intell Med 41(2):87–104

    Article  Google Scholar 

  10. Cios KJ, Moore GW (2002) Uniqueness of medical data mining. Artif Intell Med 26(1–2):1–24

    Article  Google Scholar 

  11. Cao CG, Wang HT, Sui YF (2004) Knowledge modeling and acquisition of traditional Chinese herbal drugs and formulae from text. Artif Intell Med 32(1):3–13

    Article  Google Scholar 

  12. Lavrac N (1999) Selected techniques for data mining in medicine. Artif Intell Med 16(1):3–23

    Article  Google Scholar 

  13. Yao MC, Yuan YM, Ai L, Qiao YJ (2002) Data mining and its application in the modernization of traditional Chinese medicine and traditional Chinese pharmacy. J Beijing Univ TCM 25(5):20–23 (in Chinese)

    Google Scholar 

  14. Feng Y, Wu ZH, Zhou ZM (2005) Combining an order-semisensitive text similarity and closest fit approach to textual missing values in knowledge discovery. In: Khosla R, Howlett RJ, Jain LC (eds) Proceedings of KES 2005. Lecture notes in computer science, vol 3682. Springer, Berlin, pp 943–949

    Google Scholar 

  15. Zhou ZM, Wu CH (2004) Mining frequent maximum patterns with constraint. J Fudan Univ (Nat Sci Ed) 5:746–749 (in Chinese)

    Google Scholar 

  16. He QF, Zhou XZ, Zhou ZM, Cui M, Wu ZH (2004) Efficacy-based clustering analysis of traditional Chinese medicinal herbs. Chin J Inf TCM 11(7):561–562 (in Chinese)

    Google Scholar 

  17. Li C, Tang CJ, Peng J, Hu JJ, Zeng LM, Yin XX, Jiang YG, Liu J (2004) TCMiner: a high performance data mining system for multi-dimensional data analysis of traditional Chinese medicine prescriptions. In: Wang S, Yang DQ, Tanaka K, Grandi F, Zhou SG, Mangina EE et al. (eds) Proceedings of ER workshops 2004. Lecture notes in computer science, vol 3289. Springer, Berlin, pp 246–257

    Google Scholar 

  18. Yao MC, Zhang YL, Yuan YM, Ai L, Qiao YJ (2004) Study on the prediction of the effect attribution of the deficiency-nourishing drugs based on the quantification of TCM drug properties. J Beijing Univ Tradit Chin Med 27(4):7–9 (in Chinese)

    Google Scholar 

  19. Zhou L, Tang XY, Fu C, Peng SH (2004) Fuzzy clustering analysis of Chinese herbs for relieving exterior syndrome. West China J Pharm Sci 19(5):339–341 (in Chinese)

    Google Scholar 

  20. Wu ZH, Zhou XZ, Liu BY, Chen JL (2004) Text mining for finding functional community of related genes using TCM knowledge. In: Boulicaut JF, Esposito F, Giannotti F, Pedreschi D (eds) Proceedings of the 8th European conference on principles and practice of knowledge discovery in databases. Springer, Berlin, pp 459–470

    Google Scholar 

  21. Qiao SJ, Tang CJ, Peng J, Yin XX, Han N (2007) Mining the compatibility law of multidimensional medicines based on dependence mode sets. J Sichuan Univ (Eng Sci Ed) 39(4):134–138 (in Chinese)

    Google Scholar 

  22. Qiao SJ, Tang CJ, Peng J, Yu ZH, Jiang YG, Han N (2006) A novel prescription function reduction algorithm based on neural network. In: Chen GR, Liu XZ (eds) Proceedings of the ICSCA 2006, DCDIS series B: application and algorithm. Watam, Canada, pp 939–944

    Google Scholar 

  23. Peng J, Tang CJ, Zeng T, Qiao SJ, Yong XJ (2006) A Chinese traditional medicine prescription effect reduction algorithm based on artificial neural network and property distance matrix. J Sichuan Univ (Eng Sci Ed ) 38(1):92–97 (in Chinese)

    Google Scholar 

  24. Qiao SJ, Tang CJ, Peng J, Hu JJ, Zhang H (2006) BPGEP: robot path planning based on backtracking parallel-chromosome GEP. In: Chen GR, Liu XZ (eds) Proceedings of the ICSCA 2006, DCDIS series B: application and algorithm. Watam, Canada, pp 439–444

    Google Scholar 

  25. Ferreira C (2002) Gene expression programming: mathematical modeling by an artificial intelligence. Angra do Heroismo, Portugal

    Google Scholar 

  26. Yu X, Tang CJ, Zhang H, Qiao SJ, Jiang YG, Liu J, Han PY (2005) Mining formula-syndrome relationship in traditional Chinese medicine with gene expression programming. Comput Appl 25(11):2679–2880 (in Chinese)

    Google Scholar 

  27. Hu JJ (2006) The research of key-techniques in knowledge discovery system for TCM pharmacology. Thesis, Chengdu, Sichuan University (in Chinese)

  28. Peng HR (2002) Chinese medical formula dictionary. People’s Medical Publishing House, Beijing (in Chinese)

    Google Scholar 

  29. Li C, Fan M (2004) Generating association rules based on threaded frequent pattern tree. Comput Eng Appl 4:188–192 (in Chinese)

    MathSciNet  Google Scholar 

  30. Agrawal R, Imielinski T, Swami AN (1993) Mining Association Rules between Sets of Items in Large Databases. In: Buneman P, Jajodia S (eds) Proceedings of the ACM SIGMOD conference on management of data 1993. ACM Press, New York, pp 207–216

    Chapter  Google Scholar 

  31. Yin XX (2005) A mining model for medicine paring correlation of traditional Chinese medicine prescriptions. Thesis, Chengdu, Sichuan University (in Chinese)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shaojie Qiao.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Qiao, S., Tang, C., Jin, H. et al. KISTCM: knowledge discovery system for traditional Chinese medicine. Appl Intell 32, 346–363 (2010). https://doi.org/10.1007/s10489-008-0149-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-008-0149-4

Keywords

Navigation