Abstract
Objective: Traditional Chinese Medicine (TCM) provides an alternative method for achieving and maintaining good health. Due to the increasing prevalence of TCM and the large volume of TCM data accumulated though thousands of years, there is an urgent need to efficiently and effectively explore this information and its hidden rules with knowledge discovery in database (KDD) techniques. This paper describes the design and development of a knowledge discovery system for TCM as well as the newly proposed KDD techniques integrated in this system.
Methods: A novel Knowledge dIscovery System for TCM (KISTCM) is developed by incorporating several data mining techniques, primarily including a medicine dependency relationship discovery algorithm, an efficacy dimension reduction algorithm based on neural networks, a method for exploring the relationships between formulae and syndromes using gene expression programming (GEP), and an approach for discovering the properties in terms of nature, taste and meridian based on the herbal dosage by employing the effect degree function to calculate the effect of each property.
Results: Representative experimental cases are used to evaluate the system performance. Encouraging results are obtained, including rules previously unknown to algorithm designers and experiment runners. Experiments demonstrate that KISTCM has powerful knowledge discovery and data analysis capabilities, and is a useful tool for discovering the underlying rules in formulae. Our proposed techniques successfully discover hidden knowledge from TCM data, which is a new direction in knowledge discovery. From TCM experts’ perspective, the accuracy of data analysis for KISTCM is an improvement, and these results compare favorably to other existing TCM data mining techniques. The system could be expected to be useful in the practice of TCM, e.g., assisting TCM physicians in prescribing formulae or automatically distinguishing between minister and assistant herbs in a formula.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Feng Y, Wu ZH, Zhou XZ, Zhou ZM, Fan WY (2006) Knowledge discovery in traditional Chinese medicine: state of the art and perspectives. Artif Intell Med 38(3):219–236
Xiang ZG (2003) A 3-stage voting algorithm for mining optimal ingredient pattern of traditional Chinese medicine. J Softw 14(11):1882–1890
Liu ZW, Jiang YN (1998) Formulas of traditional Chinese medicine. Academy Press, Beijing (in Chinese)
Frawley WJ, Piatetsky-Shapiro G, Matheus C (1991) In: Piatetsky-Shapiro G, Frawley WJ (eds) Knowledge discovery in databases: an overview. AAAI Press/MIT Press, Cambridge, pp 1–30
Bath PA (2004) Data mining in health and medical information. Annual Rev Inf Sci Technol 38(1):331–369
Yong XJ, Jiang YG, Cao L, Song YP (2005) Analysis on factors of affecting medicine effects in formula compatibility. Liaoning J Tradit Chi Med 32(9):867–868. (in Chinese)
Altman RB (1999) AI in medicine: the spectrum of challenges from managed care to molecular medicine. AI Mag 20(3):67–77
Zupan B, Lavrac N, Keravnou E (1999) Data mining techniques and applications in medicine. Artif Intell Med 16(1):1–2
Zhou XZ, Liu BY, Wu ZH, Feng Y (2007) Integrative mining of traditional Chinese medicine literature and MEDLINE for functional gene networks. Artif Intell Med 41(2):87–104
Cios KJ, Moore GW (2002) Uniqueness of medical data mining. Artif Intell Med 26(1–2):1–24
Cao CG, Wang HT, Sui YF (2004) Knowledge modeling and acquisition of traditional Chinese herbal drugs and formulae from text. Artif Intell Med 32(1):3–13
Lavrac N (1999) Selected techniques for data mining in medicine. Artif Intell Med 16(1):3–23
Yao MC, Yuan YM, Ai L, Qiao YJ (2002) Data mining and its application in the modernization of traditional Chinese medicine and traditional Chinese pharmacy. J Beijing Univ TCM 25(5):20–23 (in Chinese)
Feng Y, Wu ZH, Zhou ZM (2005) Combining an order-semisensitive text similarity and closest fit approach to textual missing values in knowledge discovery. In: Khosla R, Howlett RJ, Jain LC (eds) Proceedings of KES 2005. Lecture notes in computer science, vol 3682. Springer, Berlin, pp 943–949
Zhou ZM, Wu CH (2004) Mining frequent maximum patterns with constraint. J Fudan Univ (Nat Sci Ed) 5:746–749 (in Chinese)
He QF, Zhou XZ, Zhou ZM, Cui M, Wu ZH (2004) Efficacy-based clustering analysis of traditional Chinese medicinal herbs. Chin J Inf TCM 11(7):561–562 (in Chinese)
Li C, Tang CJ, Peng J, Hu JJ, Zeng LM, Yin XX, Jiang YG, Liu J (2004) TCMiner: a high performance data mining system for multi-dimensional data analysis of traditional Chinese medicine prescriptions. In: Wang S, Yang DQ, Tanaka K, Grandi F, Zhou SG, Mangina EE et al. (eds) Proceedings of ER workshops 2004. Lecture notes in computer science, vol 3289. Springer, Berlin, pp 246–257
Yao MC, Zhang YL, Yuan YM, Ai L, Qiao YJ (2004) Study on the prediction of the effect attribution of the deficiency-nourishing drugs based on the quantification of TCM drug properties. J Beijing Univ Tradit Chin Med 27(4):7–9 (in Chinese)
Zhou L, Tang XY, Fu C, Peng SH (2004) Fuzzy clustering analysis of Chinese herbs for relieving exterior syndrome. West China J Pharm Sci 19(5):339–341 (in Chinese)
Wu ZH, Zhou XZ, Liu BY, Chen JL (2004) Text mining for finding functional community of related genes using TCM knowledge. In: Boulicaut JF, Esposito F, Giannotti F, Pedreschi D (eds) Proceedings of the 8th European conference on principles and practice of knowledge discovery in databases. Springer, Berlin, pp 459–470
Qiao SJ, Tang CJ, Peng J, Yin XX, Han N (2007) Mining the compatibility law of multidimensional medicines based on dependence mode sets. J Sichuan Univ (Eng Sci Ed) 39(4):134–138 (in Chinese)
Qiao SJ, Tang CJ, Peng J, Yu ZH, Jiang YG, Han N (2006) A novel prescription function reduction algorithm based on neural network. In: Chen GR, Liu XZ (eds) Proceedings of the ICSCA 2006, DCDIS series B: application and algorithm. Watam, Canada, pp 939–944
Peng J, Tang CJ, Zeng T, Qiao SJ, Yong XJ (2006) A Chinese traditional medicine prescription effect reduction algorithm based on artificial neural network and property distance matrix. J Sichuan Univ (Eng Sci Ed ) 38(1):92–97 (in Chinese)
Qiao SJ, Tang CJ, Peng J, Hu JJ, Zhang H (2006) BPGEP: robot path planning based on backtracking parallel-chromosome GEP. In: Chen GR, Liu XZ (eds) Proceedings of the ICSCA 2006, DCDIS series B: application and algorithm. Watam, Canada, pp 439–444
Ferreira C (2002) Gene expression programming: mathematical modeling by an artificial intelligence. Angra do Heroismo, Portugal
Yu X, Tang CJ, Zhang H, Qiao SJ, Jiang YG, Liu J, Han PY (2005) Mining formula-syndrome relationship in traditional Chinese medicine with gene expression programming. Comput Appl 25(11):2679–2880 (in Chinese)
Hu JJ (2006) The research of key-techniques in knowledge discovery system for TCM pharmacology. Thesis, Chengdu, Sichuan University (in Chinese)
Peng HR (2002) Chinese medical formula dictionary. People’s Medical Publishing House, Beijing (in Chinese)
Li C, Fan M (2004) Generating association rules based on threaded frequent pattern tree. Comput Eng Appl 4:188–192 (in Chinese)
Agrawal R, Imielinski T, Swami AN (1993) Mining Association Rules between Sets of Items in Large Databases. In: Buneman P, Jajodia S (eds) Proceedings of the ACM SIGMOD conference on management of data 1993. ACM Press, New York, pp 207–216
Yin XX (2005) A mining model for medicine paring correlation of traditional Chinese medicine prescriptions. Thesis, Chengdu, Sichuan University (in Chinese)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Qiao, S., Tang, C., Jin, H. et al. KISTCM: knowledge discovery system for traditional Chinese medicine. Appl Intell 32, 346–363 (2010). https://doi.org/10.1007/s10489-008-0149-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-008-0149-4