Abstract
Traditional Chinese Medicine (TCM) is the main route of disease control for ancient Chinese. Through thousands of years’ development and inheriting, TCM is the most influential traditional medical system which lasts the longest time and used by the largest population. However, the theory of TCM lacks objective and quantitative standards. In this paper, we propose a statistical diagnosis approach to find out the pathogenesises based on the latent semantic analysis of symptoms and the corresponding herbs. We assume that the latent pathogenesis is the inherent connection between symptoms and herbs within a medical case. Previous topic models mostly focus on single content documents, but medical cases have two different contents: symptoms and herbs. We therefore develop a novel muti-content model based on LDA. We used the proposed model to analysis two TCM domains amenorrhea and lung cancer. Experiment results illustrate that the pathogenesises found by our model correspond well with the theory of TCM and it provides a theoretical data-driven approach to establish diagnosis standards.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Brown, P.F., et al.: An estimate of an upper bound for the entropy of English. Comput. Linguist. 18(1), 31–40 (1992)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Murphy, K.P.: Machine Learning: A Probabilistic Perspective. MIT Press, Cambridge (2012)
Zhou, X., Liu, B., Wu, Z.: Text mining for clinical chinese herbal medical knowledge discovery. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds.) DS 2005. LNCS (LNAI), vol. 3735, pp. 396–398. Springer, Heidelberg (2005)
Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16(1), 22–29 (1990)
Li, S.: Compendium of Materia Medica: (Bencao Gangmu). Foreign Languages Press, Beijing (2003)
Zhang, N.L., et al.: Latent tree models and diagnosis in traditional Chinese medicine. Artif. Intell. Med. 42(3), 229–245 (2008)
Ying, J., et al.: Collection and analysis of characteristics of tongue manifestations in patients with cerebrovascular diseases. J. Beijing Univ. Tradit. Chin. Med. 28(4), 62 (2005)
Lei, L., et al.: Study on application of probability latent semantic analysis (PLSA) in herbal prescription development. World Sci. Technol. (Modernization Tradit. Chin. Med. Mater. Med.) 5, 1976–1980 (2012)
Heinrich, G.: Parameter estimation for text analysis. Technical report (2005)
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM (1999)
Zhou, X., Peng, Y., Liu, B.: Text mining for traditional Chinese medical knowledge discovery: a survey. J. Biomed. Inform. 43(4), 650–660 (2010)
Acknowledgements
This work was supported by NSFC grants (No. 61532021, 61472141 and 61021004), Shanghai Knowledge Service Platform Project (No. ZF1213)and Shanghai Leading Academic Discipline Project(Project NumberB412).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Ji, W., Zhang, Y., Wang, X., Zhou, Y. (2016). Latent Semantic Diagnosis in Traditional Chinese Medicine. In: Li, F., Shim, K., Zheng, K., Liu, G. (eds) Web Technologies and Applications. APWeb 2016. Lecture Notes in Computer Science(), vol 9931. Springer, Cham. https://doi.org/10.1007/978-3-319-45814-4_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-45814-4_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45813-7
Online ISBN: 978-3-319-45814-4
eBook Packages: Computer ScienceComputer Science (R0)