Abstract
In diagnosing diseases in clinical practice, a combination of three clinical findings is often used to represent each disease. This is largely because it is often difficult or impractical to assess for all possible combinations of symptoms and abnormal exam findings that occur in any particular disease. For most diseases, diagnostic triads are based on empirical observations. In this study, we determined diagnostic triads for chronic diseases using data mining procedures. We also verified the combinations’ validity as well as our procedure for determining them. We used symptoms and examination findings from 477 patients with chronic diseases, collected as part of a 35-year longitudinal study begun in 1968. For each patient there were 295 items from examinations in internal medicine, dermatology, ophthalmology, dentistry and blood tests. We judged each item to be either normal or abnormal, and restricted the analysis to the abnormal findings. To analyze such an exhaustive assortment, we used the data mining technique of association analysis. The analysis generated three clinical findings for each disease. Diseases were defined based on blood tests. Searching through all 295 items to find the three most useful clinical findings would be impractical on a commodity PC. However, by excluding normal items, we were able to sufficiently reduce the total number of combinations so as to make combinatorial analysis on a PC feasible. In addition to more accurate diagnoses, we believe our technique can identify those diagnostic data that are more cost effective in terms of time and other resources required for their collection.
Similar content being viewed by others
References
Agrawal R, Imielinski T, Swami AN (1993) Mining association rules between sets of items in large databases. In: ACM SIGMOD international conference on management of data, pp 207–216
Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th international conference on very large databases, pp 487–499
Doddi S, Marathe A, Ravi SS, Torney DC (2001) Discovery of association rules in medical data. Med Inform Int Med 26(1):25–33
Furue M, Uenotsuchi T, Urabe K, Ishikawa T, Kuwabara (2005) M overview of Yusho. J Dermatol Sci Suppl 1:S3–S10
Kanagawa Y, Imamuara T (2005) Relationship of clinical symptoms and laboratory findings with the blood levels of PCDFs in patients with Yusho. J Dermatol Sci Suppl 1:S85–S93
Karlsson J, Eklund P (2000) Data mining and structuring of executable data analysis reports: guideline development and implementation in a narrow sense. Stud Health Technol Inform 77:790–794
Ordonez C, Omiecinski E, de Braal L (2001) Mining constrained association rules to predict heart disease. In: Proceedings of the 2001 IEEE international conference on data mining, pp 433–440
Prather JC, Lobach DF, Goodwin LK, Hales JW, Hage ML, Hammond WE (1997) Medical data mining: knowledge discovery in a clinical data warehouse. Proc AMIA Annu Fall Symp, pp 101–105
Roddick JF, Fule P, Graco WJ (2003) Exploratory medical knowledge discovery: experiences and issues. SIGKDD Explorations 5(1):94–99
Stilou S, Bamidis PD, Maglaveras N, Pappas C (2001) Mining association rules from clinical databases: an intelligent diagnostic process in healthcare. Medinfo 10(1pt.2):1399–1403
Tan PN, Kumar V, Srivastava J (2002) Selecting the right interestingness measure for association patterns. In: Proceedings of the ACM SIGKDD international conference knowledge discovery in databases, pp 32–41
Vinterbo S, Ohno-Machado L (2000) A genetic algorithm approach to multi-order diagnosis. Art Intell Med 18:117–132
Acknowledgments
We would like to offer our most sincere thanks to the Health Sciences Division of the Ministry of Health, Labor and Welfare for its support in providing a research grant that allowed us to carry out this study in 2004.
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Imamura, T., Matsumoto, S., Kanagawa, Y. et al. A technique for identifying three diagnostic findings using association analysis. Med Bio Eng Comput 45, 51–59 (2007). https://doi.org/10.1007/s11517-006-0121-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11517-006-0121-6