Design of Query-Driven System for Time-Utility Based Data Mining on Medical Data

Arumugam, G.; Vijayakumar, V. K.

doi:10.1007/978-3-319-21009-4_49

G. Arumugam⁹ &
V. K. Vijayakumar¹⁰

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 224))

Included in the following conference series:

International Conference on Knowledge Management in Organizations

2635 Accesses
1 Citations

Abstract

Association rule mining(ARM) techniques search for groups of frequently co-occurring items (i.e., frequent itemset) in a market-basket transaction database and convert these groups into business-oriented rules. The problem of ARM will gain momentum when it is attached with the time of transaction. High utility itemset mining is a research area of utility based data mining, aimed at finding itemsets that contribute most to the total utility. The association of time and utility on frequent itemsets gives a novel approach to efficiently capture the transactions for getting better predictions and planning for an enterprise. Previous research has focused mainly on how to obtain exhaustive lists of association rules. However, users often prefer a quick response to targeted queries. To accelerate the processing of such queries, a query-driven system called TD-FVAUFM (Time-Dependent Fast Value Added Utility Frequent Mining) is proposed in this paper. It performs data preprocessing steps on the given database and the resultant database is converted in the form of an itemset tree, a compact data structure suitable for query response. The proposed system is applied on a medical database containing patient’s records. It generates association rules that predict possible diseases with risk factor and frequency with respect to time. Experiments indicate that the targeted queries are answered in a time that is roughly linear in the number of transactions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 207–216. Washington, May 1993
Google Scholar
Arumugam, G., Vijayakumar, V.K.: Discovery of time-dependent association rules for targeted queries on medical data. In: Proceedings of the International Conference on Recent trends in Information Systems (IRIS 2006) National Engineering College, Kovilpatti, Tamil Nadu, India, pp. 443–453, 6–8 January 2006
Google Scholar
Borgelt, C.: Keeping things simple: finding frequent itemsets by recursive elimination. In: Workshop Open Source Data Mining Software, pp. 66–70. ACM Press, NewYork (2005)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings of the International Conference on Management of Data, pp. 1–12, 2000
Google Scholar
Han, J., Kamber, M.: Data Mining – Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2001)
Google Scholar
Kubat, M., Hafez, A., Raghavan, V.V., Lekkala, J.R., Kian Chen, W.: Itemset Trees for Targeted Association Querying. IEEE Trans. Knowl. Data Eng. 15(6), 1522–1534 (2003)
Article Google Scholar
Adnan, M.H.M., Husain, W., Rashid, N.A.: Data mining for medical systems: a review. In: Proceeding of the International Conference on Advances in Computer and Information Technology – ACIT, pp. 17–22 (2012)
Google Scholar
Podpe.can V.: Utility-based Data Mining. BSc Paper (in Slovene), University of Ljubljana (2007)
Google Scholar
Podpecan, V., Lavrac, N., Kononenko, I.: A Fast Algorithm for Mining Utility-Frequent Itemsets (2007)
Google Scholar
Ruijuan, Hu: Medical data mining based on association rules. J. Comput. Inform. Sci. 3(4), 104–108 (2010)
Google Scholar
Doddi, S., Marathe, A., Ravi, S.S.: Discovery of association rules in medical data. Med Inform Internet Med. 26(1), 25–33 (2001)
Article Google Scholar
ACM SIGKDD Workshop on utility-based data mining, (2005). http://storm.cis.fordham.edu/˜gweiss/ubdm-kdd05.html
ACM SIGKDD Workshop on utility-based data mining (2006). http://www.ic.uff.br/bianca/ubdm-kdd06.html
Weiss, G., Zadrozny, B., Saar-Tsechansky, M.: Utility-baseddatamining 2006 workshop report. SIGKDD Explorations 8(2), 98–101 (2006)
Article Google Scholar
Yao, H., Hamilton, H.J., Butz C.J.: A foundational approach to mining itemset utilities from databases. In: The Fourth SIAM International Conference od Data Mining SDM, pp. 428–486 (2004)
Google Scholar
Yao, H., Hamilton, H.J., Geng, L.: A unified framework for utility based measures for mining itemsets. In: Second International Workshop on Utility-Based Data Mining, Philadelphia (2006)
Google Scholar

Download references

Acknowledgments

The encouragement and financial support provided by the Sourashtra College authorities during the preparation of this research work is highly acknowledged.

Author information

Authors and Affiliations

Department of Computer Science, Madurai Kamaraj University, Madurai, Tamil Nadu, India
G. Arumugam
Department of Computer Science, Sourashtra College, Madurai, Tamil Nadu, India
V. K. Vijayakumar

Authors

G. Arumugam
View author publications
You can also search for this author in PubMed Google Scholar
V. K. Vijayakumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to V. K. Vijayakumar .

Editor information

Editors and Affiliations

Staffordshire University, Stoke-on-Trent, United Kingdom
Lorna Uden
University of Maribor, Maribor, Slovenia
Marjan Heričko
National University of Kaohsiung, Kaohsiung City, Taiwan
I-Hsien Ting

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arumugam, G., Vijayakumar, V.K. (2015). Design of Query-Driven System for Time-Utility Based Data Mining on Medical Data. In: Uden, L., Heričko, M., Ting, IH. (eds) Knowledge Management in Organizations. KMO 2015. Lecture Notes in Business Information Processing, vol 224. Springer, Cham. https://doi.org/10.1007/978-3-319-21009-4_49

Download citation

DOI: https://doi.org/10.1007/978-3-319-21009-4_49
Published: 04 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21008-7
Online ISBN: 978-3-319-21009-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics