Skip to main content

Design of Query-Driven System for Time-Utility Based Data Mining on Medical Data

  • Conference paper
  • First Online:
Knowledge Management in Organizations (KMO 2015)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 224))

Included in the following conference series:

Abstract

Association rule mining(ARM) techniques search for groups of frequently co-occurring items (i.e., frequent itemset) in a market-basket transaction database and convert these groups into business-oriented rules. The problem of ARM will gain momentum when it is attached with the time of transaction. High utility itemset mining is a research area of utility based data mining, aimed at finding itemsets that contribute most to the total utility. The association of time and utility on frequent itemsets gives a novel approach to efficiently capture the transactions for getting better predictions and planning for an enterprise. Previous research has focused mainly on how to obtain exhaustive lists of association rules. However, users often prefer a quick response to targeted queries. To accelerate the processing of such queries, a query-driven system called TD-FVAUFM (Time-Dependent Fast Value Added Utility Frequent Mining) is proposed in this paper. It performs data preprocessing steps on the given database and the resultant database is converted in the form of an itemset tree, a compact data structure suitable for query response. The proposed system is applied on a medical database containing patient’s records. It generates association rules that predict possible diseases with risk factor and frequency with respect to time. Experiments indicate that the targeted queries are answered in a time that is roughly linear in the number of transactions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 207–216. Washington, May 1993

    Google Scholar 

  2. Arumugam, G., Vijayakumar, V.K.: Discovery of time-dependent association rules for targeted queries on medical data. In: Proceedings of the International Conference on Recent trends in Information Systems (IRIS 2006) National Engineering College, Kovilpatti, Tamil Nadu, India, pp. 443–453, 6–8 January 2006

    Google Scholar 

  3. Borgelt, C.: Keeping things simple: finding frequent itemsets by recursive elimination. In: Workshop Open Source Data Mining Software, pp. 66–70. ACM Press, NewYork (2005)

    Google Scholar 

  4. Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings of the International Conference on Management of Data, pp. 1–12, 2000

    Google Scholar 

  5. Han, J., Kamber, M.: Data Mining – Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2001)

    Google Scholar 

  6. Kubat, M., Hafez, A., Raghavan, V.V., Lekkala, J.R., Kian Chen, W.: Itemset Trees for Targeted Association Querying. IEEE Trans. Knowl. Data Eng. 15(6), 1522–1534 (2003)

    Article  Google Scholar 

  7. Adnan, M.H.M., Husain, W., Rashid, N.A.: Data mining for medical systems: a review. In: Proceeding of the International Conference on Advances in Computer and Information Technology – ACIT, pp. 17–22 (2012)

    Google Scholar 

  8. Podpe.can V.: Utility-based Data Mining. BSc Paper (in Slovene), University of Ljubljana (2007)

    Google Scholar 

  9. Podpecan, V., Lavrac, N., Kononenko, I.: A Fast Algorithm for Mining Utility-Frequent Itemsets (2007)

    Google Scholar 

  10. Ruijuan, Hu: Medical data mining based on association rules. J. Comput. Inform. Sci. 3(4), 104–108 (2010)

    Google Scholar 

  11. Doddi, S., Marathe, A., Ravi, S.S.: Discovery of association rules in medical data. Med Inform Internet Med. 26(1), 25–33 (2001)

    Article  Google Scholar 

  12. ACM SIGKDD Workshop on utility-based data mining, (2005). http://storm.cis.fordham.edu/˜gweiss/ubdm-kdd05.html

  13. ACM SIGKDD Workshop on utility-based data mining (2006). http://www.ic.uff.br/bianca/ubdm-kdd06.html

  14. Weiss, G., Zadrozny, B., Saar-Tsechansky, M.: Utility-baseddatamining 2006 workshop report. SIGKDD Explorations 8(2), 98–101 (2006)

    Article  Google Scholar 

  15. Yao, H., Hamilton, H.J., Butz C.J.: A foundational approach to mining itemset utilities from databases. In: The Fourth SIAM International Conference od Data Mining SDM, pp. 428–486 (2004)

    Google Scholar 

  16. Yao, H., Hamilton, H.J., Geng, L.: A unified framework for utility based measures for mining itemsets. In: Second International Workshop on Utility-Based Data Mining, Philadelphia (2006)

    Google Scholar 

Download references

Acknowledgments

The encouragement and financial support provided by the Sourashtra College authorities during the preparation of this research work is highly acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. K. Vijayakumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Arumugam, G., Vijayakumar, V.K. (2015). Design of Query-Driven System for Time-Utility Based Data Mining on Medical Data. In: Uden, L., Heričko, M., Ting, IH. (eds) Knowledge Management in Organizations. KMO 2015. Lecture Notes in Business Information Processing, vol 224. Springer, Cham. https://doi.org/10.1007/978-3-319-21009-4_49

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-21009-4_49

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-21008-7

  • Online ISBN: 978-3-319-21009-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics