Skip to main content
Log in

Study of select items in different data sources by grouping

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Many large organizations have multiple large databases as they transact from multiple branches. Many important decisions are based on a set of specific items called the select items. Thus, the analysis of select items in multiple databases is an important issue. For the purpose of studying select items in multiple databases, one might need true global patterns of select items. Thus, we propose a model of mining global patterns of select items from multiple databases. A measure of overall association between two items in a database is proposed. We have extended the proposed measure for a database whose transactions contain items along with the quantities purchased. We have designed an algorithm based on proposed measure for the purpose of grouping the frequent items in multiple databases. In addition, we have studied properties of different measures proposed in this paper. Experimental results are presented for both real and synthetic databases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Adhikari A, Rao PR, Adhikari J (2007) Mining multiple large databases, In: Proceedings of 10th international conference on information technology, pp 80–84

  2. Adhikari A, Rao PR (2008) Efficient clustering of databases induced by local patterns. Decis Support Syst 44(4): 925–943

    Article  Google Scholar 

  3. Adhikari A, Rao PR (2008) Association rules induced by item and quantity purchased. In: Proceedings of international conference on database systems for advance applications, LNCS 4947, pp 478–485

  4. Aggarwal C, Yu P (1998) A new framework for itemset generation. In: Proceedings of the 17th symposium on principles of database systems, pp 18–24

  5. Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD conference on management of data, pp 207–216

  6. Agrawal R, Shafer J (1999) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6): 962–969

    Article  Google Scholar 

  7. Anagnostopoulos A, Broder A, Punera K (2008) Effective and efficient classification on a search-engine model. Knowl Inf Syst 16(2): 129–154

    Article  Google Scholar 

  8. Barte RG (1976) The elements of real analysis, 2nd edn. Wiley, New York

    Google Scholar 

  9. Denton AM, Besemann CA, Dorr DH (2009) Pattern-based time-series subsequence clustering using radial distribution functions. Knowl Inf Syst 18(1): 129–154

    Article  Google Scholar 

  10. Estivill-Castro V, Yang J (2004) Fast and robust general purpose clustering algorithms. Data Min Knowl Discov 8(2): 127–150

    Article  MathSciNet  Google Scholar 

  11. Frequent itemset mining dataset repository. http://fimi.cs.helsinki.fi/data

  12. Kandylas V, Upham , Ungar LH (2008) Finding cohesive clusters for analyzing knowledge communities. Knowl Inf Syst 17(3): 335–354

    Article  Google Scholar 

  13. Klemettinen M, Mannila H, Ronkainen P, Toivonen T, Verkamo A (1994) Finding interesting rules from large sets of discovered association rules. In: Proceedings of the 3rd international conference on information and knowledge management, pp 401–407

  14. Li T (2008) Clustering based on matrix approximation: a unifying view. Knowl Inf Syst 17(1): 1–15

    Article  MATH  Google Scholar 

  15. Liu B, Hsu W, Ma Y (1999) Pruning and summarizing the discovered associations. In: Proceedings of the 5th international conference on knowledge discovery and data mining, pp 125–134

  16. Pyle D (1999) Data preparation for data mining. Morgan Kufmann, San Francisco

    Google Scholar 

  17. Ramkumar T, Srinivasan R (2008) Modified algorithms for synthesizing high-frequency rules from different data sources. Knowl Inf Syst 17(3): 313–334

    Article  Google Scholar 

  18. Silberschatz A, Tuzhilin A (1996) What makes patterns interesting in knowledge discovery systems. IEEE Trans Knowl Data Eng 8(6): 970–974

    Article  Google Scholar 

  19. Silverstein C, Brin S, Motwani R (1998) Beyond market baskets: generalizing association rules to dependence rules. Data Min Knowl Discov 2(1): 39–68

    Article  Google Scholar 

  20. Tan P-N, Kumar V, Srivastava J (2002) Selecting the right interestingness measure for association patterns. In: Proceedings of SIGKDD conference, pp 32–41

  21. Wu X, Zhang S (2003) Synthesizing high-frequency rules from different data sources. IEEE Trans Knowl Data Eng 14(2): 353–367

    Google Scholar 

  22. Wu X, Zhang C, Zhang S (2005) Database classification for multi-database mining. Inf Syst 30(1): 71–88

    Article  Google Scholar 

  23. Zhang C, Liu M, Nie W, Zhang S (2004) Identifying global exceptional patterns in multi-database mining. IEEE Comput Intell Bull 3(1): 19–24

    Google Scholar 

  24. Zhang S, Wu X, Zhang C (2003) Multi-database mining. IEEE Comput Intell Bull 2(1): 5–13

    Google Scholar 

  25. Zhang S, Zhang C, Wu X (2004) Knowledge discovery in multiple databases. Springer, London

  26. Zhang T, Ramakrishnan R, Livny M (1997) BIRCH: a new data clustering algorithm and its applications. Data Min Knowl Discov 1(2): 141–182

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Animesh Adhikari.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Adhikari, A., Ramachandrarao, P. & Pedrycz, W. Study of select items in different data sources by grouping. Knowl Inf Syst 27, 23–43 (2011). https://doi.org/10.1007/s10115-010-0290-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-010-0290-3

Keywords

Navigation