Modified algorithms for synthesizing high-frequency rules from different data sources

Ramkumar, Thirunavukkarasu; Srinivasan, Rengaramanujam

doi:10.1007/s10115-008-0126-6

Modified algorithms for synthesizing high-frequency rules from different data sources

Regular Paper
Published: 11 March 2008

Volume 17, pages 313–334, (2008)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Thirunavukkarasu Ramkumar¹ &
Rengaramanujam Srinivasan²

145 Accesses
14 Citations
Explore all metrics

Abstract

Because of the rapid growth in information and communication technologies, a company’s data may be spread over several continents. For an effective decision-making process, knowledge workers need data, which may be geographically spread in different locations. In such circumstances, multi-database mining plays a major role in the process of extracting knowledge from different data sources. In this paper, we have proposed a new methodology for synthesizing high-frequency rules from different data sources, where data source weight has been calculated on the basis of their transaction population. We have also proposed a new method for calculating global confidence. Our goal in synthesizing local patterns to obtain global patterns is that, the support and confidence of synthesized patterns must be very nearly same if all the databases are integrated and mono-mining has been done. Experiments conducted clearly establish that the proposed method of synthesizing high-frequency rules fairly meets the stipulation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of 20th international conference on very large databases, pp 487–499
Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD international conference on management of data, pp 207–216
Berti-Equille L (2007). Data quality awareness: a case study for cost optimal association rule mining. Knowl Inf Syst 11(2): 191–215
Article Google Scholar
Blanchard J, Guillet F and Briand H (2007). Interactive visual exploration of association rules with rule-focusing methodology. Knowl Inf Syst 13(1): 43–75
Article Google Scholar
Good IJ (1950). Probability and the weighing of evidence. Charles Griffin, London
MATH Google Scholar
Leung CW-K, Chan SC-F and Chung F-L (2006). A collaborative filtering framework based on fuzzy association rules and multiple-level similarity. Knowl Inf Syst 10(3): 357–381
Article Google Scholar
Liu H, Lu H and Yao J (2001). Toward multidatabse mining: Identifying relevant databases. IEEE Trans Knowl Data Eng 13(4): 541–553
Article Google Scholar
Turinsky A, Grossman R (2000) A Framework for finding distributed data mining strategies that are intermediate between centralized strategies and in-place strategies. In: Proceedings of the workshop on distributed and parallel knowledge discovery, pp 1–7
Wu X and Zhang S (2003). Synthesizing high-frequency rules from different data sources. IEEE Trans Knowl Data Eng 15(2): 353–367
Article Google Scholar
Wu X, Zhang C and Zhang S (2005). Database classification for multi-database mining. Inf Sys 30(1): 71–88
Article Google Scholar
Zhang C, Liu M and Nie W (2004). Identifying global and exceptional patterns in multi-database mining. IEEE Comput Intell Bull 3(1): 19–24
Google Scholar
Zhang S and Zaki MJ (2006). Mining multiple data sources: local pattern analysis. Data Min Knowl Discov 12(2–3): 121–125
Article MathSciNet Google Scholar
Zhang S, Wu X and Zhang C (2003). Multi-database mining. IEEE Comput Intell Bull 2(1): 5–13
Google Scholar
Zhang S, Zhang C and Wu X (2004). Knowledge discovery in multiple databases. Springer, New York
MATH Google Scholar
Zhong N, Yao Y and Ohshima M (2003). Peculiarity oriented multidatabase mining. IEEE Trans Knowl Data Eng 15(4): 952–960
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Applications, A.V.C. College of Engineering, Mayiladuthurai, Tamilnadu, India
Thirunavukkarasu Ramkumar
Department of Computer Science and Engineering, B.S.A. Crescent Engineering College, Chennai, Tamilnadu, India
Rengaramanujam Srinivasan

Authors

Thirunavukkarasu Ramkumar
View author publications
You can also search for this author in PubMed Google Scholar
Rengaramanujam Srinivasan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rengaramanujam Srinivasan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ramkumar, T., Srinivasan, R. Modified algorithms for synthesizing high-frequency rules from different data sources. Knowl Inf Syst 17, 313–334 (2008). https://doi.org/10.1007/s10115-008-0126-6

Download citation

Received: 16 March 2007
Revised: 28 December 2007
Accepted: 06 January 2008
Published: 11 March 2008
Issue Date: December 2008
DOI: https://doi.org/10.1007/s10115-008-0126-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Modified algorithms for synthesizing high-frequency rules from different data sources

Abstract

Access this article

Similar content being viewed by others

Mining High Utility Itemsets from Multiple Databases

Rule Induction Based on Logic Synthesis Methods

Multi-heuristic Induction of Decision Rules

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Modified algorithms for synthesizing high-frequency rules from different data sources

Abstract

Access this article

Similar content being viewed by others

Mining High Utility Itemsets from Multiple Databases

Rule Induction Based on Logic Synthesis Methods

Multi-heuristic Induction of Decision Rules

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation