Skip to main content

Frequent Itemset Mining for a Combination of Certain and Uncertain Databases

  • Chapter
  • First Online:

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 361))

Abstract

Modern industries and business firms are widely using data mining applications in which the problem of Frequent Itemset Mining (FIM) has a major role. FIM problem can be solved by standard traditional algorithms like Apriori in certain transactional database and can also be solved by different exact (UApriori, UFP Growth) and approximate (Poisson Distribution based UApriori, Normal Distribution based UApriori) probabilistic frequent itemset mining algorithm in uncertain transactional database (database in which each item has its existential probability). In our algorithm it is considered that database is distributed among different locations of globe in which one location has certain transactional database, we call this location as main site and all other locations have uncertain transactional databases, we call these locations as remote sites. To the best of our knowledge no algorithm is developed yet which can calculate frequent itemsets on the combination of certain and uncertain transactional database. We introduced a novel approach for finding itemsets which are globally frequent among the combination of all uncertain transactional databases on remote site with certain database at main site.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. R. Agrawal, R. Srikant, Fast algorithms for mining association rules, in Proceedings of the 20th VLDB Conference, Santiago, Chile (1994), pp. 487–499

    Google Scholar 

  2. T. Bernecker, R. Cheng, H.P. Kriegel, M. Renz, F. Verhein, A. Züfle, D.W. Cheung, S.D. Lee, Wang Liang, Model-based probabilistic frequent itemset mining. Knowl. Inf. Syst. 37, 181–217 (2013)

    Article  Google Scholar 

  3. L. Wang, R. Cheng, S.D. Lee, D.W.-L. Cheung, Accelerating probabilistic frequent itemset mining: a model-based approach, in CIKM (2010), pp. 429–438

    Google Scholar 

  4. R. Agrawal, J.C. Shafer, Parallel mining of association rules. IEEE Trans. Knowl. Data Eng. 8(6), 962–969 (1996)

    Article  Google Scholar 

  5. E. Han, G. Karypis, V. Kumar, Scalable parallel data mining for association rules, in Proceedings 1997 ACM-SIGMOD International Conferences on Management of Data, Tucson, Arizona (1997)

    Article  Google Scholar 

  6. E. Han, G. Karypis, V. Kumar, Scalable parallel data mining for association rules. IEEE Trans. Knowl. Data Eng. 12(3), 337–352 (2000)

    Article  Google Scholar 

  7. M.V. Joshi, E. Han, G. Karypis, V. Kumar, Efficient parallel algorithms for mining associations, in Large-scale Parallel and Distributed Data Mining, ed. by M. Zaki, C.-T. Ho. Lecture Notes in Computer Science/Lecture Notes in Artificial Intelligence (LNCS/LNAI), vol. 1759 (Springer, 2000)

    Google Scholar 

  8. D.W. Cheung, J. Han, V.T. Ng, A.W. Fu, Y. Fu, A fast distributed algorithm for mining association rules, in Proceedings of the 4th International Conferences Parallel and Distributed Information System, IEEE Computer Society Press, Los Alamitos, CA (1996), pp. 31–42

    Google Scholar 

  9. D.W. Cheung, Y. Xiao, Effect of data distribution in parallel mining of associations, in Data Mining and Knowledge Discovery, vol. 3 (Kluwer Academic Publishers, 1999), pp. 219–314

    Google Scholar 

  10. C.K. Chui, B. Kao, E. Hung, Mining frequent itemsets from uncertain data, in 11th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2007, Nanjing, China

    Google Scholar 

  11. T. Bernecker, H.-P. Kriegel, M. Renz, F. Verhein, A. Züfle, Probabilistic frequent itemset mining in uncertain databases in Proceedings of the 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’09), Paris, France

    Google Scholar 

  12. R. Cheng, D. Kalashnikov, S. Prabhakar, Evaluating probabilistic queries over imprecise data, in SIGMOD (2003)

    Google Scholar 

  13. Q. Zhang, F. Li, K. Yi, Finding frequent items in probabilistic data, in SIGMOD (2008)

    Google Scholar 

  14. L. Sun, R. Cheng, D.W. Cheung, J. Cheng, Mining uncertain data with probabilistic guarantees, in SIGKDD (2010)

    Google Scholar 

  15. C.K. Chui, B. Kao, A decremental approach for mining frequent itemsets from uncertain data, in PAKDD (2008), pp. 64–75

    Google Scholar 

  16. Y. Tong, L. Chen, Y. Cheng, P.S. Yu, Mining frequent itemsets over uncertain databases, in VLDB’12

    Google Scholar 

  17. Y. Tong, L. Chen, P.S. Yu, UFIMT: an uncertain frequent itemset mining toolbox, in KDD’12, Beijing, China. 12–16 August 2012

    Google Scholar 

  18. Frequent Itemset Mining Implementations Repository, http://fimi.ua.ac.be/

  19. D.W. Cheung, V.T. Ng, A.W. Fu, Y. Fu, Efficient mining of association rules in distributed databases. IEEE Trans. Knowl. Data Eng. 8(6), 911–922 (1996)

    Article  Google Scholar 

  20. SPMF An Open-Source Data Mining Library, http://www.philippe-fournier-viger.com/spmf/index.php?link=datasets.php

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Samar Wazir .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Wazir, S., Ahmad, T., Sufyan Beg, M.M. (2018). Frequent Itemset Mining for a Combination of Certain and Uncertain Databases. In: Zadeh, L., Yager, R., Shahbazova, S., Reformat, M., Kreinovich, V. (eds) Recent Developments and the New Direction in Soft-Computing Foundations and Applications. Studies in Fuzziness and Soft Computing, vol 361. Springer, Cham. https://doi.org/10.1007/978-3-319-75408-6_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-75408-6_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-75407-9

  • Online ISBN: 978-3-319-75408-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics