Skip to main content

Parallel and Distributed Mining of Probabilistic Frequent Itemsets Using Multiple GPUs

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8055))

Abstract

Probabilistic frequent itemset mining, which discovers frequent itemsets from uncertain data, has attracted much attention due to inherent uncertainty in the real world. Many algorithms have been proposed to tackle this problem, but their performance is not satisfactory because handling uncertainty incurs high processing cost. To accelerate such computation, we utilize GPUs (Graphics Processing Units). Our previous work accelerated an existing algorithm with a single GPU. In this paper, we extend the work to employ multiple GPUs. Proposed methods minimize the amount of data that need to be communicated among GPUs, and achieve load balancing as well. Based on the methods, we also present algorithms on a GPU cluster. Experiments show that the single-node methods realize near-linear speedups.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Imieliński, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. In: SIGMOD, pp. 207–216 (1993)

    Google Scholar 

  2. Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: SIGMOD, pp. 207–216 (1993)

    Google Scholar 

  3. Amossen, R.R., Pagh, R.: A New Data Layout for Set Intersection on GPUs. In: IPDPS, pp. 698–708 (2011)

    Google Scholar 

  4. Bernecker, T., Kriegel, H.-P., Renz, M., Verhein, F., Zuefle, A.: Probabilistic Frequent Itemset Mining in Uncertain Databases. In: KDD, pp. 119–128 (2009)

    Google Scholar 

  5. Fang, W., Lu, M., Xiao, X., He, B., Luo, Q.: Frequent Itemset Mining on Graphics Processors. In: DaMoN, pp. 34–42 (2009)

    Google Scholar 

  6. Kozawa, Y., Amagasa, T., Kitagawa, H.: GPU Acceleration of Probabilistic Frequent Itemset Mining from Uncertain Databases. In: CIKM, pp. 892–901 (2012)

    Google Scholar 

  7. Orlando, S., Palmerini, P., Perego, R., Silvestri, F.: Adaptive and Resource-Aware Mining of Frequent Sets. In: ICDM, pp. 338–345 (2002)

    Google Scholar 

  8. Ozkural, E., Ucar, B., Aykanat, C.: Parallel Frequent Item Set Mining with Selective Item Replication. IEEE TPDS 22(10), 1632–1640 (2011)

    Google Scholar 

  9. Silvestri, C., Orlando, S.: gpuDCI: Exploiting GPUs in Frequent Itemset Mining. In: PDP, pp. 416–425 (2012)

    Google Scholar 

  10. Sun, L., Cheng, R., Cheung, D.W., Cheng, J.: Mining Uncertain Data with Probabilistic Guarantees. In: KDD, pp. 273–282 (2010)

    Google Scholar 

  11. Wang, L., Cheung, D.W., Cheng, R., Lee, S.D., Yang, X.S.: Efficient Mining of Frequent Item Sets on Large Uncertain Databases. IEEE TKDE 24(12), 2170–2183 (2012)

    Google Scholar 

  12. Zaki, M.J.: Parallel and Distributed Association Mining: A Survey. IEEE Concurrency 7(4), 14–25 (1999)

    Article  Google Scholar 

  13. NVIDIA, CUDA C Programming Guide (October 2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kozawa, Y., Amagasa, T., Kitagawa, H. (2013). Parallel and Distributed Mining of Probabilistic Frequent Itemsets Using Multiple GPUs. In: Decker, H., Lhotská, L., Link, S., Basl, J., Tjoa, A.M. (eds) Database and Expert Systems Applications. DEXA 2013. Lecture Notes in Computer Science, vol 8055. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40285-2_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40285-2_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40284-5

  • Online ISBN: 978-3-642-40285-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics