Skip to main content

Parallel Method for Mining High Utility Itemsets from Vertically Partitioned Distributed Databases

  • Conference paper
Knowledge-Based and Intelligent Information and Engineering Systems (KES 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5711))

Abstract

Mining high utility itemsets (HUIs) has been developing in recent years. However, the methods of mining from distributed databases have not mentioned yet. In this paper, we present a parallel method for mining HUIs in vertically partitioned distributed databases. We use WIT-tree structure to store local database on each site for parallel mining HUIs. The item ith in each SlaverSite is only sent to MasterSite if its Transaction-Weighted Utilization (TWU) satisfies minutility (minutil), and MasterSite only mines HUIs which exist at least on 2 sites. Besides, the parallel performance is also interesting because it reduces the waiting time of attended sites. Thus, the mining time is reduced more significant than that in mining from centralized database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: VLDB 1994, pp. 487–499 (1994)

    Google Scholar 

  2. Agrawal, R., Shafer, J.: Parallel Mining of Association Rules. IEEE Trans. Knowledge and Data Eng. 8(6), 962–969 (1996)

    Article  Google Scholar 

  3. Erwin, A., Gopalan, R.P., Achuthan, N.R.: CTU-Mine: An Efficient High Utility Itemset Mining Algorithm Using the Pattern Growth Approach. In: IEEE 7th International Conferences on Computer and Information Technology, Aizu Wakamatsu, Japan, pp. 71–76 (2007)

    Google Scholar 

  4. Erwin, A., Gopalan, R.P., Achuthan, N.R.: A Bottom-Up Projection Based Algorithm for Mining High Utility Itemsets. In: The 2nd International Workshop on Integrating Artificial Intelligence and Data Mining, Gold Coast, Australia, vol. 84, pp. 3–11 (2007)

    Google Scholar 

  5. Khan, M.S., Muyeba, M., Coenen, F.: A Weighted Utility Framework for Mining Association Rule. In: Proc. IEEE European Modelling Symposium (EMS 2008), pp. 87–92 (2008)

    Google Scholar 

  6. Le, B., Nguyen, H., Cao, T.A., Vo, B.: A Novel Algorithm for Mining High Utility Itemsets. In: IEEE 1st Asian Conference on Intelligent Information and Database Systems, Quang Binh, Vietnam, April 1-3, pp. 13–17 (2009)

    Google Scholar 

  7. Liu, Y., Liao, W., Choudhary, A.: A Fast High Utility Itemsets Mining Algorithm. In: UBDM 2005, Chicago, Illinois, USA, August 21, pp. 90–99 (2005)

    Google Scholar 

  8. Luo, P., Xuong, H., Lu, K., Shi, Z.: Distributed Classification in Peer-to-Peer Networks. In: KDD 2007, San Jose, California, USA, August 12–15 (2007)

    Google Scholar 

  9. Miller, D.J., Zhang, Y., Kesidis, G.: Decision Aggregation in Distributed Classification by a Transductive Extension of Maximum Entropy/Improved Iterative Scaling. EURASIP Journal on Advances in Signal Processing 2008 (2008)

    Google Scholar 

  10. Schuster, A., Wolff, R.: Communication-efficient distributed mining of association rules. In: Proc. of the 2001 ACM SIGMOD Int’l. Conference on Management of Data, Santa Barbara, California, pp. 473–484 (2001)

    Google Scholar 

  11. Serazi, M., Perera, A., Abidin, T., Hamer, G., Perrizo, W.: An API for Transparent Distributed Vertical Data Mining. In: Proceedings of the ISCA 14th International Conference on Intelligent and Adaptive Systems and Software Engineering, Novotel Toronto Centre, Toronto, Canada, July 20-22, pp. 151–156 (2005)

    Google Scholar 

  12. Sucahyo, Y.G., Gopalan, R.P.: CT-PRO: A Bottom-Up Non Recursive Frequent Itemset Mining Algorithm Using Compressed FP-Tree Data Structure. In: IEEE ICDM Workshop on Frequent Itemset Mining Implementation (FIMI), Brighton UK (2004)

    Google Scholar 

  13. Wolff, R., Schuster, A.: Association Rule Mining in Peer-to-Peer Systems. IEEE Trans. Systems, Man and Cybernetics, Part B 34(6), 2426–2438 (2004)

    Article  Google Scholar 

  14. Yao, H., Hamilton, H.J., Butz, C.J.: A Foundational Approach to Mining Itemset Utilities from Databases. In: Proceedings of 2004 SIAM International Conference on Data Mining, pp. 482–486 (2004)

    Google Scholar 

  15. Yao, H., Hamilton, H.J.: Mining Itemset Utilities from Transaction Databases. Data and Knowledge Engineering 59, 603–626 (2005)

    Article  Google Scholar 

  16. Yao, H., Hamilton, H.J., Geng, L.: A Unified Framework for Utility Based Measures for Mining Itemsets. In: UBDM 2006, Philadelphia, Pennsylvania, USA, pp. 28–37 (2006)

    Google Scholar 

  17. Yu, G., Shao, S., Sun, D., Luo, B.: Mining Long High Utility Itemsets in Transaction Databases. WSEAS Transactions on Information Science & Applications 5(2), 326–331 (2008)

    Google Scholar 

  18. Zaki, M.J., Hsiao, C.J.: Efficient Algorithms for Mining Closed Itemsets and Their Lattice Structure. IEEE Transactions on Knowledge and Data Engineering 17(4), 462–478 (2005)

    Article  Google Scholar 

  19. Zaki, M.J.: Parallel and Distributed Association Mining: A Survey. IEEE Concurrency, Special Issue on Parallel Mechanisms for Data Mining, 14–25 (December 1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Vo, B., Nguyen, H., Ho, T.B., Le, B. (2009). Parallel Method for Mining High Utility Itemsets from Vertically Partitioned Distributed Databases. In: Velásquez, J.D., Ríos, S.A., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2009. Lecture Notes in Computer Science(), vol 5711. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04595-0_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04595-0_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04594-3

  • Online ISBN: 978-3-642-04595-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics