Mining Subspace Clusters from Distributed Data

Bian, Haiyun; Bhatnagar, Raj

doi:10.1007/978-3-642-01209-9_7

Mining Subspace Clusters from Distributed Data

Haiyun Bian⁵ &
Raj Bhatnagar⁶

Chapter

588 Accesses

Part of the book series: Studies in Computational Intelligence ((SCI,volume 208))

Introduction

Many real world applications have datasets consisting of high dimensional feature spaces. For example, the gene expression data record the expression levels of a set of thousands of genes under hundreds of experimental conditions. Traditional clustering algorithms fail to efficiently find clusters of genes that demonstrate similar expression levels in all conditions due to such a high dimensional feature space. Subspace clustering addresses this problem by looking for patterns in subspaces [1] instead of in the full dimensional space. A lot of work has been done in developing efficient subspace clustering algorithms for datasets of various characteristics [1, 6].

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Gehrke, J., Gunopulos, D., Raghavan, P.: Automatic subspace clustering of high dimensional data for data mining applications. In: Proceedings of the 1998 ACM SIGMOD international conference on Management of data (SIGMOD 1998), pp. 94–105. ACM Press, New York (1998)
Chapter Google Scholar
Blake, C., Merz, C.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Evfimievski, A., Srikanta, R., Agrawal, R., Gehrke, J.: Privacy preserving mining of association rules. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2002) (2002)
Google Scholar
Ganter, B., Wille, R.: Formal Concept Analysis: Mathematical Foundations. Springer, Heidelberg (1999)
MATH Google Scholar
Kriegel, H.P., Kroger, P., Pryakhin, A., Schubert, M.: Effective and efficient distributed model-based clustering. In: Proceedings of Fifth International Conference on Data Mining (ICDM 2005), pp. 258–265 (November 2005)
Google Scholar
Kroger, P., Kriegel, H.P., Kailing, K.: Density-connected subspace clustering for highdimensional data. In: Jonker, W., Petković, M. (eds.) SDM 2004. LNCS, vol. 3178, pp. 246–257. Springer, Heidelberg (2004)
Google Scholar
Pasquier, N., Bastide, Y., Taouil, R., Lakhal, L.: Discovering frequent closed itemsets for association rules. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 398–416. Springer, Heidelberg (1999)
Chapter Google Scholar
Peeters, R.: The maximum edge biclique problem is np-complete. Discrete Applied Mathematics 131, 651–654 (2003)
Article MATH MathSciNet Google Scholar
Zaki, M.J., Hsiao, C.J.: Charm: an efficient algorithm for closed itemset mining. In: Proceedings of the Second SIAM International Conference on Data Mining (SDM 2004) (April 2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Math & CS, Metropolitan State College of Denver, Denver, CO, 80217,
Haiyun Bian
Department of CS, University of Cincinnati, Cincinnati, OH, 45221,
Raj Bhatnagar

Authors

Haiyun Bian
View author publications
You can also search for this author in PubMed Google Scholar
Raj Bhatnagar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Software Engineering & Information, Technology Institute, Central Michigan University, MI 48859, Mt. Pleasant, U.S.A.
Roger Lee
Department of Computer Science, Central Michigan University, MI 48859, Mt. Pleasant, U.S.A.
Gongzu Hu
School of Computer Engineering and Science, Shanghai University, Shanghai, China
Huaikou Miao

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Bian, H., Bhatnagar, R. (2009). Mining Subspace Clusters from Distributed Data. In: Lee, R., Hu, G., Miao, H. (eds) Computer and Information Science 2009. Studies in Computational Intelligence, vol 208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01209-9_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-01209-9_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01208-2
Online ISBN: 978-3-642-01209-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics