OMVD: An Optimization of MVD

He, Zhi; Tian, Shengfeng; Huang, Houkuan

doi:10.1007/11811305_52

OMVD: An Optimization of MVD

Zhi He²²,
Shengfeng Tian²² &
Houkuan Huang²²

Conference paper

2808 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4093))

Abstract

Most discretization algorithms are univariate and consider only one attribute at a time. Stephen D. Bay presented a multivariate discretization(MVD) method that considers the affects of all the attributes in the procedure of data mining. But as the author mentioned, any test of differences has a limited amount of power. We present OMVD by improving MVD on the power of testing differences with a genetic algorithm. OMVD is more powerful than MVD because the former does not suffer from setting the difference threshold and from seriously depending on the basic intervals. In addition, the former simultaneously searches partitions for multiple attributes. Our experiments with some synthetic and real datasets suggest that OMVD could obtain more interesting discretizations than could MVD.

This work is funded by China National Natural Science Foundation grants 60442002 and 60443003.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agarwal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proc. ACM SIGMOD International Conference on Management of Data, Washington, DC, pp. 207–216 (1993)
Google Scholar
Quinlan, J.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Bay, S.D.: Multivariate discretization for set mining. Knowledge and Information Systems 3, 491–512 (2001)
Article MATH Google Scholar
Bay, S.D., Pazzani, M.J.: Detecting group differences: Mining contrast sets. Data Mining and Knowledge Discovery 5, 213–246 (2001)
Article MATH Google Scholar
Kwedlo, W., Kretowski, M.: An evolutionary algorithm using multivariate discretization for decision rule induction. In: Principles of Data Mining and Knowledge Discovery, pp. 392–397 (1999)
Google Scholar
Srikant, R., Agrawal, R.: Mining quantitative association rules in large relational tables. In: Jagadish, H.V., Mumick, I.S. (eds.) Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, Montreal, Quebec, Canada, pp. 1–12 (1996)
Google Scholar
Miller, R.J., Yang, Y.: Association rules over interval data. In: Proceedings ACM SIGMOD International Conference on Management of Data, pp. 452–461 (1997)
Google Scholar
Monti, S., Cooper, G.F.: A latent variable model for multivariate discretization. In: The 7th Int. Workshop Artificial Intelligence and Statistics, Fort Lauderdale (1999)
Google Scholar
Ludl, M.C., Widmer, G.: Relative unsupervised discretization for association rule mining. In: Proceedings of the 4th European Conference on Principles and Practice of Knowledge Discovery in Databases, Springer, Berlin (2000)
Google Scholar
Mehta, S., Parthasarathy, S., Yang, H.: Toward unsupervised correlation preserving discretization. IEEE Transactions on Knowledge and Data Engineering 17, 1174–1185 (2005)
Article Google Scholar
Eiben, A., Smith, J.: Introduction to Evolutionary Computing. Springer, Heidelberg (2003)
Book MATH Google Scholar
Ruggles, S., Sobek, M., Alexander, T., et. al.: Integrated public use microdata series: Version 2.0 minneapolis: Historical census projects (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer and Information Technology, Beijing Jiaotong University, Beijing, 100044, China
Zhi He, Shengfeng Tian & Houkuan Huang

Authors

Zhi He
View author publications
You can also search for this author in PubMed Google Scholar
Shengfeng Tian
View author publications
You can also search for this author in PubMed Google Scholar
Houkuan Huang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electronic Engineering, The University of Queensland, Queensland, Australia
Xue Li
University of Alberta, Canada
Osmar R. Zaïane
Northwest Polytechnical University, China
Zhanhuai Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, Z., Tian, S., Huang, H. (2006). OMVD: An Optimization of MVD. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_52

Download citation

DOI: https://doi.org/10.1007/11811305_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics