Integrating Quantitative Attributes in Hierarchical Clustering of Transactional Data

Vranić, Mihaela; Pintar, Damir; Skočir, Zoran

doi:10.1007/978-3-642-30947-2_13

Integrating Quantitative Attributes in Hierarchical Clustering of Transactional Data

Mihaela Vranić²³,
Damir Pintar²³ &
Zoran Skočir²³

Conference paper

2050 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7327))

Abstract

Appropriate data mining exploration methods can reveal valuable but hidden information in today’s large quantities of transactional data. While association rules generation is commonly used for transactional data analysis, clustering is rather rarely used for analysis of this type of data. In this paper we provide adaptations of parameters related to association rules generation so they can be used to represent distance. Furthermore, we integrate goal-oriented quantitative attributes in distance measure formulation to increase the quality of gained results and streamline the decision making process. As a proof of concept, newly developed measures are tested and results are discussed both on a referent dataset as well as a large real-life retail dataset.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Han, J., Kamber, M.: Data mining: concepts and techniques. The Morgan Kaufmann series in data management systems. Elsevier (2006)
Google Scholar
Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, SIGMOD 1993, New York, pp. 207–216 (1993)
Google Scholar
Padmanabhan, B.: The interestingness paradox in pattern discovery. Journal of Applied Statistics 31(8), 1019–1035 (2004)
Article MathSciNet MATH Google Scholar
Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In: Knowledge Discovery in Databases, pp. 229–248. AAAI Press (1991)
Google Scholar
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right objective measure for association analysis. Information Systems 29, 293–313 (2004)
Article Google Scholar
Webb, G.I.: Discovering significant rules. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2006, New York, pp. 434–443 (2006)
Google Scholar
Webb, G.I.: Self-sufficient itemsets: An approach to screening potentially interesting associations between items. ACM Transactions on Knowledge Discovery From Data 4, 1–20 (2010)
Article Google Scholar
Pinjušić, S., Vranić, M., Pintar, D.: Improvement of hierarchical clustering results by refinement of variable types and distance measures. Automatika: Journal for Control, Measurement, Electronics, Computing and Communications 52(4), 353–364 (2011)
Google Scholar
Vranić, M.: Designing concise representation of correlations among elements in transactional data. PhD thesis, FER, Zagreb, Croatia (2011)
Google Scholar
Vranić, M., Pintar, D., Gamberger, D.: Adapting hierarchical clustering distance measures for improved presentation of relationships between transaction elements. Journal of Information and Organizational Sciences 36(1) (in press, 2012)
Google Scholar
Srikant, R., Agrawal, R.: Mining quantitative association rules in large relational tables. SIGMOD Rec. 25, 1–12 (1996)
Article Google Scholar
Ruckert, U., Richter, L., Kramer, S.: Quantitative association rules based on half-spaces: An optimization approach. In: Proceedings of the Fourth IEEE International Conference on Data Mining, ICDM 2004, pp. 507–510. IEEE Computer Society, Washington DC (2004)
Chapter Google Scholar
Aumann, Y., Lindel, Y.: A statistical theory for quantitative association rules. Journal of Intelligent Information Systems, 261–270 (1999)
Google Scholar
Demšar, J., Zupan, B., Leban, G., Curk, T.: Orange: From Experimental Machine Learning to Interactive Data Mining. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 537–539. Springer, Heidelberg (2004)
Chapter Google Scholar
Vranić, M., Pintar, D., Skočir, Z.: Generation and analysis of tree structures based on association rules and hierarchical clustering. In: Proceedings of the 2010 Fifth International Multi-conference on Computing in the Global Information Technology, ICCGI 2010, pp. 48–53. IEEE Computer Society, Washington DC (2010)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia
Mihaela Vranić, Damir Pintar & Zoran Skočir

Authors

Mihaela Vranić
View author publications
You can also search for this author in PubMed Google Scholar
Damir Pintar
View author publications
You can also search for this author in PubMed Google Scholar
Zoran Skočir
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000, Zagreb, Croatia
Gordan Jezic & Mario Kusek &
Institute of Informatics (I-32), Division of Knowledge Management Systems, Wroclaw University of Technology, Str. Wyb. Wyspianskiego 27, 50-370, Wroclaw, Poland
Ngoc-Thanh Nguyen
KES International, Shoreham-by-sea, P.O. Box 2115, BN43 9AF, UK
Robert J. Howlett
School of Electrical and Information Engineering, University of South Australia, Mawson Lakes Campus, 5095, Adelaide, SA, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vranić, M., Pintar, D., Skočir, Z. (2012). Integrating Quantitative Attributes in Hierarchical Clustering of Transactional Data. In: Jezic, G., Kusek, M., Nguyen, NT., Howlett, R.J., Jain, L.C. (eds) Agent and Multi-Agent Systems. Technologies and Applications. KES-AMSTA 2012. Lecture Notes in Computer Science(), vol 7327. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30947-2_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-30947-2_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30946-5
Online ISBN: 978-3-642-30947-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics