Core-Tag Clustering for Web 2.0 Based on Multi-similarity Measurements

Jiang, Yexi; Tang, Changjie; Xu, Kaikuo; Duan, Lei; Tang, Liang; Gong, Jie; Li, Chuan

doi:10.1007/978-3-642-03996-6_21

Yexi Jiang²⁹,
Changjie Tang²⁹,
Kaikuo Xu²⁹,
Lei Duan²⁹,
Liang Tang²⁹,
Jie Gong²⁹ &
…
Chuan Li²⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5731))

Included in the following conference series:

419 Accesses
1 Citations

Abstract

Along with the development of Web2.0, folksonomy has become a hot topic related to data mining, information retrieval and social network. The tag semantic is the key for deep understanding the correlation of objects in folksonomy. This paper proposes two methods to cluster tags for core-tag by fusing multi-similarity measurements. The contributions of this paper include: (1) Proposing the concept of core-tag and the model of core-tag clusters. (2) Designing a core-tag clustering algorithm CETClustering, based on clustering ensemble method. (3) Designing a second kind of core-tag clustering algorithm named SkyTagClustering, based on skyline operator. (4) Comparing the two algorithms with modified K-means. Experiments show that the two algorithms are better than modified K-means with 20-30% on efficiency and 20% higher scores on quality.

Supported by the 11th Five Years Key Programs for Sci. & Tech. Development of China under grant No. 2006BAI05A01, the National Science Foundation under grant No. 60773169, the Software Innovation Project of Sichuan Youth under grant No. 2007AA0155.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Mates, A.: Folksonomies – Cooperative Classification and Communication through Shared Metadata. In: Computer Mediated Communication, LIS590CMC (2004)
Google Scholar
Hammond, T., Hannay, T., Lund, B., Scott, J.: Social Bookmarking Tools:A General Review. D-Lib Magazine (2005)
Google Scholar
Hotho, A., Jäschke, R., Schmitz, C., Stumme, G.: Information retrieval in folksonomies: Search and ranking. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS (LNAI), vol. 4011, pp. 411–426. Springer, Heidelberg (2006)
Chapter Google Scholar
Xu, K., Chen, Y., Jiang, Y., Tang, R., Liu, Y., Gong, J.: A comparative study of correlation measurements for searching similar tags. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds.) ADMA 2008. LNCS, vol. 5139, pp. 709–716. Springer, Heidelberg (2008)
Chapter Google Scholar
Fred, A., Jain, A.K.: Evidence Accumulation Clustering based on the K-means Algorithm. In: Proceedings of the Joint IAPR International Workshop (2002)
Google Scholar
Voorhees, E., Gupta, N.K., Johnson-Laird, B.: The Collection Fusion Problem. In: The Third Retrieval Conference (1995)
Google Scholar
Dietterich, T.G.: Ensemble methods in machine learning. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, p. 1. Springer, Heidelberg (2000)
Chapter Google Scholar
Quinlan, J.R.: Bagging, boosting, and C4.5. In: Proc. of the13th AAAI Conference on Artificial Intelligence. AAAI Press, Menlo Park (1996)
Google Scholar
Oza, N.C.: Ensemble Data Mining Methods. NASA Ame Research Center (2000)
Google Scholar
Strehl, A., Ghosh, J.: Cluster Ensembles – A Knowledge Reuse Framework for Combining Partitionings. AAAI, Menlo Park (2002)
MATH Google Scholar
Topchy, A., Jain, A.K., Punch, W.: Combining Multiple Weak Clusterings. In: ICDM (2003)
Google Scholar
Borzsony, S., Kossmann, D., Stocker, K.: The Skyline Operator. In: ICDE (2001)
Google Scholar
Tan, K.L., Eng, P.K., Ooi, B.C.: Efficient progressive skyline computation. In: VLDB (2001)
Google Scholar
Chan, C.-Y., Jagadish, H.V., Tan, K.-L., Tung, A.K.H., Zhang, Z.: On high dimensional skylines. In: Ioannidis, Y., Scholl, M.H., Schmidt, J.W., Matthes, F., Hatzopoulos, M., Böhm, K., Kemper, A., Grust, T., Böhm, C. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 478–495. Springer, Heidelberg (2006)
Chapter Google Scholar
Chan, C.Y., Jagadish, H.V., Tun, K.L., Tung, A.K.H., Zhang, Z.: Finding k-Dominant Skylines in High Dimensional Space. In: SIGMOD (2006)
Google Scholar
Papadias, D., Tao, Y.: An optimal and progressive algorithm for skyline. In: SIGMOD (2003)
Google Scholar
Kossmann, K., Ramsak, F., Rost, S.: Shooting Stars in the Sky-An Online Algorithm for Skyline Queries. In: VLDB (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, Sichuan University, Chengdu, 610065, China
Yexi Jiang, Changjie Tang, Kaikuo Xu, Lei Duan, Liang Tang, Jie Gong & Chuan Li

Authors

Yexi Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Changjie Tang
View author publications
You can also search for this author in PubMed Google Scholar
Kaikuo Xu
View author publications
You can also search for this author in PubMed Google Scholar
Lei Duan
View author publications
You can also search for this author in PubMed Google Scholar
Liang Tang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Gong
View author publications
You can also search for this author in PubMed Google Scholar
Chuan Li
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Hong Kong University of Science and Technology, Hong Kong
Lei Chen
Swinburne University of Technology, Melbourne, Australia
Chengfei Liu
Renmin Universty of China, China
Xiao Zhang
Renmin University China, China
Shan Wang
Dept. of Industrial Economics and Technology Management, NTNU, Norway
Darijus Strasunskas
NTNU, Norway
Stein L. Tomassen
AOL, China
Jinghai Rao
SAP Research China, China
Wen-Syan Li
Comp. Sci. and Eng. Dept., Arizona State University, 85287, Tempe, AZ
K. Selçuk Candan
Dickson Computer Systems, 7A Victory Avenue 4th floor, Homantin, Kln, P.O. Box, Hong Kong
Dickson K. W. Chiu
Zhejiang Gongshang University, China
Yi Zhuang
University of Colorado at Boulder, USA
Clarence A. Ellis
Kyonggi University, Korea
Kwang-Hoon Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, Y. et al. (2009). Core-Tag Clustering for Web 2.0 Based on Multi-similarity Measurements. In: Chen, L., et al. Advances in Web and Network Technologies, and Information Management. APWeb WAIM 2009 2009. Lecture Notes in Computer Science, vol 5731. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03996-6_21

Download citation

DOI: https://doi.org/10.1007/978-3-642-03996-6_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03995-9
Online ISBN: 978-3-642-03996-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics