Abstract
Similarity calculation is a key step in the process of clustering. Because most tagged resources on the Internet lack text information, traditional similarity measures cannot obtain good results. We propose the STAC measure to solve the problem of calculating the similarity between tagged resources. In the calculation of STAC, the similarity between tags is calculated using tag co-occurrence information, and the similarity between tagged resources is calculated based on tag comparison. Experiments show the clustering results of tagged resources using STAC is significantly better than using other traditional metrics such as the Euclidean distance and Jaccard coefficient.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adam M.: Folksonomies - Cooperative Classification and Communication through Shared Metadata. In: Computer Mediated Communication - LIS590CMC (2004)
Daniel, R., Paul, H., Christopher, D.M., et al.: Clustering the Tagged Web. In: WSDM 2009, pp. 54–63 (2009)
Fernando, P., David, P., John, C., et al.: Improving the Clustering of Blogosphere with a Self-term Enriching Technique. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 40–47. Springer, Heidelberg (2009)
Yin, Z., Kening, G., Bin, Z.: Clustering Blog Posts Using Tags and Relations in the Blogosphere. In: ICISE 2009, pp. 817–820 (2009)
Aixin, S., Maggy, A.S., Ying, L.: Blog Classification Using Tags: An Empirical Study. In: Goh, D.H.-L., Cao, T.H., Sølvberg, I.T., Rasmussen, E. (eds.) ICADL 2007. LNCS, vol. 4822, pp. 307–316. Springer, Heidelberg (2007)
Grigory, B., Philipp, K., Frank, S.: Automated Tag Clustering: Improving Search and Exploration in the Tag Space. In: WWW 2006, pp. 22–26 (2006)
Edwin, S.: Clustering Tags in Enterprise and Web Folksonomies. Technical report, HP Labs (2008)
Kaikuo, X., Yu, C., Yexi, J., et al.: A Comparative Study of Correlation Measurements for Searching Similar Tags. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds.) ADMA 2008. LNCS (LNAI), vol. 5139, pp. 709–716. Springer, Heidelberg (2008)
Ciro, C., Dominik, B., Andreas, H., et al.: Semantic Analysis of Tag Similarity Measures in Collaborative Tagging Systems. In: LWA 2008, pp. 18–26 (2008)
Ludovico, B., Salvatore, C., Eloisa, V.: RATC: A Robust Automated Tag Clustering Technique. In: Di Noia, T., Buccafurri, F. (eds.) E-Commerce and Web Technologies. LNCS, vol. 5692, pp. 324–335. Springer, Heidelberg (2009)
Aixin, S., Anwitaman, D.: On Stability, Clarity, and Co-occurrence of Self-Tagging. In: WSDM 2009 (2009)
Jianwei, C., Pei, L., Hongyan, L., et al.: A Neighborhood Search Method for Link-Based Tag Clustering. In: ADMA 2009, pp. 91–103 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gao, F., Gao, K., Zhang, B. (2010). Clustering the Tagged Resources Using STAC. In: Wang, F.L., Gong, Z., Luo, X., Lei, J. (eds) Web Information Systems and Mining. WISM 2010. Lecture Notes in Computer Science, vol 6318. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16515-3_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-16515-3_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16514-6
Online ISBN: 978-3-642-16515-3
eBook Packages: Computer ScienceComputer Science (R0)