Skip to main content

A Comparative Study of Correlation Measurements for Searching Similar Tags

  • Conference paper
Advanced Data Mining and Applications (ADMA 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5139))

Included in the following conference series:

Abstract

In recent years, folksonomy becomes a hot topic in many research fields such as complex systems, information retrieval, and recommending systems. It is essential to study the semantic relationships among tags in folksonomy applications. The main contributions of this paper includes: (a) proposes a general framework for the analysis of the semantic relationships among tags based on their co-occurrence. (b)investigates eight correlation measurements from various fields; then appliying these measurements to searching similar tags for a given tag on datasets from del.icio.us. (c) conducts a comparative study on both accuracy and time performance of the eight measurements. From the comparison, a best overall correlation measurement is concluded for similar tags searching in the applications of folksonomy.

Supported by the 11th Five Years Key Programs for Sci. &Tech. Development of China under grant No. 2006BAI05A01, the National Science Foundation under grant No 60773169, the Software Innovation Project of Sichuan Youth under Grant No 2007AA0155 and the Development Foundation of Chengdu Univeristy of Information Technology(KYTZ200811).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Shirky, C.: Ontology is overrated: Categories, links, and tags. Clay Shirky’s Writings About the Internet Website (2005)

    Google Scholar 

  2. del.icio.us, http://del.icio.us/

  3. Cattuto, C., Loreto, V., Pietronero, L.: Semiotic dynamics and collaborative tagging. Proceedings of the National Academy of Sciences United States of America 104, 1461 (2007)

    Article  Google Scholar 

  4. Cattuto, C., Loreto, V., Servedio, V.D.: A yule-simon process with memory. Europhysics Letters 76(2), 208–214 (2006)

    Article  MathSciNet  Google Scholar 

  5. Lux, M., Granitzer, M., Kern, R.: Aspects of Broad Folksonomies. In: 18th International Conference on Database and Expert Systems Applications (2007)

    Google Scholar 

  6. Brin, S., Motwani, R., Silverstein, C.: Beyond Market Baskets: Generalizing Association Rules to Correlations. In: Proceedings ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 13-15 (1997)

    Google Scholar 

  7. Chan, P.K.: A non-invasive learning approach to building web user profiles. In: KDD 1999 Workshop on Web Usage Analysis and User Profiling (1999)

    Google Scholar 

  8. Jeh, G., Widom, J.: SimRank: A measure of structural-context similarity. In: Proc. 8th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining (July 2002)

    Google Scholar 

  9. Pythagorean Theorem, http://mathworld.wolfram.com/PythagoreanTheorem.html

  10. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  11. Deshpande, M., Karypis, G.: Item-based top-n recommendation algorithms. ACM Trans. Inf. Syst. 22(1), 143–177 (2004)

    Article  Google Scholar 

  12. Thorsten, J.: Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization. In: Proceedings of 14th International Conference on Machine Learning (1996)

    Google Scholar 

  13. Rosenfeld, R.: A maximum entropy approach to adaptive statistical language modeling. Computer,speech, and language 10 (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xu, K., Chen, Y., Jiang, Y., Tang, R., Liu, Y., Gong, J. (2008). A Comparative Study of Correlation Measurements for Searching Similar Tags. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_75

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-88192-6_75

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-88191-9

  • Online ISBN: 978-3-540-88192-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics