Skip to main content

Associated Topic Extraction for Consumer Generated Media Analysis

  • Conference paper
Book cover Service-Oriented Computing: Agents, Semantics, and Engineering (SOCASE 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4504))

  • 316 Accesses

Abstract

This paper proposes a new algorithm of associated topic extraction, which detects related topics in a collection of blog entries referring to a specified topic. It is a partial feature of our product reputation information retrieval service whose aim is to detect product names rather than general terms. The main feature of the algorithm is to evaluate how important a topic is to the collection, according to the popularity of blog entries through Trackbacks and comments. Another feature is to utilize product ontology for topic filtering, which extracts products relevant to or similar to a specified product. The paper also presents a brief evaluation of the algorithm, in comparison with TF-IDF. In respect to the evaluation, it can be concluded that the proposed algorithm can capture users’ impressions of associated topics more accurately than TF-IDF.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kawamura, T., et al.: Ubiquitous Metadata Scouter – Ontology Brings Blogs Outside. In: Mizoguchi, R., Shi, Z.-Z., Giunchiglia, F. (eds.) ASWC 2006. LNCS, vol. 4185, pp. 752–761. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  2. Kleinberg, J.: Bursty and hierarchical structure in streams. In: Proc. of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1–25. ACM Press, New York (2002)

    Google Scholar 

  3. Kleinberg, J.M.: Authoritative sources in hyperlinked environment. Journal of the ACM 46(5) (1999)

    Google Scholar 

  4. Chakrabarti, S.: Mining the web (2003)

    Google Scholar 

  5. Fujimura, K., Inoue, T., Sugizaki, M.: The EigenRumor algorithm for ranking blogs. In: Proceedings of the WWW2005 Workshop on the Weblogging Ecosystem (2005)

    Google Scholar 

  6. Kamvar, S.D., Schlosser, M.T., Garcia-Molina, H.: The EigenTrust algorithm for reputation management in P2P networks. In: Proceedings of 12th International World Wide Web Conference (2003)

    Google Scholar 

  7. de Rijke, M., Mishne, G.: A Study of Blog Search. In: Lalmas, M., et al. (eds.) ECIR 2006. LNCS, vol. 3936, pp. 289–301. Springer, Heidelberg (2006)

    Google Scholar 

  8. Fujiki, T., et al.: Identification of bursts in a document stream. In: Proceedings of First International Workshop on Knowledge Discovery in Data Streams (2004)

    Google Scholar 

  9. Nanno, T., et al.: Automatically collecting, monitoring, and mining japanese weblogs. In: Proceedings of 13th International World Wide Web Conference (2004)

    Google Scholar 

  10. Facca, F.M., Lanzi, P.L.: Mining interesting knowledge from weblogs: a survey. Data and Knowledge Engineering 53(3), 225–241 (2005)

    Article  Google Scholar 

  11. Allan, J.: Topic Detection and Tracking: Event-based Information Organization. Kluwer Academic Publishers, Dordrecht (2002)

    MATH  Google Scholar 

  12. Trott, B., Trott, M.: Trackback technical specification (2002), http://www.movabletype.org/docs/mttrackback.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Jingshan Huang Ryszard Kowalczyk Zakaria Maamar David Martin Ingo Müller Suzette Stoutenburg Katia P. Sycara

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Nagano, S., Inaba, M., Mizoguchi, Y., Kawamura, T. (2007). Associated Topic Extraction for Consumer Generated Media Analysis. In: Huang, J., et al. Service-Oriented Computing: Agents, Semantics, and Engineering. SOCASE 2007. Lecture Notes in Computer Science, vol 4504. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72619-7_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-72619-7_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-72618-0

  • Online ISBN: 978-3-540-72619-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics