Abstract
By augmenting conventional techniques of topic modeling with unigram analysis and community detection, we establish an automated method that generates a comprehensive and meaningful summary of forum conversations over time that also sheds light on patterns of user behavior. We combine these methods to obtain a multiscale representation of what topics are being discussed, what the users are saying about each topic, how the conversation is evolving over time, and how friendships relate to content. As an example of our methodology, we examine discussion boards on Cafemom–an online hub for women to share their experiences and discuss their views on issues pertinent to child rearing. We apply the method with a focus on the issue of vaccination- a subject matter which has become controversial in recent years. We demonstrate how our methodology provides valuable insights into the evolution of conversations and highlights similarities in attitudes of socially connected users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ahmed, A., Ho, Q., Eisenstein, J., Xing, E., Smola, A.J., Teo, C.H.: Unified analysis of streaming news. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, pp. 267–276. ACM, New York (2011)
AlSumait, L., Barbará, D., Domeniconi, C.: On-line lda: Adaptive topic models for mining text streams with applications to topic detection and tracking. In: Proceedings of the Eighth IEEE International Conference on Data Mining, ICDM 2008, pp. 3–12. IEEE Computer Society, Washington, DC (2008)
Anderson, A., Huttenlocher, D., Kleinberg, J., Leskovec, J.: Effects of user similarity in social media. In: Proceedings of the fifth ACM International Conference on Web Search and Data Mining, WSDM 2012, pp. 703–712. ACM, New York (2012)
Betsch, C., Renkewitz, F., Betsch, T., Ulshofer, C.: The influence of vaccine-critical websites on perceiving vaccination risks. J. Health Psychol. 15(3), 446–455 (2010)
Blei, D.M., Lafferty, J.D.: Dynamic topic models. In: Proceedings of the 23rd International Conference on Machine Learning, ICML 2006, pp. 113–120. ACM, New York (2006)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Boscolo, R., Rezaei, B.A., Boykin, P.O., Roychowdhury, V.P.: Functionality Encoded In Topology? Discovering Macroscopic Regulatory Modules from Large-Scale Protein-DNA Interaction Networks. eprint arXiv:q-bio/0501039 (January 2005)
Clauset, A., Newman, M.E.J., Moore, C.: Finding community structure in very large networks. Physical Review EÂ 70(6), 066111 (2004)
Dave, K., Lawrence, S., Pennock, D.M.: Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: Proceedings of the 12th International Conference on World Wide Web, WWW 2003, pp. 519–528. ACM, New York (2003)
Fortunato, S.: Community detection in graphs. Physics Reports 486(3-5), 75–174 (2010)
Fountain, C., King, M.D., Bearman, P.S.: Age of diagnosis for autism: individual and community factors across 10 birth cohorts. J. Epidemiol. Community Health 65(6), 503–510 (2011)
Freed, G.L., Clark, S.J., Butchart, A.T., Singer, D.C., Davis, M.M.: Parental vaccine safety concerns in 2009. Pediatrics 125(4), 654–659 (2010)
Ghani, R., Probst, K., Liu, Y., Krema, M., Fano, A.: Text mining for product attribute extraction. SIGKDD Explor. Newsl. 8(1), 41–48 (2006)
Gohr, A., Hinneburg, A., Schult, R., Spiliopoulou, M.: Topic evolution in a stream of documents. In: SDM, pp. 859–872. SIAM (2009)
Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America 101(suppl. 1), 5228–5235 (2004)
Jo, Y., Hopcroft, J.E., Lagoze, C.: The web of topics: discovering the topology of topic evolution in a corpus. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, pp. 257–266. ACM, New York (2011)
Kennedy, A., LaVail, K., Nowak, G., Basket, M., Landry, S.: Confidence about vaccines in the united states: Understanding parents perceptions. Health Affairs 30(6), 1151–1159 (2011)
Liu, B.: Opinion observer: Analyzing and comparing opinions on the web. In: WWW 2005: Proceedings of the 14th International Conference on World Wide Web, pp. 342–351. ACM Press (2005)
McCallum, A.K.: Mallet: A machine learning for language toolkit (2002), http://mallet.cs.umass.edu
Mei, Q., Zhai, C.: Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, KDD 2005, pp. 198–207. ACM, New York (2005)
Wolfe, R.M., Sharp, L.K., Lipsky, M.S.: Content and design attributes of antivaccination web sites. JAMA 287(24), 3245–3248 (2002)
Zhou, D., Ji, X., Zha, H., Giles, C.L.: Topic evolution and social interactions: how authors effect research. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management, CIKM 2006, pp. 248–257. ACM, New York (2006)
Zimmerman, R.K., Wolfe, R.M., Fox, D.E., Fox, J.R., Nowalk, M.P., Troy, J.A., Sharp, L.K.: Vaccine criticism on the World Wide Web. J. Med. Internet Res. 7(2), e17 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ahluwalia, A., Huang, A., Bandari, R., Roychowdhury, V. (2012). An Automated Multiscale Map of Conversations: Mothers and Matters. In: Aberer, K., Flache, A., Jager, W., Liu, L., Tang, J., Guéret, C. (eds) Social Informatics. SocInfo 2012. Lecture Notes in Computer Science, vol 7710. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35386-4_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-35386-4_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35385-7
Online ISBN: 978-3-642-35386-4
eBook Packages: Computer ScienceComputer Science (R0)