ABSTRACT
Online social streams such as Twitter timelines, forum discussions and email threads have emerged as important channels for information propagation. Mining transient stories and their correlations implicit in social streams is a challenging task, since these streams are noisy and surge quickly. In this paper, we propose CAST, which is a context-aware story-teller that discovers new stories from social streams and tracks their structural context on the fly to build a vein of stories. More precisely, we model the social stream as a capillary network, and define stories by a new cohesive subgraph type called (k,d)-Core in the capillary network. We propose deterministic and randomized context search to support the iceberg query, which builds the story vein as social streams flow. We perform detailed experimental study on real Twitter streams and the results demonstrate the creativity and value of our approach.
- J. Abello, M. G. C. Resende, and S. Sudarsky. Massive quasi-clique detection. In LATIN, pages 598--612, 2002. Google ScholarDigital Library
- J. Allan, editor. Topic detection and tracking: event-based information organization. Kluwer Academic Publishers, 2002. Google ScholarDigital Library
- A. Angel, N. Koudas, N. Sarkas, and D. Srivastava. Dense subgraph maintenance under streaming edge weight updates for real-time story identification. PVLDB, 5(6):574--585, 2012. Google ScholarDigital Library
- B. Balasundaram, S. Butenko, and I. V. Hicks. Clique relaxations in social network analysis: The maximum k-plex problem. Operations Research, 59(1):133--142, 2011. Google ScholarDigital Library
- P. Berkhin. Survey: A survey on pagerank computing. Internet Mathematics, 2(1):73--120, 2005.Google ScholarCross Ref
- D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003. Google ScholarDigital Library
- M. Brunato, H. H. Hoos, and R. Battiti. On effectively finding maximal quasi-cliques in graphs. In LION, pages 41--55, 2007. Google ScholarDigital Library
- Z. Gao, Y. Song, S. Liu, H. Wang, H. Wei, Y. Chen, and W. Cui. Tracking and connecting topics via incremental hierarchical dirichlet processes. In ICDM, pages 1056--1061, 2011. Google ScholarDigital Library
- J. Hastad. Clique is hard to approximate within nxup1--∈. Acta Mathematica, 182:105--142, 1999.Google ScholarCross Ref
- Y. He, C. Lin, W. Gao, and K.-F. Wong. Tracking sentiment and topic dynamics from social media. In ICWSM, 2012.Google Scholar
- B. Kovach and T. Rosenstiel. Blur: How to Know What's True in the Age of Information Overload. Bloomsbury Publishing USA, 2010.Google Scholar
- P. Lee, L. V. S. Lakshmanan, and J. X. Yu. On top-k structural similarity search. In ICDE, pages 774--785, 2012. Google ScholarDigital Library
- V. E. Lee, N. Ruan, R. Jin, and C. C. Aggarwal. A survey of algorithms for dense subgraph discovery. In Managing and Mining Graph Data, pages 303--336. 2010.Google ScholarCross Ref
- J. Leskovec, L. Backstrom, and J. M. Kleinberg. Meme-tracking and the dynamics of the news cycle. In KDD, pages 497--506, 2009. Google ScholarDigital Library
- P. Li, J. X. Yu, H. Liu, J. He, and X. Du. Ranking individuals and groups by influence propagation. In PAKDD (2), pages 407--419, 2011. Google ScholarDigital Library
- N. Liu. Topic detection and tracking. In Encyclopedia of Database Systems, pages 3121--3124. 2009.Google Scholar
- C. D. Manning, P. Raghavan, and H. Schuetze. Introduction to Information Retrieval. Cambridge University Press, 2008. Google ScholarDigital Library
- A. Marcus, M. S. Bernstein, O. Badar, D. R. Karger, S. Madden, and R. C. Miller. Twitinfo: aggregating and visualizing microblogs for event exploration. In CHI, pages 227--236, 2011. Google ScholarDigital Library
- D. Nadeau and S. Sekine. A survey of named entity recognition and classification. Lingvisticae Investigationes, 30(1):3--26, 2007.Google ScholarCross Ref
- T. Nomoto. Two-tier similarity model for story link detection. In CIKM, pages 789--798, 2010. Google ScholarDigital Library
- K. Saito, T. Yamada, and K. Kazama. The k-dense method to extract communities from complex networks. In Mining Complex Data, pages 243--257. 2009.Google ScholarCross Ref
- T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes twitter users: real-time event detection by social sensors. In WWW, pages 851--860, 2010. Google ScholarDigital Library
- A. D. Sarma, A. Jain, and C. Yu. Dynamic relationship and event discovery. In WSDM, pages 207--216, 2011. Google ScholarDigital Library
- C. Shah, W. B. Croft, and D. Jensen. Representing documents with named entities for story link detection (sld). In CIKM, pages 868--869, 2006. Google ScholarDigital Library
- D. Shahaf, J. Yang, C. Suen, J. Jacobs, H. Wang, and J. Leskovec. Information cartography: creating zoomable, large-scale maps of information. In KDD, pages 1097--1105, 2013. Google ScholarDigital Library
- D. Sontag and D. Roy. Complexity of inference in latent dirichlet allocation. In NIPS, pages 1008--1016, 2011.Google Scholar
- N. Wang, S. Parthasarathy, K.-L. Tan, and A. K. H. Tung. Csv: visualizing and mining cohesive subgraphs. In SIGMOD Conference, pages 445--458, 2008. Google ScholarDigital Library
- J. Weng and B.-S. Lee. Event detection in twitter. In ICWSM, 2011.Google Scholar
Index Terms
CAST: A Context-Aware Story-Teller for Streaming Social Content
Recommendations
CAT: a context aware teller for supporting tourist experiences
The aim of this paper is the introduction of a methodology for the dynamic creation of an adaptive generator of the stories related to a tourist context. The proposed approach selects the most suitable contents for the users and builds a context aware ...
Sharing social content from home: a measurement-driven feasibility study
NOSSDAV '11: Proceedings of the 21st international workshop on Network and operating systems support for digital audio and videoToday, OSN sites allow users to share data using a centrally controlled web infrastructure. However, if users shared data directly from home, they could potentially retain full control over the data (i.e., what to share, whom to share with).
This paper ...
Searching for Recent Celebrity Images in Microblog Platform
MM '14: Proceedings of the 22nd ACM international conference on MultimediaWith the explosive growth and widespread accessibility of image content in social media, many users are eagerly searching for most recent and relevant images on topics of their interests. However, most current microblog platforms merely make use of ...
Comments