ABSTRACT
Microblog data differs significantly from the traditional text data with respect to a variety of dimensions. Microblog data contains short documents, SMS kind of language, and is full of code mixing. Though a lot of it is mere social babble, it also contains fresh news coming from human sensors at a humungous rate. Given such interesting characteristics, the world wide web community has witnessed a large number of research tasks for microblogging platforms recently. Event detection on Twitter is one of the most popular such tasks with a large number of applications. The proposed tutorial on social analytics for Twitter will contain three parts. In the first part, we will discuss research efforts towards detection of events from Twitter using both the tweet content as well as other external sources. We will also discuss various applications for which event detection mechanisms have been put to use. Merely detecting events is not enough. Applications require that the detector must be able to provide a good description of the event as well. In the second part, we will focus on describing events using the best phrase, event type, event timespan, and credibility. In the third part, we will discuss user profiling for Twitter with a special focus on user location prediction. We will conclude with a summary and thoughts on future directions.
- F. Alvanaki, M. Sebastian, K. Ramamritham, and G. Weikum. EnBlogue: Emergent Topic Detection in Web 2.0 Streams. In Proc. of the 2011 ACM SIGMOD Intl. Conf. on Management of Data (SIGMOD), pages 1271--1274, 2011. Google ScholarDigital Library
- C. Castillo, M. Mendoza, and B. Poblete. Information Credibility on Twitter. In Proc. of the $20^th$ Intl. Conf. on World Wide Web (WWW), pages 675--684, 2011. Google ScholarDigital Library
- Z. Cheng, J. Caverlee, and K. Lee. You are Where you Tweet: A Content-based Approach to Geo-locating Twitter Users. In Proc. of the 19th ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 759--768, 2010. Google ScholarDigital Library
- A. Cui, M. Zhang, Y. Liu, S. Ma, and K. Zhang. Discover Breaking Events with Popular Hashtags in Twitter. In Proc. of the 21st ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 1794--1798, 2012. Google ScholarDigital Library
- N. Dalvi, R. Kumar, and B. Pang. Object Matching in Tweets with Spatial Models. In Proc. of the 5th ACM Intl. Conf. on Web Search and Data Mining (WSDM), pages 43--52, 2012. Google ScholarDigital Library
- C. A. Davis Jr, G. L. Pappa, D. R. R. de Oliveira, and F. de L Arcanjo. Inferring the Location of Twitter Messages based on User Relationships. Transactions in GIS, 15(6):735--751, 2011.Google ScholarCross Ref
- B. De Longueville, R. S. Smith, and G. Luraschi. OMG, from here, I can see the Flames!: A Use-case of Mining Location Based Social Networks to acquire Spatio-temporal Data on Forest Fires. In Proc. of the 2009 Intl. Workshop on Location Based Social Networks, pages 73--80, 2009. Google ScholarDigital Library
- J. Eisenstein, B. O'Connor, N. A. Smith, and E. P. Xing. A Latent Variable Model for Geographic Lexical Variation. In Proc. of the 2010 Conf. on Empirical Methods in Natural Language Processing (EMNLP), pages 1277--1287, 2010. Google ScholarDigital Library
- M. Gupta, P. Zhao, and J. Han. Evaluating Event Credibility on Twitter. In Proc. of the 2012 SIAM Intl. Conf. on Data Mining (SDM), pages 153--164, 2012.Google ScholarCross Ref
- B. Han, P. Cook, and T. Baldwin. Geolocation Prediction in Social Media Data by Finding Location Indicative Words. In Proc. of the 23rd Intl. Conf. on Computational Linguistics (COLING), pages 1045--1062, 2012.Google Scholar
- T. Hua, F. Chen, L. Zhao, C.-T. Lu, and N. Ramakrishnan. STED: Semi-supervised Targeted-interest Event Detection in Twitter. In Proc. of the 19th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pages 1466--1469, 2013. Google ScholarDigital Library
- R. Lee and K. Sumiya. Measuring Geographical Regularities of Crowd Behaviors for Twitter-based Geo-social Event Detection. In Proc. of the 2nd ACM SIGSPATIAL Intl. Workshop on Location Based Social Networks, pages 1--10, 2010. Google ScholarDigital Library
- C. Li, A. Sun, and A. Datta. Twevent: Segment-based Event Detection from Tweets. In Proc. of the 21st ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 155--164, 2012. Google ScholarDigital Library
- R. Li, K. H. Lei, R. Khadiwala, and K.-C. Chang. TEDAS: A Twitter-based Event Detection and Analysis System. In Proc. of the 2012 IEEE 28th Intl. Conf. on Data Engineering (ICDE), pages 1273--1276, 2012. Google ScholarDigital Library
- R. Li, C. Wang, and K. Chang. User Profiling in Ego Network: An Attribute and Relationship Type Co-profiling Approach. In Proc. of the 23rd Intl. Conf. on World Wide Web (WWW), pages 675--684, 2011.Google Scholar
- R. Li, S. Wang, H. Deng, R. Wang, and K. C.-C. Chang. Towards Social User Profiling: Unified and Discriminative Influence Model for Inferring Home Locations. In Proc. of the 18th ACM Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pages 1023--1031, 2012. Google ScholarDigital Library
- W. Li, P. Serdyukov, A. P. de Vries, C. Eickhoff, and M. Larson. The Where in the Tweet. In Proc. of the 20th ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 2473--2476, 2011. Google ScholarDigital Library
- M. Mathioudakis and N. Koudas. Twittermonitor: Trend Detection over the Twitter Stream. In Proc. of the 2010 ACM SIGMOD Intl. Conf. on Management of Data (SIGMOD), pages 1155--1158, 2010. Google ScholarDigital Library
- D. Metzler, C. Cai, and E. Hovy. Structured Event Retrieval over Microblog Archives. In Proc. of the 2012 Conf. of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pages 646--655, 2012. Google ScholarDigital Library
- A.-M. Popescu and M. Pennacchiotti. Detecting Controversial Events from Twitter. In Proc. of the 19th ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 1873--1876, 2010. Google ScholarDigital Library
- A. Ritter, O. Etzioni, S. Clark, et al. Open Domain Event Extraction from Twitter. In Proc. of the 18th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pages 1104--1112, 2012. Google ScholarDigital Library
- A. Sadilek, H. Kautz, and J. P. Bigham. Finding your Friends and Following them to Where you are. In Proc. of the 5th ACM Intl. Conf. on Web Search and Data Mining (WSDM), pages 723--732, 2012. Google ScholarDigital Library
- T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes Twitter Users: Real-time Event Detection by Social Sensors. In Proc. of the 19th Intl. Conf. on World Wide Web (WWW), pages 851--860, 2010. Google ScholarDigital Library
- B. Sharifi, M.-A. Hutton, and J. Kalita. Summarizing Microblogs Automatically. In Human Language Technologies: The 2010 Annual Conf. of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT), pages 685--688, 2010. Google ScholarDigital Library
Index Terms
- Towards a social media analytics platform: event detection and user profiling for twitter
Recommendations
Extracting and Summarizing Situational Information from the Twitter Social Media during Disasters
Microblogging sites like Twitter have become important sources of real-time information during disaster events. A large amount of valuable situational information is posted in these sites during disasters; however, the information is dispersed among ...
Supporting temporal analytics for health-related events in microblogs
CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge managementMicroblogging services, such as Twitter, are gaining interests as a means of sharing information in social networks. Numerous works have shown the potential of using Twitter posts (or tweets) in order to infer the existence and magnitude of real-world ...
Dark retweets: investigating non-conventional retweeting patterns
SocInfo'12: Proceedings of the 4th international conference on Social InformaticsRetweets are an important mechanism for recognising propagation of information on the Twitter social media platform. However, many retweets do not use the official retweet mechanism, or even community established conventions, and these "dark retweets" ...
Comments