skip to main content
10.1145/2567948.2577264acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
tutorial

Towards a social media analytics platform: event detection and user profiling for twitter

Published:07 April 2014Publication History

ABSTRACT

Microblog data differs significantly from the traditional text data with respect to a variety of dimensions. Microblog data contains short documents, SMS kind of language, and is full of code mixing. Though a lot of it is mere social babble, it also contains fresh news coming from human sensors at a humungous rate. Given such interesting characteristics, the world wide web community has witnessed a large number of research tasks for microblogging platforms recently. Event detection on Twitter is one of the most popular such tasks with a large number of applications. The proposed tutorial on social analytics for Twitter will contain three parts. In the first part, we will discuss research efforts towards detection of events from Twitter using both the tweet content as well as other external sources. We will also discuss various applications for which event detection mechanisms have been put to use. Merely detecting events is not enough. Applications require that the detector must be able to provide a good description of the event as well. In the second part, we will focus on describing events using the best phrase, event type, event timespan, and credibility. In the third part, we will discuss user profiling for Twitter with a special focus on user location prediction. We will conclude with a summary and thoughts on future directions.

References

  1. F. Alvanaki, M. Sebastian, K. Ramamritham, and G. Weikum. EnBlogue: Emergent Topic Detection in Web 2.0 Streams. In Proc. of the 2011 ACM SIGMOD Intl. Conf. on Management of Data (SIGMOD), pages 1271--1274, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. C. Castillo, M. Mendoza, and B. Poblete. Information Credibility on Twitter. In Proc. of the $20^th$ Intl. Conf. on World Wide Web (WWW), pages 675--684, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Z. Cheng, J. Caverlee, and K. Lee. You are Where you Tweet: A Content-based Approach to Geo-locating Twitter Users. In Proc. of the 19th ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 759--768, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Cui, M. Zhang, Y. Liu, S. Ma, and K. Zhang. Discover Breaking Events with Popular Hashtags in Twitter. In Proc. of the 21st ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 1794--1798, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Dalvi, R. Kumar, and B. Pang. Object Matching in Tweets with Spatial Models. In Proc. of the 5th ACM Intl. Conf. on Web Search and Data Mining (WSDM), pages 43--52, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. A. Davis Jr, G. L. Pappa, D. R. R. de Oliveira, and F. de L Arcanjo. Inferring the Location of Twitter Messages based on User Relationships. Transactions in GIS, 15(6):735--751, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  7. B. De Longueville, R. S. Smith, and G. Luraschi. OMG, from here, I can see the Flames!: A Use-case of Mining Location Based Social Networks to acquire Spatio-temporal Data on Forest Fires. In Proc. of the 2009 Intl. Workshop on Location Based Social Networks, pages 73--80, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Eisenstein, B. O'Connor, N. A. Smith, and E. P. Xing. A Latent Variable Model for Geographic Lexical Variation. In Proc. of the 2010 Conf. on Empirical Methods in Natural Language Processing (EMNLP), pages 1277--1287, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Gupta, P. Zhao, and J. Han. Evaluating Event Credibility on Twitter. In Proc. of the 2012 SIAM Intl. Conf. on Data Mining (SDM), pages 153--164, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  10. B. Han, P. Cook, and T. Baldwin. Geolocation Prediction in Social Media Data by Finding Location Indicative Words. In Proc. of the 23rd Intl. Conf. on Computational Linguistics (COLING), pages 1045--1062, 2012.Google ScholarGoogle Scholar
  11. T. Hua, F. Chen, L. Zhao, C.-T. Lu, and N. Ramakrishnan. STED: Semi-supervised Targeted-interest Event Detection in Twitter. In Proc. of the 19th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pages 1466--1469, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Lee and K. Sumiya. Measuring Geographical Regularities of Crowd Behaviors for Twitter-based Geo-social Event Detection. In Proc. of the 2nd ACM SIGSPATIAL Intl. Workshop on Location Based Social Networks, pages 1--10, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. C. Li, A. Sun, and A. Datta. Twevent: Segment-based Event Detection from Tweets. In Proc. of the 21st ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 155--164, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Li, K. H. Lei, R. Khadiwala, and K.-C. Chang. TEDAS: A Twitter-based Event Detection and Analysis System. In Proc. of the 2012 IEEE 28th Intl. Conf. on Data Engineering (ICDE), pages 1273--1276, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Li, C. Wang, and K. Chang. User Profiling in Ego Network: An Attribute and Relationship Type Co-profiling Approach. In Proc. of the 23rd Intl. Conf. on World Wide Web (WWW), pages 675--684, 2011.Google ScholarGoogle Scholar
  16. R. Li, S. Wang, H. Deng, R. Wang, and K. C.-C. Chang. Towards Social User Profiling: Unified and Discriminative Influence Model for Inferring Home Locations. In Proc. of the 18th ACM Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pages 1023--1031, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. W. Li, P. Serdyukov, A. P. de Vries, C. Eickhoff, and M. Larson. The Where in the Tweet. In Proc. of the 20th ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 2473--2476, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Mathioudakis and N. Koudas. Twittermonitor: Trend Detection over the Twitter Stream. In Proc. of the 2010 ACM SIGMOD Intl. Conf. on Management of Data (SIGMOD), pages 1155--1158, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. Metzler, C. Cai, and E. Hovy. Structured Event Retrieval over Microblog Archives. In Proc. of the 2012 Conf. of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pages 646--655, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A.-M. Popescu and M. Pennacchiotti. Detecting Controversial Events from Twitter. In Proc. of the 19th ACM Intl. Conf. on Information and Knowledge Management (CIKM), pages 1873--1876, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A. Ritter, O. Etzioni, S. Clark, et al. Open Domain Event Extraction from Twitter. In Proc. of the 18th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pages 1104--1112, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Sadilek, H. Kautz, and J. P. Bigham. Finding your Friends and Following them to Where you are. In Proc. of the 5th ACM Intl. Conf. on Web Search and Data Mining (WSDM), pages 723--732, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes Twitter Users: Real-time Event Detection by Social Sensors. In Proc. of the 19th Intl. Conf. on World Wide Web (WWW), pages 851--860, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. B. Sharifi, M.-A. Hutton, and J. Kalita. Summarizing Microblogs Automatically. In Human Language Technologies: The 2010 Annual Conf. of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT), pages 685--688, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Towards a social media analytics platform: event detection and user profiling for twitter

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '14 Companion: Proceedings of the 23rd International Conference on World Wide Web
      April 2014
      1396 pages
      ISBN:9781450327459
      DOI:10.1145/2567948

      Copyright © 2014 Copyright is held by the owner/author(s)

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 7 April 2014

      Check for updates

      Qualifiers

      • tutorial

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader