skip to main content
research-article

Real-Time Traffic Event Detection From Social Media

Published: 04 November 2017 Publication History

Abstract

Smart communities are composed of groups, organizations, and individuals who share information and make use of that shared information for better decision making. Shared information can come from many sources, particularly, but not exclusively, from sensors and social media. Social media has become an important source of near-instantaneous user-generated information that can be shared and analyzed to support better decision making. One domain where social media data can add value is transportation and traffic management. This article looks at the exploitation of Twitter data in the traffic reporting domain. A key challenge is how to identify relevant information from a huge amount of user-generated data and then analyze the relevant data for automatic geocoded incident detection. The article proposes an instant traffic alert and warning system based on a novel latent Dirichlet allocation (LDA) approach (“tweet-LDA”). The system is evaluated and shown to perform better than related approaches.

References

[1]
Simon Beddus and Mike Fisher. 2011. Enabling smart communities. Journal of the Institute of Telecommunications Professionals 5, 4, 34--38.
[2]
Illinois Department of Transportation. 2017. Illinois Traffic Alert System. Retrieved September 23, 2017, from http://www.iltrafficalert.com/.
[3]
UK Traffic News. 2017. Ross Targett publishing. Retrieved July 8, 2017, from https://uk-traffic-news-twitraffic.soft112.com/.
[4]
Yiming Gu, Zhen (Sean) Qian, and Feng Chen. 2016. From Twitter to detector: Real-time traffic incident detection using social media data. Transportation Research Part C 67, 321--342.
[5]
Eleonora D'Andrea, Pietro Ducange, Beatrice Lazzerini, and Francesco Marcelloni. 2015. Real-time detection of traffic from Twitter stream analysis. IEEE Transactions on Intelligent Transportation Systems 16, 4, 2269--2283.
[6]
Jie Yin, Andrew Lampert, Bella Robinson, and Robert Power. 2012. Using social media to enhance emergency situation awareness. IEEE Intelligent Systems 27, 6, 52--59.
[7]
Takumi Ichimura and Shin Kamada. 2012. A generation method of filtering rules of Twitter via smartphone based Participatory Sensing system for tourist by interactive GHSOM and C4.5. In Proceedings of the 2012 IEEE International Conference on Systems, Man, and Cybernetics.
[8]
Takeshi Sakaki, Yutaka Matsuo, Tadashi Yanagihara, Naiwala P. Chandrasiri, and Kazunari Nawa. 2012. Real-time event extraction for driving information from social sensors. In Proceedings of the 2012 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems.
[9]
M. A. H. Khan, D. Bollegala, G. Liu, and K. Sezaki. 2013. Multi-tweet summarization of real-time events. In Proceedings of the International Conference on Social Computing (SocialCom’13). 128--133.
[10]
Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2013. Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Transactions on Knowledge and Data Engineering 25, 4, 919--931.
[11]
Kirill Kireyev, Leysia Palen, and Kenneth M. Anderson. 2009. Applications of topics models to analysis of disaster-related Twitter data. In Proceedings of the NIPS Workshop on Applications for Topic Models: Text and Beyond.
[12]
Michael J. Paul and Mark Dredze. 2011. You are what you tweet: Analyzing Twitter for public health. Proceedings of the 5th International AAAI Conference on Weblogs and Social Media.
[13]
Kyle W. Prier, Matthew S. Smith, Christophe Giraud-Carrier, and Carl L. Hanson. 2011. Identifying health-related topics on Twitter. Lecture Notes in Computer Science, Vol. 6589. Springer, 18--25.
[14]
Jiang Bian, Umit Topaloglu, and Fan Yu. 2012. Towards large-scale Twitter mining for drug-related adverse events. In Proceedings of the 2012 International Workshop on Smart Health and Wellbeing (SHB’12). ACM, New York, NY, 25--32.
[15]
Brendan O'Connor, Michel Krieger, and David Ahn. 2010. Tweetmotif: Exploratory search and topic summarization for Twitter. In Proceedings of the 4th International AAAI Conference on Weblogs and Social Media.
[16]
Guang Xiang, Bin Fan, Ling Wang, Jason I. Hong, and Carolyn P. Rose. 2012. Detecting offensive tweets via topical feature discovery over a large scale Twitter corpus. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). 1980--1984.
[17]
Aron Culotta. 2010. Towards detecting influenza epidemics by analyzing Twitter messages. In Proceedings of the 1st Workshop on Social Media Analytics (SOMA’10). 115--122.
[18]
Bharath Sriram, David Fuhry, Engin Demir, Hakan Ferhatosmanoglu, and Murat Demirbas. 2010. Short text classification in Twitter to improve information filtering. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’10). 841--842.
[19]
Alan Ritter, Sam Clark, Mausam, and Oren Etzioni. 2011. Named entity recognition in tweets: An experimental study. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. 1524--1534.
[20]
Max Kaufmann. 2010. Syntactic normalization of Twitter messages. In Proceedings of the International Conference on Natural Language Processing.
[21]
D. M. Blei, A. Y. Ng, and M. I. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993--1022.
[22]
X. Zhao Wayne, Jiang Jing, Wng Jianshu, He Jing, Lim Ee-Peng, Yan Hongfei, and Li Xiaoming, 2011. Comparing Twitter and traditional media using topic models. In Proceedings of the 33rd European Conference on Advances in Information Retrieval (ECIR’11). 338--349.
[23]
M. Steyvers, P. Smyth, M. Rosen-Zvi, and T. Griffiths. 2004. Probabilistic author-topic models for information discovery. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’04). 206--315.
[24]
Liangjie Hong and Brian D. Davison. 2010. Empirical study of topic modeling in Twitter. In Proceedings of the 1st Workshop on Social Media Analytics (SOMA’10).
[25]
Jeff Huang, Katherine M. Thornton, Efthimis N. Efthimiadis. 2010. Conversational tagging in Twitter. In Proceedings of the 21st ACM Conference on Hypertext and Hypermedia (HT’10). 173--178.
[26]
D. Blei and J. D. McAuliffe. 2007. Supervised topic models. In Proceedings of Advances in Neural Information Processing Systems (NIPS’07). 121--128.
[27]
D. Wang, M. Thint, and A. Al-Rubaie. 2012. Semi-supervised latent Dirichlet allocation and its application for document classification. In Proceedings of the IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT’12). 306--310.
[28]
Matthew D. Hoffman, David M. Blei, and Francis R. Bach. 2010. Online learning for latent Dirichlet allocation. In Proceedings of Advances in Neural Information Processing Systems (NIPS’10). 856--864.
[29]
Kevin R. Canini, Lei Shi, and Thomas L. Griffiths. 2009. Online inference of topics with latent Dirichlet allocation. In Proceedings of the 12th International Conference on Artificial Intelligence and Statistics (AISTATS’09).
[30]
Issei Sato, Kenichi Kurihara, and Hiroshi Nakagawa. 2010. Deterministic single-pass algorithm for LDA. In Proceedings of the 23rd International Conference on Neural Information Processing Systems (NIPS’10). 2074--2082.
[31]
Zekai J. Gao, Yangqiu Song, Shixia Liu, Haixun Wang, Hao Weil, and Weiwei Cui. 2011. Tracking and connecting topics via incremental hierarchical Dirichlet processes. In Proceedings of the 2011 11th IEEE International Conference on Data Mining.
[32]
B. Han and T. Baldwin. Lexical normalisation of short text messages: Makn sens a #twitter. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics. 368--378.
[33]
Github. 2017. First20hours/google-10000-english. Retrieved September 23, 2017, from https://github.com/first20hours/google-10000-english/blob/master/google-10000-english.txt.
[34]
Jalal Mahmud, Jeffrey Nichols, and Clemens Drews. 2014. Home location identification of Twitter users. ACM Transactions on Intelligent Systems and Technology 5, 3, Article No. 47.
[35]
Clodoveu A. Davis Jr, Gisele L. Pappa, Diogo Rennó Rocha de Oliveira, and Filipe de L. Arcanjo. 2011. Inferring the location of Twitter messages based on user relationships. Transactions in GIS 15, 6, 735--751.
[36]
Ahmed Talal, Khaled Al Kaabi, Di Wang, Ahmad Al-Rubaie, Ahmed Al Dhanhani, John Davies, and Sandra Stincic Clarke. 2016. Event identification and assertion from social media using auto-extendable knowledge base. In Proceedings of the 2016 IEEE World Congress on Computational Intelligence (IEEE-WCCI’16).
[37]
Wikipedia. 2017. Dartford Crossing. Retrieved September 23, 2017, from https://en.wikipedia.org/wiki/Dartford_Crossing.

Cited By

View all
  • (2023)Traffic event detection as a slot filling problemEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106202123(106202)Online publication date: Aug-2023
  • (2023)Accident event detection from Facebook posts written in Bengali and Banglish languagesInternational Journal of Communication Systems10.1002/dac.567138:1Online publication date: 26-Dec-2023
  • (2022)Deep Journalism and DeepJournal V1.0: A Data-Driven Deep Learning Approach to Discover Parameters for TransportationSustainability10.3390/su1409571114:9(5711)Online publication date: 9-May-2022
  • Show More Cited By

Index Terms

  1. Real-Time Traffic Event Detection From Social Media

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Internet Technology
    ACM Transactions on Internet Technology  Volume 18, Issue 1
    Special Issue on Connected Communities
    February 2018
    250 pages
    ISSN:1533-5399
    EISSN:1557-6051
    DOI:10.1145/3155100
    • Editor:
    • Munindar P. Singh
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 November 2017
    Accepted: 01 July 2017
    Revised: 01 June 2017
    Received: 01 June 2016
    Published in TOIT Volume 18, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Traffic alert system
    2. incremental learning
    3. latent Dirichlet allocation (LDA)
    4. text mining
    5. tweet mining

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)25
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 25 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Traffic event detection as a slot filling problemEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106202123(106202)Online publication date: Aug-2023
    • (2023)Accident event detection from Facebook posts written in Bengali and Banglish languagesInternational Journal of Communication Systems10.1002/dac.567138:1Online publication date: 26-Dec-2023
    • (2022)Deep Journalism and DeepJournal V1.0: A Data-Driven Deep Learning Approach to Discover Parameters for TransportationSustainability10.3390/su1409571114:9(5711)Online publication date: 9-May-2022
    • (2022)Socially Enhanced Situation Awareness from Microblogs Using Artificial Intelligence: A SurveyACM Computing Surveys10.1145/352449855:4(1-38)Online publication date: 21-Nov-2022
    • (2022)A multi-scale semantic attention representation for multi-label image recognition with graph networksNeurocomputing10.1016/j.neucom.2022.03.057491:C(14-23)Online publication date: 28-Jun-2022
    • (2022)CS-AFNeurocomputing10.1016/j.neucom.2022.03.042491:C(206-216)Online publication date: 28-Jun-2022
    • (2022)Meta hyperbolic networks for zero-shot learningNeurocomputing10.1016/j.neucom.2022.03.040491:C(57-66)Online publication date: 28-Jun-2022
    • (2022)Elastic gradient boosting decision tree with adaptive iterations for concept drift adaptationNeurocomputing10.1016/j.neucom.2022.03.038491:C(288-304)Online publication date: 28-Jun-2022
    • (2022)Multi-source temporal knowledge graph embedding for edge computing enabled internet of vehiclesNeurocomputing10.1016/j.neucom.2021.12.036491:C(597-606)Online publication date: 28-Jun-2022
    • (2022)Learning edge-preserved image stitching from multi-scale deep homographyNeurocomputing10.1016/j.neucom.2021.12.032491:C(533-543)Online publication date: 28-Jun-2022
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media