research-article

Real-Time Traffic Event Detection From Social Media

Authors:

Ahmad Al-Rubaie,

Sandra Stinčić Clarke,

John DaviesAuthors Info & Claims

ACM Transactions on Internet Technology (TOIT), Volume 18, Issue 1

Article No.: 9, Pages 1 - 23

https://doi.org/10.1145/3122982

Published: 04 November 2017 Publication History

Abstract

Smart communities are composed of groups, organizations, and individuals who share information and make use of that shared information for better decision making. Shared information can come from many sources, particularly, but not exclusively, from sensors and social media. Social media has become an important source of near-instantaneous user-generated information that can be shared and analyzed to support better decision making. One domain where social media data can add value is transportation and traffic management. This article looks at the exploitation of Twitter data in the traffic reporting domain. A key challenge is how to identify relevant information from a huge amount of user-generated data and then analyze the relevant data for automatic geocoded incident detection. The article proposes an instant traffic alert and warning system based on a novel latent Dirichlet allocation (LDA) approach (“tweet-LDA”). The system is evaluated and shown to perform better than related approaches.

References

[1]

Simon Beddus and Mike Fisher. 2011. Enabling smart communities. Journal of the Institute of Telecommunications Professionals 5, 4, 34--38.

[2]

Illinois Department of Transportation. 2017. Illinois Traffic Alert System. Retrieved September 23, 2017, from http://www.iltrafficalert.com/.

[3]

UK Traffic News. 2017. Ross Targett publishing. Retrieved July 8, 2017, from https://uk-traffic-news-twitraffic.soft112.com/.

[4]

Yiming Gu, Zhen (Sean) Qian, and Feng Chen. 2016. From Twitter to detector: Real-time traffic incident detection using social media data. Transportation Research Part C 67, 321--342.

[5]

Eleonora D'Andrea, Pietro Ducange, Beatrice Lazzerini, and Francesco Marcelloni. 2015. Real-time detection of traffic from Twitter stream analysis. IEEE Transactions on Intelligent Transportation Systems 16, 4, 2269--2283.

Digital Library

[6]

Jie Yin, Andrew Lampert, Bella Robinson, and Robert Power. 2012. Using social media to enhance emergency situation awareness. IEEE Intelligent Systems 27, 6, 52--59.

Digital Library

[7]

Takumi Ichimura and Shin Kamada. 2012. A generation method of filtering rules of Twitter via smartphone based Participatory Sensing system for tourist by interactive GHSOM and C4.5. In Proceedings of the 2012 IEEE International Conference on Systems, Man, and Cybernetics.

[8]

Takeshi Sakaki, Yutaka Matsuo, Tadashi Yanagihara, Naiwala P. Chandrasiri, and Kazunari Nawa. 2012. Real-time event extraction for driving information from social sensors. In Proceedings of the 2012 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems.

[9]

M. A. H. Khan, D. Bollegala, G. Liu, and K. Sezaki. 2013. Multi-tweet summarization of real-time events. In Proceedings of the International Conference on Social Computing (SocialCom’13). 128--133.

Digital Library

[10]

Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2013. Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Transactions on Knowledge and Data Engineering 25, 4, 919--931.

Digital Library

[11]

Kirill Kireyev, Leysia Palen, and Kenneth M. Anderson. 2009. Applications of topics models to analysis of disaster-related Twitter data. In Proceedings of the NIPS Workshop on Applications for Topic Models: Text and Beyond.

[12]

Michael J. Paul and Mark Dredze. 2011. You are what you tweet: Analyzing Twitter for public health. Proceedings of the 5th International AAAI Conference on Weblogs and Social Media.

[13]

Kyle W. Prier, Matthew S. Smith, Christophe Giraud-Carrier, and Carl L. Hanson. 2011. Identifying health-related topics on Twitter. Lecture Notes in Computer Science, Vol. 6589. Springer, 18--25.

[14]

Jiang Bian, Umit Topaloglu, and Fan Yu. 2012. Towards large-scale Twitter mining for drug-related adverse events. In Proceedings of the 2012 International Workshop on Smart Health and Wellbeing (SHB’12). ACM, New York, NY, 25--32.

Digital Library

[15]

Brendan O'Connor, Michel Krieger, and David Ahn. 2010. Tweetmotif: Exploratory search and topic summarization for Twitter. In Proceedings of the 4th International AAAI Conference on Weblogs and Social Media.

[16]

Guang Xiang, Bin Fan, Ling Wang, Jason I. Hong, and Carolyn P. Rose. 2012. Detecting offensive tweets via topical feature discovery over a large scale Twitter corpus. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). 1980--1984.

Digital Library

[17]

Aron Culotta. 2010. Towards detecting influenza epidemics by analyzing Twitter messages. In Proceedings of the 1st Workshop on Social Media Analytics (SOMA’10). 115--122.

Digital Library

[18]

Bharath Sriram, David Fuhry, Engin Demir, Hakan Ferhatosmanoglu, and Murat Demirbas. 2010. Short text classification in Twitter to improve information filtering. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’10). 841--842.

Digital Library

[19]

Alan Ritter, Sam Clark, Mausam, and Oren Etzioni. 2011. Named entity recognition in tweets: An experimental study. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. 1524--1534.

Digital Library

[20]

Max Kaufmann. 2010. Syntactic normalization of Twitter messages. In Proceedings of the International Conference on Natural Language Processing.

[21]

D. M. Blei, A. Y. Ng, and M. I. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993--1022.

Digital Library

[22]

X. Zhao Wayne, Jiang Jing, Wng Jianshu, He Jing, Lim Ee-Peng, Yan Hongfei, and Li Xiaoming, 2011. Comparing Twitter and traditional media using topic models. In Proceedings of the 33rd European Conference on Advances in Information Retrieval (ECIR’11). 338--349.

[23]

M. Steyvers, P. Smyth, M. Rosen-Zvi, and T. Griffiths. 2004. Probabilistic author-topic models for information discovery. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’04). 206--315.

Digital Library

[24]

Liangjie Hong and Brian D. Davison. 2010. Empirical study of topic modeling in Twitter. In Proceedings of the 1st Workshop on Social Media Analytics (SOMA’10).

Digital Library

[25]

Jeff Huang, Katherine M. Thornton, Efthimis N. Efthimiadis. 2010. Conversational tagging in Twitter. In Proceedings of the 21st ACM Conference on Hypertext and Hypermedia (HT’10). 173--178.

Digital Library

[26]

D. Blei and J. D. McAuliffe. 2007. Supervised topic models. In Proceedings of Advances in Neural Information Processing Systems (NIPS’07). 121--128.

[27]

D. Wang, M. Thint, and A. Al-Rubaie. 2012. Semi-supervised latent Dirichlet allocation and its application for document classification. In Proceedings of the IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT’12). 306--310.

Digital Library

[28]

Matthew D. Hoffman, David M. Blei, and Francis R. Bach. 2010. Online learning for latent Dirichlet allocation. In Proceedings of Advances in Neural Information Processing Systems (NIPS’10). 856--864.

[29]

Kevin R. Canini, Lei Shi, and Thomas L. Griffiths. 2009. Online inference of topics with latent Dirichlet allocation. In Proceedings of the 12th International Conference on Artificial Intelligence and Statistics (AISTATS’09).

[30]

Issei Sato, Kenichi Kurihara, and Hiroshi Nakagawa. 2010. Deterministic single-pass algorithm for LDA. In Proceedings of the 23rd International Conference on Neural Information Processing Systems (NIPS’10). 2074--2082.

[31]

Zekai J. Gao, Yangqiu Song, Shixia Liu, Haixun Wang, Hao Weil, and Weiwei Cui. 2011. Tracking and connecting topics via incremental hierarchical Dirichlet processes. In Proceedings of the 2011 11th IEEE International Conference on Data Mining.

Digital Library

[32]

B. Han and T. Baldwin. Lexical normalisation of short text messages: Makn sens a #twitter. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics. 368--378.

[33]

Github. 2017. First20hours/google-10000-english. Retrieved September 23, 2017, from https://github.com/first20hours/google-10000-english/blob/master/google-10000-english.txt.

[34]

Jalal Mahmud, Jeffrey Nichols, and Clemens Drews. 2014. Home location identification of Twitter users. ACM Transactions on Intelligent Systems and Technology 5, 3, Article No. 47.

Digital Library

[35]

Clodoveu A. Davis Jr, Gisele L. Pappa, Diogo Rennó Rocha de Oliveira, and Filipe de L. Arcanjo. 2011. Inferring the location of Twitter messages based on user relationships. Transactions in GIS 15, 6, 735--751.

[36]

Ahmed Talal, Khaled Al Kaabi, Di Wang, Ahmad Al-Rubaie, Ahmed Al Dhanhani, John Davies, and Sandra Stincic Clarke. 2016. Event identification and assertion from social media using auto-extendable knowledge base. In Proceedings of the 2016 IEEE World Congress on Computational Intelligence (IEEE-WCCI’16).

[37]

Wikipedia. 2017. Dartford Crossing. Retrieved September 23, 2017, from https://en.wikipedia.org/wiki/Dartford_Crossing.

Cited By

Yang XBekoulis GDeligiannis N(2023)Traffic event detection as a slot filling problemEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106202123(106202)Online publication date: Aug-2023
https://doi.org/10.1016/j.engappai.2023.106202
Sabah Mredula MRahman MSanwar Hosen A(2023)Accident event detection from Facebook posts written in Bengali and Banglish languagesInternational Journal of Communication Systems10.1002/dac.567138:1Online publication date: 26-Dec-2023
https://doi.org/10.1002/dac.5671
Ahmad IAlqurashi FAbozinadah EMehmood R(2022)Deep Journalism and DeepJournal V1.0: A Data-Driven Deep Learning Approach to Discover Parameters for TransportationSustainability10.3390/su1409571114:9(5711)Online publication date: 9-May-2022
https://doi.org/10.3390/su14095711
Show More Cited By

Index Terms

Real-Time Traffic Event Detection From Social Media
1. Information systems
  1. Information systems applications
    1. Data mining
      1. Data stream mining

Recommendations

Who are the spoilers in social media marketing? Incremental learning of latent semantics for social spam detection

With the rise of social web, there has also been a great concern about the quality of user-generated content on social media sites (SMSs). Deceptive comments harm users' trust in online social media and cause financial loss to firms. Previous studies ...
Social media and COVID‐19: Characterizing anti‐quarantine comments on Twitter
Abstract
Social media has become a mainstream channel of communication during the COVID‐19 pandemic. While some studies have been developed on investigating public opinion on social media data regarding COVID‐19 pandemic, there is no study analyzing anti‐...
Social Media Marketing: A Beginner Guide To Get Success In Your Business (Volume 1)

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Internet Technology

ACM Transactions on Internet Technology Volume 18, Issue 1

Special Issue on Connected Communities

February 2018

250 pages

ISSN:1533-5399

EISSN:1557-6051

DOI:10.1145/3155100

Editor:
Munindar P. Singh
Department of Computer Science, North Carolina State University

Issue’s Table of Contents

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 November 2017

Accepted: 01 July 2017

Revised: 01 June 2017

Received: 01 June 2016

Published in TOIT Volume 18, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

32
Total Citations
View Citations
587
Total Downloads

Downloads (Last 12 months)25
Downloads (Last 6 weeks)2

Reflects downloads up to 25 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yang XBekoulis GDeligiannis N(2023)Traffic event detection as a slot filling problemEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106202123(106202)Online publication date: Aug-2023
https://doi.org/10.1016/j.engappai.2023.106202
Sabah Mredula MRahman MSanwar Hosen A(2023)Accident event detection from Facebook posts written in Bengali and Banglish languagesInternational Journal of Communication Systems10.1002/dac.567138:1Online publication date: 26-Dec-2023
https://doi.org/10.1002/dac.5671
Ahmad IAlqurashi FAbozinadah EMehmood R(2022)Deep Journalism and DeepJournal V1.0: A Data-Driven Deep Learning Approach to Discover Parameters for TransportationSustainability10.3390/su1409571114:9(5711)Online publication date: 9-May-2022
https://doi.org/10.3390/su14095711
Lamsal RHarwood ARead M(2022)Socially Enhanced Situation Awareness from Microblogs Using Artificial Intelligence: A SurveyACM Computing Surveys10.1145/352449855:4(1-38)Online publication date: 21-Nov-2022
https://dl.acm.org/doi/10.1145/3524498
Liang JXu FYu S(2022)A multi-scale semantic attention representation for multi-label image recognition with graph networksNeurocomputing10.1016/j.neucom.2022.03.057491:C(14-23)Online publication date: 28-Jun-2022
https://dl.acm.org/doi/10.1016/j.neucom.2022.03.057
Zhuang DChen KChang J(2022)CS-AFNeurocomputing10.1016/j.neucom.2022.03.042491:C(206-216)Online publication date: 28-Jun-2022
https://dl.acm.org/doi/10.1016/j.neucom.2022.03.042
Xu YMu LJi ZLiu XHan J(2022)Meta hyperbolic networks for zero-shot learningNeurocomputing10.1016/j.neucom.2022.03.040491:C(57-66)Online publication date: 28-Jun-2022
https://dl.acm.org/doi/10.1016/j.neucom.2022.03.040
Wang KLu JLiu ASong YXiong LZhang G(2022)Elastic gradient boosting decision tree with adaptive iterations for concept drift adaptationNeurocomputing10.1016/j.neucom.2022.03.038491:C(288-304)Online publication date: 28-Jun-2022
https://dl.acm.org/doi/10.1016/j.neucom.2022.03.038
Shi HZhang YXu ZXu XQi L(2022)Multi-source temporal knowledge graph embedding for edge computing enabled internet of vehiclesNeurocomputing10.1016/j.neucom.2021.12.036491:C(597-606)Online publication date: 28-Jun-2022
https://dl.acm.org/doi/10.1016/j.neucom.2021.12.036
Nie LLin CLiao KZhao Y(2022)Learning edge-preserved image stitching from multi-scale deep homographyNeurocomputing10.1016/j.neucom.2021.12.032491:C(533-543)Online publication date: 28-Jun-2022
https://dl.acm.org/doi/10.1016/j.neucom.2021.12.032
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Issue’s Table of Contents