Abstract
The practical applications of social media have raised the bar for real-time event detection all over the globe. It has been deemed useful for extracting important data that enables people across the world to access information within mere seconds of its occurrence. Among the many use cases of the aforementioned applications lies detection of issues regarding the smooth mobility of traffic on the road. Machine learning models that were developed earlier used support vector machines with Bag of Words. However, BoW (Bag of Words) suffers from problems such as the inability to manage semantic relationships between words, limited representation, and high dimensional representation. Our model developed for a similar cause fetches the required data from Twitter to make the target authorities aware of the issues like potholes, accidents, and high traffic density (congestion) of the Chandigarh tri-city area. The data collected then undergoes multiple stages of pre-processing. After that, multiple word embedding models come into play to build semantic relationships between the words and intra software jargon. The resultant is then processed through the several recently introduced deep learning based natural language processing. The current study aims to perform a comparative evaluation of the several advanced state-of-the-art classification models at the target traffic event detection activity. The developed models make a fact-based prediction to make multi-class classification into the aforementioned categories. From the comparative evaluation, it has been observed that the proposed language processing model (RoBERTa based) pipeline outperforms the existing approaches and is 97% accurate at classifying the real-time tweets. Moreover, the proposed pipeline achieves 96% recall at segregating the traffic events efficiently.










Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability statement
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Sarrab M, Pulparambil S, Awadalla M (2020) Development of an iot based real-time traffic monitoring system for city governance. Glob Transit 2:230–245
Zhu F, Lv Y, Chen Y, Wang X, Xiong G, Wang F-Y (2019) Parallel transportation systems: toward iot-enabled smart urban traffic control and management. IEEE Trans Intell Transp Syst 21(10):4063–4071
Singh R, Sharma R, Akram SV, Gehlot A, Buddhi D, Malik PK, Arya R (2021) Highway 4.0: Digitalization of highways for vulnerable road safety development with intelligent iot sensors and machine learning. Safety Sci 143:105407
Silva BN, Khan M, Han K (2018) Towards sustainable smart cities: a review of trends, architectures, components, and open challenges in smart cities. Sustain Cities Soc 38:697–713
Afrin T, Yodo N (2020) A survey of road traffic congestion measures towards a sustainable and resilient transportation system. Sustainability 12(11):4660
D‘Andrea E, Ducange P, Lazzerini B, Marcelloni F (2015) Real-time detection of traffic from twitter stream analysis. IEEE Trans Intell Transp Syst 16(4):2269–2283
Ilieva RT, McPhearson T (2018) Social-media data for urban sustainability. Nat Sustain 1(10):553–565
Hysa B, Zdonek I, Karasek A (2022) Social media in sustainable tourism recovery. Sustainability 14(2):760
Dixon S (2022) Countries with most twitter users 2022. https://www.statista.com/statistics/242606/number-of-active-twitter-users-in-selected-countries. Accessed: (April 3, 2023)
Duan HK, Vasarhelyi MA, Codesso M, Alzamil Z (2023) Enhancing the government accounting information systems using social media information: An application of text mining and machine learning. Int J Account Inf Syst 48:100600
Savastano M, Suciu M-C, Gorelova I, Stativă G-A (2023) How smart is mobility in smart cities? an analysis of citizens‘ value perceptions through ict applications. Cities 132:104071
Agarwal A, Toshniwal D, Bedi J, (2019) Can twitter help to predict outcome of, Indian general election: A deep learning based study. Joint Eur Conf Mach Learn Knowl Discov Databases. Springer 2019:38–53
Bedi J, Toshniwal D (2022) Citenergy: a bert based model to analyse citizens‘ energy-tweets. Sustain Cities Soc 80:103706
Mohit B (2014) Named entity recognition. In: Natural language processing of semitic languages. Springer, pp 221–245
Voutilainen A (2003) Part-of-speech tagging. In: The Oxford handbook of computational linguistics, pp 219–232
Torregrossa F, Allesiardo R, Claveau V, Kooli N, Gravier G (2021) A survey on training and evaluation of word embeddings. Int J Data Sci Anal 11:85–103
Devlin J, Chang M-W, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint: arXiv:1810.04805
Yang Z, Dai Z, Yang Y, Carbonell J, Salakhutdinov RR (2019) Le QV, Xlnet: generalized autoregressive pretraining for language understanding. Adv Neural Inf Process Syst, 32
Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: a robustly optimized bert pretraining approach. arXiv preprint: arXiv:1907.11692
Hu D, Wu J, Tian K, Liao L, Xu M, Du Y (2017) Urban air quality, meteorology and traffic linkages: evidence from a sixteen-day particulate matter pollution event in december 2015, Beijing. J Environ Sci 59:30–38
Giuliano G, Lu Y (2021) Analyzing traffic impacts of planned major events. Transp Res Record 2675(8):432–442
da Silva Barboza F, Stumpf L, Pauletto EA, de Lima CLR, Pinto LFS, Jardim TM, Pimentel JP, Albert RP, Vivan GA (2021) Impact of machine traffic events on the physical quality of a minesoil after topographic reconstruction. Soil Tillage Res 210:104981
Ribeiro Jr. SS, Davis Jr. CA, Oliveira DRR, Meira Jr. W, Gonçalves TS, Pappa GL (2012) Traffic observatory: a system to detect and locate traffic events and conditions using twitter. In: Proceedings of the 5th ACM SIGSPATIAL international workshop on location-based social networks, pp 5–11
Gu Y, Qian ZS, Chen F (2016) From twitter to detector: real-time traffic incident detection using social media data. Transp Res Part C Emerg Tech 67:321–342
Dabiri S, Heaslip K (2019) Developing a twitter-based traffic event detection model using deep learning architectures. Exp Syst Appl 118:425–439
Albuquerque FC, Casanova MA, Lopes H, Redlich LR, de Macedo JAF, Lemos M, de Carvalho MTM, Renso C (2016) A methodology for traffic-related twitter messages interpretation. Comput Ind 78:57–69
Essien A, Petrounias I, Sampaio P, Sampaio S (2021) A deep-learning model for urban traffic flow prediction with traffic events mined from twitter. World Wide Web 24(4):1345–1368
Alomari E, Katib I, Mehmood R (2020) Iktishaf: a big data road-traffic event detection tool using twitter and spark machine learning. Mobile Netw Appl, pp 1–16
Agarwal S, Mittal N, Sureka A (2018) Potholes and bad road conditions: mining twitter to extract information on killer roads. In: Proceedings of the ACM India joint international conference on data science and management of data, pp 67–77
Chaturvedi N, Toshniwal D, Parida M (2019) Twitter to transport: geo-spatial sentiment analysis of traffic tweets to discover people‘s feelings for urban transportation issues. J Eastern Asia Soc Transp Stud 13:210–220
Suat-Rojas N, Gutierrez-Osorio C, Pedraza C (2022) Extraction and analysis of social networks data to detect traffic accidents. Information 13(1):26
Yao W, Qian S (2021) From twitter to traffic predictor: next-day morning traffic prediction using social media data. Transp Res Part C Emerg Technol 124:102938
Milusheva S, Marty R, Bedoya G, Williams S, Resor E, Legovini A (2021) Applying machine learning and geolocation techniques to social media data (twitter) to develop a resource for urban planning. PloS One 16(2):e0244317
Azhar A, Rubab S, Khan MM, Bangash YA, Alshehri MD, Illahi F, Bashir AK (2022) Detection and prediction of traffic accidents using deep learning techniques. Cluster Comput, pp 1–17
Deb S, Chanda AK (2022) Comparative analysis of contextual and context-free embeddings in disaster prediction from twitter data. Mach Learn Appl, p 100253
Sqlalchemy MB (2012) In: Brown A, Wilson G (eds) The architecture of open source applications volume II: structure, scale, and a few more fearless hacks. aosabook.org. http://aosabook.org/en/sqlalchemy.html
Singh T, Kumari M (2016) Role of text pre-processing in twitter sentiment analysis. Proc Comput Sci 89:549–554
Symeonidis S, Effrosynidis D, Arampatzis A (2018) A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis. Exp Syst Appl 110:298–310
Tarwani KM, Edem S (2017) Survey on recurrent neural network in natural language processing. Int J Eng Trends Technol 48:301–304
Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint: arXiv:1301.3781
Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543
Bojanowski P, Grave E, Joulin A, Mikolov T (2017) Enriching word vectors with subword information. Trans Assoc Comput Linguist 5:135–146
Aggarwal CC et al (2018) Neural networks and deep learning. Springer 10:978–3
Bengio Y, Goodfellow I, Courville A (2017) Deep learning, vol 1. MIT press Cambridge, MA, USA
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint: arXiv:1412.3555
Xu M, Zhang X, Guo L (2019) Jointly detecting and extracting social events from twitter using gated bilstm-crf. IEEE Access 7:148462–148471
Prasad R, Udeme AU, Misra S, Bisallah H (2023) Identification and classification of transportation disaster tweets using improved bidirectional encoder representations from transformers. Int J Inf Manage Data Insights 3(1):100154
Tan P-N, Steinbach M, Kumar V (2016) Introduction to data mining. Pearson Education India
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author(s) declare(s) that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Babbar, S., Bedi, J. Real-time traffic, accident, and potholes detection by deep learning techniques: a modern approach for traffic management. Neural Comput & Applic 35, 19465–19479 (2023). https://doi.org/10.1007/s00521-023-08767-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-08767-8