Skip to main content
Log in

Twitter-based traffic delay detection based on topic propagation analysis using railway network topology

  • Original Article
  • Published:
Personal and Ubiquitous Computing Aims and scope Submit manuscript

Abstract

Twitter has become one of the most popular social media platforms, evidently stirred by a very popular trend of event detection with many applications, including delay detection and traffic congestion on the public transport network. In this paper, we propose a Twitter-based railway delay detection method based on topic propagation analysis of geo-tagged tweets between railway stations. In particular, we aim to discover delay events and to predict train delays due to traffic accidents by analyzing topic propagation using railway network topology of real space. To realize this, first, we construct the topology of the railway network (the physical space) as a graph in which nodes are railway stations and edges are represented as routes between them. Then, we extract the topology of the social network that is mapped on the railway network, based on topic propagation analysis of accident delays between stations and by analyzing geo-tagged tweets of each station with a neural network. This allows us to observe the influence of delays on railway stations even if there are a few tweets on them and to predict stations affected by delays with the tweets which contain indirect topics about delays such as “crowded!” and “raining!”. Overall, this paper proposes the method which enables us to analyze the topic propagation of geo-tagged tweets in order to predict accident delays by considering the railway topology of real space. In addition, we also evaluate the performance of the proposed method on datasets derived from Twitter with the actual delay information from 488 stations of 62 routes in Tokyo area in Japan.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. The MeCab Japanese morphological analyzer: https://taku910.github.io/mecab/

References

  1. Twitter: http://twitter.com/

  2. Foursquare: https://foursquare.com/

  3. Tumblr: https://www.tumblr.com/

  4. Jorudan: https://world.jorudan.co.jp/mln/en/?sub_lang=ja

  5. Tokyo Metro Subway Map: http://www.tokyometro.jp/en/subwaymap/pdf/rosen_en_1702.pdf

  6. Twitter Streaming API: https://dev.twitter.com/streaming/overview

  7. Google Places API v3: https://developers.google.com/place

  8. World Urbanization Prospects (2014) The 2014 revision population database, vol ST/ESA/SE.A/352. United Nations

  9. Ardon S, Bagchi A, Mahanti A, Ruhela A, Seth A, Tripathy RM, Triukose S (2013) Spatio-temporal and events based analysis of topic popularity in twitter. In: Proceedings of the 22nd ACM international conference on information & knowledge management, CIKM ’13, pp 219–228. https://doi.org/10.1145/2505515.2505525. http://doi.acm.org/10.1145/2505515.2505525

  10. Auxilia R, Gandhi M (2016) Earthquake reporting system development by tweet analysis with approach earthquake alarm systems. European Journal of Applied Sciences 8(3):176–180. https://doi.org/10.5829/idosi.ejas.2016.8.3.23003

    Google Scholar 

  11. Carvalho J, Marques M, Costeira JP (2017) Understanding people flow in transportation hubs. IEEE Trans Intell Transp Syst 19(10):1–10

    Google Scholar 

  12. Daly EM, Lecue F, Bicer V (2013) Westland row why so slow?: Fusing social media and linked data sources for understanding real-time traffic conditions. In: Proceedings of the 2013 international conference on intelligent user interfaces, IUI ’13, pp 203–212. https://doi.org/10.1145/2449396.2449423

  13. D’Andrea E, Ducange P, Lazzerini B, Marcelloni F (2015) Real-time detection of traffic from twitter stream analysis. IEEE Trans Intell Transp Syst 16(4):2269–2283. https://doi.org/10.1109/TITS.2015.2404431

    Article  Google Scholar 

  14. Dong G, Yang W, Zhu F, Wang W (2017) Discovering burst patterns of burst topic in twitter. Comput Electr Eng 58(C):551–559. https://doi.org/10.1016/j.compeleceng.2016.06.012

    Article  Google Scholar 

  15. Eleta I, Golbeck J (2014) Multilingual use of twitter: social networks at the language frontier. Comput Hum Behav 41:424–432

    Article  Google Scholar 

  16. Endarnoto SK, Pradipta S, Nugroho AS, Purnama J (2011) Traffic condition information extraction & visualization from social media twitter for android mobile application. In: Proceedings of the international conference on electronics engineering and informatics, ICEEI ’11, pp 1–4. https://doi.org/10.1109/ICEEI.2011.6021743

  17. Goonetilleke O, Sellis T, Zhang X, Sathe S (2014) Twitter analytics: a big data management perspective. ACM SIGKDD Explorations Newsletter 16(1):11–20. https://doi.org/10.1145/2674026.2674029. http://doi.acm.org/10.1145/2674026.2674029

    Article  Google Scholar 

  18. Günnemann N, Pfeffer J (2015) Finding non-redundant multi-word events on twitter. In: Proceedings of the 2015 IEEE/ACM international conference on advances in social networks analysis and mining 2015, ASONAM ’15, pp 520–525. https://doi.org/10.1145/2808797.2809390

  19. Gutierrez C, Figueiras P, Oliveira P, Costa R, Jardim-Goncalves R (2015) Twitter mining for traffic events detection. In: IEEE science and information conference 2015, SAI 2015. https://doi.org/10.1109/SAI.2015.7237170

  20. Itoh M, Yoshinaga N, Toyoda M (2016) Spatio-temporal event visualization from a geo-parsed microblog stream. In: Companion publication of the 21st international conference on intelligent user interfaces, IUI ’16 Companion, pp 58–61. https://doi.org/10.1145/2876456.2879486. http://doi.acm.org/10.1145/2876456.2879486

  21. Kabalan B, Leurent F, Christoforou Z, Dubroca-Voisin M (2017) Framework for centralized and dynamic pedestrian management in railway stations. Transportation Research Procedia 27:712–719. https://doi.org/10.1016/j.trpro.2017.12.091

    Article  Google Scholar 

  22. Kalloubi F, Nfaoui EH, El Beqqali O (2017) Harnessing semantic features for large-scale content-based hashtag recommendations on microblogging platforms. International Journal on Semantic Web & Information Systems 13(1):48–67. https://doi.org/10.4018/IJSWIS.2017010104

    Article  Google Scholar 

  23. Lee R, Sumiya K (2010) Measuring geographical regularities of crowd behaviors for twitter-based geo-social event detection. In: Proceedings of the 2nd ACM SIGSPATIAL international workshop on location based social networks, LBSN ’10. ACM, New York, pp 1–10. https://doi.org/10.1145/1867699.1867701

  24. Lee R, Wakamiya S, Sumiya K (2011) Discovery of unusual regional social activities using geo-tagged microblogs. World Wide Web 14(4):321–349. https://doi.org/10.1007/s11280-011-0120-x

    Article  Google Scholar 

  25. Liu M, Fu K, Lu CT, Chen G, Wang H (2014) A search and summary application for traffic events detection based on twitter data. In: Proceedings of the 22nd ACM SIGSPATIAL international conference on advances in geographic information systems, SIGSPATIAL ’14, pp 549–552. https://doi.org/10.1145/2666310.2666366. http://doi.acm.org/10.1145/2666310.2666366

  26. Mallela D, Ahlers D, Pera MS (2017) Mining twitter features for event summarization and rating. In: Proceedings of the international conference on web intelligence, WI ’17, pp 615–622. https://doi.org/10.1145/3106426.3106487

  27. Morioka M, Kuramochi K, Mishina Y, Akiyama T, Taniguchi N (2015) City management platform using big data from people and traffic flows. Hitachi Review 64(1):53

    Google Scholar 

  28. Nugroho R, Zhao W, Yang J, Paris C, Nepal S (2017) Using time-sensitive interactions to improve topic derivation in twitter. World Wide Web 20(1):61–87. https://doi.org/10.1007/s11280-016-0417-x

    Article  Google Scholar 

  29. Ozkurt C, Camci F (2009) Automatic traffic density estimation and vehicle classification for traffic surveillance systems using neural networks. Mathematical and Computational Application 14(3):187–196. https://doi.org/10.3390/mca14030187

    Article  Google Scholar 

  30. Pla F, Hurtado LF (2016) Language identification of multilingual posts from twitter: a case study. Knowl Inf Syst 51(3):1–25. https://doi.org/10.1007/s10115-016-0997-x

    Google Scholar 

  31. Raghavi KC, Chinnakotla MK, Shrivastava M (2015) “answer ka type kya he?”: Learning to classify questions in code-mixed language. In: Proceedings of the 24th international conference on World Wide Web, WWW ’15 companion. ACM, New York, pp 853–858. https://doi.org/10.1145/2740908.2743006

  32. Ritter A, Mausam, Etzioni O, Clark S (2012) Open domain event extraction from twitter. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, pp 1104–1112. https://doi.org/10.1145/2339530.2339704. http://doi.acm.org/10.1145/2339530.2339704

  33. Sakaki T, Okazaki M, Matsuo Y (2013) Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Trans Knowl Data Eng 25(4):919–931. https://doi.org/10.1109/TKDE.2012.29

    Article  Google Scholar 

  34. Stilo G, Velardi P (2014) Time makes sense: event discovery in twitter using temporal similarity. In: Proceeidngs of the 2014 IEEE/WIC/ACM international joint conferences on Web Intelligence (WI) and intelligent agent technologies (IAT) - Volume 02, WI-IAT ’14, pp 186–193. https://doi.org/10.1109/WI-IAT.2014.97

  35. Sureesha B, Priyadarshini V (2016) Monitoring and analysis of dynamic traffic analyzer using twitter. IEEE Trans Intell Transp Syst 7(4):136–139

    Google Scholar 

  36. Wakamiya S, Lee R, Sumiya K (2011) Crowd-powered tv viewing rates: measuring relevancy between tweets and tv programs. In: International conference on database systems for advanced applications. Springer, pp 390–401

  37. Wakamiya S, Lee R, Sumiya K (2011) Towards better tv viewing rates: Exploiting crowd’s media life logs over twitter for tv rating. In: Proceedings of the 5th international conference on ubiquitous information management and communication, ICUIMC ’11. ACM, New York, pp 39:1–39:10. https://doi.org/10.1145/1968613.1968661

  38. Wang S, Zhang X, Cao J, He L, Stenneth L, Yu PS, Li Z, Huang Z (2017) Computing urban traffic congestions by incorporating sparse gps probe data and social media data. ACM Trans Inf Syst (TOIS) 35 (4):40:1–40:30. https://doi.org/10.1145/3057281

    Google Scholar 

  39. Wang Y, Yasui G, Hosokawa Y, Kawai Y, Akiyama T, Sumiya K (2014) Location-based microblog viewing system synchronized with web pages. In: 2014 IEEE 33rd international symposium on reliable distributed systems workshops (SRDSW). IEEE, pp 70–75. https://doi.org/10.1109/SRDSW.2014.18

  40. Wang Y, Yasui G, Kawai Y, Akiyama T, Sumiya K, Ishikawa Y (2016) Dynamic mapping of dense geo-tweets and web pages based on spatio-temporal analysis. In: Proceedings of the 31st annual ACM symposium on applied computing, SAC ’16, pp 1170–1173. https://doi.org/10.1145/2851613.2851985. http://doi.acm.org/10.1145/2851613.2851985

  41. Yuan Y, Lint HV, Wageningen-Kessels FV, Hoogendoorn S (2014) Network-wide traffic state estimation using loop detector and floating car data. J Intell Transp Syst Technol Plann Oper 18(1):41–50. https://doi.org/10.1080/15472450.2013.773225

    Article  Google Scholar 

  42. Zhao F, Zhu Y, Jin H, Yang LT (2016) A personalized hashtag recommendation approach using lda-based topic model in microblog environment, vol 65, pp 196–206. https://doi.org/10.1016/j.future.2015.10.012

  43. Zheng Y (2015) Methodologies for cross-domain data fusion: an overview. IEEE Transactions on Big Data 1 (1):16–34. https://doi.org/10.1109/TBDATA.2015.2465959

    Article  Google Scholar 

Download references

Funding

This work was partially supported by SCOPE of the Ministry of Internal Affairs and Communications of Japan (#171507010), JSPS KAKENHI Grant Numbers 16H01722, 17K12686, 15K00162, and 17H01822.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuanyuan Wang.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Siriaraya, P., Kawai, Y. et al. Twitter-based traffic delay detection based on topic propagation analysis using railway network topology. Pers Ubiquit Comput 23, 233–247 (2019). https://doi.org/10.1007/s00779-019-01204-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00779-019-01204-5

Keywords

Navigation