skip to main content
10.1145/2676440.2676444acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Explorative public transport flow analysis from uncertain social media data

Published:04 November 2014Publication History

ABSTRACT

In this paper, we propose a framework to detect human mobility transportation hubs and infer public transport flows from unstructured georeferenced social media data using semantic topic modeling and spatial clustering techniques. An infrastructure for receiving and storing large sets of social media data has been developed together with an ad hoc processing framework in order to consider the high uncertainty of our retrieved data. Given the detected and extracted social media signals indicating human mobility, we compared the results with the public transport network from OpenStreetMap and classified observed mobility patterns for an exemplary case study. To analyze collected datasets a web based visualization tool has been setup.

References

  1. Tapscott, D. 1996. The digital economy: Promise and peril in the age of networked intelligence. Educom Review 31.3. 52--57.Google ScholarGoogle Scholar
  2. o'Reilly, T. 2009. What is web 2.0.Google ScholarGoogle Scholar
  3. Resch, B. 2013. People as sensors and collective sensing-contextual observations complementing geo-sensor network measurements. Progress in Location-Based Services. (2013), 391--406.Google ScholarGoogle Scholar
  4. Goodchild, M. 2007. Citizens as sensors: the world of volunteered geography. Geo. Journal. 69, 4 (2007), 211--221.Google ScholarGoogle Scholar
  5. Boyd, D. M. and Ellison, N. B. 2007. Social Network Sites: Definition, History, and Scholarship. Journal of Computer-Mediated Communication. 13, 1 (Oct. 2007), 210--230.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Zheng, Y. 2011. Location-based social networks: Users. Computing with Spatial Trajectories. Springer New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Steiger, E., Lauer, J., Ellersiek, T., Zipf, A. 2014. Towards a framework for automatic geographic feature extraction from Twitter. Eighth International Conference on Geographic Information Science. Vienna.Google ScholarGoogle Scholar
  8. Cho, E., Myers, S. A. and Leskovec, J. 2011. Friendship and mobility. Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '11. ACM, New York, NY, Aug. 2011, 1082--1090. DOI= http://doi.acm.org/10.1145/2020408.2020579. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Noulas, A., Scellato, S., Mascolo, C. and Pontil, M. 2011. Exploiting Semantic Annotations for Clustering Geographic Areas and Users in Location-based Social Networks. The Social Mobile Web 11 (2011).Google ScholarGoogle Scholar
  10. Wakamiya, S., Lee, R. and Sumiya, K. 2011. Crowd-based Urban Characterization: Extracting Crowd Behavioral Patterns in Urban Areas from Twitter. Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks. ACM, New York, NY, 2011, 77--84. DOI= http://doi.acm.org/10.1145/2063212.2063225. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Wanichayapong, N., Pruthipunyaskul, W., Pattaraatikom, W. and Chaovalit, P. 2011. Social-based Traffic Information Extraction and Classification. ITS Telecommunications (ITST), 2011 11th International Conference on. IEEE (2011), 107--112.Google ScholarGoogle Scholar
  12. Sakaki, T., Okazaki, M. and Matsuo, Y. 2010. Earthquake shakes Twitter users: real-time event detection by social sensors. Proceedings of the 19th international conference on World wide web. ACM, New York, NY, 2010, 851--860. DOI= http://doi.acm.org/10.1145/1772690.1772777. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Gerais, M., Oliveira, D. R. R., Gonçalves, T. S., Jr, C. A. D. and Pappa, G. L. 2012. Traffic Observatory: a system to detect and locate traffic events and conditions using Twitter. Proceedings of the 5th International Workshop on Location-Based Social Networks. ACM, New York, NY, 2012, 5--11. DOI= http://doi.acm.org/10.1145/2442796.2442800. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Kosala, R. and Adi, E. 2012. Harvesting Real Time Traffic Information from Twitter. Procedia Engineering. 50, Icasce (Jan. 2012), 1--11.Google ScholarGoogle Scholar
  15. Kling, F., Kildare, C. and Pozdnoukhov, A. 2012. When a City Tells a Story: Urban Topic Analysis. Proceedings of the 20th International Conference on Advances in Geographic Information Systems. ACM, New York, NY, 2012, 482--485. DOI= http://doi.acm.org/10.1145/2424321.2424395. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Ferrari, L., Rosi, A., Mamei, M. and Zambonelli, F. 2011. Extracting urban patterns from location-based social networks. Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks - LBSN '11. ACM, New York, NY, 2011, 9--16. DOI= http://doi.acm.org/10.1145/2063212.2063226. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Cranshaw, J., Schwartz, R., Hong, J. and Sadeh, N. 2012. The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City. ICWSM (2012).Google ScholarGoogle Scholar
  18. Hasan, S., Zhan, X. and Ukkusuri, S. V. 2013. Understanding urban human activity and mobility patterns using large-scale location-based data from online social media. Proceedings of the 2nd ACM SIGKDD International Workshop on Urban Computing - UrbComp '13. ACM, New York, NY, Aug. 2013. DOI= http://doi.acm.org/10.1145/2505821.2505823. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Coffey, C. and Pozdnoukhov, A. 2013. Temporal decomposition and semantic enrichment of mobility flows. Proceedings of the 6th ACM SIGSPATIAL International Workshop on Location-Based Social Networks - LBSN '13. ACM, New York, NY, Nov. 2013, 34--43. DOI= http://doi.acm.org/10.1145/2536689.2536806. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. https://dev.twitter.com/docs/api/streaming, https://www.flickr.com/services/api/, https://api.instagram.com/v1/media/popular?client_id=C LIENT-IDGoogle ScholarGoogle Scholar
  21. Lewis, D. D., Yang, Y., Rose, T. G. and Li, F. 2004. RCV1: A New Benchmark Collection for Text Categorization Research. The Journal of Machine Learning Research. 5, (Dec. 2004), 361--397. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Blei, D., Ng, A. and Jordan, M. 2003. Latent dirichlet allocation. Journal of machine Learning research. (2003), 993--1022. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Griffiths, T. and Steyvers, M. 2004. Finding scientific topics. Proceedings of the National academy of Sciences of the United States of America (2004), 5228--5235.Google ScholarGoogle Scholar
  24. Ester, M., Kriegel, H. P., Sander, J., and Xu, X. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise. In Kdd(Vol. 96, pp. 226--231).Google ScholarGoogle Scholar
  25. Zandbergen, P. a. and Barbeau, S. J. 2011. Positional Accuracy of Assisted GPS Data from High-Sensitivity GPS-enabled Mobile Phones. Journal of Navigation. 64, 03 (Jun. 2011), 381--399.Google ScholarGoogle ScholarCross RefCross Ref
  26. Mierswa, I., Wurst, M. and Klinkenberg, R. 2006. Yale: Rapid prototyping for complex data mining tasks. Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, New York, NY, 2006, 935--940. DOI= http://doi.acm.org/10.1145/1150402.1150531. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Ding, A., Zhao, X. and Jiao, L. 2002. Traffic flow time series prediction based on statistics learning theory. Proceedings. The IEEE 5th International Conference on Intelligent Transportation Systems (2002), 727--730.Google ScholarGoogle ScholarCross RefCross Ref
  28. Nagel, K. and Schreckenberg, M. 1992. A cellular automaton model for freeway traffic. Journal de physique I. (1992).Google ScholarGoogle Scholar
  29. Birant, D. and Kut, A. 2007. ST-DBSCAN: An algorithm for clustering spatial--temporal data. Data & Knowledge Engineering. (2007). Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Hengl, T. 2006. Finding the right pixel size. Computers & Geosciences. 32, 9 (Nov. 2006), 1283--1298. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Explorative public transport flow analysis from uncertain social media data

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              GeoCrowd '14: Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information
              November 2014
              44 pages
              ISBN:9781450331333
              DOI:10.1145/2676440

              Copyright © 2014 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 4 November 2014

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article

              Acceptance Rates

              GeoCrowd '14 Paper Acceptance Rate5of10submissions,50%Overall Acceptance Rate17of30submissions,57%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader