ABSTRACT
Mobile crowdsensing can provide mobility researchers with fine grained spatio-temporal location data. But crowdsourcing impacts data quality both due to device and OS heterogeneity, and to annotation errors. Additionally, it is often necessary to deal with multimodality, i.e. participants using different travel modes often in the same trip. In this paper, we address how to draw value from a crowdsensed dataset for characterising mobility demand through origin-destination (OD) matrices, highlighting challenges and providing some solutions.
First, we identify typical errors in heterogeneous location data, propose and compare methods to automatically improve data quality. Then, we devise a method to detect among 5 transport modes (walk, car, bus, metro, bike) offline a posteriori. We segment trips on stopped periods and propose a random forest model to detect transportation mode per segment using only location data. Our results show that with adequate pre-processing and robust features, an RF classifier is able to achieve accuracy and precision of 85% in trip segments. This is similar to the literature, but our work uses a very heterogeneous crowdsourced trajectory dataset when compared to the others. Finally, we quantify the impact of the model on mulit-modal OD matrices and whole trip characterisation. We can correctly identify used transportation modes accurately, but the precision is impaired by the high likelihood of at least one false positive in the whole trip.
- Filip Biljecki, Hugo Ledoux, and Peter Van Oosterom. 2013. Transportation modebased segmentation and classification of movement trajectories. International Journal of Geographical Information Science 27, 2 (2013), 385--407. Google ScholarDigital Library
- N. Caceres, J.P. Wideberg, and F.G. Benitez. 2007. Deriving origin-destination data from a mobile phone network. IET Intelligent Transport Systems 1, 1 (2007), 15.Google ScholarCross Ref
- Francesco Calabrese, Giusy Di Lorenzo, Liang Liu, and Carlo Ratti. 2011. Estimating origin-destination flows using mobile phone location data. IEEE Pervasive Computing 10, 4 (4 2011), 36--44. Google ScholarDigital Library
- Giuseppe Cardone, Luca Foschini, Paolo Bellavista, Antonio Corradi, Cristian Borcea, Manoop Talasila, and Reza Curtmola. 2013. Fostering participaction in smart cities: a geo-social crowdsensing platform. IEEE Communications Magazine 51, 6 (6 2013), 112--119.Google ScholarCross Ref
- Anke Huss, Johan Beekhuizen, Hans Kromhout, and Roel Vermeulen. 2014. Using GPS-derived speed patterns for recognition of transport modes in adults. International journal of health geographics 13, 1 (2014), 40.Google Scholar
- Nancy McGuckin and Yukiko Nakamoto. 2004. Trips, Chains and Tours-Using an Operational Definition. In National Household Travel Survey Conference.Google Scholar
- Joao G. P. Rodrigues, Ana Aguiar, and Joao Barros. 2014. SenseMyCity: Crowdsourcing an Urban Sensor. arXiv preprint arXiv:1412.2070 (12 2014), 1--10. http://arxiv.org/abs/1412.2070Google Scholar
- Joao G. P. Rodrigues, Ana Aguiar, and Cristina Queiros. 2016. Opportunistic mobile crowdsensing for gathering mobility information: Lessons learned. In 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC). IEEE, 1654--1660.Google ScholarCross Ref
- Luis Sanchez, Luis Muñoz, Jose Antonio Galache, Pablo Sotres, Juan R. Santana, Veronica Gutierrez, Rajiv Ramdhany, Alex Gluhak, Srdjan Krco, Evangelos Theodoridis, and Dennis Pfisterer. 2014. SmartSantander: IoT experimentation over a smart city testbed. Computer Networks 61 (3 2014), 217--238.Google Scholar
- Nadine Schuessler and Kay Axhausen. 2009. Processing raw data from global positioning systems without additional information. Transportation Research Record: Journal of the Transportation Research Board 2105 (2009), 28--36.Google ScholarCross Ref
- Leon Stenneth, Ouri Wolfson, Philip S Yu, and Bo Xu. 2011. Transportation mode detection using mobile phones and GIS information. In Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, 54--63. Google ScholarDigital Library
- Lijuan Zhang, Sagi Dalyot, Daniel Eggert, and Monika Sester. 2011. Multi-stage approach to travel-mode segmentation and classification of gps traces. In Proceedings of the ISPRS Guilin 2011 Workshop on International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Guilin, China, Vol. 2021. 8793.Google Scholar
- Yu Zheng, Like Liu, Longhao Wang, and Xing Xie. 2008. Learning transportation mode from raw gps data for geographic applications on the web. In Proceedings of the 17th international conference on World Wide Web. ACM, 247--256. Google ScholarDigital Library
Index Terms
- Impact of Crowdsourced Data Quality on Travel Pattern Estimation
Recommendations
A Crowd Sensing Approach to Video Classification of Traffic Accident Hotspots
Machine Learning and Data Mining in Pattern RecognitionAbstractDespite various initiatives over the recent years, the number of traffic accidents has been steadily increasing and has reached over 1.2 million fatalities per year world wide. Recent research has highlighted the positive effects that come from ...
Collecting Quality Data for Database Mining
AI '01: Proceedings of the 14th Australian Joint Conference on Artificial Intelligence: Advances in Artificial IntelligenceData collecting is necessary to some organizations such as nuclear power plants and earthquake bureaus, which have very small databases. Traditional data collecting is to obtain necessary data from internal and external data-sources and join all data ...
Real-time passenger location estimation using CDRs and train objects generated from crowdsourced timetables
UbiComp/ISWC'15 Adjunct: Adjunct Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2015 ACM International Symposium on Wearable ComputersThis paper shows that cell phone Call Detail Records (CDRs) and train objects in GIS, generated from crowdsourced timetable information, can be used to estimate the train on which a specific passenger is riding. Passenger train ride likelihood ...
Comments