Abstract
In recent years, shared bikes have gradually emerged into public life as a new way to travel and helped solve the last-mile problem of residents’ travel. While this development has brought convenient travel to users, a series of problems exist, a prominent one is the uneven distribution of bikes at each shared bike station. Accurately predicting bike usage in a bike-sharing system can help solve this problem. In this paper, we investigate how to improve the accuracy of predicting the usage of bikes in bike-sharing system. First, considering both geographic location information of shared bike stations and the migration trend of bikes between stations, we design a two-level fuzzy c-means clustering algorithm to cluster shared bicycle stations into groups, which can better capture the connection between shared bicycle stations and improve the clustering accuracy of shared bicycle sites, then, we combine the two-level fuzzy c-means clustering algorithm with the multi-similarity reference model to predict the usage of bikes, which can significantly improve the accuracy of the forecast. To evaluate the performance of our model, we validate our model in the New York Bike-Sharing System. The results shows that our model obtained significantly better results than other models.
Similar content being viewed by others
References
Demaio P (2009) Bike-sharing: history, impacts, models of provision, and future. J Publ Transp 12(4)
Goodwin P (2004) The economic costs of road traffic congestion. Ucl the Rail Frght Group, London
Wang H, Odoni A (2016) Approximating the performance of a “last mile” transportation system. Transp Sci 50(2):659–675
Schuijbroek J, Hampshire R. C, Van Hoeve W. J (2017) Inventory rebalancing and vehicle routing in bike sharing systems. Eur J Oper Res 257(3):992–1004
Li Y, Zheng Y (2019) Citywide bike usage prediction in a bike-sharing system. IEEE Trans Knowl Data Eng 32(6):1079–1091
Ghosh S, Varakantham P, Adulyasak Y, Jaillet P (2017) Dynamic repositioning to reduce lost demand in bike sharing systems. J Artif Intell Res 58:387–430
Hoang M. X, Zheng Y, Singh A. K (2016) FCCF: forecasting citywide crowd flows based on big data. In: Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp 1–10
Jia W, Tan Y, Liu L, Li J, Zhang H, Zhao K (2019) Hierarchical prediction based on two-level Gaussian mixture model clustering for bike-sharing system. Knowl-Based Syst 178:84–97
Han Y, Zhu L, Cheng Z, Li J, Liu X (2018) Discrete optimal graph clustering. IEEE transactions on cybernetics
Frey B. J, Dueck D (2007) Clustering by passing messages between data points. Science 315(5814):972–976
Zhang J, Zheng Y, Sun J, Qi D (2019) Flow prediction in spatio-temporal networks based on multitask deep learning. IEEE Trans Knowl Data Eng 32(3):468–478
Seyedarabi M H, Shamsi H, Borzabadi E (2011) A Modified Fuzzy C-Means Clustering with Spatial Information for Image Segmentation// International Conference on Information and Computer Applications (ICICA 2011)
Chen J, Yuan W, Cao J et al (2020) Traffic-flow prediction via granular computing and stacked autoencoder. Granular Comput 5:449–459
Friedman J. H. (2001) Greedy function approximation: a gradient boosting machine. Ann Stat:1189–1232
Li Y, Zheng Y, Zhang H, Chen L (2015) Traffic prediction in a bike-sharing system. In: Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp 1–10)
Zhang J, Zheng Y, Qi D, Li R, Yi X, Li T (2018) Predicting citywide crowd flows using deep spatio-temporal residual networks. Artif Intell 259:147–166
Campbell A. A, Cherry C. R, Ryerson M. S, Yang X (2016) Factors influencing the choice of shared bicycles and shared electric bikes in Beijing. Transp Res Part C: Emerg Technol 67:399–414
Nannan Z, Chao L (2019) Adaptive online time series prediction based on a novel dynamic fuzzy cognitive map. J Intell Fuzzy Syst 36(6):5291–5303
Liang Y, Ouyang K, Jing L, Ruan S, Liu Y, Zhang J, Zheng Y (2019) Urbanfm: Inferring fine-grained urban flows. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 3132–3142
Bargar A, Gupta A, Gupta S, Ma D (2014) Interactive visual analytics for multi-city bikeshare data analysis. In: The 3rd International Workshop on Urban Computing (UrbComp 2014), vol 45, New York
Etienne C, Latifa O (2014) Model-Based Count Series Clustering for Bike Sharing System Usage Mining: A Case Study with the Vélib’ System of Paris. ACM Trans Intell Syst Technol (TIST) 5(3):1–21
Kaltenbrunner A, Meza R, Grivolla J, Codina J, Banchs R (2010) Urban cycles and mobility patterns: Exploring and predicting trends in a bicycle-based public transport system. Pervasive Mob Comput 6(4):455–466
Bezdek J. C, Ehrlich R, Full W (1984) FCM: The fuzzy c-means clustering algorithm. Comput Geosci 10(2-3):191–203
Jia W, Tan Y, Li J (2018) Hierarchical prediction based on two-level affinity propagation clustering for bike-sharing system. IEEE Access 6:45875–45885
Savaresi S. M, Boley D. L (2004) A comparative analysis on the bisecting K-means and the PDDP clustering algorithms. Intell Data Anal 8(4):345–362
Zhang H, Cao L (2014) A spectral clustering based ensemble pruning approach. Neurocomputing 139:289–297
Böttcher A, Wenzel D (2008) The Frobenius norm and the commutator. Linear Algebra Appl 429(8-9):1864-1885
Rasmussen C. E (2003) Gaussian processes in machine learning. In: Summer School on Machine Learning. Springer, Berlin, pp 63–71
Aljuaid T, Sasi S (2017) Proper Imputation Techniques for Missing Values in Data sets. International Conference on Data Science & Engineering. IEEE
Zhang J, Zheng Y, Qi D, Li R, Yi X (2016) DNN-based prediction model for spatio-temporal data. In: Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp 1–4
Luo C, Tan C, Wang X, Zheng Y (2019) An evolving recurrent interval type-2 intuitionistic fuzzy neural network for online learning and time series prediction. Appl Soft Comput 78:150–163
Zhang M. L, Zhou Z. H (2007) ML-KNN: A lazy learning approach to multi-label learning. Pattern Recogn 40(7):2038–2048
Liu W, Shoji Y (2019) DeepVM: RNN-based vehicle mobility prediction to support intelligent vehicle applications. IEEE Trans Ind Inf 16(6):3997–4006
Yang B, Sun S, Li J et al (2019) Traffic flow prediction using LSTM with feature enhancement. Neurocomputing 332:320–327
Cho K, Van Merri̇ënboer B, Gulcehre C et al (2014) Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv:1406.1078
Wu T, Xie K, Xinpin D, Song G (2012) A online boosting approach for traffic flow forecasting under abnormal conditions. In: 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery. IEEE, pp 2555–2559
Acknowledgments
This work is partially supported by the National Natural Science Foundation of China (Nos. U1836216, 61702310, 61772322), the major fundamental research project of Shandong, China(No. ZR2019ZD03), and the Taishan Scholar Project of Shandong, China.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, B., Tan, Y. & Jia, W. TL-FCM: A hierarchical prediction model based on two-level fuzzy c-means clustering for bike-sharing system. Appl Intell 52, 6432–6449 (2022). https://doi.org/10.1007/s10489-021-02186-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-02186-9