ABSTRACT
With the growth of the economy and population, the transportation infrastructure of cities faces various challenges such as lacks of parking spots and traffic jams. One of the alternatives is to take public transport combined with proper utilization of demand-responsive transport such as Taxi or Uber. Therefore, it is valuable to improve the efficiency of these responsive transport. This research evaluates the predicting performance of different prediction models such as GBDT, XGBoost, and Random forest on taxi trip duration. At the same time, new elements were added to the modified data to contribute a higher accuracy rate including the snow status and precipitation data. Moreover, exploratory data analysis and data mining was conducted taking concerns of the Covid-19 effect of the year 2020.
- [1] "Nyc Taxi & Ridehailing Stats Dashboard." https://toddwschneider.com/dashboards/nyc-taxi-ridehailing-uber-lyft-data/.Google Scholar
- [2] Commission, NYC and Limousince. "Covid-19 Impact on the Nyc for-Hire Industry." https://www1.nyc.gov/assets/tlc/downloads/pdf/covid-impact-report.pdf.Google Scholar
- [3] Rogoff, E. G. "The Limitation of New York City Taxicab Medallions: Economic Thoery and Political Reality of the Haas Law." Journal of Transportation Law Logistics & Policy, 1997.Google Scholar
- [4] Natashia, C. and S. R. Velu. "Data Exploratory on Taxi Data in New York City." 4th Global Conference on Computing & Media Technology (2020), 2020.Google Scholar
- [5] Antoniades, C. "Fare and Duration Prediction: A Study of New York City Taxi Rides."Google Scholar
- [6] Fan, Junliang "Evaluation of Svm, Elm and Four Tree-Based Ensemble Models for Predicting Daily Reference Evapotranspiration Using Limited Meteorological Data in Different Climates of China." Agricultural and Forest Meteorology, vol. 263, 2018, pp. 225-241, doi:10.1016/j.agrformet.2018.08.019.Google ScholarCross Ref
- [7] Han, Shuwen "Predicting Biomarkers from Classifier for Liver Metastasis of Colorectal Adenocarcinomas Using Machine Learning Models." Cancer Medicine, vol. 9, no. 18, 2020, pp. 6667-6678, doi:10.1002/cam4.3289.Google ScholarCross Ref
- [8] Chen, Tianqi Xgboost: A Scalable Tree Boosting System. 2016. Kdd'16: Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining.Google Scholar
- [9] Ke, Jintao "Short-Term Forecasting of Passenger Demand under on-Demand Ride Services: A Spatio-Temporal Deep Learning Approach." Transportation Research Part C-Emerging Technologies, vol. 85, 2017, pp. 591-608, doi:10.1016/j.trc.2017.10.016.Google ScholarCross Ref
- [10] Breiman, L. "Random Forests." Machine Learning, vol. 45, no. 1, 2001, pp. 5-32, doi:10.1023/a:1010933404324Google ScholarDigital Library
- [11] Bennett, C. L. "First-Year Wilkinson Microwave Anisotropy Probe (Wmap) Observations: Preliminary Maps and Basic Results." Astrophysical Journal Supplement Series, vol. 148, no. 1, 2003, pp. 1-27, doi:10.1086/377253.Google ScholarCross Ref
- [12] Akgun, Aykut. "A Comparison of Landslide Susceptibility Maps Produced by Logistic Regression, Multi-Criteria Decision, and Likelihood Ratio Methods: A Case Study at Izmir, Turkey." Landslides, vol. 9, no. 1, 2012, pp. 93-106, doi:10.1007/s10346-011-0283-7.Google ScholarCross Ref
- [13] Austin, Peter C. and Ewout W. Steyerberg. "Graphical Assessment of Internal and External Calibration of Logistic Regression Models by Using Loess Smoothers." Statistics in Medicine, vol. 33, no. 3, 2014, pp. 517-535, doi:10.1002/sim.5941.Google ScholarCross Ref
- [14] Wang, Cuiyan "Immediate Psychological Responses and Associated Factors During the Initial Stage of the 2019 Coronavirus Disease (Covid-19) Epidemic among the General Population in China." International Journal of Environmental Research and Public Health, vol. 17, no. 5, 2020, doi:10.3390/ijerph17051729.Google ScholarCross Ref
- [15] Nicola, Maria "The Socio-Economic Implications of the Coronavirus Pandemic (Covid-19): A Review." International Journal of Surgery, vol. 78, 2020, pp. 185-193, doi:10.1016/j.ijsu.2020.04.018.Google ScholarCross Ref
Index Terms
- Analyze the impact of the epidemic on New York taxis by machine learning algorithms and recommendations for optimal prediction algorithms
Recommendations
Situation assessment and decision making for lane change assistance using ensemble learning methods
Lane change assistance system was developed using ensemble learning methods.Proposed system has potential to prevent lane change crashes and thus reducing injuries and fatalities.Random forest and AdaBoost outperformed Bayes/Decision tree ...
Viability Analysis of Electric Taxis Using New York City Dataset
e-Energy '17: Proceedings of the Eighth International Conference on Future Energy SystemsThis paper examines the viability of electric taxis, namely whether it will be profitable for taxi drivers to adopt electric taxis, in comparison with conventional taxis with internal combustion engines. This paper provides a data analytic investigation ...
Prediction of lane change trajectory of autonomous vehicles based on vehicle-environment data fusion
ICBICC '22: Proceedings of the 2022 International Conference on Big Data, IoT, and Cloud ComputingAbstract. Vehicle lane changing is a current research hotspot in the field of autonomous driving. However, the current research does not consider the environmental data of a large number of vehicles as well as the lane change decision risk and lane ...
Comments