Abstract
Transit agencies require a constant stream of operations performance data to support standard planning, scheduling and operations management activities. Ridership and revenue statistics play a critical role in strategic system design, policy development, and budgeting decisions at all levels of transit management. Many agencies rely on electronic fare collection devices as a primary source for ridership and revenue data. The quality of this data will greatly affect transit-related reporting and decision making. This study proposes a systematic, data-driven approach to process revenue and ridership data pulled off electronic farebox equipment installed on a bus fleet operating in the St. Louis region. Three major farebox data errors are identified and impacts of these data errors are further evaluated and discussed at the system and trip level. Results indicate ridership and revenue may be overestimated by up to 8.05 and 9.95 %, respectively, due to farebox data errors. The results of this development effort offer a range of low-cost error identification and processing techniques that transit staff could easily and quickly implement. Even though the St. Louis Metro Transit data was used for analysis, these proposed approaches can be considered as a general framework and used by other transit agencies.
Similar content being viewed by others
References
Breunig MM, Kriegel HP, Ng RT, Sander J (2000) LOF: identifying density-based local outliers. In: Proceedings of the 2000 ACM Sigmod international conference on management of data, pp 93–104 (Association for Computing Machinery)
Corey J, Lao Y, Wu Y, Wang Y (2012) Detection and correction of inductive loop detector sensitivity errors by using Gaussian Mixture Models. Transp Res Rec J Transp Res Board 2256(2011):120–129
Furth PG (1995) Integrating electronic fareboxes with other on-board equipment (pp. 1–30). http://onlinepubs.trb.org/onlinepubs/archive/studies/idea/finalreports/transit/Transit4_Final_Report.pdf. Accessed 10 Sep 2014
Furth PG (1996) Integration of fareboxes with other. Transp Res Rec J Transp Res Board 1557:21–27. doi:10.3141/1557-04 (Public Transportation 1996: Bus, Rural and Intercity, and Paratransit)
Greater Cleveland Regional Transit Authority (2005) How to Use RTA’s Fareboxes Fares Greater Cleveland Regional Transit Authority. http://www.riderta.com/fares/farebox.asp
Hardin J, Rocke DM (2005) The distribution of robust distances. J Comput Gr Stat 14(4):928–946
Hofmann M, O’Mahony M (2005) Transfer journey identification and analyses from electronic fare collection data. In: Proceedings of the 2005 IEEE Intelligent Transportation Systems, pp 34–39
Knorr EM, Ng RT, Tucakov V (2000) Distance-based outliers: algorithms and applications. VLDB J Int J Very Large Data Bases 8(3–4):237–253
Lee H, Coifman B (2012) Identifying and correcting pulse-breakup errors from freeway loop detectors. Transp Res Rec J Transp Res Board 2256(2011):68–78
Lee S, Hickman M (2014) Trip purpose inference using automated fare collection data. Public Transp 6:1–20. doi:10.1007/s12469-013-0077-5
Metro Transit-St. Louis (2010) Fare chart. http://www.metrostlouis.org/FaresPasses/FareChart.aspx. Accessed 10 September 2014
Lu A, Reddy A (2012) An algorithm to measure daily bus passenger miles using electronic farebox data for national transit database (NTD) section 15 reporting. Transp Res Rec J Transp Res Board 2216(2011):19–32
Ma X, Wu YJ, Wang Y, Chen F, Liu J (2013) Mining smartcard data for transit riders’ travel patterns. Transp Res Part C Emerg Technol 36:1–12. doi:10.1016/j.trc.2013.07.010
Moore, G. R. (2002). Transit ridership efficiency as a function of fares. J Public Transp 5(1)
Navick DS, Furth PG (2007) Estimating passenger miles, origin-destination patterns, and loads with location-stamped farebox data. Transp Res Rec J Transp Res Board 1799(2002):107–113
O’Malley M, Brown AG, Swaim-Staley BK, Mobley DB (2011) Farebox recovery attainment and operational requirements. Hanover, Maryland. http://dlslibrary.state.md.us/publications/Exec/MDOT/MTA/TR7-208(b)_2011.pdf
Pelletier M-P, Trépanier M, Morency C (2011) Smart card data use in public transit: a literature review. Transp Res Part C Emerg Technol 19(4):557–568. doi:10.1016/j.trc.2010.12.003
Peña D, Prieto FJ (2001) Multivariate outlier detection and robust covariance matrix estimation. Technometrics 43(3):286–300 (American Statistical Association and American Society for Quality)
Robinson S, Narayanan B, Toh N, Pereira F (2014) Methods for pre-processing smartcard data to improve data quality. Transp Res Part C 49:43–58. doi:10.1016/j.trc.2014.10.006
Wong ASF (1997) The role of new technology in improving data collection for public transportation. Thesis, Massachusetts Institute of Technology
Wu Y, Zhang G, Wang Y (2010) Volume data correction for single-channel advance loop detectors at signalized intersections. Transp Res Rec J Transp Res Board 2160(2010):128–139
Acknowledgments
The authors would like to thank Metro Transit, St. Louis for data support.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yang, S., Wu, YJ., Marion, B. et al. Identification of transit farebox data errors: impacts on transit planning. Public Transp 7, 457–473 (2015). https://doi.org/10.1007/s12469-015-0107-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12469-015-0107-6