Skip to main content
Log in

Data-driven traffic congestion patterns analysis: a case of Beijing

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

With the rapid increase of urban population and the number of motor vehicles, the traffic congestion problem is becoming more and more serious in megacities. This paper aims to identify the traffic congestion patterns and analyze their spatial–temporal variations by conducting cluster analysis for daily traffic congestion index curves. First, since the importance of sampling points in different time segments is not exactly the same, the coefficient of variation is taken to assign weight for improving K-means clustering algorithm. The improved weighted K-means clustering algorithm is proposed to identify the representative traffic congestion patterns. Second, the paired t-test method is used to analyze the spatial and temporal variations of traffic congestion patterns. Finally, case studies are conducted based on the real-life traffic congestion index data in Beijing, including over 670, 000 records covering six districts from January 1, 2017 to December 31, 2017. The results illustrate that traffic congestion patterns are both temporal dependent and spatial dependent, and the automobile license plate restriction has significant influence on the traffic congestion patterns. This study could be instructive for formulating specific traffic optimization and control schemes to alleviate congestion and promote the balance of traffic conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig.2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. [1] http://jtw.beijing.gov.cn/

References

  • Angayarkanni SA, Sivakumar R, Ramana Rao YV (2021) Hybrid grey wolf: Bald eagle search optimized support vector regression for traffic flow forecasting. J Ambient Intell Humaniz Comput 12:1293–1304

    Article  Google Scholar 

  • Arachchige CNPG, Prendergast LA, Staudte RG (2020) Robust analogs to the coefficient of variation. J Appl Stat. https://doi.org/10.1080/02664763.2020.1808599

    Article  MATH  Google Scholar 

  • Beckers JM, Rixen M (2003) EOF calculations and data filling from incomplete oceanographic datasets. J Atmos Oceanic Tech 20(12):1839–1856

    Article  Google Scholar 

  • Chauhan S, Agarwal N, Kar AK (2016) Addressing big data challenges in smart cities: a systematic literature review. Info 18(4):1–10

    Article  Google Scholar 

  • Chen YC, Chen YL, Lu JY (2021a) MK-means: detecting evolutionary communities in dynamic networks. Expert Syst Appl 176:114807

    Article  Google Scholar 

  • Chen YY, Chen C, Wu Q, Ma JM, Zhang GH, Milton J (2021b) Spatial-temporal traffic congestion identification and correlation extraction using floating car data. J Intell Transp Syst 25(3):263–280

    Article  Google Scholar 

  • Degen WLF (2007) Sharp error bounds for piecewise linear interpolation of planar curves. Computing 79:143–151

    Article  MathSciNet  MATH  Google Scholar 

  • Hong YM, Lee YJ (2017) A general approach to testing volatility models in time series. J Manag Sci Eng 2(1):1–33

    MathSciNet  Google Scholar 

  • Jiang MR, Chen W, Li X (2021) S-GCN-GRU-NN: a novel hybrid model by combining a spatiotemporal graph convolutional network and a gated recurrent units neural network for short-term traffic speed forecasting. J Data, Inform Manag 3:1–20

    Article  Google Scholar 

  • Kan ZH, Tang LL, Kwan MP, Ren C, Liu D, Li QQ (2019) Traffic congestion analysis at the turn level using taxis’ GPS trajectory data. Comput Environ Urban Syst 74:229–243

    Article  Google Scholar 

  • Ke JT, Yang H, Zheng ZF (2020) On ride-pooling and traffic congestion. Transp Res Part B: Methodol 142:213–231

    Article  Google Scholar 

  • Kim J, Kwan MP (2019) Beyond commuting: ignoring individuals’ activity-travel patterns may lead to inaccurate assessments of their exposure to traffic congestion. Int J Environ Res Public Health 16(1):89

    Article  Google Scholar 

  • Kim J, Ryu JH (2015) Quantifying a threshold of missing values for gap filling processes in daily precipitation series. Water Resour Manag 29:4173–4184

    Article  Google Scholar 

  • Klos A, Bogusz J, Figurski M, Kosek W (2015) On the handling of outliers in the GNSS time series by means of the noise and probability analysis. Int Assoc Geod Symp 143:657–664

    Article  Google Scholar 

  • Konietschke F, Pauly M (2014) Bootstrapping and permuting paired t-test type statistics. Stat Comput 24:283–296

    Article  MathSciNet  MATH  Google Scholar 

  • Kruschke JK, Aguinis H, Joo H (2012) The time has come: Bayesian methods for data analysis in the organizational sciences. Org Res Methods 15:722–752

    Article  Google Scholar 

  • Lee SY, Xia YM (2006) Maximum likelihood methods in treating outliers and symmetrically heavy-tailed distributions for nonlinear structural equation models with missing data. Psychometrika 71:565–585

    Article  MathSciNet  MATH  Google Scholar 

  • Li X, Wong HS, Wu S (2012) A fuzzy minimax clustering model and its applications. Inf Sci 186:114–125

    Article  Google Scholar 

  • Li L, Su XN, Wang YW, Lin YT, Li ZH, Li YB (2015) Robust causal dependence mining in big data network and its application to traffic flow predictions. Transp Res Part C: Emerg Technol 58:292–307

    Article  Google Scholar 

  • Li YC, Xiong WT, Wang XP (2019) Does polycentric and compact development alleviate urban traffic congestion? A case study of 98 Chinese cities. Cities 88:100–111

    Article  Google Scholar 

  • Lu ZD, Hui YV (2003) L1 linear interpolator for missing values in time series. Ann Inst Stat Math 55:197–216

    Article  MATH  Google Scholar 

  • Praveen DS, Raj DP (2021) Smart traffic management system in metropolitan cities. J Ambient Intell Humaniz Comput 12:7529–7541

    Article  Google Scholar 

  • Retallack AE, Ostendorf B (2019) Current understanding of the effects of congestion on traffic accidents. Int J Environ Res Public Health 16(18):3400

    Article  Google Scholar 

  • Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65

    Article  MATH  Google Scholar 

  • Sanchez-Cambronero S, Jimenez P, Rivas A, Gallego I (2017) Plate scanning tools to obtain travel times in traffic networks. J Intell Transp Syst 21(5):390–408

    Article  Google Scholar 

  • Shelke M, Malhotra A, Mahalle PN (2019) Fuzzy priority based intelligent traffic congestion control and emergency vehicle management using congestion-aware routing algorithm. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-019-01523-8

    Article  Google Scholar 

  • Shi L, Chen GM (2008) Detection of outliers in multilevel models. J Stat Plan Inference 138:3189–3199

    Article  MathSciNet  MATH  Google Scholar 

  • ShirMohammadi MM, Esmaeilpour M (2020) The traffic congestion analysis using traffic congestion index and artificial neural network in main streets of electronic city (case study: Hamedan city). Program Comput Softw 46:433–442

    Article  Google Scholar 

  • Simolo C, Brunetti M, Maugeri M, Nanni T (2009) Improving estimation of missing values in daily precipitation series by a probability density function-preserving approach. Int J Climatol 30(10):1564–1576

    Article  Google Scholar 

  • Su Y, Sun W (2019) Dynamic differential models for studying traffic flow and density. J Ambient Intell Humaniz Comput 10:315–320

    Article  Google Scholar 

  • Sun QX, Sun YX, Sun L, Li Q, Zhao JL, Zhang Y, He H (2019) Research on traffic congestion characteristics of city business circles based on TPI data: the case of Qingdao, China. Physica A 534:122214

    Article  Google Scholar 

  • Sun QX, Zhang Y, Sun L, Li Q, Gao P, He H (2021) Spatial–temporal differences in operational performance of urban trunk roads based on TPI data: the case of Qingdao. Physica A 568:125696

    Article  Google Scholar 

  • Tian Q, Yang H, Huang HJ (2010) Novel travel cost functions based on morning peak commuting equilibrium. Oper Res Lett 38(3):195–200

    Article  MathSciNet  MATH  Google Scholar 

  • Torkjazi M, Mirjafari PS, Poorzahedy H (2018) Reliability-based network flow estimation with day-to-day variation: a model validation on real large-scale urban networks. J Intell Transp Syst 22(2):121–143

    Article  Google Scholar 

  • Wang MJ, Yang S, Sun Y, Gao J (2017) Discovering urban mobility patterns with PageRank based traffic modeling and prediction. Physica A 485:23–34

    Article  Google Scholar 

  • Wang WX, Guo RJ, Yu J (2018) Research on road traffic congestion index based on comprehensive parameters: taking Dalian city as an example. Adv Mech Eng 10(6):1–8

    Article  Google Scholar 

  • Wen HM, Sun JP, Zhang X (2014) Study on traffic congestion patterns of large city in China taking Beijing as an example. Procedia Soc Behav Sci 138:482–491

    Article  Google Scholar 

  • Wu X, Zhu X, Wu GQ, Ding W (2014) Data mining with big data. IEEE Trans Knowl Data Eng 26(1):97–107

    Article  Google Scholar 

  • Xu SJ, Chan HK, Ch’ng E, Tan KH (2020) A comparison of forecasting methods for medical device demand using trend-based clustering scheme. J Data Inf Manag 2:85–94

    Article  Google Scholar 

  • Yang Y, Zhou JD, Li X (2018) Energy-efficient stochastic chance-constrained programming model for train timetable optimization. J Syst Eng 33(2):197–211

    MATH  Google Scholar 

  • Yaqoob I, Hashem IAT, Gani A, Mokhtar S, Ahmed E, Anuar NB, Vasilakos AV (2016) Big data: from beginning to future. Int J Inf Manag 36(6):1231–1247

    Article  Google Scholar 

  • Zhao PJ, Hu HY (2019) Geographical patterns of traffic congestion in growing megacities: big data analytics from Beijing. Cities 92:164–174

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by grants from the National Natural Science Foundation of China (Nos. 71722007 & 71931001), the Funds for First-class Discipline Construction (XK1802-5), the Key Program of NSFC-FRQSC Joint Project (NSFC No. 72061127002 and FRQSC No. 295837), the Fundamental Research Funds for the Central Universities (buctrc201926).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiang Li.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, X., Gui, J. & Liu, J. Data-driven traffic congestion patterns analysis: a case of Beijing. J Ambient Intell Human Comput 14, 9035–9048 (2023). https://doi.org/10.1007/s12652-022-04409-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-022-04409-4

Keywords

Navigation