Dynamic graph convolutional recurrent imputation network for spatiotemporal traffic missing data

doi:10.1016/j.knosys.2022.110188

Knowledge-Based Systems

Volume 261, 15 February 2023, 110188

https://doi.org/10.1016/j.knosys.2022.110188 Get rights and content

Abstract

In real-world intelligent transportation systems, the spatiotemporal traffic data collected from sensors often exhibit missing or corrupted data, significantly hindering the development of traffic data research. Missing data imputation is a classic research topic that encompasses a wide range of methods. However, these methods are typically underdeveloped in two aspects: the dynamic spatial dependencies of the road network over time, and the information extraction and utilization of diverse data. In this study, we design a novel deep learning architecture – Dynamic Graph Convolutional Recurrent Imputation Network (DGCRIN) – as a tool to impute missing traffic data. The DGCRIN employs a graph generator and dynamic graph convolutional gated recurrent unit (DGCGRU) to perform fine-grained modeling of the dynamic spatiotemporal dependencies of road network. Additionally, an auxiliary GRU learns the missing pattern information of the data, and a fusion layer with a decay mechanism is introduced to fuse a diverse range of information. This architecture enables the DGCRIN to be highly adaptable to complex scenarios involving missing data. Extensive experiments on two datasets demonstrate the superiority of DGCRIN over multiple baseline models.

Introduction

Spatiotemporal traffic data collected from the real world via sensor devices are essential and fundamental for traffic research and applications [1]. By analyzing and mining traffic data, researchers can address a wide range of problems, including traffic speed prediction [2], traffic pattern recognition [3], and traffic data generation [4]. However, owing to the inherent unpredictability of the data collection and storage processes, the collected traffic information frequently contains missing data, which degrades model performance for downstream tasks if not handled properly. Therefore, it is necessary to perform imputation carefully on spatiotemporal traffic data.

Generally, the essence of traffic data imputation is to productively extract effective latent information, such as temporal correlations and spatial dependencies, from observed data to estimate the missing data. A wide range of methods have been developed to achieve this goal. Early approaches attempted to directly utilize statistical features, such as zeros, historical averages [5], and last observations [6], to fill in or simply eliminate the gaps in data. These rudimentary methods consider solely a site’s own historical data, whereas a more effective approach is to combine information from multiple sites. A typical KNN-based imputation method estimated missing data by averaging the known values of the $k$ neighbors. Recently, matrix- and tensor-based decomposition techniques [7], [8], [9] have exhibited great potential as tools to solve traffic data imputation problems. However, these methods rely on a global low rank while ignoring local spatiotemporal consistency. Therefore, these models may have certain limitations in capturing globally complex spatiotemporal dependencies. In contrast, deep learning-based approaches have exhibited extraordinary nonlinear modeling capabilities for various tasks [10], [11], [12], [13], [14], [15]. Recent studies have attempted to use neural networks (NN), such as recurrent neural networks (RNN) [16], convolutional neural networks (CNN) [17], and graph neural networks (GNN) [18], to handle data imputation tasks. Among them, the GNN-based models are highly effective in capturing spatial dependencies among irregular road networks than other methods. Although these methods have achieved some success in addressing the issue of missing data, two important research gaps remain.

Dynamic spatial dependencies: Most existing GNN-based methods model spatial correlations via predefined static graph structures based on geographic distance or road connectivity. Consequently, these methods maintain a constant graphical structure over time. However, traffic data often exhibit strong dynamic correlations in the spatiotemporal dimension, which may not be fully modeled by a static graph structure. Several recent studies have proposed the construction of dynamic graph structures to capture the dynamic dependencies of traffic data, and achieved promising performance in traffic prediction tasks [19], [20], [21]. However, this issue is rarely considered in the context of traffic data imputation. As a result, the construction of a dynamic graph that models dynamic dependencies, and provides more accurate and effective inference information for data reconstruction, remains a challenge.

Various complex temporal information: Traffic data are essentially time-series data collected through various physical devices at regular or irregular time intervals. In an environment prone to missing data, a variety of time-series datasets can be obtained from different perspectives. For example, according to the missing position and data collection timestamp, a masking matrix dataset and a time-lag matrix dataset can be generated. The former indicates the original data states that includes missing patterns, and the latter helps in studying the contributions of observations to the missing value estimates [22]. Both approaches are beneficial for the imputation task. Nevertheless, many prior studies [1], [23], [24] on the imputation of traffic data have neglected the analysis and utilization of such datasets. Therefore, the extraction and fusion of information from diverse and complex temporal data warrant further examination.

To address the aforementioned issues, a novel spatiotemporal deep learning approach for traffic data imputation called Dynamic Graph Convolutional Recurrent Imputation Networks (DGCRIN) is proposed in this study. A graph generator was developed to model dynamic spatial correlations and a dynamic graph convolutional gated recurrent unit (DGCGRU) was used to capture spatiotemporal dependencies. Furthermore, to extract more useful information from different types of data, we employed an auxiliary gated recurrent unit (GRU) to model the missing patterns of masking data, and then introduced a fusion layer with a decay mechanism to fuse information from disparate data. The missing values of different road segments at each timestamp are imputed in a bidirectional process using the spatial dependencies and temporal correlations learned from historical observations. To evaluate the effectiveness of the proposed model, we conducted experiments on two real-world traffic datasets, demonstrating the superior performance of our method over that of existing approaches. In addition, corresponding hyperparameter sensitivity and ablation experiments were also carried out.

Our main contributions can be summarized as follows:

•
We propose a novel traffic data imputation framework to infer missing values in the spatiotemporal input data. The model can effectively model the dynamic spatiotemporal characteristics of incomplete traffic data by considering the road network’s dynamics and diverse temporal information including data missing patterns and observation slot, to achieve more accurate estimation.

•
We propose a dynamic graph generation technique to perform fine-grained modeling of the spatial correlations between road network nodes with incomplete traffic data via using the recurrent generated imputation and historical information.

•
We demonstrate the capabilities and advantages of the proposed model by applying it to two real-world traffic datasets under three types of missing patterns. Compared with the baselines, our model significantly reduced the imputation error while achieving a higher imputation accuracy.

The remainder of this paper is organized as follows. In Section 2, we first review existing studies pertaining to traffic data imputation by category. The imputation problem is formulated in Section 3, and the methodology is described in Section 4. The experiments conducted within this study are presented in Section 5. Finally, we conclude the paper in Section 6.

Section snippets

Related work

This section provides a detailed review of existing studies related to traffic data imputation. We classified the literature among three categories: RNN-based, GAN-based, and GNN-based methods.

Preliminaries

Generally, a traffic network with $N$ detectors can be defined as an undirected graph $G = {V, E, SA}$ , where $V = {v_{i}}$ is a set of $N$ detectors corresponding to the nodes in the graph, and $E = e {v_{i}, v_{j}}$ represents the spatial connectivity between two detectors. The adjacency matrix of graph $G$ is structured using matrix $SA \in R^{N \times N}$ , according to the following rule: ${SA}_{i j} = \{\begin{matrix} 1, & e_{i j} = 1 \\ 0, & e_{i j} = 0 \end{matrix}$ where $e_{i j}$ denotes the connectivity between the graph nodes. Because speed data is a widely-used form of traffic data, it is the

Methodology

This section introduces the proposed missing imputation framework for traffic data. We first provide an overview of the proposed model, and then describe each component in detail.

Experiment

To investigate the effectiveness of our proposed model, we conducted a series of experiments on two real traffic datasets with different missing data scenarios. The following section first introduces our dataset and evaluation metrics, and then provides a brief description of the comparison methods and experimental setting. Finally, relevant experimental results are analyzed and visualized.

Conclusion

In this study, we modeled the dynamic spatial dependencies of road networks under the conditions of incomplete traffic data, and developed DGCRIN for traffic data imputation. Inspired by the iterative generation characteristics of RNNs, we designed a novel graph generator to model the dynamic spatial correlations between road network nodes at each moment using the recurrent generated imputation data and historical information. Based on the dynamic graph, we employed a DGCGRU module to

CRediT authorship contribution statement

Xiangjie Kong: Conceptualization, Methodology, Investigation. Wenfeng Zhou: Methodology, Writing – original draft, Validation, Software. Guojiang Shen: Supervision, Project administration, Funding acquisition. Wenyi Zhang: Writing - review, Validation, Visualization. Nali Liu: Writing – review, Validation. Yao Yang: Validation.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported in part by the Zhejiang Provincial Natural Science Foundation, China under Grant LR21F020003, in part by the National Natural Science Foundation of China under Grant 62072409 and Grant 62073295, in part by the “Pioneer” and “Leading Goose” R&D Program of Zhejiang under Grant 2022C01050, and in part by Key Research Project of ZheJiang Lab under Grant 2022NF0AC01.

References (53)

DuanYanjie et al.
An efficient realization of deep learning for traffic data imputation
Transp. Res. C
(2016)
ShenGuojiang et al.
Motif discovery based traffic pattern mining in attributed road networks
Knowl.-Based Syst.
(2022)
ChenXinyu et al.
A Bayesian tensor decomposition approach for spatiotemporal traffic data imputation
Transp. Res. C
(2019)
ChenXinyu et al.
A nonconvex low-rank tensor completion model for spatiotemporal traffic data imputation
Transp. Res. C
(2020)
LiDongda et al.
One model packs thousands of items with recurrent conditional query learning
Knowl.-Based Syst.
(2022)
SongXiangyu et al.
JKT: A joint graph convolutional network based deep knowledge tracing
Inform. Sci.
(2021)
LiGuopeng et al.
Multistep traffic forecasting by dynamic graph convolution: Interpretations of real-time spatial correlations
Transp. Res. C
(2021)
LiDong et al.
Smoothed LSTM-AE: A spatio-temporal deep model for multiple time-series missing imputation
Neurocomputing
(2020)
TianYan et al.
LSTM-Based traffic flow prediction with missing data
Neurocomputing
(2018)
CuiZhiyong et al.
Stacked bidirectional and unidirectional LSTM recurrent neural network for forecasting network-wide traffic state with missing values
Transp. Res. C
(2020)

WangYufeng et al.

PC-GAIN: Pseudo-label conditional generative adversarial imputation networks for incomplete data

Neural Netw.

(2021)

SongXiangyu et al.

Bi-CLKT: Bi-graph contrastive learning based knowledge tracing

Knowl.-Based Syst.

(2022)

ZhangZhengchao et al.

A customized deep learning approach to integrate network-scale online traffic data imputation and prediction

Transp. Res. C

(2021)

ZhangMingwei et al.

METoNR: A meta explanation triplet oriented news recommendation model

Knowl.-Based Syst.

(2022)

XueGuotong et al.

Dynamic network embedding survey

Neurocomputing

(2022)

Liangzhe Han, Bowen Du, Leilei Sun, Yanjie Fu, Yisheng Lv, Hui Xiong, Dynamic and multi-faceted spatio-temporal deep...

KongXiangjie et al.

RMGen: A tri-layer vehicular trajectory data generation model exploring urban region division and mobility pattern

IEEE Trans. Veh. Technol.

(2022)

YinWeihao et al.

Imputing erroneous data of single-station loop detectors for nonincident conditions: Comparison between temporal and spatial methods

J. Intell. Transp. Syst.

(2012)

LittleRoderick J.A. et al.

Statistical Analysis with Missing Data, Vol. 793

(2019)

ChenXinyu et al.

Bayesian temporal factorization for multidimensional time series prediction

IEEE Trans. Pattern Anal. Mach. Intell.

(2021)

ShenGuojiang et al.

An attention-based digraph convolution network enabled framework for congestion recognition in three-dimensional road networks

IEEE Trans. Intell. Transp. Syst.

(2021)

XuCai et al.

Uncertainty-aware multi-view deep learning for Internet of Things applications

IEEE Trans. Ind. Inform.

(2022)

ZhouXiaokang et al.

Siamese neural network based few-shot learning for anomaly detection in industrial cyber-physical systems

IEEE Trans. Ind. Inform.

(2020)

KongXiangjie et al.

Spatial-temporal-cost combination based taxi driving fraud detection for collaborative Internet of Vehicles

IEEE Trans. Ind. Inform.

(2021)

BengioYoshua et al.

Recurrent neural networks for missing or asynchronous data

Adv. Neural Inf. Process. Syst.

(1995)

ZhuangYifan et al.

Innovative method for traffic data imputation based on convolutional neural network

IET Intell. Transp. Syst.

(2019)

Cited by (36)

Hierarchical spatio-temporal graph convolutional neural networks for traffic data imputation
2024, Information Fusion
The quality of traffic services depends on the accuracy and completeness of the collected traffic data. However,the existing traffic data imputation methods usually only rely on the predefined road network structure to capture the spatio-temporal features and only consider the imputation effect from a single perspective, which are very limited for imputation of different missing patterns of road traffic data. In this paper, we propose a novel deep learning framework called Hierarchical Spatio-temporal Graph Convolutional Neural Networks(HSTGCN) to impute traffic data,through the macro layer and the road layer. The model constructs macro graph of the road network based on the data temporal correlation clustering, which can mine the temporal dependencies of road traffic data from a hierarchical perspective. Besides, a temporal attention mechanism and adaptive adjacency matrix are introduced in the road layer to better extract the spatio-temporal information of the road traffic data. Finally, we use graph convolution neural networks to learn the spatio-temporal feature representations of the road layer and macro layer, which are then fused to achieve data imputation. To illustrate the efficient performance of the model, experiments are conducted on traffic data collected from California and Seattle. The proposed model performs better than the comparison model for traffic data imputation.
Graph Spatial-Temporal Transformer Network for Traffic Prediction
2024, Big Data Research
Traffic information can reflect the operating status of a city, and accurate traffic forecasting is critical in intelligent transportation systems (ITS) and urban planning. However, traffic information has complex nonlinearity and dynamic spatial-temporal dependencies due to human mobility, bringing new traffic forecasting challenges. This paper proposed a graph spatial-temporal transformer network for traffic prediction (GSTTN) to cope with the above problems. Specifically, the proposed framework explores spatial characteristics of the across-road network of traffic information hidden in human behavior patterns via a multi-view graph convolutional network (GCN). Furthermore, the transformer network with a multi-head attention mechanism is adopted to capture the random disturbance in the time series characteristics of traffic information. As a result, these two components can be used to model spatial relations and temporal trends. Finally, we examine real-world datasets, and the experiments show that the proposed framework outperforms the current state-of-the-art baselines.
Dynamic spatial–temporal graph convolutional recurrent networks for traffic flow forecasting
2024, Expert Systems with Applications
Traffic flow forecasting is crucial for making appropriate route guidance and vehicle scheduling schemes in intelligent transportation systems. However, recent graph-based methods leverage pre-defined static graphs to describe the spatial–temporal characteristic of road networks. The interactions of two road segments are changeable under the influence of natural environmental and socioeconomic factors, while these methods are not sufficient to capture the complicated dynamic correlations of different nodes. To address these problems, we propose a novel dynamic graph-based deep learning framework with dynamic graph recurrent network for traffic flow forecasting, called Dynamic Spatial–temporal Graph Recurrent Neural Networks. In this framework, a novel dynamic graph generator is designed to obtain the dynamic representation of nodes, which employs multi-head attention network and dynamic node embedding to capture hidden spatial dependency more effectively. To infer the edge status of dynamic graph at different times, the generated dynamic graph is trained as special time series data via dynamic graph recurrent neural network for downstream time-series prediction. In contrast to methods straightforwardly concatenating static graphs and dynamic graphs, a novel fusion framework integrates two-channel convolutional networks with penalty terms and a gate fusion layer to extract dynamic spatial dependency from multiple graphs for improving forecasting accuracy and reducing computational consumption. Experiments on three real-world datasets are carried out to evaluate the superior performance of our model. Compared with previous state-of-the-art baselines, the proposed method performs much better with 10%–26% improvements on three datasets. The results also indicate that our model is robust against emergent traffic situations.
Road network pixelization: A traffic flow imputation method based on image restoration techniques
2024, Expert Systems with Applications
The complete traffic network data is crucial for accurate traffic data prediction in intelligent transportation systems. Inspired by the success of Generative Adversarial Networks (GANs) in image restoration, this study proposes a traffic flow imputation method by employing image restoration techniques. First, we propose a trajectory data representation method called Trajectory2Matrix, that converts the trajectory data into a two-dimensional spatiotemporal relation feature map. Consequently, the data imputation scale and scope are increased. Second, a spatiotemporal feature map generation module based on a graph convolutional network is designed to optimize the GAN generator, thus utilizing its advantages for non-Euclidean data and dynamic spatiotemporal correlation. Finally, a heterogeneous multisource data fusion module based on a channel attention mechanism is proposed to merge dynamic/static external attributes and multimode characteristics in time. The proposed method improved imputation accuracy and robustness for multitype missing data patterns, especially in high missing rate situations.
Semantic understanding and prompt engineering for large-scale traffic data imputation
2024, Information Fusion
Intelligent Transportation Systems (ITS) face the formidable challenge of large-scale missing data, particularly in the imputation of traffic data. Existing studies have mainly relied on modeling network-level spatiotemporal correlations to address this issue. However, these methods often overlook the rich semantic information (e.g., road infrastructure, sensor location, etc.) inherent in road networks when capturing network-wide spatiotemporal correlations. We address this limitation by presenting the Graph Transformer-based Traffic Data Imputation (GT-TDI) model, which imputes missing values in extensive traffic data by leveraging spatiotemporal semantic understanding of road networks. The proposed model leverages semantic descriptions that capture the spatial and temporal dynamics of traffic across road networks, enhancing its capacity to infer comprehensive spatiotemporal relationships. Moreover, to augment the model’s capabilities, we employ a Large Language Model (LLM) and prompt engineering to enable natural and intuitive interactions with the traffic data imputation system, allowing users to query and request in plain language, without requiring expert knowledge or complex mathematical models. The proposed model, GT-TDI, utilizes Graph Neural Networks (GNN) and Transformer architectures to perform large-scale traffic data imputation using deficient observations, sensor social connectivity, and semantic descriptions as inputs. We evaluate the GT-TDI model on the PeMS freeway dataset and benchmark it against cutting-edge models. The experimental evidence demonstrates that GT-TDI surpasses the cutting-edge approaches in scenarios with intricate patterns and varying rates of missing data.
Semi-supervised learning with missing values imputation
2024, Knowledge-Based Systems
Incomplete instances with various missing attributes in many real-world applications have brought challenges to the classification tasks. Unsupervised imputation is often employed to replace the missing values with substitute values before supervised classification. However, this process often separates the imputation and classification, which may lead to inferior performance since the separated two tasks ignore the data distribution and label information contained in each other. Besides, traditional methods may rely on improper assumptions to initialize the missing values, whereas the unreliability of such initialization might degrade the performance. To address these problems, a novel semi-supervised conditional normalizing flow (SSCFlow) is proposed in this paper. SSCFlow combines unsupervised imputation and supervised classification as a joint semi-supervised task, which estimates the conditional distribution of incomplete instances to facilitate the imputation and classification simultaneously. Moreover, SSCFlow treats the initialized missing values as corrupted initial imputations and iteratively reconstructs their latent representations to approximate their true conditional distribution. Experiments on real-world datasets demonstrate the robustness and effectiveness of the proposed algorithm.

View all citing articles on Scopus

View full text

Dynamic graph convolutional recurrent imputation network for spatiotemporal traffic missing data

Abstract

Introduction

Section snippets

Related work

Preliminaries

Methodology

Experiment

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Transp. Res. C

Knowl.-Based Syst.

Transp. Res. C

Transp. Res. C

Knowl.-Based Syst.

Inform. Sci.

Transp. Res. C

Neurocomputing

Neurocomputing

Transp. Res. C

Neural Netw.

Knowl.-Based Syst.

Transp. Res. C

Knowl.-Based Syst.

Neurocomputing

RMGen: A tri-layer vehicular trajectory data generation model exploring urban region division and mobility pattern

IEEE Trans. Veh. Technol.

Imputing erroneous data of single-station loop detectors for nonincident conditions: Comparison between temporal and spatial methods

J. Intell. Transp. Syst.

Statistical Analysis with Missing Data, Vol. 793

Bayesian temporal factorization for multidimensional time series prediction

IEEE Trans. Pattern Anal. Mach. Intell.

An attention-based digraph convolution network enabled framework for congestion recognition in three-dimensional road networks

IEEE Trans. Intell. Transp. Syst.

Uncertainty-aware multi-view deep learning for Internet of Things applications

IEEE Trans. Ind. Inform.

Siamese neural network based few-shot learning for anomaly detection in industrial cyber-physical systems

IEEE Trans. Ind. Inform.

Spatial-temporal-cost combination based taxi driving fraud detection for collaborative Internet of Vehicles

IEEE Trans. Ind. Inform.

Recurrent neural networks for missing or asynchronous data

Adv. Neural Inf. Process. Syst.

Innovative method for traffic data imputation based on convolutional neural network

IET Intell. Transp. Syst.