Posture and Appearance Fusion Network for Driver Distraction Recognition

Yu, Hao; Zhao, Chong; Wei, Xing; Zhai, Yan; Chen, Zhen; Sun, Guangling; Lu, Yang

doi:10.1007/978-3-031-19208-1_14

Hao Yu¹¹,
Chong Zhao^12,13,
Xing Wei^11,12,14,
Yan Zhai¹¹,
Zhen Chen¹⁵,
Guangling Sun¹⁶ &
…
Yang Lu^11,14

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13471))

Included in the following conference series:

International Conference on Wireless Algorithms, Systems, and Applications

1769 Accesses

Abstract

Distracted driving is the act of driving while engaged in other activities, such as using a cell phone, texting, eating, or reading, which takes the driver’ attention away from the road. Nowadays, the distracted driving detection models based on deep learning can extract critical information from video data to characterize the driving behavior process. But the distraction driving method based solely on appearance features cannot essentially eliminate the noise impact of the complex environment on the model, and the distracted driving recognition method based solely on skeletal information is unable to recognize the joint action of the human body and the objects. Therefore, the development of an accurate distracted driving detection model has become challenging. In this paper, we propose a distracted driving recognition model MFD-former based on the fusion of posture and appearance. First, a feature extraction module is proposed to extract skeleton data(i.e., posture) and appearance features(i.e., descriptors), which are merged by a graph neural network. Then, the two kinds of information are input into the MFD-former encoder module, and the self-attention mechanism quickly extracts the sparse data. Finally, the classification results of distracted driving are obtained by extracting the classification labels through the MLP Head. The MFD-former model outperforms existing models. It achieved $95.1\%$ accuracy on the State Farm dataset and $90.24\%$ accuracy on the self-built Train Drivers dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Distracted driver classification using deep learning

Article 06 November 2019

DMD: A Large-Scale Multi-modal Driver Monitoring Dataset for Attention and Alertness Analysis

Driver Anomaly Detection Using Skeleton Images

References

Abadal, S., Jain, A., Guirado, R., López-Alonso, J., Alarcón, E.: Computing graph neural networks: a survey from algorithms to accelerators. ACM Comput. Surv. (CSUR) 54(9), 1–38 (2021)
Article Google Scholar
Ahuja, K., Shen, V., Fang, C.M., Riopelle, N., Kong, A., Harrison, C.: Controllerpose: inside-out body capture with VR controller cameras. In: CHI Conference on Human Factors in Computing Systems, pp. 1–13 (2022)
Google Scholar
Cao, Z., Simon, T., Wei, S.E., Sheikh, Y.: Realtime multi-person 2d pose estimation using part affinity fields. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7291–7299 (2017)
Google Scholar
Farm, S.: State farm distracted driver detection. Technical report (2016). https://www. kaggle. com/c/state . . .(2016)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Koesdwiady, A., Bedawi, S.M., Ou, C., Karray, F.: End-to-End deep learning for driver distraction recognition. In: Karray, F., Campilho, A., Cheriet, F. (eds.) ICIAR 2017. LNCS, vol. 10317, pp. 11–18. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59876-5_2
Chapter Google Scholar
Lemley, J., Bazrafkan, S., Corcoran, P.: Transfer learning of temporal information for driver action classification. In: MAICS, pp. 123–128 (2017)
Google Scholar
Moslemi, N., Azmi, R., Soryani, M.: Driver distraction recognition using 3d convolutional neural networks. In: 2019 4th International Conference on Pattern Recognition and Image Analysis (IPRIA), pp. 145–151. IEEE (2019)
Google Scholar
Moslemi, N., Soryani, M., Azmi, R.: Computer vision-based recognition of driver distraction: a review. Concurrency Comput.: Pract. Experience 33(24), e6475 (2021)
Article Google Scholar
Peng, W., Hong, X., Chen, H., Zhao, G.: Learning graph convolutional network for skeleton-based human action recognition by neural searching. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 2669–2676 (2020)
Google Scholar
Plizzari, C., Cannici, M., Matteucci, M.: Skeleton-based action recognition via spatial and temporal transformer networks. Comput. Vis. Image Underst. 208, 103219 (2021)
Article Google Scholar
Shi, L., Zhang, Y., Cheng, J., Lu, H.: Two-stream adaptive graph convolutional networks for skeleton-based action recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12026–12035 (2019)
Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Advances in neural information processing systems, vol. 30 (2017)
Google Scholar
Yan, S., Xiong, Y., Lin, D.: Spatial temporal graph convolutional networks for skeleton-based action recognition. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Google Scholar
Zhang, C., Song, D., Huang, C., Swami, A., Chawla, N.V.: Heterogeneous graph neural network. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 793–803 (2019)
Google Scholar

Download references

Acknowledgement

This work was supported by Joint Fund of Natural Science Foundation of Anhui Province in 2020 (2008085UD08), Anhui Provincial Key R &D Program (202004a05020004), Open fund of Intelligent Interconnected Systems Laboratory of Anhui Province (PA2021AKSK0107), Intelligent Networking and New Energy Vehicle Special Project of Intelligent Manufacturing Institute of HFUT (IMIWL2019003, IMIDC2019002).

Author information

Authors and Affiliations

School of Computer and Information, Hefei University of Technology, Hefei, China
Hao Yu, Xing Wei, Yan Zhai & Yang Lu
Intelligent Manufacturing Institute of Hefei University of Technology, Baohe, China
Chong Zhao & Xing Wei
Engineering Quality Education Center of Undergraduate School, Hefei University of Technology, Hefei, China
Chong Zhao
Engineering Research Center of Safety Critical Industrial Measurement and Control Technology, Ministry of Education, Beijing, China
Xing Wei & Yang Lu
School of Computer Science and Technology, Anhui University, Hefei, China
Zhen Chen
School of Electronic and Information Engineering, Anhui Jianzhu University, Hefei, China
Guangling Sun

Authors

Hao Yu
View author publications
You can also search for this author in PubMed Google Scholar
Chong Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xing Wei
View author publications
You can also search for this author in PubMed Google Scholar
Yan Zhai
View author publications
You can also search for this author in PubMed Google Scholar
Zhen Chen
View author publications
You can also search for this author in PubMed Google Scholar
Guangling Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yang Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chong Zhao .

Editor information

Editors and Affiliations

Dalian University of Technology, Dalian, China
Lei Wang
Ben-Gurion University of the Negev, Beer-Sheva, Israel
Michael Segal
Chang Gung University, Taiwan, China
Jenhui Chen
Tianjin University, Tianjin, China
Tie Qiu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, H. et al. (2022). Posture and Appearance Fusion Network for Driver Distraction Recognition. In: Wang, L., Segal, M., Chen, J., Qiu, T. (eds) Wireless Algorithms, Systems, and Applications. WASA 2022. Lecture Notes in Computer Science, vol 13471. Springer, Cham. https://doi.org/10.1007/978-3-031-19208-1_14

Download citation

DOI: https://doi.org/10.1007/978-3-031-19208-1_14
Published: 17 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-19207-4
Online ISBN: 978-3-031-19208-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Posture and Appearance Fusion Network for Driver Distraction Recognition