Abstract
Lightweight model development has emerged as an important study subject in computer vision in response to the need for resource-efficient solutions. These models attempt to strike a balance between model size, computing requirements, and accuracy. They give benefits such as efficient resource use, faster inference times, and improved accessibility. For 3D facial reconstruction models, lightweight architectures present an opportunity for implementation in less demanding hardware, since these algorithms usually rely on powerful processors such as NVIDIA graphic cards. The following research paper provides a benchmark comparison between diverse state-of-the-art lightweight models in a facial reconstruction model, with the aim to reduce its computational complexity so that it can be tested on a mobile device.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Basak, S., Corcoran, P., McDonnell, R., Schukat, M.: 3D face-model reconstruction from a single image: a feature aggregation approach using hierarchical transformer with weak supervision. Neural Netw. 156, 108–122 (2022)
Chang, X., Li, Y., Oymak, S., Thrampoulidis, C.: Provable benefits of overparameterization in model compression: from double descent to pruning neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 6974–6983 (2021)
Chen, Y., et al.: Mobile-former: bridging MobileNet and transformer. arXiv arXiv:2108.05895 (2021)
Chen, Z., Sun, Y., Bi, X., Yue, J.: Lightweight image de-snowing: a better trade-off between network capacity and performance. Neural Netw. 165, 896–908 (2023)
Deng, J., Guo, J., Zhang, D., Deng, Y., Lu, X., Shi, S.: Lightweight face recognition challenge. In: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (2019)
Esteva, A., et al.: Deep learning-enabled medical computer vision. npj Digit. Med. 4(1), 5 (2021)
Feng, M., Gilani, S.Z., Wang, Y., Mian, A.: 3D face reconstruction from light field images: a model-free approach. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11214, pp. 508–526. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01249-6_31
Gao, S.H., Cheng, M.M., Zhao, K., Zhang, X.Y., Yang, M.H., Torr, P.: Res2Net: a new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 43(2), 652–662 (2019)
Goel, A., Tung, C., Lu, Y.H., Thiruvathukal, G.K.: A survey of methods for low-power deep learning and computer vision. In: 2020 IEEE 6th World Forum on Internet of Things (WF-IoT), pp. 1–6. IEEE (2020)
Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., Xu, C.: GhostNet: more features from cheap operations. In: CVPR (2020)
Hodges, C., An, S., Rahmani, H., Bennamoun, M.: Deep learning for driverless vehicles. In: Balas, V.E., Roy, S.S., Sharma, D., Samui, P. (eds.) Handbook of Deep Learning Applications. SIST, vol. 136, pp. 83–99. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-11479-4_4
Howard, A.G., et al.: MobileNets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)
Lee, Y.J., Lee, S.J., Park, K.R., Jo, J., Kim, J.: Single view-based 3D face reconstruction robust to self-occlusion. EURASIP J. Adv. Sig. Process. 2012, 1–20 (2012)
Li, Y., Liu, J., Wang, L.: Lightweight network research based on deep learning: a review. In: 2018 37th Chinese Control Conference (CCC), pp. 9021–9026. IEEE (2018)
Luo, X., Xie, Y., Zhang, Y., Qu, Y., Li, C., Fu, Y.: LatticeNet: towards lightweight image super-resolution with lattice block. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12367, pp. 272–289. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58542-6_17
Mhalla, A., Chateau, T., Gazzah, S., Amara, N.E.B.: An embedded computer-vision system for multi-object detection in traffic surveillance. IEEE Trans. Intell. Transp. Syst. 20(11), 4006–4018 (2018)
Nalbant, K.G., Uyanik, Ş: Computer vision in the metaverse. J. Metaverse 1(1), 9–12 (2021)
O’Mahony, N., et al.: Deep learning vs. traditional computer vision. In: Arai, K., Kapoor, S. (eds.) CVC 2019. AISC, vol. 943, pp. 128–144. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-17795-9_10
Shang, J., Chen, Y.: 3D-FERNet: a facial expression recognition network utilizing 3D information. In: 2022 26th International Conference on Pattern Recognition (ICPR), pp. 3265–3272. IEEE (2022)
Tan, M., Le, Q.: EfficientNet: rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, pp. 6105–6114. PMLR (2019)
Vasu, P.K.A., Gabriel, J., Zhu, J., Tuzel, O., Ranjan, A.: MobileOne: an improved one millisecond mobile backbone. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7907–7917 (2023)
Wang, C.C., Chiu, C.T., Chang, J.Y.: EfficientNet-eLite: extremely lightweight and efficient CNN models for edge devices by network candidate search. J. Sig. Process. Syst. 95, 657–669 (2022). https://doi.org/10.1007/s11265-022-01808-w
Xia, M., Huang, Z., Tian, L., Wang, H., Chang, V., Zhu, Y., Feng, S.: SparkNoC: an energy-efficiency FPGA-based accelerator using optimized lightweight CNN for edge computing. J. Syst. Architect. 115, 101991 (2021)
Yao, D., Liu, H., Yang, J., Li, X.: A lightweight neural network with strong robustness for bearing fault diagnosis. Measurement 159, 107756 (2020)
Zhang, X., et al.: A lightweight feature optimizing network for ship detection in SAR image. IEEE Access 7, 141662–141678 (2019)
Zhou, J., Li, Y.: Detection-by-simulation: exposing DeepFake via simulating forgery using face reconstruction. In: 2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR), pp. 210–215. IEEE (2022)
Zhou, Y., Chen, S., Wang, Y., Huan, W.: Review of research on lightweight convolutional neural networks. In: 2020 IEEE 5th Information Technology and Mechatronics Engineering Conference (ITOEC), pp. 1713–1720. IEEE (2020)
Zhu, X., Lei, Z., Liu, X., Shi, H., Li, S.Z.: Face alignment across large poses: a 3D solution. CoRR abs/1511.07212 (2015). http://arxiv.org/abs/1511.07212
Zhu, X., Liu, X., Lei, Z., Li, S.Z.: Face alignment in full pose range: a 3D total solution. IEEE Trans. Pattern Anal. Mach. Intell. 41(1), 78–92 (2017)
Zollhöfer, M., et al.: State of the art on monocular 3D face reconstruction, tracking, and applications. Comput. Graph. Forum 37, 523–550 (2018)
Acknowledgment
The authors would like to acknowledge the financial support of Tecnologico de Monterrey through the program “Challenge-Based Research Funding Program 2022”. Project ID # E120 - EIC-GI06 - B-T3 - D.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Hernández-Manrique, V., González-Mendoza, M., Vilchis, C., Méndez-Ruiz, M., Pérez-Guerrero, C. (2024). Benchmark Analysis for Backbone Optimization in a Facial Reconstruction Model. In: Calvo, H., Martínez-Villaseñor, L., Ponce, H. (eds) Advances in Computational Intelligence. MICAI 2023. Lecture Notes in Computer Science(), vol 14391. Springer, Cham. https://doi.org/10.1007/978-3-031-47765-2_11
Download citation
DOI: https://doi.org/10.1007/978-3-031-47765-2_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47764-5
Online ISBN: 978-3-031-47765-2
eBook Packages: Computer ScienceComputer Science (R0)