Benchmark Analysis for Backbone Optimization in a Facial Reconstruction Model

Hernández-Manrique, Victor; González-Mendoza, Miguel; Vilchis, Carlos; Méndez-Ruiz, Mauricio; Pérez-Guerrero, Carmina

doi:10.1007/978-3-031-47765-2_11

Victor Hernández-Manrique¹⁰,
Miguel González-Mendoza¹⁰,
Carlos Vilchis¹⁰,
Mauricio Méndez-Ruiz¹¹ &
…
Carmina Pérez-Guerrero¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14391))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

302 Accesses

Abstract

Lightweight model development has emerged as an important study subject in computer vision in response to the need for resource-efficient solutions. These models attempt to strike a balance between model size, computing requirements, and accuracy. They give benefits such as efficient resource use, faster inference times, and improved accessibility. For 3D facial reconstruction models, lightweight architectures present an opportunity for implementation in less demanding hardware, since these algorithms usually rely on powerful processors such as NVIDIA graphic cards. The following research paper provides a benchmark comparison between diverse state-of-the-art lightweight models in a facial reconstruction model, with the aim to reduce its computational complexity so that it can be tested on a mobile device.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Basak, S., Corcoran, P., McDonnell, R., Schukat, M.: 3D face-model reconstruction from a single image: a feature aggregation approach using hierarchical transformer with weak supervision. Neural Netw. 156, 108–122 (2022)
Article Google Scholar
Chang, X., Li, Y., Oymak, S., Thrampoulidis, C.: Provable benefits of overparameterization in model compression: from double descent to pruning neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 6974–6983 (2021)
Google Scholar
Chen, Y., et al.: Mobile-former: bridging MobileNet and transformer. arXiv arXiv:2108.05895 (2021)
Chen, Z., Sun, Y., Bi, X., Yue, J.: Lightweight image de-snowing: a better trade-off between network capacity and performance. Neural Netw. 165, 896–908 (2023)
Article Google Scholar
Deng, J., Guo, J., Zhang, D., Deng, Y., Lu, X., Shi, S.: Lightweight face recognition challenge. In: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (2019)
Google Scholar
Esteva, A., et al.: Deep learning-enabled medical computer vision. npj Digit. Med. 4(1), 5 (2021)
Article Google Scholar
Feng, M., Gilani, S.Z., Wang, Y., Mian, A.: 3D face reconstruction from light field images: a model-free approach. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11214, pp. 508–526. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01249-6_31
Chapter Google Scholar
Gao, S.H., Cheng, M.M., Zhao, K., Zhang, X.Y., Yang, M.H., Torr, P.: Res2Net: a new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 43(2), 652–662 (2019)
Article Google Scholar
Goel, A., Tung, C., Lu, Y.H., Thiruvathukal, G.K.: A survey of methods for low-power deep learning and computer vision. In: 2020 IEEE 6th World Forum on Internet of Things (WF-IoT), pp. 1–6. IEEE (2020)
Google Scholar
Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., Xu, C.: GhostNet: more features from cheap operations. In: CVPR (2020)
Google Scholar
Hodges, C., An, S., Rahmani, H., Bennamoun, M.: Deep learning for driverless vehicles. In: Balas, V.E., Roy, S.S., Sharma, D., Samui, P. (eds.) Handbook of Deep Learning Applications. SIST, vol. 136, pp. 83–99. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-11479-4_4
Chapter Google Scholar
Howard, A.G., et al.: MobileNets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)
Lee, Y.J., Lee, S.J., Park, K.R., Jo, J., Kim, J.: Single view-based 3D face reconstruction robust to self-occlusion. EURASIP J. Adv. Sig. Process. 2012, 1–20 (2012)
Google Scholar
Li, Y., Liu, J., Wang, L.: Lightweight network research based on deep learning: a review. In: 2018 37th Chinese Control Conference (CCC), pp. 9021–9026. IEEE (2018)
Google Scholar
Luo, X., Xie, Y., Zhang, Y., Qu, Y., Li, C., Fu, Y.: LatticeNet: towards lightweight image super-resolution with lattice block. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12367, pp. 272–289. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58542-6_17
Chapter Google Scholar
Mhalla, A., Chateau, T., Gazzah, S., Amara, N.E.B.: An embedded computer-vision system for multi-object detection in traffic surveillance. IEEE Trans. Intell. Transp. Syst. 20(11), 4006–4018 (2018)
Article Google Scholar
Nalbant, K.G., Uyanik, Ş: Computer vision in the metaverse. J. Metaverse 1(1), 9–12 (2021)
Article Google Scholar
O’Mahony, N., et al.: Deep learning vs. traditional computer vision. In: Arai, K., Kapoor, S. (eds.) CVC 2019. AISC, vol. 943, pp. 128–144. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-17795-9_10
Chapter Google Scholar
Shang, J., Chen, Y.: 3D-FERNet: a facial expression recognition network utilizing 3D information. In: 2022 26th International Conference on Pattern Recognition (ICPR), pp. 3265–3272. IEEE (2022)
Google Scholar
Tan, M., Le, Q.: EfficientNet: rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, pp. 6105–6114. PMLR (2019)
Google Scholar
Vasu, P.K.A., Gabriel, J., Zhu, J., Tuzel, O., Ranjan, A.: MobileOne: an improved one millisecond mobile backbone. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7907–7917 (2023)
Google Scholar
Wang, C.C., Chiu, C.T., Chang, J.Y.: EfficientNet-eLite: extremely lightweight and efficient CNN models for edge devices by network candidate search. J. Sig. Process. Syst. 95, 657–669 (2022). https://doi.org/10.1007/s11265-022-01808-w
Article Google Scholar
Xia, M., Huang, Z., Tian, L., Wang, H., Chang, V., Zhu, Y., Feng, S.: SparkNoC: an energy-efficiency FPGA-based accelerator using optimized lightweight CNN for edge computing. J. Syst. Architect. 115, 101991 (2021)
Article Google Scholar
Yao, D., Liu, H., Yang, J., Li, X.: A lightweight neural network with strong robustness for bearing fault diagnosis. Measurement 159, 107756 (2020)
Article Google Scholar
Zhang, X., et al.: A lightweight feature optimizing network for ship detection in SAR image. IEEE Access 7, 141662–141678 (2019)
Article Google Scholar
Zhou, J., Li, Y.: Detection-by-simulation: exposing DeepFake via simulating forgery using face reconstruction. In: 2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR), pp. 210–215. IEEE (2022)
Google Scholar
Zhou, Y., Chen, S., Wang, Y., Huan, W.: Review of research on lightweight convolutional neural networks. In: 2020 IEEE 5th Information Technology and Mechatronics Engineering Conference (ITOEC), pp. 1713–1720. IEEE (2020)
Google Scholar
Zhu, X., Lei, Z., Liu, X., Shi, H., Li, S.Z.: Face alignment across large poses: a 3D solution. CoRR abs/1511.07212 (2015). http://arxiv.org/abs/1511.07212
Zhu, X., Liu, X., Lei, Z., Li, S.Z.: Face alignment in full pose range: a 3D total solution. IEEE Trans. Pattern Anal. Mach. Intell. 41(1), 78–92 (2017)
Article Google Scholar
Zollhöfer, M., et al.: State of the art on monocular 3D face reconstruction, tracking, and applications. Comput. Graph. Forum 37, 523–550 (2018)
Article Google Scholar

Download references

Acknowledgment

The authors would like to acknowledge the financial support of Tecnologico de Monterrey through the program “Challenge-Based Research Funding Program 2022”. Project ID # E120 - EIC-GI06 - B-T3 - D.

Author information

Authors and Affiliations

Tecnologico de Monterrey, Escuela de Ingeniería y Ciencias, Monterrey, Nuevo León, Mexico
Victor Hernández-Manrique, Miguel González-Mendoza & Carlos Vilchis
Eugenia Virtual Humans S.A. de C.V., Laboratorio de Investigación, Naucalpan de Juárez, Estado de México, Mexico
Mauricio Méndez-Ruiz & Carmina Pérez-Guerrero

Authors

Victor Hernández-Manrique
View author publications
You can also search for this author in PubMed Google Scholar
Miguel González-Mendoza
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Vilchis
View author publications
You can also search for this author in PubMed Google Scholar
Mauricio Méndez-Ruiz
View author publications
You can also search for this author in PubMed Google Scholar
Carmina Pérez-Guerrero
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Victor Hernández-Manrique .

Editor information

Editors and Affiliations

Center for Computing Research, Instituto Politécnico Nacional, Ciudad de México, Distrito Federal, Mexico
Hiram Calvo
Facultad de Ingeniería, Universidad Panamericana, Ciudad de México, Mexico
Lourdes Martínez-Villaseñor
Facultad de Ingeniería, Universidad Panamericana, Ciudad de México, Mexico
Hiram Ponce

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hernández-Manrique, V., González-Mendoza, M., Vilchis, C., Méndez-Ruiz, M., Pérez-Guerrero, C. (2024). Benchmark Analysis for Backbone Optimization in a Facial Reconstruction Model. In: Calvo, H., Martínez-Villaseñor, L., Ponce, H. (eds) Advances in Computational Intelligence. MICAI 2023. Lecture Notes in Computer Science(), vol 14391. Springer, Cham. https://doi.org/10.1007/978-3-031-47765-2_11

Download citation

DOI: https://doi.org/10.1007/978-3-031-47765-2_11
Published: 09 November 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47764-5
Online ISBN: 978-3-031-47765-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Benchmark Analysis for Backbone Optimization in a Facial Reconstruction Model