Deepfake Detection via Fine-Grained Classification and Global-Local Information Fusion

Li, Tonghui; Guo, Yuanfang; Wang, Yunhong

doi:10.1007/978-981-99-8537-1_25

Tonghui Li¹⁵,
Yuanfang Guo¹⁵ &
Yunhong Wang¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14430))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

Abstract

In response to the increasing amount of deepfake content on the internet, a large number of deepfake detection methods have been recently developed. To our best knowledge, existing methods simply perform binary classification, i.e., they simply consider the deepfake images generated from different forgery methods as a single category. Unfortunately, different deepfake forgery methods usually generate deepfake images with different artifacts/appearances. Under such circumstance, a simple binary classification mechanism may limit the learning ability of the detection models, i.e., they may ignore certain forgery traces. Therefore, we propose a novel deepfake detection method via fine-grained classification and global-local information fusion. Specifically, we improve the binary classification task with a fine-grained classification mechanism, such that the deepfake detection model can learn more precise features for fake images from different forgery methods. Besides, we construct a global-local information fusion architecture to emphasize the important information in certain local regions and fuse them with global semantic information. In addition, we design a global center loss, which makes the real images features more cohesive and enlarge the distance between real and fake images features, to further enhance the generalization ability of the detection model. Extensive experiments demonstrate the effectiveness and superiority of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Afchar, D., Nozick, V., Yamagishi, J., Echizen, I.: Mesonet: a compact facial video forgery detection network. In: IEEE WIFS, pp. 1–7 (2018)
Google Scholar
AI, G.: Contributing data to deepfake detection research (2019). https://ai.googleblog.com/2019/09/contributing-data-to-deepfake-detection.html. Accessed 09 Apr 2022
Chollet, F.: Xception: deep learning with depthwise separable convolutions. In: IEEE CVPR, pp. 1251–1258 (2017)
Google Scholar
Dong, X., et al.: Protecting celebrities from deepfake with identity consistency transformer. In: IEEE/CVF CVPR, pp. 9468–9478 (2022)
Google Scholar
Goodfellow, I., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2672–2680 (2014)
Google Scholar
Gunawan, T.S., Hanafiah, S.A.M., Kartiwi, M., Ismail, N., Za’bah, N.F., Nordin, A.N.: Development of photo forensics algorithm by detecting photoshop manipulation using error level analysis. Indones. J. Electr. Eng. Comput. Sci. 7(1), 131–137 (2017)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Kingma, D.P., Welling, M.: Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013)
Li, J., Xie, H., Li, J., Wang, Z., Zhang, Y.: Frequency-aware discriminative feature learning supervised by single-center loss for face forgery detection. In: IEEE/CVF CVPR, pp. 6458–6467 (2021)
Google Scholar
Li, L., et al.: Face x-ray for more general face forgery detection. In: IEEE/CVF CVPR, pp. 5001–5010 (2020)
Google Scholar
Li, Y., Lyu, S.: Exposing DeepFake videos by detecting face warping artifacts. arXiv preprint arXiv:1811.00656 (2018)
Li, Y., Yang, X., Sun, P., Qi, H., Lyu, S.: Celeb-DF: a large-scale challenging dataset for DeepFake forensics. In: IEEE/CVF CVPR, pp. 3207–3216 (2020)
Google Scholar
Liu, H., et al.: Spatial-phase shallow learning: rethinking face forgery detection in frequency domain. In: IEEE/CVF CVPR, pp. 772–781 (2021)
Google Scholar
der Maaten, L.V., Hinton, G.: Visualizing data using t-SNE. JMLR 9(86), 2579–2605 (2008)
Google Scholar
Masi, I., Killekar, A., Mascarenhas, R.M., Gurudatt, S.P., AbdAlmageed, W.: Two-branch recurrent network for isolating deepfakes in videos. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12352, pp. 667–684. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58571-6_39
Chapter Google Scholar
Qian, Y., Yin, G., Sheng, L., Chen, Z., Shao, J.: Thinking in frequency: face forgery detection by mining frequency-aware clues. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12357, pp. 86–103. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58610-2_6
Chapter Google Scholar
Rossler, A., Cozzolino, D., Verdoliva, L., Riess, C., Thies, J., Nießner, M.: Faceforensics++: learning to detect manipulated facial images. In: IEEE/CVF ICCV, pp. 1–11 (2019)
Google Scholar
Tan, M., Le, Q.: Efficientnet: rethinking model scaling for convolutional neural networks. In: ICML, pp. 6105–6114. PMLR (2019)
Google Scholar
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Chapter Google Scholar
Yang, X., Li, Y., Lyu, S.: Exposing deep fakes using inconsistent head poses. In: IEEE ICASSP, pp. 8261–8265 (2019)
Google Scholar
Zhang, B., Li, S., Feng, G., Qian, Z., Zhang, X.: Patch diffusion: a general module for face manipulation detection. AAAI 36(3), 3243–3251 (2022)
Article Google Scholar
Zhao, H., Zhou, W., Chen, D., Wei, T., Zhang, W., Yu, N.: Multi-attentional deepfake detection. In: IEEE/CVF CVPR, pp. 2185–2194 (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Intelligent Recognition and Image Processing, School of Computer Science and Engineering, Beihang University, Beijing, 100191, China
Tonghui Li, Yuanfang Guo & Yunhong Wang

Authors

Tonghui Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuanfang Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yunhong Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuanfang Guo .

Editor information

Editors and Affiliations

Nanjing University of Information Science and Technology, Nanjing, China
Qingshan Liu
Xiamen University, Xiamen, China
Hanzi Wang
Beijing University of Posts and Telecommunications, Beijing, China
Zhanyu Ma
Sun Yat-sen University, Guangzhou, China
Weishi Zheng
Peking University, Beijing, China
Hongbin Zha
Chinese Academy of Sciences, Beijing, China
Xilin Chen
Chinese Academy of Sciences, Beijing, China
Liang Wang
Xiamen University, Xiamen, China
Rongrong Ji

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, T., Guo, Y., Wang, Y. (2024). Deepfake Detection via Fine-Grained Classification and Global-Local Information Fusion. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14430. Springer, Singapore. https://doi.org/10.1007/978-981-99-8537-1_25

Download citation

DOI: https://doi.org/10.1007/978-981-99-8537-1_25
Published: 26 December 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8536-4
Online ISBN: 978-981-99-8537-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics