Region-Based Face Alignment with Convolution Neural Network Cascade

Zhang, Yu; Jiang, Fei; Shen, Ruimin

doi:10.1007/978-3-319-70090-8_31

Region-Based Face Alignment with Convolution Neural Network Cascade

Yu Zhang¹⁸,
Fei Jiang¹⁸ &
Ruimin Shen¹⁸

Conference paper
First Online: 28 October 2017

4318 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10636))

Abstract

Most face alignment approaches perform landmark detection over the entire face. However, it has been shown that the difficulty for landmark detection is unbalanced among different facial parts. Thus, in this paper, we propose a novel region-based facial landmark detection algorithm based on a two-level convolutional neural networks (CNNs). In the first level, we partition the whole face into four regions including three facial components (eyebrow-eyes, nose, and mouth) and the face contour. Regions are detected through an improved CNN model which is incorporated with a feature fusion scheme. To simultaneously detect three facial components and face contour landmarks, a novel weighted loss function combining bounding box regression with landmark localization is presented. In the second level, the landmarks are separately detected for three facial components. Experimental results on the public benchmarks demonstrate the superiority of the proposed algorithm over several state-of-the-art face alignment algorithms.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
http://opencv.org/.

References

Fabian, B.Q., Srinivasan, R., Martinez, A.M.: Emotionet: an accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild. In: 29th IEEE Conference on Computer Vision and Pattern Recognition, pp. 5562–5570. IEEE Press, Las Vegas (2016)
Google Scholar
Chen, C., Dantcheva, A., Ross, A.: Automatic facial makeup detection with application in face recognition. In: 6th IEEE Conference on Biometrics, pp. 1–8. IEEE Press, Madrid (2013)
Google Scholar
Lu, C., Tang, X.: Surpassing human-level face verification performance on LFW with GaussianFace. In: 29th AAAI Conference on Artifical Intelligence, pp. 3811–3819. AAAI Press, Austin Texas (2015)
Google Scholar
Cootes, T.F., Taylor, C.J.: An algorithm for tuning an active appearance model to new data. In: 17th British Machine Vision Conference, pp. 919–928. DBLP, Edinburgh (2006)
Google Scholar
Ashraf, A.B., Lucey, S., Cohn, J.F., Chen, T., Ambadar, Z., Prkachin, K.M.: The painful face - pain expression recognition using active appearance models. Image Vis. Comput. 27(12), 1788–1796 (2009)
Article Google Scholar
Kazemi, V., Sullivan, J.: One millisecond face alignment with an ensemble of regression trees. In: 27th IEEE Conference on Computer Vision and Pattern Recognition, pp. 1867–1874. IEEE Press, Columbus (2014)
Google Scholar
Zhu, S., Li, C., Change Loy, C., Tang, X.: Face alignment by coarse-to-fine shape searching. In: 28th IEEE Conference on Computer Vision and Pattern Recognition, pp. 4998–5006. IEEE Press, Boston (2015)
Google Scholar
Zhang, Z., Luo, P., Chen, C.L., Tang, X.: Learning deep representation for face alignment with auxiliary attributes. IEEE Trans. Pattern Anal. Mach. Intell. 38(5), 918–930 (2016)
Article Google Scholar
Sun, Y., Wang, X., Tang, X.: Deep convolutional network cascade for facial point detection. In: 26th IEEE Conference on Computer Vision and Pattern Recognition, pp. 3476–3483. IEEE Press, Portland (2013)
Google Scholar
Sauer, P., Cootes, T., Taylor, C.: Accurate regression procedures for active appearance models. In: 18th British Machine Vision Conference, pp. 681–685. DBLP, Warwickshire (2007)
Google Scholar
Zhou, E., Fan, H., Cao, Z., Jiang, Y., Yin, Q.: Extensive facial landmark localization with coarse-to-fine convolutional network cascade. In: 26th IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 386–391. IEEE Press, Portland (2013)
Google Scholar
Saragih, J., Goecke, R.: A nonlinear discriminative approach to AAM fitting. In: 11th International Conference on Computer Vision, pp. 1–8. IEEE Press, Rio de Janeiro (2007)
Google Scholar
Cootes, T.F., Taylor, C.J.: Active Shape Models-‘smart snakes’. In: 3th British Machine Vision Conference, pp. 266–275. DBLP, Oxford (1992)
Google Scholar
Jin, X., Tan, X.: Face alignment in-the-wild: a survey. arXiv preprint. arXiv:1608.04188 (2016)
Xiong, X., Torre, F.D.L.: Supervised descent method and its applications to face alignment. In: 26th IEEE Conference on Computer Vision and Pattern Recognition, pp. 532–539. IEEE Press, Portland (2013)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Sagonas, C., Tzimiropoulos, G., Zafeiriou, S., Pantic, I.: 26th IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 397–403. IEEE Press, Portland (2013)
Google Scholar
Belhumeur, P.N., Jacobs, D.W., Kriegman, D.J., Kumar, N.: Localizing parts of faces using a consensus of exemplars. IEEE Trans. Pattern Anal. Mach. Intell. 35(12), 2930–2940 (2013)
Article Google Scholar
Le, V., Brandt, J., Lin, Z., Bourdev, L., Huang, T.S.: Interactive Facial Feature Localization. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 679–692. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33712-3_49
Chapter Google Scholar
Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: 25th IEEE Conference on Computer Vision and Pattern Recognition, pp. 2879–2886. IEEE Press, Rhode Island (2012)
Google Scholar
Sagonas, C., Antonakos, E., Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: 300 faces in-the-wild challenge: database and results. Image Vis. Comput. 47, 3–18 (2016)
Article Google Scholar
Jia, Y.Q.: Caffe: convolutional architecture for fast feature embedding. In: 22nd ACM international Conference on Multimedia, pp. 675–678. ACM, Netherlands (2014)
Google Scholar

Download references

Acknowledgments

The authors would like to thank the editor and all the anonymous reviewers of this paper for their constructive suggestions and comments. This work is supported by NSFC (No.61671290) in China, the Key Program for International S&T Cooperation Project of China (No.2016YFE0129500), and the Shanghai Committee of Science and Technology, China (No.17511101903).

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiao Tong University, No.800 Dongchuan Road, Minhang District, Shanghai, China
Yu Zhang, Fei Jiang & Ruimin Shen

Authors

Yu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Fei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Ruimin Shen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ruimin Shen .

Editor information

Editors and Affiliations

Guangdong University of Technology, Guangzhou, China
Derong Liu
Guangdong University of Technology, Guangzhou, China
Shengli Xie
South China University of Technology, Guangzhou, China
Yuanqing Li
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Dongbin Zhao
King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia
El-Sayed M. El-Alfy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Y., Jiang, F., Shen, R. (2017). Region-Based Face Alignment with Convolution Neural Network Cascade. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, ES. (eds) Neural Information Processing. ICONIP 2017. Lecture Notes in Computer Science(), vol 10636. Springer, Cham. https://doi.org/10.1007/978-3-319-70090-8_31

Download citation

DOI: https://doi.org/10.1007/978-3-319-70090-8_31
Published: 28 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70089-2
Online ISBN: 978-3-319-70090-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics