Abstract
This paper presents a novel classification framework derived from AdaBoost to classify facial expressions. The proposed framework adopts rotation-reversal invariant HOG as features. The Framework is implemented through configuring the Area under ROC curve (AUC) of the weak classifier with HOG, which is a discriminative classification framework. The proposed classification framework is evaluated with two very popular and representative public databases: MMI and AFEW. As a result, it outperforms the state-of-the-arts methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 886–893 (2005)
Takacs, G., Chandrasekhar, V., Tsai, S., Chen, D., Grzeszczuk, R., Girod, B.: Fast computation of rotation-invariant image features by an approximate radial gradient transform. IEEE Trans. Image Proc. (TIP) 22, 2970–2982 (2013)
Viola, P., Jones, M.: Robust real-time face detection. Int. J. Comput. Vis. (IJCV) 57, 137–154 (2004)
Ferri, C., Flach, P.A., Hernández-Orallo, J.: Learning decision trees using the area under the ROC curve. In: Proceedings of International Conference Machine Learning (ICML), pp. 139–146 (2002)
Valstar, M.F., Pantic, M.: Induced disgust, happiness and surprise: an addition to the MMI facial expression database. In: Proceedings of International Conference on Language Resources and Evaluation, Workshop on Emotion, pp. 65–70 (2010)
Pantic, M., Valstar, M.F., Rademaker, R., Maat, L.: Web-based database for facial expression analysis. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pp. 317–321 (2005)
Dhall, A., Goecke, R., Lucey, S., Gedeon, T.: Collecting large, richly annotated facial-expression databases from movies. IEEE MultiMedia 19, 34–41 (2012)
Thurau, C., Hlavac, V.: Pose primitive based human action recognition in videos or still images. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
Li, J., Wang, T., Zhang, Y.: Face detection using SURF cascade. In: Proceedings of IEEE International Conference on Computer Vision (ICCV) Workshops, pp. 2183–2190 (2011)
Long, P., Servedio, R.: Boosting the area under the ROC Curve. In: Proceedings of Advances in Neural Information Processing Systems (NIPS), pp. 945–952 (2007)
Li, S.Z., Zhang, Z., Shum, H.Y., Zhang, H.: FloatBoost learning for classification. In: Proceedings of Advances in Neural Information Processing Systems (NIPS), pp. 993–1000 (2002)
Li, J., Zhang, Y.: Learning SURF cascade for fast and accurate object detection. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3468–3475 (2013)
Xiao, R., Zhu, H., Sun, H., Tang, X.: Dynamic cascades for face detection. In: Proceedings of IEEE International Conference on Computer Vision (ICCV), pp. 1–8 (2007)
Bourdev, L., Brandt, J.: Robust object detection via soft cascade. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 236–243 (2005)
Sochman, J., Matas, J.: WaldBoost - learning for time constrained sequential detection. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 150–156 (2005)
Brubaker, S., Wu, J., Sun, J., Mullin, M., Rehg, J.: On the design of cascades of boosted ensembles for face detection. Int. J. Comput. Vis. (IJCV) 77, 65–86 (2008)
Lucey, P., Cohn, J.F., Kanade, T., Saragih, J., Ambadar, Z., Matthews, I.: The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 94–101 (2010)
Liu, M., Shan, S., Wang, R., Chen, X.: Learning expressionlets on spatio-temporal manifold for dynamic facial expression recognition. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1749–1756 (2014)
Chen, J., Ariki, Y., Takiguchi, T.: Robust facial expressions recognition using 3d average face and ameliorated adaboost. In: Proceedings of ACM Multimedia Conference (MM), pp. 661–664 (2013)
Wang, L., Qiao, Y., Tang, X.: Motionlets: mid-level 3d parts for human motion recognition. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2674–2681 (2013)
Zhao, G., Pietikainen, M.: Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 29, 915–928 (2007)
Klaeser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3d-gradients. In: Proceedings of British Machine Vision Conference (BMVC), pp. 99.1–99.10 (2008)
Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: Proceedings of ACM Multimedia Conference (MM), pp. 357–360 (2007)
Rudovic, O., Pavlovic, V., Pantic, M.: Multi-output Laplacian dynamic ordinal regression for facial expression recognition and intensity estimation. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2634–2641 (2012)
Liu, M., Li, S., Shan, S., Wang, R., Chen, X.: Deeply learning deformable facial action parts model for dynamic expression analysis. In: Cremers, D., Reid, I., Saito, H., Yang, M.-H. (eds.) ACCV 2014. LNCS, vol. 9006, pp. 143–157. Springer, Heidelberg (2015). doi:10.1007/978-3-319-16817-3_10
Trzcinski, T., Christoudias, M., Lepetit, V.: Learning image descriptors with boosting. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 37, 597–610 (2015)
Chen, D., Ren, S., Wei, Y., Cao, X., Sun, J.: Joint cascade face detection and alignment. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8694, pp. 109–122. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10599-4_8
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Chen, J., Luo, Z., Takiguchi, T., Ariki, Y. (2017). Expression Recognition with Ri-HOG Cascade. In: Chen, CS., Lu, J., Ma, KK. (eds) Computer Vision – ACCV 2016 Workshops. ACCV 2016. Lecture Notes in Computer Science(), vol 10118. Springer, Cham. https://doi.org/10.1007/978-3-319-54526-4_38
Download citation
DOI: https://doi.org/10.1007/978-3-319-54526-4_38
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54525-7
Online ISBN: 978-3-319-54526-4
eBook Packages: Computer ScienceComputer Science (R0)