Data augmentation based on shape space exploration for low-size datasets: application to 2D shape classification

Ghorbel, Emna; Ghorbel, Faouzi

doi:10.1007/s00521-024-09798-5

Data augmentation based on shape space exploration for low-size datasets: application to 2D shape classification

Review
Published: 29 April 2024

Volume 36, pages 10031–10054, (2024)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

275 Accesses
Explore all metrics

Abstract

This article introduces a novel 2D shape data augmentation approach based on intra-class shape space exploration. The proposed method relies on a geodesic interpolation between shapes, leveraging invariant-based morphing techniques. By blending a 2D shape pair belonging to a given class, we are able to generate nonlinear augmentations, hence covering more variations within the shape space. In particular, we formulate data augmentation as an optimization problem that minimizes the deformations between two shapes using the Generalized Finite Fourier Invariant Descriptor. The proposed augmentation technique is evaluated using numerous Convolution Neural Network architectures for 2D shape classification. The results indicate the superiority of the proposed method as compared to state-of-the-art techniques when considering small-scale datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Deeper Look at 3D Shape Classifiers

Non-rigid 3D Shape Classification Based on Low-Level Features

Convolutional Shape-Aware Representation for 3D Object Classification

Article 19 May 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Guo Y, Liu Y, Oerlemans A, Lao S, Wu S, Lew MS (2016) Deep learning for visual understanding: a review. Neurocomputing 187:27–48
Article Google Scholar
Li Y, Wang S, Tian Q, Ding X (2015) Feature representation for statistical-learning-based object detection: a review. Pattern Recognit 48(11):3542–3559
Article Google Scholar
Paulin M, Revaud J, Harchaoui Z, Perronnin F, Schmid C (2014) Transformation pursuit for image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3646–3653
Ben Khlifa A, Ghorbel F (2019) An almost complete curvature scale space representation: Euclidean case. Signal Process Image Commun 75:32–43
Article Google Scholar
Perez L, Wang J (2017) The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621
Summers C, Dinneen MJ (2019) Improved mixed-example data augmentation. In: 2019 IEEE winter conference on applications of computer vision (WACV). IEEE, pp 1262–1270
Inoue H (2018) Data augmentation by pairing samples for images classification. arXiv preprint arXiv:1801.02929
Zhang H, Cisse M, Dauphin YN, Lopez-Paz D (2017) mixup: beyond empirical risk minimization. arXiv preprint arXiv:1710.09412
Kim J-H, Choo W, Song HO (2020) Puzzle mix: exploiting saliency and local statistics for optimal mixup. In: International conference on machine learning. PMLR, pp 5275–5285
Zou Y, Verma V, Mittal S, Tang WH, Pham H, Kannala J, Bengio Y, Solin A, Kawaguchi K (2023) Mixupe: understanding and improving mixup from directional derivative perspective. In: Uncertainty in artificial intelligence (2023). PMLR, pp 2597–2607
Zhu J, Shi L, Yan J, Zha H (2020) Automix: mixup networks for sample interpolation via cooperative barycenter learning. In: Computer Vision–ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part X 16. Springer, pp 633–649
Verma V, Lamb A, Beckham C, Najafi A, Mitliagkas I, Lopez-Paz D, Bengio Y (2019) Manifold mixup: better representations by interpolating hidden states. In: International conference on machine learning. PMLR, pp 6438–6447
Kang G, Dong X, Zheng L, Yang Y (2017) Patchshuffle regularization. arXiv preprint arXiv:1707.07103
Zhong Z, Zheng L, Kang G, Li S, Yang Y (2020) Random erasing data augmentation. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 13001–13008
Gatys LA, Ecker AS, Bethge M (2015) A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576
Konno T, Iwazume M (2018) Icing on the cake: an easy and quick post-learnig method you can try after deep learning. arXiv preprint arXiv:1807.06540
Bowles C, Chen L, Guerrero R, Bentley P, Gunn R, Hammers A, Dickie DA, Hernández MV, Wardlaw J, Rueckert D (2018) Gan augmentation: augmenting training data using generative adversarial networks. arXiv preprint arXiv:1810.10863 (2018)
Su J, Vargas DV, Sakurai K (2019) One pixel attack for fooling deep neural networks. IEEE Trans Evolut Comput 23(5):828–841
Article Google Scholar
El-Sawy A, Hazem E-B, Loey M (2016) Cnn for handwritten Arabic digits recognition based on lenet-5. In: International conference on advanced intelligent systems and informatics. Springer, pp 566–575
Patel V, Mujumdar N, Balasubramanian P, Marvaniya S, Mittal A (2019) Data augmentation using part analysis for shape classification. In: 2019 IEEE winter conference on applications of computer vision (WACV), pp 1223–1232. https://doi.org/10.1109/WACV.2019.00135
Ciregan D, Meier U, Schmidhuber J (2012) Multi-column deep neural networks for image classification. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp 3642–3649
Sato I, Nishimura H, Yokoi K (2015) Apac: augmented pattern classification with neural networks. arXiv preprint arXiv:1505.03229
Yin D, Lopes RG, Shlens J, Cubuk ED, Gilmer J (2019) A Fourier perspective on model robustness in computer vision. arXiv preprint arXiv:1906.08988
Chatfield K, Simonyan K, Vedaldi A, Zisserman A (2014) Return of the devil in the details: delving deep into convolutional nets. arXiv preprint arXiv:1405.3531
Simard PY, Steinkraus D, Platt JC et al (2003) Best practices for convolutional neural networks applied to visual document analysis. In: ICDAR, vol 3
Ghorbel E, Ghorbel F, M’Hiri S (2022) A fast and efficient shape blending by stable and analytically invertible finite descriptors. IEEE Trans Image Process 31:5788–5800. https://doi.org/10.1109/TIP.2022.3199105
Article Google Scholar
Surazhsky T, Elber G (2002) Metamorphosis of planar parametric curves via curvature interpolation. Int J Shape Model 8(02):201–216
Article Google Scholar
Hirano M, Watanabe Y, Ishikawa M (2017) Rapid blending of closed curves based on curvature flow. Comput Aided Geom Design 52:217–230
Article MathSciNet Google Scholar
Saba M, Schneider T, Hormann K, Scateni R (2014) Curvature-based blending of closed planar curves. Graph Models 76(5):263–272
Article Google Scholar
Sederberg TW, Gao P, Wang G, Mu H (1993) 2-d shape blending: an intrinsic solution to the vertex path problem. In: Proceedings of the 20th annual conference on computer graphics and interactive techniques. ACM, New York, NY, pp 15–18
Klassen E, Srivastava A, Mio M, Joshi SH (2004) Analysis of planar shapes using geodesic paths on shape spaces. IEEE Trans Pattern Anal Mach Intell 26(3):372–383
Article Google Scholar
Srivastava A, Klassen E, Joshi SH, Jermyn IH (2010) Shape analysis of elastic curves in Euclidean spaces. IEEE Trans Pattern Anal Mach Intell 33(7):1415–1428
Article Google Scholar
Jin L, Wen Z, Hu Z (2021) Topology-preserving nonlinear shape registration on the shape manifold. Multimed Tools Appl 80(11):17377–17389
Article Google Scholar
Shapira M, Rappoport A (1995) Shape blending using the star-skeleton representation. IEEE Comput Graph Appl 15(2):44–50
Article Google Scholar
Yang W, Feng J (2009) 2d shape morphing via automatic feature matching and hierarchical interpolation. Comput Graph 33(3):414–423
Article Google Scholar
Hahmann S, Bonneau G-P, Caramiaux B, Cornillac M (2007) Multiresolution morphing for planar curves. Computing 79(2):197–209
Article MathSciNet Google Scholar
Yang W, Wang X, Wang G (2014) Part-to-part morphing for planar curves. Vis Comput 30(6):919–928
Article Google Scholar
Ghorbel E, Ghorbel F, Sakly I, M’Hiri S (2021) Fast blending of planar shapes based on invariant invertible and stable descriptors. In: 2020 25th international conference on pattern recognition (ICPR). IEEE, pp 10259–10265
Zhong Z, Zheng L, Kang G, Li S, Yang Y (2020) Random erasing data augmentation. In: Proceedings of the AAAI conference on artificial intelligence, vol. 34, pp 13001–13008
Cubuk ED, Zoph B, Mane D, Vasudevan V, Le QV (2018) Autoaugment: learning augmentation policies from data. arXiv preprint arXiv:1805.09501
Jiang X, Bunke H, Abegglen K, Kandel A (2002) Curve morphing by weighted mean of strings. In: Object recognition supported by user interaction for service robots, vol 4. IEEE, New York City at 3 Park Ave, pp 192–195
Crimmins TR (1982) A complete set of Fourier descriptors for two-dimensional shapes. IEEE Trans Syst Man Cybern 12(6):848–855
Article MathSciNet Google Scholar
Ghorbel F (1992) Stability of invariant fourier descriptors and its inference in the shape classification. In: International conference on pattern recognition. IEEE Computer Society Press, New York City at 3 Park Ave, pp 130–130
Elghoul S, Ghorbel F (2021) A fast and robust affine-invariant method for shape registration under partial occlusion. Int J Multimed Inf Retr 11:1–21
Google Scholar
Belogay E, Cabrelli C, Molter U, Shonkwiler R (1997) Calculating the hausdorff distance between curves. Inf Process Lett 64(1):17–22
Article MathSciNet Google Scholar
Latecki LJ, Lakamper R, Eckhardt T (2000) Shape descriptors for non-rigid shapes with a single closed contour. In: Proceedings IEEE conference on computer vision and pattern recognition. CVPR 2000 (Cat. No.PR00662), vol 1. IEEE, New York City at 3 Park Ave, pp 424–4291
LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551
Article Google Scholar
Yuan A, Bai G, Jiao L, Liu Y (2012) Offline handwritten English character recognition based on convolutional neural network. In: 2012 10th IAPR international workshop on document analysis systems. IEEE, New York City at 3 Park Ave, pp 125–129
Ozdemir MA, Elagoz B, Alaybeyoglu A, Sadighzadeh R, Akan A (2019) Real time emotion recognition from facial expressions using CNN architecture. In: 2019 Medical technologies congress (tiptekno). IEEE, New York City at 3 Park Ave, pp 1–4
Kayed M, Anter A, Mohamed H (2020) Classification of garments from fashion mnist dataset using CNN lenet-5 architecture. In: 2020 international conference on innovative trends in communication and computer engineering (ITCE). IEEE, pp 238–243
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2818–2826
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
Sato I, Nishimura H, Yokoi K (2015) Apac: Augmented pattern classification with neural networks. arXiv preprint arXiv:1505.03229
Wang X, Feng B, Bai X, Liu W, Latecki LJ (2014) Bag of contour fragments for robust shape classification. Pattern Recognit 47(6):2116–2125
Article Google Scholar
Shen W, Jiang Y, Gao W, Zeng D, Wang X (2016) Shape recognition by bag of skeleton-associated contour parts. Pattern Recognit Lett 83:321–329
Article Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556

Download references

Author information

Authors and Affiliations

CRISTAL Laboratory GRIFT Group, National School of Computer Science, Manouba University, 2010, Manouba, Tunisia
Emna Ghorbel & Faouzi Ghorbel

Authors

Emna Ghorbel
View author publications
You can also search for this author inPubMed Google Scholar
Faouzi Ghorbel
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Emna Ghorbel.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ghorbel, E., Ghorbel, F. Data augmentation based on shape space exploration for low-size datasets: application to 2D shape classification. Neural Comput & Applic 36, 10031–10054 (2024). https://doi.org/10.1007/s00521-024-09798-5

Download citation

Received: 20 March 2023
Accepted: 25 March 2024
Published: 29 April 2024
Issue Date: June 2024
DOI: https://doi.org/10.1007/s00521-024-09798-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Data augmentation based on shape space exploration for low-size datasets: application to 2D shape classification

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Deeper Look at 3D Shape Classifiers

Non-rigid 3D Shape Classification Based on Low-Level Features

Convolutional Shape-Aware Representation for 3D Object Classification

Explore related subjects

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now