Abstract
Recent advancements in deep learning architecture have increased its utility in real-life applications. Deep learning models require a large amount of data to train the model. In many application domains, there is a limited set of data available for training neural networks as collecting new data is either not feasible or requires more resources such as in marketing, computer vision, and medical science. These models require a large amount of data to avoid the problem of overfitting. One of the data space solutions to the problem of limited data is data augmentation. The purpose of this study focuses on various data augmentation techniques that can be used to further improve the accuracy of a neural network. This saves the cost and time consumption required to collect new data for the training of deep neural networks by augmenting available data. This also regularizes the model and improves its capability of generalization. The need for large datasets in different fields such as computer vision, natural language processing, security, and healthcare is also covered in this survey paper. The goal of this paper is to provide a comprehensive survey of recent advancements in data augmentation techniques and their application in various domains.
- 2019. MIMETIC: Mobile encrypted traffic classification using multimodal deep learning. Computer Networks 165, 106944. Google ScholarDigital Library .
- 2021. Cognitive data augmentation for adversarial defense via pixel masking. Pattern Recognition Letters. Google ScholarCross Ref .
- 2019. Applications of generative adversarial networks (GANs): An updated review. Archives of Computational Methods in Engineering. Google ScholarCross Ref .
- 2021. GAN Augmentation to Deal with Imbalance in Imaging-based Intrusion Detection 123, 108–127.Google Scholar .
- 2011. ACM Digital Library, Association for computing machinery. Special Interest Group on Knowledge Discovery & Data Mining, & Association for Computing Machinery. Special Interest Group on Management of Data. (2011). Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM.Google ScholarDigital Library .
- 2020. Image augmentation to improve construction resource detection using generative adversarial networks, cut-and-paste, and image transformation techniques. Automation in Construction 115. Google ScholarCross Ref .
- Google Scholar , (n.d.). Measuring Neural Net Robustness with Constraints.
- Google Scholar (n.d.). GAN Augmentation: Augmenting Training Data using Generative Adversarial Networks.
- 2018. Computer vision and deep learning techniques for pedestrian detection and tracking: A survey. Neurocomputing 300, 17–33. Google ScholarCross Ref .
- 2017. Towards evaluating the robustness of neural networks. Proceedings - IEEE Symposium on Security and Privacy. 39–57. Google ScholarCross Ref .
- 2019. Rating augmentation with generative adversarial networks towards accurate collaborative filtering. The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019. 2616–2622. Google ScholarDigital Library .
- 2021. Semi-supervised task-driven data augmentation for medical image segmentation. Medical Image Analysis 68. Google ScholarCross Ref .
- 2017. Data generation for improving person re-identification. MM 2017 - Proceedings of the 2017 ACM Multimedia Conference. 609–617. Google ScholarDigital Library .
- 2019. PAC-GAN: Packet generation of network traffic using generative adversarial networks. 2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2019, 728–734. Google ScholarCross Ref .
- 2017. GRAM: Graph-based attention model for healthcare representation learning. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Part F129685, 787–795. Google ScholarDigital Library .
- 2017. Introduction to the special issue on deep learning approaches for machine translation. Computer Speech and Language 46, 367–373. Google ScholarDigital Library .
- Google Scholar (n.d.). Randaugment: Practical Automated Data Augmentation with a Reduced Search Space.
- https://pillow.readthedocs.io/en/5.1.x/.Google Scholar (n.d.). AutoAugment: Learning Augmentation Strategies from Data.
- 2018. A deep inference learning framework for healthcare. Pattern Recognition Letters. Google ScholarCross Ref .
- 2007. Frustratingly Easy Domain Adaptation. http://hal3.name/easyadapt.pl.gz.Google Scholar .
- 2017. A survey on heterogeneous transfer learning. Journal of Big Data 4, 1 (2017) Google ScholarCross Ref .
- 2017a. Dataset Augmentation in Feature Space. http://arxiv.org/abs/1702.05538.Google Scholar .
- 2017b. Improved Regularization of Convolutional Neural Networks with Cutout. http://arxiv.org/abs/1708.04552.Google Scholar .
- Dhiraj and D. K. Jain. 2019. An evaluation of deep learning based object detection strategies for threat object detection in baggage security imagery. Pattern Recognition Letters 120, 112–119. Google ScholarDigital Library
- 2016. Convolutional neural network with data augmentation for SAR target recognition. IEEE Geoscience and Remote Sensing Letters 13, 3 (2016), 364–368. Google ScholarCross Ref .
- 2019. A case study of the augmentation and evaluation of training data for deep learning. Journal of Data and Information Quality 11, 4 (2019). Google ScholarDigital Library .
- 2020. Object-adaptive LSTM network for real-time visual tracking with adversarial data augmentation. Neurocomputing 384, 67–83. Google ScholarDigital Library .
- 2012. Domain transfer multiple kernel learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 3 (2012), 465–479. Google ScholarDigital Library .
- 2017. Evaluating deep learning architectures for speech emotion recognition. Neural Networks 92, 60–68. Google ScholarCross Ref .
- 2018. GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification. Neurocomputing 321, 321–331. Google ScholarCross Ref .
- 2019. Deep learning-based image recognition for autonomous driving. In IATSS Research 43, 4 (2019), 244–252. Elsevier B.V. Google ScholarCross Ref .
- 2016. Image style transfer using convolutional neural networks. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2016-December. 2414–2423. Google ScholarCross Ref .
- http://www.github.com/goodfeli/adversarial.Google Scholar . (n.d.). Generative Adversarial Nets.
- 2014. Explaining and Harnessing Adversarial Examples. http://arxiv.org/abs/1412.6572.Google Scholar .
- 2019. A survey on deep learning based face recognition. Computer Vision and Image Understanding 189. Google ScholarDigital Library .
- 2009. The unreasonable effectiveness of data. IEEE Intelligent Systems 24, 2 (2009), 8–12. Google ScholarDigital Library .
- 2018. A new image classification method using CNN transfer learning and web data augmentation. Expert Systems with Applications 95, 43–56. Google ScholarCross Ref .
- www.aaai.org.Google Scholar (n.d.). Neural Knowledge Acquisition via Mutual Attention between Knowledge Graph and Text.
- 2021. Text data augmentations: Permutation, antonyms and negation. Expert Systems with Applications 177, (2020). Google ScholarDigital Library .
- 2011. Learning from Multiple Outlooks.Google Scholar .
- 2021. A convolutional neural network-based ancient Sundanese character classifier with data augmentation. Procedia Computer Science 179, (2020) 195–201. Google ScholarCross Ref .
- https://github.com/arcelien/pba.Google Scholar (n.d.). Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules.
- 2018. Data Augmentation by Pairing Samples for Images Classification. http://arxiv.org/abs/1801.02929.Google Scholar .
- Google Scholar (n.d.). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift.
- 2020. The survey: Text generation models in deep learning. In Journal of King Saud University - Computer and Information Sciences. King Saud bin Abdulaziz University. Google ScholarCross Ref .
- Google Scholar (n.d.). Style Augmentation: Data Augmentation via Style Randomization.
- 2019. Survey on deep learning with class imbalance. Journal of Big Data 6, 1 (2019) Google ScholarCross Ref .
- 2000. On overfitting, generalization, and randomly expanded training sets. In IEEE Transactions On Neural Networks (11, 5).Google Scholar .
- 2021. Image Captioning Using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation. http://arxiv.org/abs/2102.11237.Google Scholar .
- 2018. Cost-sensitive learning of deep feature representations from imbalanced data. IEEE Transactions on Neural Networks and Learning Systems 29, 8 (2018), 3573–3587. Google ScholarCross Ref .
- 2013. A systematic review of systematic review process research in software engineering. Information and Software Technology 55, 12 (2013), 2049–2075. Google ScholarDigital Library .
- http://code.google.com/p/cuda-convnet/.Google Scholar (n.d.). ImageNet Classification with Deep Convolutional Neural Networks.
- 2011. What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1785–1792. Google ScholarDigital Library .
- 2019. A preliminary study on data augmentation of deep learning for image classification. ACM International Conference Proceeding Series. Google ScholarDigital Library .
- 2017. Smart augmentation learning an optimal data augmentation strategy. IEEE Access 5, 5858–5869. Google ScholarCross Ref .
- 2012. Cross-Domain Co-Extraction of Sentiment and Topic Lexicons.Google Scholar .
- 2014. Learning with augmented features for supervised and semi-supervised heterogeneous domain adaptation. IEEE Transactions on Pattern Analysis and Machine Intelligence 36, 6 (2014), 1134–1148. Google ScholarDigital Library .
- https://github.com/kakaobrain/fast-autoaugment.Google Scholar (n.d.). Fast AutoAugment.
- 2020. Document-level multi-topic sentiment classification of Email data with BiLSTM and data augmentation. Knowledge-Based Systems 197. Google ScholarCross Ref .
- 2020. A novel data augmentation scheme for pedestrian detection with attribute preserving GAN. Neurocomputing 401, 123–132. Google ScholarCross Ref .
- 2014. Adaptation regularization: A general framework for transfer learning. IEEE Transactions on Knowledge and Data Engineering 26, 5 (2014), 1076–1089. Google ScholarDigital Library .
- 2020. Acoustic data augmentation for Mandarin-English code-switching speech recognition. Applied Acoustics 161. Google ScholarCross Ref .
- 2019. Generative adversarial network based image augmentation for insect pest classification enhancement. IFAC-PapersOnLine 52, 30 (2019), 1–5. Google ScholarCross Ref .
- 2018. KAME: Knowledge-based attention model for diagnosis prediction in healthcare. International Conference on Information and Knowledge Management, Proceedings. 743–752. Google ScholarDigital Library .
- 2018. Risk prediction on electronic health records with prior medical knowledge. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 1910–1919. Google ScholarDigital Library .
- 2020. Automated domain-specific healthcare knowledge graph curation framework: Subarachnoid hemorrhage as phenotype. Expert Systems with Applications 145. Google ScholarDigital Library .
- 2016. Do we really need to collect millions of faces for effective face recognition? Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9909 LNCS 579–596. Google ScholarCross Ref .
- 2019. Sample fusion network: An end-to-end data augmentation network for skeleton-based human action recognition. IEEE Transactions on Image Processing 28, 11 (2019), 5281–5295. Google ScholarDigital Library .
- 2018. Data augmentation for improving deep learning in image classification problem. 2018 International Interdisciplinary PhD Workshop (IIPhDW). 117–122.Google ScholarCross Ref .
- 2019. Forward noise adjustment scheme for data augmentation. Proceedings of the 2018 IEEE Symposium Series on Computational Intelligence, SSCI 2018. 728–734. Google ScholarCross Ref .
- 2021. Spectral images based environmental sound classification using CNN with meaningful data augmentation. Applied Acoustics 172, 107581. Google ScholarCross Ref .
- 2018. Heterogeneous defect prediction. IEEE Transactions on Software Engineering 44, 9 (2018), 874–896. Google ScholarCross Ref .
- Exploring Generalization in Deep Learning.Google Scholar (n.d.).
- 2020. A framework for detecting credit card fraud with cost-sensitive meta-learning ensemble approach. Scientific African 8. Google ScholarCross Ref .
- 2014. Learning and transferring mid-level image representations using convolutional neural networks. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1717–1724. Google ScholarDigital Library .
- 2019. Comparison of traditional transformations for data augmentation in deep learning of medical thermography. 2019 42nd International Conference on Telecommunications and Signal Processing, TSP 2019, 191–194. Google ScholarCross Ref .
- Zero-Shot Learning with Semantic Output Codes.Google Scholar (n.d.).
- 2011. Domain adaptation via transfer component analysis. IEEE Transactions on Neural Networks 22, 2 (2011), 199–210. Google ScholarDigital Library .
- 2010. A survey on transfer learning. In IEEE Transactions on Knowledge and Data Engineering 22, 10 (2010), 1345–1359. Google ScholarDigital Library .
- 2021. Do 2D GANs Know 3D Shape? Unsupervised 3D. 1–18.Google Scholar .
- 2020. An image augmentation approach using two-stage generative adversarial network for nuclei image segmentation. Biomedical Signal Processing and Control 57. Google ScholarCross Ref .
- 2017. The Effectiveness of Data Augmentation in Image Classification using Deep Learning. http://arxiv.org/abs/1712.04621.Google Scholar .
- 2010. Cross-Language Text Classification using Structural Correspondence Learning. Association for Computational Linguistics.Google Scholar .
- 2019. Data augmentation using generative adversarial networks for robust speech recognition. Speech Communication 114, 1–9. Google ScholarDigital Library .
- Google Scholar (n.d.). Learning to Compose Domain-Specific Transformations for Data Augmentation.
- 2019. Multi-grade brain tumor classification using deep CNN with extensive data augmentation. Journal of Computational Science 30, 174–182. Google ScholarCross Ref .
- https://github.com/openai/improved-gan.Google Scholar . (n.d.). Improved Techniques for Training GANs.
- 2019. Overfitting Mechanism and Avoidance in Deep Neural Networks. http://arxiv.org/abs/1901.06566.Google Scholar .
- 2013. Combating negative transfer from predictive distribution differences. IEEE Transactions on Cybernetics 43, 4 (2013), 1153–1165. Google ScholarCross Ref .
- 2020. Financial time series forecasting with deep learning: A systematic literature review: 2005–2019. Applied Soft Computing Journal 90. Google ScholarCross Ref .
- 2020. A multi-cascaded model with data augmentation for enhanced paraphrase detection in short texts. Information Processing and Management 57, 3 (2020). Google ScholarDigital Library .
- 2015. Transfer learning for visual categorization: A survey. IEEE Transactions on Neural Networks and Learning Systems 26, 5 (2015), 1019–1034. Google Scholar .
- Google Scholar (n.d.). LNAI 5212 - Actively Transfer Domain Knowledge.
- Google Scholar (n.d.). Research on Data Augmentation for Image Classification Based on Convolution Neural Networks. 201602118.
- 2019. A survey on image data augmentation for deep learning. Journal of Big Data 6, 1 (2019). Google ScholarCross Ref .
- 2014. Dropout: A simple way to prevent neural networks from overfitting. In Journal of Machine Learning Research (15).Google Scholar .
- 2021. Human action recognition in drone videos using a few aerial training examples. Computer Vision and Image Understanding 206 (2020). Google ScholarCross Ref .
- 2019. Improved mixed-example data augmentation. Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019, 1262–1270. Google ScholarCross Ref .
- 2017. Revisiting Unreasonable Effectiveness of Data in Deep Learning Era. Google ScholarCross Ref .
- 2013. Intriguing Properties of Neural Networks. http://arxiv.org/abs/1312.6199.Google Scholar .
- 2019. Data augmentation using random image cropping and patching for deep CNNs. IEEE Transactions on Circuits and Systems for Video Technology. 1–1. Google ScholarDigital Library .
- 2019. Improving deep learning with generic data augmentation. Proceedings of the 2018 IEEE Symposium Series on Computational Intelligence, SSCI 2018, 1542–1547. Google ScholarCross Ref .
- 2010. Safety in numbers: Learning categories from few examples with multi model knowledge transfer. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 3081–3088. Google ScholarCross Ref .
- 2020. CovidGAN: Data augmentation using auxiliary classifier GAN for improved Covid-19 detection. IEEE Access 8, 91916–91923. Google ScholarCross Ref .
- 2020. PacketCGAN: Exploratory study of class imbalance for encrypted traffic classification using CGAN. IEEE International Conference on Communications, 2020-June. Google ScholarCross Ref .
- 2020. A survey on face data augmentation for the training of deep neural networks. In Neural Computing and Applications. Springer. Google ScholarDigital Library .
- 2020. Recent advances in deep learning. In International Journal of Machine Learning and Cybernetics 11, 4 (2020), 747–750. Springer. Google ScholarCross Ref .
- 2020. CNN Tracking Based on Data Augmentation ✩. 194, 105594. Google ScholarCross Ref .
- 2021. Regularizing deep networks with semantic data augmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 8828(c). Google ScholarCross Ref .
- 2016. A survey of transfer learning. Journal of Big Data 3, 1 (2016). Google ScholarCross Ref .
- 2014. Deep Image: Scaling up Image Recognition.Google Scholar .
- 2013. Feature Ensemble Plus Sample Selection: Domain Adaptation for Sentiment Classification. www.computer.org/intelligent.Google Scholar .
- 2010. Boosting for transfer learning with multiple sources. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1855–1862. Google ScholarCross Ref .
- www.aaai.org.Google Scholar (n.d.). Adversarial Framing for Image and Video Classification.
- 2019. Adversarial framing for image and video classification. Proceedings of the AAAI Conference on Artificial Intelligence 33, 10077–10078. Google ScholarDigital Library .
- Google Scholar (n.d.). Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization.
- Google Scholar (n.d.). Adversarial Autoaugment.
- 2020. Structure-augmented knowledge graph embedding for sparse data with rule learning. Computer Communications 159, 271–278. Google ScholarCross Ref .
- https://github.com/zhunzhong07/Random-Erasing.Google Scholar (n.d.). Random Erasing Data Augmentation.
- www.aaai.org.Google Scholar (n.d.). Hybrid Heterogeneous Transfer Learning through Deep Learning.
- www.aaai.org.Google Scholar (n.d.). Heterogeneous Transfer Learning for Image Classification.
- 2021. Progressive and Aligned Pose Attention Transfer for Person Image Generation. 1–15. http://arxiv.org/abs/2103.11622.Google Scholar .
- Google Scholar (n.d.). Learning Data Augmentation Strategies for Object Detection.
Index Terms
- A Systematic Review on Data Scarcity Problem in Deep Learning: Solution and Applications
Recommendations
Deep semi-supervised learning with contrastive learning and partial label propagation for image data
AbstractDeep semi-supervised learning is becoming an active research topic because it jointly utilizes labeled and unlabeled samples in training deep neural networks. Recent advances are mainly focused on inductive semi-supervised learning ...
Semi-supervised Deep Continuous Learning
ICDLT '17: Proceedings of the 2017 International Conference on Deep Learning TechnologiesOur research focuses on a new data flow architecture in neural network training called Continuous Neural Network Learning (CNNL) whose main objective is the reduction of data required to train a neural network. In real-world applications, much of the ...
Semi-supervised text classification with deep convolutional neural network using feature fusion approach
WI '19: IEEE/WIC/ACM International Conference on Web IntelligenceSupervised learning algorithms employ labeled training data for classification purposes while obtaining labeled data for large datasets is costly and time consuming. Semi-supervised learning algorithms, on the contrary, use a small set of labeled data ...
Comments