skip to main content
survey

Multi-Class Imbalanced Data Handling with Concept Drift in Fog Computing: A Taxonomy, Review, and Future Directions

Published: 07 October 2024 Publication History

Abstract

A network of actual physical objects or “IoT components” linked to the internet and equipped with sensors, electronics, software, and network connectivity is known as the Internet of Things (IoT). This ability of the IoT components to gather and share data is made possible by this network connectivity. Many IoT devices are currently operating, which generate a lot of data. When these IoT devices started collecting data, the cloud was the only place to analyze, filter, pre-process, and aggregate it. However, when it comes to IoT, the cloud has restrictions regarding latency and a more centralized method of distributing programs. A new form of computing called Fog computing has been proposed to address the shortcomings of current cloud computing. In an IoT context, sensors regularly communicate signal information, and edge devices process the data obtained from these sensors using Fog computing. The sensors’ internal or external problems, security breaches, or the integration of heterogeneous equipment contribute to the imbalanced data, i.e., comparatively speaking, one class has more instances than the other. As a result of this data, the pattern extraction is imbalanced. Recent attempts have concentrated heavily on binary-class imbalanced concerns with exactly two classes. However, the classification of multi-class imbalanced data is an issue that needs to be fixed in Fog computing, even if it is widespread in other fields, including text categorization, human activity detection, and medical diagnosis. The study intends to deal with this problem. It presents a systematic, thorough, and in-depth comparative analysis of several binary-class and multi-class imbalanced data handling strategies for batch and streaming data in IoT networks and Fog computing. There are five major objectives in this study. First, reviewing the Fog computing concept. Second, outlining the optimization metric used in Fog computing. Third, focusing on binary and multi-class batch data handling for IoT networks and Fog computing. Fourth, reviewing and comparing the current imbalanced data handling methodologies for multi-class data streams. Fifth, explaining how to cope with the concept drift, including novel and recurring classes, targeted optimization measures, and evaluation tools. Finally, the best performance metrics and tools for concept drift, binary-class (batch and stream) data, and multi-class (batch and stream) data are highlighted.

References

[1]
Shaik Masthan Babu, A. Jaya Lakshmi, and B. Thirumala Rao. 2015. A study on cloud based Internet of Things: CloudIoT. In Global Conference on Communication Technologies (GCCT’15). IEEE, 60–65.
[2]
Bushra Jamil, Humaira Ijaz, Mohammad Shojafar, Kashif Munir, and Rajkumar Buyya. 2022. Resource allocation and task scheduling in fog computing and internet of everything environments: A taxonomy, review, and future directions. ACM Computing Surveys (CSUR) 54, 11s (2022), 1–38.
[3]
Enkhtur Tsogbaatar, Monowar H. Bhuyan, Yuzo Taenaka, Doudou Fall, Khishigjargal Gonchigsumlaa, Erik Elmroth, and Youki Kadobayashi. 2021. DeL-IoT: A deep ensemble learning approach to uncover anomalies in IoT. Internet Things 14 (2021), 100391.
[4]
A. Jaokar. 2016. Data Science for Internet of Things (IoT): Ten Differences From Traditional Data Science. KDnuggets.
[5]
D. Friedman. 2015. Data Science for Internet of Things (IoT): Ten Differences From Traditional Data Science. ReadWrite. https://readwrite.com/five-types-data-internet-of-things/
[6]
Alessio Botta, Walter De Donato, Valerio Persico, and Antonio Pescapé. 2016. Integration of cloud computing and Internet of Things: A survey. Fut. Gen. Comput. Syst. 56 (2016), 684–700.
[7]
Flavio Bonomi, Rodolfo Milito, Jiang Zhu, and Sateesh Addepalli. 2012. Fog computing and its role in the Internet of Things. In 1st Edition of the MCC Workshop on Mobile Cloud Computing. 13–16.
[8]
P. PunithaIlayarani and M. Maria Dominic. 2019. Anatomization of fog computing and edge computing. In IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT’19). IEEE, 1–6.
[9]
Swati Malik and Kamali Gupta. 2019. Resource scheduling in fog: Taxonomy and related aspects. J. Comput. Theoret. Nanosci. 16, 10 (2019), 4313–4319.
[10]
Aparna Kumari, Sudeep Tanwar, Sudhanshu Tyagi, Neeraj Kumar, Reza M. Parizi, and Kim-Kwang Raymond Choo. 2019. Fog data analytics: A taxonomy and process model. J. Netw. Comput. Applic. 128 (2019), 90–104.
[11]
Ricardo Barandela, José Salvador Sánchez, Vicente Garcıa, and Edgar Rangel. 2003. Strategies for learning in class imbalance problems. Pattern Recog. 36, 3 (2003), 849–851.
[12]
Pattaramon Vuttipittayamongkol and Eyad Elyan. 2020. Neighbourhood-based undersampling approach for handling imbalanced and overlapped data. Inf. Sci. 509 (2020), 47–70.
[13]
Silvia Cateni, Valentina Colla, and Marco Vannucci. 2014. A method for resampling imbalanced datasets in binary classification tasks for real-world problems. Neurocomputing 135 (2014), 32–41.
[14]
Wei-Chao Lin, Chih-Fong Tsai, Ya-Han Hu, and Jing-Shang Jhang. 2017. Clustering-based undersampling in class-imbalanced data. Inf. Sci. 409 (2017), 17–26.
[15]
Debashree Devi, Suyel Namasudra, and Seifedine Kadry. 2020. A boosting-aided adaptive cluster-based undersampling approach for treatment of class imbalance problem. Int. J. Data Warehous. Min. 16, 3 (2020), 60–86.
[16]
José Antonio Sanz, Dario Bernardo, Francisco Herrera, Humberto Bustince, and Hani Hagras. 2014. A compact evolutionary interval-valued fuzzy rule-based classification system for the modeling and prediction of real-world financial applications with imbalanced data. IEEE Trans. Fuzzy Syst. 23, 4 (2014), 973–990.
[17]
Georgios Douzas, Fernando Bacao, and Felix Last. 2018. Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE. Inf. Sci. 465 (2018), 1–20.
[18]
Lina Gong, Shujuan Jiang, and Li Jiang. 2019. Tackling class imbalance problem in software defect prediction through cluster-based over-sampling with filtering. IEEE Access 7 (2019), 145725–145737.
[19]
György Kovács. 2019. An empirical comparison and evaluation of minority oversampling techniques on a large number of imbalanced datasets. Appl. Soft Comput. 83 (2019), 105662.
[20]
Dech Thammasiri, Dursun Delen, Phayung Meesad, and Nihat Kasap. 2014. A critical assessment of imbalanced class distribution problem: The case of predicting freshmen student attrition. Expert Syst. Applic. 41, 2 (2014), 321–330.
[21]
Pin Lim, Chi Keong Goh, and Kay Chen Tan. 2016. Evolutionary cluster-based synthetic oversampling ensemble (eco-ensemble) for imbalance learning. IEEE Trans. Cybern. 47, 9 (2016), 2850–2861.
[22]
György Kovács. 2019. Smote-variants: A Python implementation of 85 minority oversampling techniques. Neurocomputing 366 (2019), 352–354.
[23]
Dong-Sheng Cao, Qing-Song Xu, Yi-Zeng Liang, Liang-Xiao Zhang, and Hong-Dong Li. 2010. The boosting: A new idea of building models. Chemomet. Intell. Lab. Syst. 100, 1 (2010), 1–11.
[24]
You-Shyang Chen. 2016. An empirical study of a hybrid imbalanced-class DT-RST classification procedure to elucidate therapeutic effects in uremia patients. Med. Biol. Eng. Comput. 54, 6 (2016), 983–1001.
[25]
Sobhan Sarkar, Nikhil Khatedi, Anima Pramanik, and J. Maiti. 2020. An ensemble learning-based undersampling technique for handling class-imbalance problem. In International Conference on Emerging Trends in Information Technology (ICETIT’19). Springer, 586–595.
[26]
Guo Haixiang, Li Yijing, Jennifer Shang, Gu Mingyun, Huang Yuanyue, and Gong Bing. 2017. Learning from class-imbalanced data: Review of methods and applications. Expert Syst. Applic. 73 (2017), 220–239.
[27]
Hengyi Wei, Baocheng Sun, and Mingming Jing. 2014. BalancedBoost: A hybrid approach for real-time network traffic classification. In 23rd International Conference on Computer Communication and Networks (ICCCN’14). IEEE, 1–6.
[28]
José F. Díez-Pastor, Juan J. Rodríguez, Cesar Garcia-Osorio, and Ludmila I. Kuncheva. 2015. Random balance: Ensembles of variable priors classifiers for imbalanced data. Knowl.-based Syst. 85 (2015), 96–111.
[29]
Lara Lusa and others. 2017. Gradient boosting for high-dimensional prediction of rare events. Computational Statistics & Data Analysis 113 (2017), 19–37.
[30]
Sarah Vluymans, Isaac Triguero, Chris Cornelis, and Yvan Saeys. 2016. EPRENNID: An evolutionary prototype reduction based ensemble for nearest neighbor classification of imbalanced data. Neurocomputing 216 (2016), 596–610.
[31]
Sergio Gónzalez, Salvador García, Marcelino Lázaro, Aníbal R. Figueiras-Vidal, and Francisco Herrera. 2017. Class switching according to nearest enemy distance for learning from highly imbalanced data-sets. Pattern Recog. 70 (2017), 12–24.
[32]
Dan Gan, Jiang Shen, Bang An, Man Xu, and Na Liu. 2020. Integrating TANBN with cost sensitive classification algorithm for imbalanced data in medical diagnosis. Comput. Industr. Eng. 140 (2020), 106266.
[33]
Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Ferdous A. Sohel, and Roberto Togneri. 2017. Cost-sensitive learning of deep feature representations from imbalanced data. IEEE Trans. Neural Netw. Learn. Syst. 29, 8 (2017), 3573–3587.
[34]
Chong Zhang, Kay Chen Tan, and Ruoxu Ren. 2016. Training cost-sensitive deep belief networks on imbalance data problems. In International Joint Conference on Neural Networks (IJCNN’16). IEEE, 4362–4367.
[35]
Chong Zhang, Kay Chen Tan, Haizhou Li, and Geok Soon Hong. 2018. A cost-sensitive deep belief network for imbalanced classification. IEEE Trans. Neural Netw. Learn. Syst. 30, 1 (2018), 109–122.
[36]
Weijie Zheng and Hong Zhao. 2020. Cost-sensitive hierarchical classification for imbalance classes. Appl. Intell. 50, 8 (2020), 2328–2338.
[37]
Victoria López, Alberto Fernández, María José Del Jesus, and Francisco Herrera. 2013. A hierarchical genetic fuzzy system based on genetic programming for addressing classification with highly imbalanced and borderline data-sets. Knowl.-based Syst. 38 (2013), 85–104.
[38]
Harshita Patel and Ghanshyam Singh Thakur. 2017. Classification of imbalanced data using a modified fuzzy-neighbor weighted approach. Int. J. Intell. Eng. Syst. 10, 1 (2017), 56–64.
[39]
Harshita Patel and G. S. Thakur. 2019. An improved fuzzy k-nearest neighbor algorithm for imbalanced data using adaptive approach. IETE J. Res. 65, 6 (2019), 780–789.
[40]
Maede Zolanvari, Marcio A. Teixeira, and Raj Jain. 2018. Effect of imbalanced datasets on security of industrial IoT using machine learning. In IEEE International Conference on Intelligence and Security Informatics (ISI’18). IEEE, 112–117.
[41]
Alberto Fernández, María José Del Jesus, and Francisco Herrera. 2010. Multi-class imbalanced data-sets with linguistic fuzzy rule based classification systems based on pairwise learning. In International Conference on Information Processing and Management of Uncertainty in Knowledge-based Systems. Springer, 89–98.
[42]
Xuebing Yang, Qiuming Kuang, Wensheng Zhang, and Guoping Zhang. 2017. AMDO: An over-sampling technique for multi-class imbalanced problems. IEEE Trans. Knowl. Data Eng. 30, 9 (2017), 1672–1685.
[43]
Zhongliang Zhang, Bartosz Krawczyk, Salvador Garcia, Alejandro Rosales-Pérez, and Francisco Herrera. 2016. Empowering one-vs-one decomposition with ensemble learning for multi-class imbalanced data. Knowl.-based Syst. 106 (2016), 251–263.
[44]
Qianmu Li, Yanjun Song, Jing Zhang, and Victor S. Sheng. 2020. Multiclass imbalanced learning with one-versus-one decomposition and spectral clustering. Expert Syst. Applic. 147 (2020), 113152.
[45]
Nutthaporn Junsomboon and Tanasanee Phienthrakul. 2017. Combining over-sampling and under-sampling techniques for imbalance dataset. In 9th International Conference on Machine Learning and Computing. 243–247.
[46]
Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hall, and W. Philip Kegelmeyer. 2002. SMOTE: Synthetic minority over-sampling technique. J. Arti. Intell. Res. 16 (2002), 321–357.
[47]
Barnan Das, Narayanan C. Krishnan, and Diane J. Cook. 2014. RACOG and wRACOG: Two probabilistic oversampling techniques. IEEE Trans. Knowl. Data Eng. 27, 1 (2014), 222–234.
[48]
Lida Abdi and Sattar Hashemi. 2015. To combat multi-class imbalanced problems by means of over-sampling techniques. IEEE Trans. Knowl. Data Eng. 28, 1 (2015), 238–251.
[49]
Show-Jane Yen and Yue-Shi Lee. 2009. Cluster-based under-sampling approaches for imbalanced data distributions. Expert Syst. Applic. 36, 3 (2009), 5718–5727.
[50]
Hu Li, Peng Zou, Xiang Wang, and Rongze Xia. 2013. A new combination sampling method for imbalanced data. In Chinese Intelligent Automation Conference. Springer, 547–554.
[51]
Ali Arshad, Saman Riaz, and Licheng Jiao. 2019. Semi-supervised deep fuzzy C-mean clustering for imbalanced multi-class classification. IEEE Access 7 (2019), 28100–28112.
[52]
Tanapol Kosolwattana, Chenang Liu, Renjie Hu, Shizhong Han, Hua Chen, and Ying Lin. 2023. A self-inspected adaptive SMOTE algorithm (SASMOTE) for highly imbalanced data classification in healthcare. BioData Min. 16, 1 (2023), 15.
[53]
Masoumeh Soleimani and Akram Sadat Mirshahzadeh. 2023. Multi-class classification of imbalanced intelligent data using deep neural network. EAI Endors. Trans. AI Robot. 2 (2023).
[54]
Sukarna Barua, Md Monirul Islam, and Kazuyuki Murase. 2015. GOS-IL: A generalized over-sampling based online imbalanced learning framework. In Neural Information Processing: 22nd International Conference, ICONIP 2015, Istanbul, Turkey, November 9-12, 2015, Proceedings, Part I 22. Springer, 680–687.
[55]
Shuo Wang, Leandro L. Minku, and Xin Yao. 2016. Dealing with multiple classes in online class imbalance learning. In International Joint Conference on Artificial Intelligence (IJCAI’16). 2118–2124.
[56]
Tahseen Al-Khateeb, Mohammad M. Masud, Khaled M. Al-Naami, Sadi Evren Seker, Ahmad M. Mustafa, Latifur Khan, Zouheir Trabelsi, Charu Aggarwal, and Jiawei Han. 2015. Recurring and novel class detection using class-based ensemble for evolving data stream. IEEE Trans. Knowl. Data Eng. 28, 10 (2015), 2752–2764.
[57]
Zahraa S. Abdallah, Mohamed Medhat Gaber, Bala Srinivasan, and Shonali Krishnaswamy. 2016. AnyNovel: Detection of novel concepts in evolving data streams. Evolv. Syst. 7, 2 (2016), 73–93.
[58]
Ahmad M. Mustafa, Gbadebo Ayoade, Khaled Al-Naami, Latifur Khan, Kevin W. Hamlen, Bhavani Thuraisingham, and Frederico Araujo. 2017. Unsupervised deep embedding for novel class detection over data stream. In IEEE International Conference on Big Data (Big Data’17). IEEE, 1830–1839.
[59]
Imen Khamassi, Moamar Sayed-Mouchaweh, Moez Hammami, and Khaled Ghédira. 2018. Discussion and review on evolving data streams and concept drift adapting. Evolv. Syst. 9, 1 (2018), 1–23.
[60]
Lucas Baier, Josua Reimold, and Niklas Kühl. 2020. Handling concept drift for predictions in business process mining. In IEEE 22nd Conference on Business Informatics (CBI’20). IEEE, 76–83.
[61]
Manzoor Ahmed Hashmani, Syed Muslim Jameel, Mobashar Rehman, and Atsushi Inoue. 2020. Concept drift evolution in machine learning approaches: A systematic literature review. Int. J. Smart Sens. Intell. Syst. 13, 1 (2020), 1.
[62]
Sheng Chen and Haibo He. 2009. SERA: Selectively recursive approach towards nonstationary imbalanced stream data mining. In International Joint Conference on Neural Networks. IEEE, 522–529.
[63]
Farnaz Sadeghi and Herna L. Viktor. 2021. Online-MC-queue: Learning from imbalanced multi-class streams. In 3rd International Workshop on Learning with Imbalanced Domains: Theory and Applications. PMLR, 21–34.
[64]
Shuo Wang, Leandro L. Minku, and Xin Yao. 2018. A systematic study of online class imbalance learning with concept drift. IEEE Trans. Neural Netw. Learn. Syst. 29, 10 (2018), 4802–4821.
[65]
S. Priya and R. Annie Uthra. 2021. RETRACTED ARTICLE: Comprehensive analysis for class imbalance data with concept drift using ensemble based classification. J. Amb. Intell. Human. Comput. 12, 5 (2021), 4943–4956.
[66]
Roberto S. M. Barros, Danilo R. L. Cabral, Paulo M. Gonçalves Jr, and Silas G. T. C. Santos. 2017. RDDM: Reactive drift detection method. Expert Syst. Applic. 90 (2017), 344–355.
[67]
Roberto Souto Maior Barros and Silas Garrido T. Carvalho Santos. 2018. A large-scale comparison of concept drift detectors. Inf. Sci. 451 (2018), 348–370.
[68]
Tegjyot Singh Sethi and Mehmed Kantardzic. 2018. Handling adversarial concept drift in streaming data. Expert Syst. Applic. 97 (2018), 18–40.
[69]
Moritz Heusinger, Christoph Raab, and Frank-Michael Schleif. 2022. Passive concept drift handling via variations of learning vector quantization. Neural Computing and Applications 34, 1 (2022), 89–100.
[70]
Arif Budiman, Mohamad Ivan Fanany, and Chan Basaruddin. 2016. Adaptive convolutional ELM for concept drift handling in online stream data. arXiv preprint arXiv:1610.02348 (2016).
[71]
Tao Peng, Sana Sellami, and Omar Boucelma. 2019. IoT data imputation with incremental multiple linear regression. Open J. Internet Things 5, 1 (2019), 69–79.
[72]
Shuo Wang and Leandro L. Minku. 2020. AUC estimation and concept drift detection for imbalanced data streams with multiple classes. In International Joint Conference on Neural Networks (IJCNN’20). IEEE, 1–8.
[73]
Łukasz Korycki and Bartosz Krawczyk. 2021. Concept drift detection from multi-class imbalanced data streams. In IEEE 37th International Conference on Data Engineering (ICDE’21). IEEE, 1068–1079.
[74]
S. Ancy and D. Paulraj. 2020. Handling imbalanced data with concept drift by applying dynamic sampling and ensemble classification model. Computer Communications 153 (2020), 553–560.
[75]
Zeng Li, Wenchao Huang, Yan Xiong, Siqi Ren, and Tuanfei Zhu. 2020. Incremental learning imbalanced data streams with concept drift: The dynamic updated ensemble algorithm. Knowl.-based Syst. 195 (2020), 105694.
[76]
Dariusz Brzezinski, Leandro L. Minku, Tomasz Pewinski, Jerzy Stefanowski, and Artur Szumaczuk. 2021. The impact of data difficulty factors on classification of imbalanced and concept drifting data streams. Knowl. Inf. Syst. 63, 6 (2021), 1429–1469.
[77]
Gregory Ditzler and Robi Polikar. 2012. Incremental learning of concept drift from streaming imbalanced data. IEEE Trans. Knowl. Data Eng. 25, 10 (2012), 2283–2301.
[78]
Jing Gao, Wei Fan, Jiawei Han, and Philip S. Yu. 2007. A general framework for mining concept-drifting data streams with skewed distributions. In Siam International Conference on Data Mining. SIAM, 3–14.
[79]
Josephine Akosa. 2017. Predictive accuracy: A misleading performance measure for highly imbalanced data. In SAS Global Forum.
[80]
Hung Cao and Monica Wachowicz. 2019. An edge-fog-cloud architecture of streaming analytics for Internet of Things applications. Sensors 19, 16 (2019), 3594.
[81]
Mercedes E. Paoletti, Oscar Mogollon-Gutierrez, Sergio Moreno-Álvarez, Jose Carlos Sancho, and Juan M. Haut. 2023. A comprehensive survey of imbalance correction techniques for hyperspectral data classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 16 (2023), 5297–5314.
[82]
Debashree Devi, Saroj K. Biswas, and Biswajit Purkayastha. 2020. A review on solution to class imbalance problem: Undersampling approaches. In International Conference on Computational Performance Evaluation (ComPE’20). IEEE, 626–631.
[83]
Sanchita Pandey and Kuldeep Kumar. 2023. Software fault prediction for imbalanced data: A survey on recent developments. Proced. Comput. Sci. 218 (2023), 1815–1824.
[84]
Abhisar Sharma, Anuradha Purohit, and Himani Mishra. 2021. A survey on imbalanced data handling techniques for classification. Int. J. Emerg. Trends Eng. Res. 9, 10 (2021).
[85]
Shaik Johny Basha, Srinivasa Rao Madala, Kolla Vivek, Eedupalli Sai Kumar, and Tamminina Ammannamma. 2022. A review on imbalanced data classification techniques. In International Conference on Advanced Computing Technologies and Applications (ICACTA’22). IEEE, 1–6.
[86]
Megha Ashok Patil, Sunil Kumar, Sandeep Kumar, and Muskan Garg. 2021. Concept drift detection for social media: A survey. In 3rd International Conference on Advances in Computing, Communication Control and Networking (ICAC3N’21). IEEE, 12–16.
[87]
Meng Han, Zhiqiang Chen, Muhang Li, Hongxin Wu, and Xilong Zhang. 2022. A survey of active and passive concept drift handling methods. Comput. Intell. 38, 4 (2022), 1492–1535.
[88]
Jiguo Yu, Yingying Qi, Guanghui Wang, and Xin Gu. 2012. A cluster-based routing protocol for wireless sensor networks with nonuniform node distribution. AEU-Int. J. Electron. Commun. 66, 1 (2012), 54–61.
[89]
Nawel Yala, Belkacem Fergani, Laurent Clavier, and others. 2014. Soft margin SVM modeling for handling imbalanced human activity datasets in multiple homes. In 2014 International Conference on Multimedia Computing and Systems (ICMCS). IEEE, 421–426.
[90]
Hong Zhou and Kun-Ming Yu. 2017. Imbalanced data classification for defective product prediction based on industrial wireless sensor network. In 6th International Conference on Future Generation Communication Technologies (FGCT’17). IEEE, 1–6.
[91]
Hang Yang, Simon Fong, Raymond Wong, and Guangmin Sun. 2013. Optimizing classification decision trees by using weighted naïve Bayes predictors to reduce the imbalanced class problem in wireless sensor network. Int. J. Distrib. Sensor Netw. 9, 1 (2013), 460641.
[92]
Harshita Patel, Dharmendra Singh Rajput, G. Thippa Reddy, Celestine Iwendi, Ali Kashif Bashir, and Ohyun Jo. 2020. A review on classification of imbalanced data for wireless sensor networks. Int. J. Distrib. Sensor Netw. 16, 4 (2020), 1550147720916404.
[93]
Sireesha Rodda and Uma Shankar Rao Erothi. 2016. Class imbalance problem in the network intrusion detection systems. In International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT’16). IEEE, 2685–2688.
[94]
Sitaram Asur and Srinivasan Parthasarathy. 2007. Correlation-based feature partitioning for rare event detection in wireless sensor networks. In 1st International Workshop on Knowledge Discovery from Sensor Data (Sensor-KDD’07).
[95]
B. Abidine M’hamed and Belkacem Fergani. 2014. A new multi-class WSVM classification to imbalanced human activity dataset. J. Comput. 9, 7 (2014), 1560–1565.
[96]
Yujing Chen, Zheng Chai, Yue Cheng, and Huzefa Rangwala. 2021. Asynchronous federated learning for sensor data with concept drift. In IEEE International Conference on Big Data (Big Data’21). IEEE, 4822–4831.
[97]
Shenglan Liu, Lin Feng, Jun Wu, Gang Hou, and Guangjie Han. 2017. Concept drift detection for data stream learning based on angle optimized global embedding and principal component analysis in sensor networks. Comput. Electric. Eng. 58 (2017), 327–336.
[98]
Evangelos Bagkis, Theodosios Kassandros, and Kostas Karatzas. 2022. Learning calibration functions on the fly: Hybrid batch online stacking ensembles for the calibration of low-cost air quality sensor networks in the presence of concept drift. Atmosphere 13, 3 (2022), 416.
[99]
Swaraj Dube, Wong Yee Wan, and Hermawan Nugroho. 2021. A novel approach of IoT stream sampling and model update on the IoT edge device for class incremental learning in an edge-cloud system. IEEE Access 9 (2021), 29180–29199.
[100]
Akbar Telikani and Amir H. Gandomi. 2021. Cost-sensitive stacked auto-encoders for intrusion detection in the Internet of Things. Internet Things 14 (2021), 100122.
[101]
Bharath Sudharsan, John G. Breslin, and Muhammad Intizar Ali. 2021. Imbal-OL: Online machine learning from imbalanced data streams in real-world IoT. In IEEE International Conference on Big Data (Big Data’21). IEEE, 4974–4978.
[102]
Gaurav Mohindru, Koushik Mondal, and Haider Banka. 2021. Different hybrid machine intelligence techniques for handling IoT-based imbalanced data. CAAI Trans. Intell. Technol. 6, 4 (2021), 405–416.
[103]
Razan Abdulhammed, Miad Faezipour, Abdelshakour Abuzneid, and Arafat AbuMallouh. 2018. Deep and machine learning approaches for anomaly-based intrusion detection of imbalanced network traffic. IEEE Sensors Lett. 3, 1 (2018), 1–4.
[104]
Mohamed Amine Ferrag, Leandros Maglaras, Ahmed Ahmim, Makhlouf Derdour, and Helge Janicke. 2020. RDTIDS: Rules and decision tree-based intrusion detection system for internet-of-things networks. Fut. Internet 12, 3 (2020), 44.
[105]
JooHwa Lee and KeeHyun Park. 2021. GAN-based imbalanced data intrusion detection system. Person. Ubiq. Comput. 25, 1 (2021), 121–128.
[106]
Chun-Cheng Lin, Der-Jiunn Deng, Chin-Hung Kuo, and Linnan Chen. 2019. Concept drift detection and adaption in big imbalance industrial IoT data using an ensemble learning method of offline classifiers. IEEE Access 7 (2019), 56198–56207.
[107]
Sikha Bagui and Kunqi Li. 2021. Resampling imbalanced data for network intrusion detection datasets. J. Big Data 8, 1 (2021), 1–41.
[108]
Pandit Byomakesha Dash, Janmenjoy Nayak, Bighnaraj Naik, Etuari Oram, and S. K. Hafizul Islam. 2020. Model based IoT security framework using multiclass adaptive boosting with SMOTE. Secur. Privac. 3, 5 (2020), e112.
[109]
Pankaj Kumar Keserwani, Mahesh Chandra Govil, Emmanuel S. Pilli, and Prajjval Govil. 2021. A smart anomaly-based intrusion detection system for the Internet of Things (IoT) network using GWO–PSO–RF model. J. Reliab. Intell. Environ. 7, 1 (2021), 3–21.
[110]
Mashaal A. Alfhaid and Manal A. Abdullah. 2022. ICSM: Imbalanced chunk-based stream model. Int. J. Innov., Creativ. Change 16 (2022).
[111]
Vinay Arora, Rohan Singh Leekha, Kyungroul Lee, and Aman Kataria. 2020. Facilitating user authorization from imbalanced data logs of credit cards using artificial intelligence. Mobile Information Systems 2020, 1 (2020), 8885269.
[112]
Siqi Ren, Bo Liao, Wen Zhu, Zeng Li, Wei Liu, and Keqin Li. 2018. The gradual resampling ensemble for mining imbalanced data streams with concept drift. Neurocomputing 286 (2018), 150–166.
[113]
Rafiq Ahmed Mohammed, Kok-Wai Wong, Mohd Fairuz Shiratuddin, and Xuequn Wang. 2020. PWIDB: A framework for learning to classify imbalanced data streams with incremental data re-balancing technique. Proced. Comput. Sci. 176 (2020), 818–827.
[114]
Li Yang, Dimitrios Michael Manias, and Abdallah Shami. 2021. PWPAE: An ensemble framework for concept drift adaptation in IoT data streams. In IEEE Global Communications Conference (GLOBECOM’21). IEEE, 01–06.
[115]
Asim Roy. 2016. Two-layered ensemble Kohonen nets for imbalanced streaming data. In IEEE Congress on Evolutionary Computation (CEC’16). IEEE, 5215–5221.
[116]
Seba Susan and Amitesh Kumar. 2021. The balancing trick: Optimized sampling of imbalanced datasets–A brief survey of the recent state of the art. Eng. Rep. 3, 4 (2021), e12298.
[117]
Amin Shahraki, Mahmoud Abbasi, Amir Taherkordi, and Anca Delia Jurcut. 2022. A comparative study on online machine learning techniques for network traffic streams analysis. Comput. Netw. 207 (2022), 108836.
[118]
Hongle Du, Yan Zhang, Ke Gang, Lin Zhang, and Yeh-Cheng Chen. 2021. Online ensemble learning algorithm for imbalanced data stream. Applied Soft Computing 107 (2021), 107378. DOI:
[119]
Manal Almuammar and Maria Fasli. 2018. Learning patterns from imbalanced evolving data streams. In IEEE International Conference on Big Data (Big Data’18). IEEE, 2048–2057.
[120]
Yange Sun, Yi Sun, and Honghua Dai. 2020. Two-stage cost-sensitive learning for data streams with concept drift and class imbalance. IEEE Access 8 (2020), 191942–191955.
[121]
Debanjan Borthakur, Harishchandra Dubey, Nicholas Constant, Leslie Mahler, and Kunal Mankodiya. 2017. Smart fog: Fog computing framework for unsupervised clustering analytics in wearable Internet of Things. In IEEE Global Conference on Signal and Information Processing (GlobalSIP’17). IEEE, 472–476.
[122]
Branka Mikavica and Aleksandra Kostić-Ljubisavljević. 2019. Fog Computing in Logistics Systems. Logic.
[123]
Yohan Joo, Jaehyeong Lee, and Jongpil Jeong. 2020. Ensemble fog computing architecture for unstable state detection of hydraulic system. Proced. Comput. Sci. 175 (2020), 230–236.
[124]
Mohammed Anis Benblidia, Bouziane Brik, Leila Merghem-Boulahia, and Moez Esseghir. 2019. Ranking fog nodes for tasks scheduling in fog-cloud environments: A fuzzy logic approach. In 15th International Wireless Communications & Mobile Computing Conference (IWCMC’19). IEEE, 1451–1457.
[125]
Arman Anzanpour, Humayun Rashid, Amir M. Rahmani, Axel Jantsch, Nikil Dutt, and Pasi Liljeberg. 2019. Energy-efficient and reliable wearable Internet-of-Things through fog-assisted dynamic goal management. Proced. Comput. Sci. 151 (2019), 493–500.
[126]
Shengjie Xu, Yi Qian, and Rose Qingyang Hu. 2019. A semi-supervised learning approach for network anomaly detection in fog computing. In IEEE International Conference on Communications (ICC’19). IEEE, 1–6.
[127]
Saurabh Shukla, Mohd Fadzil Hassan, Muhammad Khalid Khan, Low Tang Jung, and Azlan Awang. 2019. An analytical model to minimize the latency in healthcare internet-of-things in fog computing environment. PLoS One 14, 11 (2019), e0224934.
[128]
Saurabh Shukla, Mohd Fadzil Hassan, Low Tang Jung, Azlan Awang, and Muhammad Khalid Khan. 2019. A 3-tier architecture for network latency reduction in healthcare internet-of-things using fog computing and machine learning. In 8th International Conference on Software and Computer Applications. 522–528.
[129]
Jakub Jalowiczor, Jan Rozhon, and Miroslav Voznak. 2021. Study of the efficiency of fog computing in an optimized LoRaWAN cloud architecture. Sensors 21, 9 (2021), 3159.
[130]
Nader Mohamed, Jameela Al-Jaroodi, Sanja Lazarova-Molnar, and Imad Jawhar. 2021. Applications of integrated IoT-fog-cloud systems to smart cities: A survey. Electronics 10, 23 (2021), 2918.
[131]
AIA Alzahrani, A. Al-Rasheed, A. Ksibi, M. Ayadi, M. M. Asiri, and M. Zakariah. 2022. Anomaly detection in fog computing architectures using custom tab transformer for internet of things. Electronics 11, 23 (2022), 4017.
[132]
Shusen Yang. 2017. IoT stream processing and analytics in the fog. IEEE Commun. Mag. 55, 8 (2017), 21–27.
[133]
Shaimaa A. Hussein, Ahmed I. Saleh, Hossam El-Din Mostafa, and Marwa I. Obay. 2021. A Hybrid Security Strategy (HS2) for Reliable Video Streaming in Fog Computing (Retraction of Vol 51, art no 102412, 2020). Elsevier Radarweg 29, 1043 NX Amsterdam, Netherlands.
[134]
Jyun-Yao Huang, Wei-Chih Hong, Po-Shin Tsai, and I-En Liao. 2017. A model for aggregation and filtering on encrypted XML streams in fog computing. Int. J. Distrib. Sensor Netw. 13, 5 (2017), 1550147717704158.
[135]
Brenno M. Alencar, Ricardo A. Rios, Cleber Santana, and Cássio Prazeres. 2020. FoT-Stream: A fog platform for data stream analytics in IoT. Comput. Commun. 164 (2020), 77–87.
[136]
Elarbi Badidi and Karima Moumane. 2019. Enhancing the processing of healthcare data streams using fog computing. In IEEE Symposium on Computers and Communications (ISCC’19). IEEE, 1113–1118.
[137]
Paulo Bezerra, Adalberto Melo, Allan Douglas, Hugo Santos, Denis Rosário, and Eduardo Cerqueira. 2019. A collaborative routing protocol for video streaming with fog computing in vehicular ad hoc networks. Int. J. Distrib. Sensor Netw. 15, 3 (2019), 1550147719832839.
[138]
Ivan Merelli, Lucia Morganti, Elena Corni, Carmelo Pellegrino, Daniele Cesini, Luca Roverelli, Gabriele Zereik, and Daniele D’Agostino. 2018. Low-power portable devices for metagenomics analysis: Fog computing makes bioinformatics ready for the Internet of Things. Fut. Gen. Comput. Syst. 88 (2018), 467–478.
[139]
Juan Luis Pérez, Alberto Gutierrez-Torre, Josep Lluís Berral, and David Carrera. 2018. A resilient and distributed near real-time traffic forecasting application for Fog computing environments. Fut. Gen. Comput. Syst. 87 (2018), 198–212.
[140]
Maria A. Rodriguez, Ramamohanarao Kotagiri, and Rajkumar Buyya. 2018. Detecting performance anomalies in scientific workflows using hierarchical temporal memory. Fut. Gen. Comput. Syst. 88 (2018), 624–635.
[141]
Asif Muhammad and Muhammad Aleem. 2021. A3-Storm: Topology-, traffic-, and resource-aware storm scheduler for heterogeneous clusters. J. Supercomput. 77, 2 (2021), 1059–1093.
[142]
Daniele De Sensi, Tiziano De Matteis, and Marco Danelutto. 2018. Simplifying self-adaptive and power-aware computing with Nornir. Fut. Gen. Comput. Syst. 87 (2018), 136–151.
[143]
Ivan Walulya, Dimitris Palyvos-Giannas, Yiannis Nikolakopoulos, Vincenzo Gulisano, Marina Papatriantafilou, and Philippas Tsigas. 2018. Viper: A module for communication-layer determinism and scaling in low-latency stream processing. Fut. Gen. Comput. Syst. 88 (2018), 297–308.
[144]
Valeria Cardellini, Francesco Lo Presti, Matteo Nardelli, and Gabriele Russo Russo. 2018. Decentralized self-adaptation for elastic data stream processing. Fut. Gen. Comput. Syst. 87 (2018), 171–185.
[145]
Claudia Misale, Maurizio Drocco, Guy Tremblay, Alberto R. Martinelli, and Marco Aldinucci. 2018. PiCo: High-performance data analytics pipelines in modern C++. Fut. Gen. Comput. Syst. 87 (2018), 392–403.
[146]
Zoltán Zvara, Péter G. N. Szabó, Barnabás Balázs, and András Benczúr. 2019. Optimizing distributed data stream processing by tracing. Fut. Gen. Comput. Syst. 90 (2019), 578–591.
[147]
Arvind Kumar Gangwar and Sandeep Kumar. 2023. Concept drift in software defect prediction: A method for detecting and handling the drift. ACM Trans. Internet Technol. 23, 2 (2023), 1–28.
[148]
Ature Angbera and Huah Yong Chan. 2024. An adaptive XGBoost-based optimized sliding window for concept drift handling in non-stationary spatiotemporal data streams classifications. J. Supercomput. 80, 6 (2024), 7781–7811.
[149]
Ketan Sanjay Desale and Swati V. Shinde. 2023. Concept drift detection and adaption framework using optimized deep learning and adaptive sliding window approach. Expert Syst. 40, 9 (2023), e13394.
[150]
Tao Huang, Boyi Xu, Hongming Cai, Jiawei Du, Kuo-Ming Chao, and Chengxi Huang. 2018. A fog computing based concept drift adaptive process mining framework for mobile APPs. Fut. Gen. Comput. Syst. 89 (2018), 670–684.
[151]
Brenno M. Alencar, João Paulo Canário, Ruivaldo Lobão Neto, Cássio Prazeres, Abert Bifet, and Ricardo A. Rios. 2023. Fog-DeepStream: A new approach combining LSTM and concept drift for data stream analytics on Fog computing. Internet Things 22 (2023), 100731.
[152]
Amir Hossein Estiri and Muthucumaru Maheswaran. 2021. Attentive federated learning for concept drift in distributed 5G edge networks. arXiv preprint arXiv:2111.07457 (2021).
[153]
Hassan Mehmood, Panos Kostakos, Marta Cortes, Theodoros Anagnostopoulos, Susanna Pirttikangas, and Ekaterina Gilman. 2021. Concept drift adaptation techniques in distributed environment for real-world data streams. Smart Cities 4, 1 (2021), 349–371.
[154]
I. Made Murwantara and Pujianto Yugopuspito. 2021. An adaptive IoT architecture using combination of concept-drift and dynamic software product line engineering. TELKOMNIKA (Telecommun. Comput. Electron. Contr.) 19, 4 (2021), 1226–1233.
[155]
Pallavi Kulkarni and Roshani Ade. 2014. Incremental learning from unbalanced data with concept class, concept drift and missing features: A review. Int. J. Data Min. Knowl. Manag. Process 4, 6 (2014), 15.
[156]
Shikha Mehta and others. 2017. Concept drift in streaming data classification: algorithms, platforms and issues. Procedia Computer Science 122 (2017), 804–811.

Cited By

View all
  • (2024)A Survey on Reduction of Energy Consumption in Fog Networks—Communications and ComputationsSensors10.3390/s2418606424:18(6064)Online publication date: 19-Sep-2024
  • (2024)Multi-classifier semi-supervised data stream classification algorithm based on online learningProceedings of the 2024 2nd International Conference on Electronics, Computers and Communication Technology10.1145/3705754.3705777(127-132)Online publication date: 25-Oct-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Computing Surveys
ACM Computing Surveys  Volume 57, Issue 1
January 2025
984 pages
EISSN:1557-7341
DOI:10.1145/3696794
  • Editors:
  • David Atienza,
  • Michela Milano
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 October 2024
Online AM: 22 August 2024
Accepted: 06 August 2024
Revised: 13 July 2024
Received: 28 January 2023
Published in CSUR Volume 57, Issue 1

Check for updates

Author Tags

  1. Cloud computing
  2. fog computing
  3. Internet of Things (IoT)
  4. multi-class imbalanced data stream
  5. concept drift

Qualifiers

  • Survey

Funding Sources

  • EU HORIZON-TMA-MSCA-SE project TRACE-V2X

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)497
  • Downloads (Last 6 weeks)87
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Survey on Reduction of Energy Consumption in Fog Networks—Communications and ComputationsSensors10.3390/s2418606424:18(6064)Online publication date: 19-Sep-2024
  • (2024)Multi-classifier semi-supervised data stream classification algorithm based on online learningProceedings of the 2024 2nd International Conference on Electronics, Computers and Communication Technology10.1145/3705754.3705777(127-132)Online publication date: 25-Oct-2024

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media