Skip to main content
Log in

Employing Feature Extraction, Feature Selection, and Machine Learning to Classify Electricity Consumption as Normal or Electricity Theft

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

One of the main causes of revenue loss in the energy sector across the globe has been non-technical losses. Electricity theft is a non-technical loss that affects the power supply’s quality. Detecting electricity theft is crucial for conserving energy and making use of it effectively. This research proposes a method through which higher accuracy in electricity theft detection can be obtained using fewer features. Different feature extraction and feature selection techniques are examined to find the best method for selecting the features that are more relevant in electricity theft detection. Various experiments are carried out using feature selection and feature extraction methods, such as mutual information, low variance filtering, and Principal Component Analysis. Various machine learning-based classifiers are used that include Random Forest, Support Vector Machine, K-Nearest Neighbours, Naive Bayes, and Decision Tree. Results of the experiments are tabulated based on standard performance measures, namely accuracy, precision, recall, and Area Under the Receiver Operating Characteristic Curve (AUC-ROC) score. According to experimental findings, the Random Forest classifier with 30 components for PCA outperformed other methods by producing the best accuracy of 95.82%, recall of 0.938, and AUC-ROC-score of 0.989.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Data availability

Data will be made available based on request.

References

  1. Wu R, Wang L, Hu T. Adaboost-SVM for electrical theft detection and GRNN for stealing time periods identification. In: Proceedings: IECON 2018-44th Annual Conference of the IEEE Industrial Electronics Society, 2018; p. 3073–3078.

  2. Buzau MM, Tejedor-Aguilera J, Cruz-Romero P, Gómez-Expósito A. Hybrid deep neural networks for detection of non-technical losses in electricity smart meters. IEEE Trans Power Syst. 2020;35:1254–63. https://doi.org/10.1109/TPWRS.2019.2943115.

    Article  Google Scholar 

  3. Gunturi SK, Sarkar D. Ensemble machine learning models for the detection of energy theft. Electr Power Syst Res. 2021;192: 106904. https://doi.org/10.1016/j.epsr.2020.106904.

    Article  Google Scholar 

  4. Kong X, Zhao X, Liu C, Li Q, Dong DL, Li Y. Electricity theft detection in low-voltage stations based on similarity measure and DT-KSVM. Int J Electric Power Energy Syst. 2021. https://doi.org/10.1016/j.ijepes.2020.106544.

    Article  Google Scholar 

  5. Ullah A, Javaid N, Asif M, Javed MU, Yahaya AS. AlexNet, AdaBoost and Artificial Bee Colony based hybrid model for electricity theft detection in smart grids. IEEE Access. 2022;10:18681–94. https://doi.org/10.1109/ACCESS.2022.3150016.

    Article  Google Scholar 

  6. Zheng Z, Yang Y, Niu X, Dai HN, Zhou Y. Wide and deep convolutional neural networks for electricity-theft detection to secure smart grids. IEEE Trans Ind Inf. 2018;14:1606–15. https://doi.org/10.1109/TII.2017.2785963.

    Article  Google Scholar 

  7. Huang Y, Xu Q. Electricity theft detection based on stacked sparse denoising autoencoder. Int J Electr Power Energy Syst. 2021;125: 106448. https://doi.org/10.1016/j.ijepes.2020.106448.

    Article  Google Scholar 

  8. Gao HX, Kuenzel S, Zhang XY. A hybrid ConvLSTM-based anomaly detection approach for combating energy theft. IEEE Trans Instrum Meas XX. 2022. https://doi.org/10.1109/TIM.2022.3201569.

    Article  Google Scholar 

  9. Alsalemi A, Himeur Y, Bensaali F, Amira A, Sardianos C, Chronis C, Varlamis I, Dimitrakopoulos G. A micro-moment system for domestic energy efficiency analysis. IEEE Syst J. 2021;15:1256–63. https://doi.org/10.1109/JSYST.2020.2997773.

    Article  Google Scholar 

  10. Hussain S, Mustafa MW, Jumani TA, Baloch SK, Alotaibi H, Khan I, Khan A. A novel feature engineered-CatBoost-based supervised machine learning framework for electricity theft detection. Energy Rep. 2021;7:4425–36. https://doi.org/10.1016/j.egyr.2021.07.008.

    Article  Google Scholar 

  11. Khan IU, Javeid N, Taylor CJ, Gamage KAA, Ma X. A stacked machine and deep learning-based approach for analysing electricity theft in smart grids. IEEE Trans Smart Grid. 2022;13:1633–44. https://doi.org/10.1109/TSG.2021.3134018.

    Article  Google Scholar 

  12. Xia X, Xiao Y, Liang W, Cui J. Detection methods in smart meters for electricity thefts: a survey. Proc IEEE. 2022;110:273–319. https://doi.org/10.1109/JPROC.2021.3139754.

    Article  Google Scholar 

  13. Yan Z, Wen H. Electricity theft detection base on extreme gradient boosting in AMI. IEEE Trans Instrum Meas. 2021. https://doi.org/10.1109/TIM.2020.3048784.

    Article  Google Scholar 

  14. Peng Y, Yang Y, Xu Y, Xue Y, Song R, Kang J, Zhao H. Electricity theft detection in AMI based on clustering and local outlier factor. IEEE Access. 2021;9:107250–9. https://doi.org/10.1109/ACCESS.2021.3100980.

    Article  Google Scholar 

  15. Lepolesa LJ, Achari S, Cheng L. Electricity theft detection in smart grids based on deep neural network. IEEE Access. 2022;10:39638–55. https://doi.org/10.1109/ACCESS.2022.3166146.

    Article  Google Scholar 

  16. Shehzad F, Javaid N, Aslam S, Umar Javaid M. Electricity theft detection using big data and genetic algorithm in electric power systems. Electr Power Syst Res. 2022;209: 107975. https://doi.org/10.1016/j.epsr.2022.107975.

    Article  Google Scholar 

  17. Liao W, Yang Z, Liu K, Zhang B, Chen X, Song R. Electricity theft detection using Euclidean and graph convolutional neural networks. IEEE Trans Power Syst. 2022. https://doi.org/10.1109/TPWRS.2022.3196403.

    Article  Google Scholar 

  18. Ahmed M, Khan A, Ahmed M, Tahir M, Jeon G, Fortino G, Piccialli F. Energy theft detection in smart grids: taxonomy, comparative analysis, challenges, and future research directions. IEEE/CAA J Autom Sin. 2022. https://doi.org/10.1109/JAS.2022.105404.

    Article  Google Scholar 

  19. Althobaiti A, Jindal A, Marnerides AK, Roedig U. Energy theft in smart grids: a survey on data-driven attack strategies and detection methods. IEEE Access. 2021;9:159291–312. https://doi.org/10.1109/ACCESS.2021.3131220.

    Article  Google Scholar 

  20. Kulkarni Y, Sayf Hussain Z, Ramamritham K, Somu N. EnsembleNTLDetect: an intelligent framework for electricity theft detection in smart grid. In: IEEE International Conference on Data Mining Workshops, ICDMW 2021-December, 2021; p. 527–536. https://doi.org/10.1109/ICDMW53433.2021.00070.

  21. Mangat G, Divya D, Gupta V, Sambyal N. Power theft detection using deep neural networks. Electr Power Compon Syst. 2021;49:458–73. https://doi.org/10.1080/15325008.2021.1970055.

    Article  Google Scholar 

  22. Arif A, Alghamdi TA, Khan ZA, Javaid N. Towards efficient energy utilization using big data analytics in smart cities for electricity theft detection. Big Data Res. 2022;27: 100285. https://doi.org/10.1016/j.bdr.2021.100285.

    Article  Google Scholar 

  23. Aldegheishem A, Anwar M, Javaid N, Alrajeh N, Shafiq M, Ahmed H. Towards sustainable energy efficiency with intelligent electricity theft detection in smart grids emphasising enhanced neural networks. IEEE Access. 2021;9:25036–61. https://doi.org/10.1109/ACCESS.2021.3056566.

    Article  Google Scholar 

  24. Wen M, Xie R, Lu K, Wang L, Zhang K. FedDetect: a novel privacy-preserving federated learning framework for energy theft detection in smart grid. IEEE Internet Things J. 2022;9:6069–80.

    Article  Google Scholar 

  25. Jia W, Sun M, Lian J, Hou S. Feature dimensionality reduction: a review. Complex Intel Syst. 2022;8:2663–93. https://doi.org/10.1007/s40747-021-00637-x.

    Article  Google Scholar 

  26. Bommert A, Sun X, Bischl B, Rahnenführer J, Lang M. Benchmark for filter methods for feature selection in high-dimensional classification data. Comput Stat Data Anal. 2020. https://doi.org/10.1016/j.csda.2019.106839.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Funding

This manuscript’s preparation is not supported financially in any way.

Author information

Authors and Affiliations

Authors

Contributions

Rajesh Nayak: Conceptualization of this study, Data collection, Methodology, Original draft preparation. Jaidhar C.D.: Conceptualization of this study, Methodology, Review and Editing, Supervision.

Corresponding author

Correspondence to Rajesh Nayak.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Compliance with Ethical Standards

This article does not contain any studies with human or animal subjects.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the topical collection “Research Trends in Computational Intelligence” guest edited by Anshul Verma, Pradeepika Verma, Vivek Kumar Singh and S. Karthikeyan.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nayak, R., Jaidhar, C.D. Employing Feature Extraction, Feature Selection, and Machine Learning to Classify Electricity Consumption as Normal or Electricity Theft. SN COMPUT. SCI. 4, 483 (2023). https://doi.org/10.1007/s42979-023-01911-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-023-01911-0

Keywords

Navigation