Skip to main content

Advertisement

Log in

An adaptive ensemble classification framework for real-time data streams by distributed control systems

  • S.I. : Emerging Trends of Applied Neural Computation - E_TRAINCO
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Smart Grids are critical infrastructure networks. They play a critical role in the survival of our postmodern economies, as all other areas depend on their availability. An interruption in their operation may have a direct impact on the availability of other services (e.g., health, transportation). The problem is particularly intense when no backup networks are available, or the required recovery time is beyond backup autonomy. The transition to a decentralized management and control system for these networks requires digital technologies, advanced interconnected system communications, and Internet access. These technologies expose critical infrastructure networks to external threats that require careful assessment of cyber-security risks and appropriate countermeasures. An important factor that enhances the range of threats is the heterogeneity of Smart Grids, which incorporate industrial control systems such as the SCADA, distributed control system, and programmable logic controllers to which security improvements may not have been made since they were installed. Υet, another serious problem arises from the fact that older technologies were designed at times when cyber-security was not part of their technical design specifications. At the same time, it should be seriously considered that many of the systems of these networks that can be cyber-attacked may not be easily disconnected, as this could potentially lead to generalized operational problems. In this scientific research, a sophisticated active security framework is proposed, which is based solely on advanced computational intelligence methods and concerns the digital security of critical infrastructure networks. Specifically, this research introduces a sophisticated adaptive ensemble classification framework for real-time data streams by distributed control systems. It is a “Kappa” architecture framework that is based on a two-step online ensemble learning model based on bagging and boosting methods. The aim is performance of real-time analysis and evaluation of data flows from Smart Grids, toward the effective identification of APT attacks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Babu B, Ijyas T, Muneer P, Varghese J (2017) Security issues in SCADA based industrial control systems. In: 2017 2nd international conference on anti-cyber crimes (ICACC), Abha, 2017, pp 47–51. https://doi.org/10.1109/anti-cybercrime.2017.7905261

  2. Haq EU, Xu H, Pan L, Khattak MI Smart grid security: threats and solutions. In: 2017 13th international conference on semantics, knowledge and grids (SKG), Beijing, China, 2017, pp 188–193. https://doi.org/10.1109/skg.2017.00039

  3. Tan S, De D, Song WZ, Yang J, Das SK (2017) Survey of security advances in smart grid: a data driven approach. IEEE Commun Surv Tutor 19(1):397–422. https://doi.org/10.1109/comst.2016.2616442

    Article  Google Scholar 

  4. Raj VS, Chezhian RM, Mrithulashri M (2014) Advanced persistent threats & recent high profile cyber threat encounters. Int J Innovative Res Comput Commun Eng (An ISO 3297: 2007 Certified Organization) 2(1)

  5. Virvilis N, Gritzalis D, Apostolopoulos T (2013) Trusted computing vs. advanced persistent threats: can a defender win this game?, In: Proceedings of 10th IEEE international conference on autonomic and trusted computing (ATC-2013), IEEE Press, Italy, pp 396–403

  6. www.damballa.com. Accessed 5 Nov 2019

  7. www.crowdstrike.com. Accessed 5 Nov 2019

  8. Aretz K, Bartram SM, Pope PF (2011) Asymmetric loss functions and the rationality of expected stock returns. Int J Forecast 27(2):413–437. https://doi.org/10.1016/j.ijforecast.2009.10.008.SSRN889323

    Article  Google Scholar 

  9. Kushner H, Yin GG (1997) Stochastic approximation algorithms and applications. Springer, New York 2nd ed., titled Stochastic approximation and recursive algorithms and applications, 2003. ISBN: 0-387-00894-2. ISBN 0-387-94916-X

    Book  Google Scholar 

  10. Kurlej B, Wozniak M (2012) Active learning approach to concept drift problem. Logic J IGPL 20(3):550–559. https://doi.org/10.1093/jigpal/jzr011

    Article  MathSciNet  Google Scholar 

  11. Kiran M, Murphy P, Monga I, Dugan J, Baveja SS, Lambda architecture for cost-effective batch and speed big data processing. In: 2015 IEEE International conference on big data (big data), Santa Clara, CA, 2015, pp 2785–2792. https://doi.org/10.1109/bigdata.2015.7364082

  12. Yamato Y, Kumazaki H, Fukumoto Y (2016) Proposal of Lambda architecture adoption for real time predictive maintenance. In: 2016 fourth international symposium on computing and networking (CANDAR), pp 713–715. https://doi.org/10.1109/CANDAR.2016.0130

  13. Lin J (2017) The lambda and the kappa. IEEE Int Comput 21(5):60–66. https://doi.org/10.1109/MIC.2017.3481351

    Article  Google Scholar 

  14. Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  Google Scholar 

  15. Gomes HM, Bifet A, Read J, Barddal JP, Enembreck F, Pfharinger B, Holmes G, Abdessalem T (2017) Adaptive random forests for evolving data stream classification. Mach Learn. https://doi.org/10.1007/s10994-017-5642-8

    Article  MathSciNet  MATH  Google Scholar 

  16. Zhou ZH (2012) Ensemble methods foundations and algorithms. CRC Press, Boca Raton

    Book  Google Scholar 

  17. Kuncheva L (2004) Combining pattern classifiers: methods and algorithms. Wiley, Hoboken

    Book  Google Scholar 

  18. Bonab HR, Can F (2016) A theoretical framework on the ideal number of classifiers for online ensembles in data streams. In: CIKM. ACM, USA, p 2053

  19. Dietterich TG (2001) Ensemble methods in machine learning. In: Kittler J, Roli F (eds) Multiple classifier systems. LNCS vol 1857. Springer, Cham, pp 1–15

  20. Webb GI, Zheng Z (2004) Multistrategy ensemble learning: Reducing error by combining ensemble learning techniques. IEEE Trans Knowl Data Eng 16(8):980–991. https://doi.org/10.1007/s00521-016-2591-2

    Article  Google Scholar 

  21. Tsoumakas G, Angelis L, Vlahavas IP (2005) Selective fusion of heterogeneous classifiers. Intell Data Anal 9(6):511–525

    Article  Google Scholar 

  22. Strutz T (2010) Data fitting and uncertainty (a practical introduction to weighted least squares and beyond). Vieweg + Teubner, Berlin. ISBN 978-3-8348-1022-9

    Google Scholar 

  23. Brzezinski D, Stefanowski J (2014) Combining block-based and online methods in learning ensembles from concept drifting data streams. Inf Sci 265:50–67

    Article  MathSciNet  Google Scholar 

  24. Barddal JP, Gomes HM, Enembreck F (2015) SNCStream: a social network-based data stream clustering algorithm. In: Proceedings of the 30th annual ACM symposium on applied computing, SAC’15. ACM, New York, NY, pp 935–940

  25. Parker BS, Khan L (2015). Detecting and tracking concept class drift and emergence in non-stationary fast data streams. In: Twenty-ninth AAAI conference on artificial intelligence

  26. Bifet A, Holmes G, Pfahringer B, Kirkby R, Gavaldà R (2009). New ensemble methods for evolving data streams. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining. ACM SIGKDD, pp 139–148

  27. Bifet A, Holmes G, Pfahringer B (2010) Leveraging bagging for evolving data streams. In: PKDD, pp 135–150

  28. Baena-Garcia M, del Campo-Avila J, Fidalgo R, Bifet A, Gavalda R, Morales-Bueno R (2006) Early drift detection method. In: ECML PKDD 2006 workshop on knowledge discovery from data streams

  29. Gomes HM, Enembreck F (2014) Sae2: advances on the social adaptive ensemble classifier for data streams. In: Proceedings of the 29th annual ACM symposium on applied computing (SAC), SAC 2014. ACM, pp 199–206

  30. Pan S, Morris T, Adhikari U (2014) Developing a hybrid intrusion detection system using data mining for power systems. IEEE Trans Smart Grid. https://doi.org/10.1109/tsg.2015.2409775

    Article  Google Scholar 

  31. Pan S, Morris T, Adhikari U (2015) Classification of disturbances and cyber-attacks in power systems using heterogeneous time-synchronized data. IEEE Trans Ind Inf. https://doi.org/10.1109/tii.2015.2420951

    Article  Google Scholar 

  32. Pan S, Morris T, Adhikari U (2015) A specification-based intrusion detection framework for cyber-physical environment in electric power system. Int J Netw Secur (IJNS) 17(2):174–188

    Google Scholar 

  33. Beaver J, Borges R, Buckner M, Morris T, Adhikari U, Pan S (2014) Machine learning for power system disturbance and cyber-attack discrimination. In: Proceedings of the 7th international symposium on resilient control systems, August 19–21, 2014, Denver, CO, USA

  34. Dodge Y (2003) The oxford dictionary of statistical terms. OUP, Oxford (entry for normalization of scores). ISBN: 0-19-920613-9

    MATH  Google Scholar 

  35. Zwillinger D, Kokoska S (2000) CRC standard probability and statistics tables and formulae. CRC Press, Boca Raton, p 18. ISBN: 1-58488-059-7

    MATH  Google Scholar 

  36. Žliobaitė I, Bifet A, Read J et al (2015) Evaluation methods and decision theory for classification of streaming data with temporal dependence. Mach Learn 98(3):455–482

    Article  MathSciNet  Google Scholar 

  37. Vinagre J, Jorge AM, Gama J (2014) Evaluation of recommender systems in streaming environments. In: Workshop on recommender systems evaluation: dimensions and design (REDD 2014), held in conjunction with RecSys. October 10, 2014, Silicon Valley, United States https://doi.org/10.13140/2.1.4381.5367

  38. Demertzis K, Iliadis L, Spartalis S (2017) A spiking one-class anomaly detection framework for cyber-security on industrial control systems. In: Boracchi G, Iliadis L, Jayne C, Likas A (eds) Engineering applications of neural networks. EANN 2017. Communications in Computer and Information Science, vol 744. Springer, Cham

  39. Cruz T, Proença J, Simões P, Aubigny M, Ouedrago M, Graziano A, Yasakhetu L (2014) Improving cyber-security awareness on industrial control systems: the CockpitCI approach. J Inf Warf 13(4):27–41

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wang Sufang.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sufang, W. An adaptive ensemble classification framework for real-time data streams by distributed control systems. Neural Comput & Applic 32, 4139–4149 (2020). https://doi.org/10.1007/s00521-020-04759-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-020-04759-0

Keywords

Navigation