Skip to main content
Log in

Review on novelty detection in the non-stationary environment

  • Review
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Novelty detection and concept drift detection are essential for the plethora of machine learning applications. The statistical properties of application generated data change over time in the streaming environment, known as concept drift. These changes develop a profound influence on the learning model’s performance. Along with concept drift, the new class emergence (i.e., novel class/novelty detection) is also challenging in the non-stationary distribution of data. Novel class detection finds whether the identifying data points of a data stream are unknown or unusual. The paper presents a survey focusing on the challenges encountered while dealing with real-time data. In addition to this, the chronological discussion on the various existing novelty detectors with their advantages, limitations, critical points, the different research prospect, and future directions are also incorporated in the paper.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Data availability

Not applicable.

References

  1. Agrahari S, Singh AK (2022) Concept drift detection in data stream mining: a literature review. J King Saud Univ-Comput Inf Sci 34:9523–9540

    Google Scholar 

  2. Agrahari S, Singh AK (2022) Disposition-based concept drift detection and adaptation in data stream. Arab J Sci Eng 47:1–17

    Google Scholar 

  3. Chikushi RTM, Maior RS, de Barros M, Gomes N, da Silva M, Maciel BIF (2021) Using spectral entropy and Bernoulli map to handle concept drift. Expert Syst Appl 167:114114

    Google Scholar 

  4. Gajbhiye MPR, Vaidya SG (2016) Classification and adaptive novel class detection of feature-evolving data streams. Int J Eng Res General Sci 4(3):616–622

    Google Scholar 

  5. da Silva TP, de Arruda CH (2020) Possibilistic approach for novelty detection in data streams. In: 2020 IEEE International conference on fuzzy systems (FUZZ-IEEE), pp 1–8. IEEE

  6. Markou M, Singh S (2003) Novelty detection: a review-part 1: statistical approaches. Signal Process 83(12):2481–2497

    Google Scholar 

  7. Markou M, Singh S (2003) Novelty detection: a review-part 2: neural network based approaches. Signal Process 83(12):2499–2521

    Google Scholar 

  8. Hodge V, Austin J (2004) A survey of outlier detection methodologies. Artif Intell Rev 22(2):85–126

    Google Scholar 

  9. Pimentel MAF, Clifton DA, Clifton L, Tarassenko L (2014) A review of novelty detection. Signal Process 99:215–249

    Google Scholar 

  10. Faria ER, Gonçalves IJCR, de Carvalho ACPLF, Gama J (2016) Novelty detection in data streams. Artif Intell Rev 45(2):235–269

    Google Scholar 

  11. Boukerche A, Zheng L, Alfandi O (2020) Outlier detection: methods, models, and classification. ACM Comput Surv (CSUR) 53(3):1–37

    Google Scholar 

  12. Manzoor E, Lamba H, Akoglu L (2018) xstream: outlier detection in feature-evolving data streams. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pp 1963–1972

  13. Park CH (2019) Outlier and anomaly pattern detection on data streams. J Supercomput 75(9):6118–6128

    Google Scholar 

  14. Souiden I, Brahmi Z, Toumi H (2016) A survey on outlier detection in the context of stream mining: review of existing approaches and recommadations. In: International conference on intelligent systems design and applications, pp 372–383. Springer

  15. Leevy JL, Khoshgoftaar TM, Bauder RA, Seliya N (2018) A survey on addressing high-class imbalance in big data. J Big Data 5(1):1–30

    Google Scholar 

  16. Nguyen H-L, Woon Y-K, Ng W-K (2015) A survey on data stream clustering and classification. Knowl Inf Syst 45(3):535–569

    Google Scholar 

  17. Agrahari S, Singh AK (2022) Adaptive PCA-based feature drift detection using statistical measure. Clust Comput 25(6):4481–4494

    Google Scholar 

  18. Gruhl C, Hannan A, Huang Z, Nivarthi C, Vogt S (2021) The problem with real-world novelty detection-issues in multivariate probabilistic models. In: 2021 IEEE international conference on autonomic computing and self-organizing systems companion (ACSOS-C), pp 204–209. IEEE

  19. Salehi M, Leckie C, Bezdek JC, Vaithianathan T, Zhang X (2016) Fast memory efficient local outlier detection in data streams. IEEE Trans Knowl Data Eng 28(12):3246–3260

    Google Scholar 

  20. Tamboli J, Shukla M (2016) A survey of outlier detection algorithms for data streams. In: 2016 3rd international conference on computing for sustainable global development (INDIACom), pp 3535–3540. IEEE

  21. Pokrajac D, Lazarevic A, Latecki LJ (2007) Incremental local outlier detection for data streams. In: 2007 IEEE symposium on computational intelligence and data mining, pp 504–515. IEEE

  22. Ding S, Fulin W, Qian J, Jia H, Jin F (2015) Research on data stream clustering algorithms. Artif Intell Rev 43(4):593–600

    Google Scholar 

  23. Hashemi S, Yang Y (2009) Flexible decision tree for data stream classification in the presence of concept change, noise and missing values. Data Min Knowl Discov 19(1):95–131

    MathSciNet  Google Scholar 

  24. Masud MM , Al-Khateeb TM=, Khan L, Aggarwal C, Gao J, Han J, Thuraisingham B (2011) Detecting recurring and novel classes in concept-drifting data streams. In: 2011 IEEE 11th international conference on data mining, pp 1176–1181. IEEE

  25. ZareMoodi P, Beigy H, Siahroudi SK (2015) Novel class detection in data streams using local patterns and neighborhood graph. Neurocomputing 158:234–245

    Google Scholar 

  26. Danielsson P-E (1980) Euclidean distance mapping. Comput Graph Image Process 14(3):227–248

    Google Scholar 

  27. Rolf N, Peter S (1996) On the Manhattan distance between points on space filling mesh indexings. Citeseer

    Google Scholar 

  28. Yin J, Sun S (2022) Incomplete multi-view clustering with cosine similarity. Pattern Recognit 123:108371

    Google Scholar 

  29. Bury M, Schwiegelshohn C, Sorella M (2019) Similarity search for dynamic data streams. IEEE Trans Knowl Data Eng 32(11):2241–2253

    Google Scholar 

  30. Spinosa EJ, de Leon F de Carvalho AP, Gama J (2007) Olindda: a cluster-based approach for detecting novelty and concept drift in data streams. In: Proceedings of the 2007 ACM symposium on Applied computing, pp 448–452. ACM

  31. Tan SC, Ting KM, Liu TF (2011) Fast anomaly detection for streaming data. In: Twenty-second international joint conference on artificial intelligence

  32. MZ Hayat, MR Hashemi (2010) A DCT based approach for detecting novelty and concept drift in data streams. In: 2010 international conference of soft computing and pattern recognition, pp 373–378. IEEE

  33. Krawczyk B, Wozniak M (2013) Incremental learning and forgetting in one-class classifiers for data streams. In: Proceedings of the 8th international conference on computer recognition systems CORES 2013, pp 319–328. Springer

  34. Krawczyk B, Woźniak M (2015) One-class classifiers with incremental learning and forgetting for data streams with concept drift. Soft Comput 19(12):3387–3400

    Google Scholar 

  35. Rusiecki A (2012) Robust neural network for novelty detection on data streams. In: International conference on artificial intelligence and soft computing, pp 178–186. Springer

  36. Spinosa EJ, de Leon de Carvalho AP, Gama J et al (2009) Novelty detection with application to data streams. Intell Data Anal, 13(3):405–422

  37. Sun J, Yang L, Zhang J, Liu F, Halappanavar M, Fan D, Cao Y (2021) Gradient-based novelty detection boosted by self-supervised binary classification. arXiv preprint arXiv:2112.09815

  38. Masud MM, Chen Q, Khan L, Aggarwal C, Gao J, Han J, Thuraisingham B (2010) Addressing concept-evolution in concept-drifting data streams. In: 2010 IEEE International conference on data mining, pp 929–934. IEEE

  39. Faria ER, Gama J, Carvalho A CPLF (2013) Novelty detection algorithm for data streams multi-class problems. In: Proceedings of the 28th annual ACM symposium on applied computing, pp 795–800

  40. JDC Júnior, Faria ER, Silva JA, Gama J, Cerri R (2019) Novelty detection for multi-label stream classification. In: 2019 8th Brazilian conference on intelligent systems (BRACIS), pp 144–149. IEEE

  41. Al-Khateeb T, Masud MM, Khan L, Aggarwal C, Han J, Thuraisingham B (2012) Stream classification with recurring and novel class detection using class-based ensemble. In: 2012 IEEE 12th international conference on data mining, pp 31–40. IEEE

  42. de Faria ER, de Leon Ferreira Carvalho André CP, Gama J (2016) Minas: multiclass learning algorithm for novelty detection in data streams. Data mining and knowledge discovery, 30:640–680

  43. Din SU, Shao J (2020) Exploiting evolving micro-clusters for data stream classification with emerging class detection. Inf Sci 507:404–420

    Google Scholar 

  44. Alippi C, Boracchi G, Roveri M, Ditzler G, Polikar R, Zhou MC, Li HX, Weijnen M (2015) Adaptive classifiers for nonstationary environments. Contemporary issues in systems science and engineering. IEEE Press/Wiley, Hoboken, pp 265–288

    Google Scholar 

  45. Tan CH, Lee VC, Salehi M, Marusic S, Jayawardena S, Lucke D (2021) A fully unsupervised and efficient anomaly detection approach with drift detection capability. In: 2021 international conference on data mining workshops (ICDMW), pp 312–321. IEEE

  46. Albertini MK, de Mello RF (2007) A self-organizing neural network for detecting novelties. In: Proceedings of the 2007 ACM symposium on Applied computing, pp 462–466. ACM

  47. Bishop CM (1994) Novelty detection and neural network validation. IEE Proc-Vis Image Sig Process 141(4):217–222

    Google Scholar 

  48. Bivens A, Palagiri C, Smith R, Szymanski B, Embrechts M et al (2002) Network-based intrusion detection using neural networks. Intell Eng Syst Through Artif Neural Netw 12(1):579–584

    Google Scholar 

  49. Rivera-Illingworth F, Callaghan V, Hagras H (2005) A neural network agent based approach to activity detection in AMI environments. In: Proceedings of the IEE international workshop on intelligent environments, pp 92–100

  50. Ma J, Perkins S (2003) Online novelty detection on temporal sequences. In: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pp 613–618

  51. Singh S, Markou M (2004) An approach to novelty detection applied to the classification of image regions. IEEE Trans Knowl Data Eng 16(4):396–407

    Google Scholar 

  52. Augusteijn MF, Folkert BA (2002) Neural network classification and novelty detection. Int J Remote Sens 23(14):2891–2902

    Google Scholar 

  53. Richter C, Roy N (2017) Safe visual navigation via deep learning and novelty detection

  54. Banko L, Maffettone PM, Naujoks D, Olds D, Ludwig A (2021) Deep learning for visualization and novelty detection in large x-ray diffraction datasets. npj Comput Mater 7(1):1–6

    Google Scholar 

  55. Cristiani A, de Arruda CH (2021) A fuzzy multi-class novelty detector for data streams under intermediate latency. In: 2021 IEEE International conference on fuzzy systems (FUZZ-IEEE), pp 1–6. IEEE

  56. Francesca C, Alberto R, Marco B, Francesco GG, Lorenzo V (2021) Feature-based multi-class classification and novelty detection for fault diagnosis of industrial machinery. Appl Sci 11(20):9580

    Google Scholar 

  57. Leite Bernardo, Abdalrahman Amr, Castro João, Frade Julieta, Moreira João, Soares Carlos (2021) Novelty detection in physical activity. In: ICAART, vol 2, pp 859–865

  58. Ziffer G, Bernardo A, Della Valle E, Bifet A (2021) Kalman filtering for learning with evolving data streams. In: 2021 IEEE international conference on big data (big data), pp 5337–5346. IEEE

  59. Tegjyot Singh Sethi and Mehmed Kantardzic (2017) On the reliable detection of concept drift from streaming unlabeled data. Expert Syst Appl 82:77–99

    Google Scholar 

  60. Gruhl C, Sick B, Tomforde S (2021) Novelty detection in continuously changing environments. Future Gener Comput Syst 114:138–154

    Google Scholar 

  61. Gruhl C, Sick B (2018) Novelty detection with candies: a holistic technique based on probabilistic models. Int J Mach Learn Cybern 9(6):927–945

    CAS  Google Scholar 

  62. Garcia KD (2021) Unsupervised learning approaches for non-stationary data streams. PhD thesis, Universidade de São Paulo

  63. Jodelka O, Anagnostopoulos C, Kolomvatsos K (2021) Adaptive novelty detection over contextual data streams at the edge using one-class classification. In: 2021 12th international conference on information and communication systems (ICICS), pp 213–219. IEEE

  64. Dai R, Lefort M, Armetta F, Guillermin M, Duffner S (2021) Novelty detection for unsupervised continual learning in image sequences. In: 2021 IEEE 33rd international conference on tools with artificial intelligence (ICTAI), pp 493–500. IEEE

  65. Dearo GK (2021) Unsupervised learning approaches for non-stationary data streams

  66. Surapaneni RK, Nimmagadda S, Pragathi K (2021) Unsupervised classification approach for anomaly detection in big data streams. In: Next generation of internet of things, pp 71–79. Springer

  67. Andri A, Mahardhika P (2022) Unsupervised continual learning in streaming environments. IEEE Trans Neural Netw Learn Syst. https://doi.org/10.1109/TNNLS.2022.3163362

    Article  Google Scholar 

  68. Hulten G, Spencer L, Domingos P (2001) Mining time-changing data streams. In: Proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining, pp 97–106. ACM

  69. Wang H, Fan W, Yu PS, Han J (2003) Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pp 226–235. ACM

  70. Masud Mohammad M, Gao J, Khan L, Han J, Thuraisingham B (2010) Classification and novel class detection in data streams with active mining. In: Pacific-Asia conference on knowledge discovery and data mining, pp 311–324. Springer

  71. Al-Khateeb TM, Masud MM, Khan L, Thuraisingham B (2012) Cloud guided stream classification using class-based ensemble. In: 2012 IEEE fifth international conference on cloud computing, pp 694–701. IEEE

  72. Farid DM, Rahman CM (2012) Novel class detection in concept-drifting data stream mining employing decision tree. In: 2012 7th international conference on electrical and computer engineering, pp 630–633. IEEE

  73. Farid DM, Li Z, Hossain A, Rahman CM, Rebecca S, Graham S, Keshav D (2013) An adaptive ensemble classifier for mining concept drifting data streams. Expert Syst Appl 40(15):5895–5906

    Google Scholar 

  74. Mohamed-Rafik B, Slawomir N, Payberah Amir H (2018) An adaptive algorithm for anomaly and novelty detection in evolving data streams. Data Min Knowl Discov 32(6):1597–1633

    MathSciNet  Google Scholar 

  75. Nguyen D, Kirsebom OS, Frazão Fábio, Fablet R, Matwin S (2019) Recurrent neural networks with stochastic layers for acoustic novelty detection. In: ICASSP 2019-2019 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 765–769. IEEE

  76. Cejnek M, Bukovsky I (2018) Concept drift robust adaptive novelty detection for data streams. Neurocomputing 309:46–53

    Google Scholar 

  77. Wang S, Minku LL, Yao X (2018) A systematic study of online class imbalance learning with concept drift. IEEE Trans Neural Netw Learn Syst 99:1–20

    Google Scholar 

  78. Ross Gordon J, Adams Niall M, Tasoulis Dimitris K, Hand David J (2012) Exponentially weighted moving average charts for detecting concept drift. Pattern Recognit Lett 33(2):191–198

    ADS  Google Scholar 

  79. Gama J (2010) Knowledge discovery from data streams. Chapman and Hall/CRC

    Google Scholar 

  80. Mahdi OA, Pardede E, Ali N (2021) A hybrid block-based ensemble framework for the multi-class problem to react to different types of drifts. Cluster Comput 24:2327–2340

    Google Scholar 

  81. Mahdi OA, Pardede E, Ali N, Cao J (2020) Fast reaction to sudden concept drift in the absence of class labels. Appl Sci 10(2):606

    Google Scholar 

  82. Gülcan EB (2022) Implicit concept drift detection for multi-label data streams. PhD thesis, Bilkent Universitesi (Turkey)

  83. Ab Ghani NL, Aziz IA, Mehat M (2020) Concept drift detection on unlabeled data streams: a systematic literature review. In: 2020 IEEE conference on big data and analytics (ICBDA), pp 61–65. IEEE

  84. Zhao L, Zhang Y, Ji Y, An Z, Gu F, Luo X (2022) Heterogeneous drift learning: classification of mix-attribute data with concept drifts. In: 2022 IEEE 9th international conference on data science and advanced analytics (DSAA), pp 1–10. IEEE

  85. Campos Guilherme O, Arthur Z, Jörg S, Campello Ricardo JGB, Barbora M, Erich S, Ira A, Houle Michael E (2016) On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Min Knowl Discov 30(4):891–927

    MathSciNet  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

Supriya Agrahari wrote the main manuscript text. All authors reviewed the manuscript.

Corresponding author

Correspondence to Supriya Agrahari.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Ethical approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Human and animal rights

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Agrahari, S., Srivastava, S. & Singh, A.K. Review on novelty detection in the non-stationary environment. Knowl Inf Syst 66, 1549–1574 (2024). https://doi.org/10.1007/s10115-023-02018-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-023-02018-x

Keywords

Navigation