Skip to main content
Log in

An incremental approach to feature selection using the weighted dominance-based neighborhood rough sets

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

Dominance-based neighborhood rough set (DNRS) is capable to give qualitative and quantitative descriptions of the relations between ordered objects. In spite of its effectiveness in feature selection, DNRS ignores the various significance of features. In fact, different features exert different impacts on decision-making. Once we explore these differences in advance, it is easier to find out features with high correlation and dependency. Likewise, it is inevitable that in big-data era the objects may update from time to time, which calls for efficient attribute reduction. However, the existing approaches are inappropriate for the weighted and ordered data. Motivated by these two deficiencies, first, we assign different weights to conditional attributes and establish the weighted dominance-based neighborhood rough set (WDNRS). Then a kind of conditional entropy in matrix form and ensuing updating principles are put forward to evaluate the significance of the attributes. In addition, grounded on the entropy, we come up with the heuristic algorithm and corresponding incremental mechanism when objects increase. Finally, twelve experiments are carried out to verify that it is effective and efficient for the designed method to select features in dynamic datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Zhong J, Wang J, Peng W, Zhang Z, Li M (2015) A feature selection method for prediction essential protein. Tsinghua Sci Technol 20(5):491–499

    MathSciNet  Google Scholar 

  2. Bang S, Kang J, Jhun M, Kim E (2017) Hierarchically penalized support vector machine with grouped variables. Int J Mach Learn Cyb 8(4):1211–1221

    Google Scholar 

  3. Abedini M, Kirley M (2013) An enhanced XCS rule discovery module using feature ranking. Int J Mach Learn Cyb 4(3):173–187

    Google Scholar 

  4. Gu S, Cheng R, Jin Y (2018) Feature selection for high-dimensional classification using a competitive swarm optimizer. Soft Comput 22(3):811–822

    Google Scholar 

  5. Pilyugina N, Tsukahara A, Tanaka K (2021) Comparing methods of feature extraction of brain activities for octave illusion classification using machine learning. Sensors 21(19):6407–6407

    Google Scholar 

  6. Ali W (2017) Phishing website detection based on supervised machine learning with wrapper features selection. Int J Adv Comput 8(9):72

    Google Scholar 

  7. Tuo Q, Zhao H, Hu Q (2018) Hierarchical feature selection with subtree based graph regularization. Knowl-Based Syst 163:996–1008

    Google Scholar 

  8. Roffo S, Melzi S, Castellani U, Vinciarelli A, Cristani M (2020) Infinite feature selection: a raph-based feature filtering approach. IEEE Trans Pattern Anal 43(12):4396–4410

    Google Scholar 

  9. Pawlak Z (1982) Rough sets. Int J Comput Inform Sci 11(5):341–365

    MATH  Google Scholar 

  10. Yao Y (1998) Relational interpretations of neighborhood operators and rough set approximation operators. Inform Sci 111(1–4):239–259

    MathSciNet  MATH  Google Scholar 

  11. Ziarko W (1993) Variable precision rough set model. J Comput Syst Sci 46(1):39–59

    MathSciNet  MATH  Google Scholar 

  12. Greco S, Matarazzo B, Slowinski R (1999) Rough approximation of a preference relation by dominance relations. Eur J Oper Res 117(1):63–83

    MATH  Google Scholar 

  13. Wang P, Wu Q, He J, Shang X (2018) Approximation operator based on neighborhood systems. Symmetry-basel 10(11):539–539

    MATH  Google Scholar 

  14. Hu Q, Yu D, Me Z (2008) Neighborhood classifiers. Expert Syst Appl 34(2):866–876

    Google Scholar 

  15. Sun L, Zhang X, Qian Y, Xu J, Zhang S (2019) Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification. Inform Sci 502:18–41

    MathSciNet  MATH  Google Scholar 

  16. Sun L, Yin T, Ding W, Qian Y, Xu J (2022) Feature selection with missing labels using multilabel fuzzy neighborhood rough sets and maximum relevance minimum redundancy. IEEE Trans Fuzzy Syst 30(5):1197–1211

    Google Scholar 

  17. Sun L, Li M, Ding W, Zhang E, Mu X, Xu J (2022) AFNFS: Adaptive fuzzy neighborhood-based feature selection with adaptive synthetic over-sampling for imbalanced data. Inform Sci 612:724–744

    Google Scholar 

  18. Greco S, Matarazzo B, Slowinski R (2001) Rough sets theory for multicriteria decision analysis. Eur J Oper Res 129(1):1–47

    MATH  Google Scholar 

  19. Zhang X, Chen D, Tsang E (2016) Generalized dominance rough set models for the dominance intuitionistic fuzzy information systems. Inform Sci 378:1–25

    MathSciNet  MATH  Google Scholar 

  20. Ali A, Ali MI, Rehman N (2019) Soft dominance based rough sets with applications in information systems. Int J Approx Reason 113:171–195

    MathSciNet  MATH  Google Scholar 

  21. Chen H, Li T, Luo C, Hu J (2015) Dominance-based neighborhood rough sets and its attribute reduction. Rough sets and knowledge technology, lecture notes in computer science. Springer, Berlin, Heidelberg, pp 89–99

    Google Scholar 

  22. Chen H, Li T, Cai Y, Luo C, Fujita H (2016) Parallel attribute reduction in dominance-based neighborhood rough set. Inform Sci 373:351–368

    MATH  Google Scholar 

  23. Wan J, Chen H, Yuan Z, Li T, Yang X, Sang B (2017) A novel hybrid feature selection method considering feature interaction in neighborhood rough set. Knowl-Based Syst 227:107167

    Google Scholar 

  24. Wang S, Li X, Xia J, Xia J, Zhang X (2010) Weighted neighborhood classifier for the classification of imbalanced tumor dataset. J Circuit Syst Copm 19(1):259–273

    Google Scholar 

  25. Tsang E, Hu Q, Chen D (2016) Feature and instance reduction for PNN classifiers based on fuzzy rough sets. Int J Mach Learn Cyb 7(1):1–11

    Google Scholar 

  26. Hu M, Tsang E, Guo Y, Chen D, Xu W (2021) A novel approach to attribute reduction based on weighted neighborhood rough sets. Knowl-Based Syst 220:106908

    Google Scholar 

  27. Liang J, Wang F, Dang C, Qian Y (2014) A group incremental approach to feature selection applying rough set technique. IEEE Trans Knowl Data En 26(2):294–308

    Google Scholar 

  28. Sang B, Chen H, Yang L, Zhou D, Li T, Xu W (2021) Incremental attribute reduction approaches for ordered data with time-evolving objects. Knowl-Based Syst 212:106583

    Google Scholar 

  29. Sang B, Chen H, Yang L, Li T, Xu W (2022) Incremental feature selection using a conditional entropy based on fuzzy dominance neighborhood rough sets. IEEE Trans Fuzzy Syst 30(6):1683–1697

    Google Scholar 

  30. Yuan K, Xu W, Li W, Ding W (2021) An incremental learning mechanism for object classification based on progressive fuzzy three-way concept. Inform Sci 584:127–147

    Google Scholar 

  31. Jing Y, Li T, Huang J, Zhang Y (2016) An incremental attribute reduction approach based on knowledge granularity under the attribute generalization. Int J Approx Reason 76:80–95

    MathSciNet  MATH  Google Scholar 

  32. Chen D, Dong L, Mi J (2019) Incremental mechanism of attribute reduction based on discernible relations for dynamically increasing attribute. Soft Comput 24(1):321–332

    MATH  Google Scholar 

  33. Dong L, Chen D (2020) Incremental attribute reduction with rough set for dynamic datasets with simultaneously increasing samples and attributes. Int J Mach Learn Cyb 11(6):1339–1355

    Google Scholar 

  34. Zhang X, Chen X, Xu W, Ding W (2022) Dynamic information fusion in multi-source incomplete interval-valued information system with variation of information sources and attributes. Inform Sci 608:1–27

    Google Scholar 

  35. Xu W, Yuan K, Li W (2022) Dynamic updating approximations of local generalized multi-granulation neighborhood rough set. Appl Intell 52(8):9148–9173

    Google Scholar 

  36. Huang Y, Li T, Luo C, Fujita H, Horng S (2017) Matrix-based dynamic updating rough fuzzy approximations for data mining. Knowl-Based Syst 119:273–283

    Google Scholar 

  37. Wang S, Li T, Luo C, Fujita H (2016) Efficient updating rough approximations with multi-dimensional variation of ordered data. Inform Sci 372:690–708

    MATH  Google Scholar 

  38. Wang S, Li T, Luo C, Chen H, Fujita H (2019) Domain-wise approaches for updating approximations with multi-dimensional variation of ordered information systems. Inform Sci 478:100–124

    MathSciNet  MATH  Google Scholar 

  39. Shannon C, Weaver W (1948) The mathematical theory of communication. Bell Syst Tech J 27(3/4):373–423

    MathSciNet  Google Scholar 

  40. Hu Q, Yu D, Xie Z, Liu J (2006) Fuzzy probabilistic approximation spaces and their information measures. IEEE Trans Fuzzy Syst 14(2):191–201

    Google Scholar 

  41. Hu Q, Che X, Zhang L, Zhang D, Guo M, Yu D (2012) Rank entropy-based decision trees for monotonic classification. IEEE Trans Knowl Data En 24(11):2052–2064

    Google Scholar 

  42. Yuan Z, Chen H, Yang X, Li T, Liu K (2021) Fuzzy complementary entropy using hybrid-kernel function and its unsupervised attribute reduction. Knowl-Based Syst 231:107398

    Google Scholar 

Download references

Acknowledgements

This paper is supported by the National Natural Science Foundation of China (NO. 61976245).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weihua Xu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pan, Y., Xu, W. & Ran, Q. An incremental approach to feature selection using the weighted dominance-based neighborhood rough sets. Int. J. Mach. Learn. & Cyber. 14, 1217–1233 (2023). https://doi.org/10.1007/s13042-022-01695-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-022-01695-4

Keywords

Navigation