Abstract
Continual learning has attracted much attention in recent years, and many continual learning methods based on deep neural networks have been proposed. However, several important problems about these methods may lead to high decision cost and affect the practical application of continual learning networks. First, continual learning networks treat all categories equally, although the unbalance of misclassification cost happens in real-world cases. Second, there is a trade-off between learning new knowledge and keep old knowledge, which leads to the forgetting of old knowledge (i.e., the catastrophic forgetting). Third, even if low confidence of a sample, the continual learning methods based on the neural network will still give a clear classification result. We propose a sequential three-way decision model for continual learning to address these problems, named Incremental Sequential Three-Way Decision model (ISTWD). Introducing cost-sensitive sequential three-way decision to continual learning network, ISTWD reduces the decision cost of continual learning, which may alleviate the potentially high cost caused by the accuracy loss in continual learning. Besides, ISTWD includes a checkpoint procedure to judge whether the process of continual learning should stop. Experimental results on CIFAR-100 and Tiny-ImageNet verify the effectiveness of our method.






Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Yao YY (2009) Three-way decision: an interpretation of rules in rough set theory. In: International conference on rough sets and knowledge technology, pp 642–649
Yao YY (2018) Three-way decision and granular computing. Int J Approx Reason 103:107–123
Yao YY (2019) Three-way conflict analysis: reformulations and extensions of the pawlak model. Knowl-Based Syst 180:26–37
Yang JL, Yao YY (2020) Semantics of soft sets and three-way decision with soft sets. Knowl-Based Syst 194:105538
Zhan JM, Jiang HB, Yao YY (2020) Three-way multi-attribute decision-making based on outranking relations. IEEE Trans Fuzzy Syst 13(8):1384
Liu JB, Li HX, Zhou XZ, Huang B, Wang TX (2019) An optimization-based formulation for three-way decisions. Inf Sci 495:185–214
Liu JB, Li HX, Huang B, Liu Y, Liu D (2021) Convex combination-based consensus analysis for intuitionistic fuzzy three-way group decision. Inf Sci 574:542–566
Yao YY, Deng XF (2011) Sequential three-way decisions with probabilistic rough sets. In: International conference on cognitive informatics and cognitive computing, pp 120–125
Li HX, Zhang LB, Zhou XZ, Huang B (2017) Cost-sensitive sequential three-way decision modeling using a deep neural network. Int J Approx Reason 85:68–78
Li HX, Zhang LB, Huang B, Zhou XZ (2020) Cost-sensitive dual-bidirectional linear discriminant analysis. Inf Sci 510:283–303
Liang DC, Liu D (2014) A novel risk decision making based on decision-theoretic rough sets under hesitant fuzzy information. IEEE Trans Fuzzy Syst 23(2):237–247
Savchenko AV (2019) Sequential three-way decisions in multi-category image recognition with deep features based on distance factor. Inf Sci 489:18–36
Hao C, Li JH, Fan M, Liu WQ, Tsang EC (2017) Optimal scale selection in dynamic multi-scale decision tables based on sequential three-way decisions. Inf Sci 415:213–232
Yang X, Li TR, Fujita H, Liu D (2019) A sequential three-way approach to multi-class decision. Int J Approx Reason 104:108–125
Qian J, Liu CH, Yue XD (2019) Multigranulation sequential three-way decisions based on multiple thresholds. Int J Approx Reason 105:396–416
Zhang LB, Li HX, Zhou XZ, Huang B (2020) Sequential three-way decision based on multi-granular autoencoder features. Inf Sci 507:630–643
Li HX, Zhang LB, Huang B, Zhou XZ (2016) Sequential three-way decision and granulation for cost-sensitive face recognition. Knowl-Based Syst 91:241–251
French RM (1999) Catastrophic forgetting in connectionist networks. Trends Cogn Sci 3(4):128–135
Goodfellow IJ, Mirza M, Xiao D, Courville A, Bengio Y (2013) An empirical investigation of catastrophic forgetting in gradient-based neural networks. arXiv preprint arXiv: 13126211
Parisi GI, Kemker R, Part JL, Kanan C, Wermter S (2019) Continual lifelong learning with neural networks: a review. Neural Netw 113:54–71
Qian YH, Liang XY, Lin GP, Guo Q, Liang JY (2017) Local multigranulation decision-theoretic rough sets. Int J Approx Reason 82:119–137
Li JH, Huang CC, Qi JJ, Qian YH, Liu WQ (2017) Three-way cognitive concept learning via multi-granularity. Inf Sci 378:244–263
Liu D, Li TR, Liang DC (2014) Incorporating logistic regression to decision-theoretic rough sets for classifications. Int J Approx Reason 55(1):197–210
Min F, Zhang ZH, Zhai WJ, Shen RP (2020) Frequent pattern discovery with tri-partition alphabets. Inf Sci 507:715–732
Yao JT, Azam N (2014) Web-based medical decision support systems for three-way medical decision making with game-theoretic rough sets. IEEE Trans Fuzzy Syst 23(1):3–15
Huang B, Guo CX, Li HX, Feng GF, Zhou XZ (2016) Hierarchical structures and uncertainty measures for intuitionistic fuzzy approximation space. Inf Sci 336:92–114
Liu D, Ye XQ (2020) A matrix factorization based dynamic granularity recommendation with three-way decisions. Knowl-Based Syst 191:105423
Yu H, Zhang C, Wang GY (2016) A tree-based incremental overlapping clustering method using the three-way decision theory. Knowl-Based Syst 91:189–203
Yu H (2018) Three-way decisions and three-way clustering. In: International joint conference on rough sets, pp 13–28
Min F, Zhang SM, Ciucci D, Wang M (2020) Three-way active learning through clustering selection. Int J Mach Learn Cybern 11(5):1033–1046
Liu D, Liang DC, Wang CC (2016) A novel three-way decision model based on incomplete information system. Knowl-Based Syst 91:32–45
Wu WZ, Qian YH, Li TJ, Gu SM (2017) On rule acquisition in incomplete multi-scale decision tables. Inf Sci 378:282–302
Liu D, Yang X, Li TR (2020) Three-way decisions: beyond rough sets and granular computing. Int J Mach Learn Cybern 11(5):989–1002
Li JH, Liu ZM (2020) Granule description in knowledge granularity and representation. Knowl-Based Syst 203:106160
Shin H, Lee JK, Kim J, Kim J (2017) Continual learning with deep generative replay. arXiv preprint arXiv: 170508690
Delange M, Aljundi R, Masana M, Parisot S, Jia X, Leonardis A, Slabaugh G, Tuytelaars T (2021) A continual learning survey: defying forgetting in classification tasks. IEEE Trans Pattern Anal Mach Intell 10.1109/TPAMI20213057446
Cichon J, Gan WB (2015) Branch-specific dendritic ca 2+ spikes cause persistent synaptic plasticity. Nature 520(7546):180–185
Kirkpatrick J, Pascanu R, Rabinowitz N, Veness J, Desjardins G, Rusu AA, Milan K, Quan J, Ramalho T, Grabska-Barwinska A et al (2017) Overcoming catastrophic forgetting in neural networks. Proc Natl Acad Sci 114(13):3521–3526
Schwarz J, Czarnecki W, Luketina J, Grabska-Barwinska A, Teh YW, Pascanu R, Hadsell R (2018) Progress & compress: a scalable framework for continual learning. In: International conference on machine learning, pp 4528–4537
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533
Rebuffi SA, Kolesnikov A, Sperl G, Lampert CH (2017) icarl: Incremental classifier and representation learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2001–2010
Li ZZ, Hoiem D (2017) Learning without forgetting. IEEE Trans Pattern Anal Mach Intell 40(12):2935–2947
He KM, Zhang XY, Ren SQ, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Hinton G, Vinyals O, Dean J (2015) Distilling the knowledge in a neural network. arXiv preprint arXiv: 150302531
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vision 115(3):211–252
Acknowledgements
This work was partially supported by National Natural Science Foundation of China (Nos. 62176116, 71732003, 61773208) and the National Key Research and Development Program of China (No. 2018YFB1402600).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Li, H., Yu, H., Min, F. et al. Incremental sequential three-way decision based on continual learning network. Int. J. Mach. Learn. & Cyber. 13, 1633–1645 (2022). https://doi.org/10.1007/s13042-021-01472-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-021-01472-9