Abstract
Hierarchical classification is a research hotspot in machine learning due to the widespread existence of data with hierarchical class structures. Existing hierarchical classification methods based on granular computing can effectively reduce the computational complexity by considering the granularity of classes. However, their predictive accuracy is affected by inter-level error propagation within the hierarchy. In this paper, we propose a hierarchical classification method with multi-path selection based on coarse- and fine-grained class relationships, which mitigates the inter-level error propagation problem. Firstly, we use a top-down recursive method to calculate the probabilities of the hierarchical classes by logistic regression classification. Secondly, the current class probability is calculated by combining the parent and current classes probabilities. We select multiple possible fine-grained classes at the current level according to their sibling relationships. Compared with existing methods, the proposed method reduces the possibility of misclassification from the upper layer. Finally, the multi-path prediction result is provided to a classical classifier for final prediction. Our hierarchical classification method is evaluated on six benchmark datasets to demonstrate that it provides better classification performance than existing state-of-the-art hierarchical methods.
Similar content being viewed by others
References
Alshamaa D, Chehade FM, Honeine P (2018) A hierarchical classification method using belief functions. Signal Process 148:68–77
Bhowmick B, Patra S, Chatterjee A, Govindu VM, Banerjee S (2017) Divide and conquer: a hierarchical approach to large-scale structure-from-motion. Comput Vis Image Underst 157:190–205
Boser BE (2008) A training algorithm for optimal margin classifiers. In: Proceedings of annual ACM workshop on computational learning theory, vol. 5, pp 144–152
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Cai ZL, Zhu W (2018) Multi-label feature selection via feature manifold learning and sparsity regularization. Int J Mach Learn Cybern 9(8):1321–1334
Cutler A, Cutler D, Stevens J (2011) Random forests. Mach Learn 45(1):157–176
Cucchiara A (2012) Applied logistic regression. Technometrics 34(3):39–56
Dai JH, Hu H, Wu WZ, Qian YH, Huang DB (2018) Maximal discernibility pairs based approach to attribute reduction in fuzzy rough sets. IEEE Trans Fuzzy Syst 26(4):2174–2187
Dekel O, Keshet J, Singer Y (2004) Large margin hierarchical classification. In: International conference on machine learning, pp 1–8
Demiar J, Schuurmans D (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7(1):1–30
Deng J, Berg AC, Li K, Li FF (2010) What does classifying more than 10,000 image categories tell us? In: European conference on computer vision, pp 71–84
Dimitrovski I, Kocev D, Loskovska S, Džeroski S (2011) Hierarchical annotation of medical images. Pattern Recognit 44(10–11):2436–2449
Ding CH, Dubchak I (2001) Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics 17(4):349–358
Dunn OJ (1961) Multiple comparisons among means. J Am Stat Assoc 56(293):52–64
Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338
Fan J, Zhang J, Mei K, Peng J, Gao L (2015) Cost-sensitive learning of hierarchical tree classifiers for large-scale image classification and novel category detection. Pattern Recognit 48(5):1673–1687
Friedman M (1940) A comparison of alternative tests of significance for the problem of m rankings. Ann Math Stat 11(1):86–92
Ghazi D, Inkpen D, Szpakowicz S (2010) Hierarchical versus flat classification of emotions in text. In: NAACL HLT workshop on computational approaches to analysis & generation of emotion in text, pp 140–146
Holden N, Freitas AA (2009) Hierarchical classification of protein function with ensembles of rules and particle swarm optimisation. Soft Comput 13(3):259–272
Ke SW, Lin WC, Tsai CF, Hu YH (2017) Soft estimation by hierarchical classification and regression. Neurocomputing 234(C):27–37
Keller JM, Gray MR, Givens JA (2012) Fuzzy K-nearest neighbor algorithm. IEEE Trans Syst Man Cybern SMC–15(4):580–585
Koller D, Sahami M (1997) Hierarchically classifying documents using very few words. In: International conference on machine learning, vol. 223, pp 170–178
Kosmopoulos A, Partalas I, Gaussier E, Paliouras G, Androutsopoulos I (2015) Evaluation measures for hierarchical classification: a unified view and novel approaches. Data Min Knowl Discov 29(3):820–865
Krause J, Stark M, Deng J, Li FF (2013) 3D object representations for fine-grained categorization. In: International IEEE workshop on 3D representation and recognition, pp 554–561
Li L, Umbach DM, Terry P, Taylor JA (2004) Application of the GA/KNN method to seldi proteomics data. Bioinformatics 20(10):1638–1640
Li D, Ju Y, Zou Q (2016) Protein folds prediction with hierarchical structured SVM. Curr Proteom 13(2):79–85
Liaw A, Wiener M (2002) Classification and regression by randomforest. R News 23:18–22
Merz CJ, Murphy PM (1998) UCI repository of machine learning databases. In: Neural information processing systems
Murzin AG, Brenner SE, Hubbard TJP, Chothia C (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 247(4):536–540
Naik A, Rangwala H (2017) Inconsistent node flattening for improving top-down hierarchical classification. In: IEEE international conference on data science and advanced analytics, pp 1–10
Nalepa J, Kawulok M (2019) Selecting training sets for support vector machines: a review. Artif Intell Rev 52(2):857–900
Qu Y, Lin L, Shen F, Lu C, Wu Y, Xie Y, Tao D (2017) Joint hierarchical category structure learning and large-scale image classification. IEEE Trans Image Process 26(9):4331–4346
Rong F (2016) Audio classification method based on machine learning. In: International conference on intelligent transportation
Shrivastava A, Tripathy AK, Dalal PK (2019) A SVM-based classification approach for obsessive compulsive disorder by oxidative stress biomarkers. J Comput Sci 36:1–9
Silla CN, Freitas AA (2009) Novel top-down approaches for hierarchical classification and their application to automatic music genre classification. In: IEEE international conference on systems, pp 1–6
Tan A, Wu WZ, Li J, Li T (2020) Reduction foundation with multigranulation rough sets using discernibility. Artif Intell Rev 53:2425–2452
Trojacanec K, Madjarov G, Loskovska S, Gjorgjevikj D (2011) Hierarchical classification architectures applied to magnetic resonance images. In: ITI international conference on information technology interfaces, pp 501–506
Wan C, Freitas AA (2017) An empirical evaluation of hierarchical feature selection methods for classification in bioinformatics datasets with gene ontology-based features. Artif Intell Rev 50(2):1–40
Wang SP, Zhu W (2018) Sparse graph embedding unsupervised feature selection. IEEE Trans Syst Man Cybern Syst 48(3):329–341
Wang Y, Chen H, Li S, Zhang J, Chao G (2017) Object tracking by color distribution fields with adaptive hierarchical structure. Vis Comput 33(2):1–13
Xiao L, Ling CX, Wang H (2013) Effective top-down active learning for hierarchical text classification. In: Pacific-Asia conference on knowledge discovery & data mining, pp 233–244
Yan E, Song J, Liu C, Luan J, Hong W (2020) Comparison of support vector machine, back propagation neural network and extreme learning machine for syndrome element differentiation. Artif Intell Rev 53:2453–2481
Yang X, Li T, Fujita H, Liu D (2019) A sequential three-way approach to multi-class decision. Int J Approx Reason 104:108–125
Yao Y (2013) Perspectives of granular computing. In: IEEE international conference on granular computing, pp 85–90
Yu W, Hu Q, Zhou Y, Hong Z, Qian Y, Liang J (2017) Local Bayes risk minimization based stopping strategy for hierarchical classification. In: IEEE international conference on data mining, pp 1–10
Zhang CC, Dai JH, Chen JL (2020) Knowledge granularity based incremental attribute reduction for incomplete decision systems. Int J Mach Learn Cybern 11(5):1141–1157
Zheng W, Zhao H (2020) Cost-sensitive hierarchical classification for imbalance classes. Appl Intell pp 1–16
Zhu W (2009) Relationship among basic concepts in covering-based rough sets. Inf Sci 179(14):2478–2486
Acknowledgements
This work was supported by the National Natural Science Foundation of China under Grant No. 61703196, the Natural Science Foundation of Fujian Province under Grant No. 2018J01549, and the President’s Fund of Minnan Normal University under Grant No. KJ19021.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Guo, S., Zhao, H. Hierarchical classification with multi-path selection based on granular computing. Artif Intell Rev 54, 2067–2089 (2021). https://doi.org/10.1007/s10462-020-09899-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-020-09899-2