Abstract
Rough set theory, supplying an effective model for representation of uncertain knowledge, has been widely used in knowledge engineering and data mining. Especially, rough set theory has been used as an attribute selection method with much success. However, current rough set approaches for attribute reduction are unsuitable for semi-supervised learning as no enough labeled data can guarantee to calculate the dependency degree. We propose a new attribute selection strategy based on rough sets, called DualPOS. It provides mutual function mechanism of multi-attributes, and generates the most consistent one as a candidate. Experiments are carried out to test the performances of classification and clustering of the proposed algorithm. The results show that DualPOS is valid for attribute selection in semi-supervised learning.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artif. Intell. 97(1C2), 245–271 (1997)
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Bae, C., Yeh, W.C., Chung, Y.Y., Liu, S.L.: Feature selection with intelligent dynamic swarm and rough set. Expert Syst. Appl. 37(10), 7026–7032 (2010)
Pawlak, Z.: Rough sets. Int. J. Comput. Inform. Sci. 11(5), 341–356 (1982)
Pawlak, Z.: Rough sets and fuzzy sets. Fuzzy Sets Syst. 17(1), 99–102 (1985)
Revett, K., Iantovics, B.: A survey of electronic fetal monitoring: a computational perspective. Stud. Comput. Intell. 486, 135–141 (2014)
Skowron, A., Rauszer, C.: The discernibility matrices and functions in information systems. In: Slowiński, R. (ed.) Intelligent Decision Support. Theory and Decision Library, vol. 11, pp. 331–362. Springer, Netherlands (1992)
Vafaie, H., Imam, I.F.: Feature selection methods: genetic algorithms vs. greedy-like search. In: Proceedings of the International Conference on Fuzzy and Intelligent Control Systems, pp. 39–43 (1994)
Hu, X., Cercone, N.: Learning in relational databases: a rough set approach. Comput. Intell. 11(2), 323–338 (1995)
Hu, X.: Knowledge discovery in databases: an attribute-oriented rough set approach. Ph.D. thesis, Citeseer (1995)
Susmaga, R.: Reducts and constructs in attribute reduction. Fundamenta Informaticae 61(2), 159–181 (2004)
Dai, J., Wang, W., Xu, Q.: An uncertainty measure for incomplete decision tables and its applications. IEEE Trans. Cybern. 43(4), 1277–1289 (2013)
Dai, J., Wang, W., Tian, H., Liu, L.: Attribute selection based on a new conditional entropy for incomplete decision systems. Knowl.-Based Syst. 39, 207–213 (2013)
Dai, J., Xu, Q., Wang, W., Tian, H.: Conditional entropy for incomplete decision systems and its application in data mining. Int. J. Gen. Syst. 41(7), 713–728 (2012)
Dai, J., Xu, Q.: Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification. Appl. Soft Comput. 13(1), 211–221 (2013)
Dai, J., Li, Y.X., Liu, Q.: Hybrid genetic algorithm for reduct of attributes in decision system based on rough set theory. Wuhan Univ. J. Nat. Sci. 7(3), 285–289 (2002)
Dai, J., Chen, W., Gu, H., Pan, Y.: Particle swarm algorithm for minimal attribute reduction of decision data tables. In: Proceedings First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2006), Hangzhou, China, I, pp. 572–575, April 2006
Bazan, J.G., Nguyen, H.S., Nguyen, S.H., Synak, P., Wróblewski, J.: Rough set algorithms in classification problem. In: Polkowski, L., Tsumoto, S., Lin, T.Y. (eds.) Rough Set Methods and Applications. Studies in Fuzziness and Soft Computing, vol. 56, pp. 49–88. Springer, Heidelberg (2000)
Wroblewski, J.: Finding minimal reducts using genetic algorithms. In: Proccedings of the 2nd Annual Join Conference on Infromation Science, pp. 186–189 (1995)
Zhu, X., Goldberg, A.B.: Introduction to semi-supervised learning. Synth. Lect. Artif. Intell. Mach. Learn. 3(1), 1–130 (2009)
Pawlak, Z., Sowinski, R.: Rough set approach to multi-attribute decision analysis. Eur. J. Oper. Res. 72(3), 443–459 (1994)
Pawlak, Z., Grzymala-Busse, J., Slowinski, R., Ziarko, W.: Rough sets. Commun. ACM 38(11), 88–95 (1995)
Dai, J., Xu, Q.: Approximations and uncertainty measures in incomplete information systems. Inf. Sci. 198, 62–80 (2012)
Dai, J., Wang, W., Xu, Q., Tian, H.: Uncertainty measurement for interval-valued decision systems based on extended conditional entropy. Knowl.-Based Syst. 27, 443–450 (2012)
Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th International Join Conference on Artificial Intelligence, pp. 1022–1027 (1993)
Jain, A., Zongker, D.: Feature selection: evaluation, application, and small sample performance. IEEE Trans. Pattern Anal. Mach. Intell. 19(2), 153–158 (1997)
Zhu, H., Zhou, M.: Efficient role transfer based on kuhn-munkres algorithm. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 42(2), 491–496 (2012)
Acknowledgements
This work was partially supported by the National Natural Science Foundation of China (No. 61473259, No. 61070074, No. 60703038), the Zhejiang Provincial Natural Science Foundation (No. Y14F020118), the National Science & Technology Support Program of China (2015BAK26B00, 2015BAK26B02) and the PEIYANG Young Scholars Program of Tianjin University (2016XRX-0001).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Dai, J., Han, H., Hu, H., Hu, Q., Zhang, J., Wang, W. (2016). DualPOS: A Semi-supervised Attribute Selection Approach for Symbolic Data Based on Rough Set Theory. In: Cui, B., Zhang, N., Xu, J., Lian, X., Liu, D. (eds) Web-Age Information Management. WAIM 2016. Lecture Notes in Computer Science(), vol 9659. Springer, Cham. https://doi.org/10.1007/978-3-319-39958-4_31
Download citation
DOI: https://doi.org/10.1007/978-3-319-39958-4_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-39957-7
Online ISBN: 978-3-319-39958-4
eBook Packages: Computer ScienceComputer Science (R0)