Abstract
The paper describes research on filtering decision rules with continuous and discretised condition attributes while combining characteristics of these attributes returned from supervised discretisation with their ranking. Numbers of intervals required for partitioning of attributes values imposed their grouping into corresponding categories, and for each group separately ranking procedures with Relief algorithm were executed. Information about numbers of bins combined with ranking positions were next exploited for selection of rules induced within rough set approaches. Filtering rules was performed directly by their conditions, or by calculating defined measures based on attribute weights, returning shortened decision algorithms with at least the same or improved classification accuracy.
References
Argamon, S., Burns, K., Dubnov, S. (eds.): The Structure of Style: Algorithmic Approaches to Understanding Manner and Meaning. Springer, Berlin (2010). doi:10.1007/978-3-642-12337-5
Baron, G.: Comparison of cross-validation and test sets approaches to evaluation of classifiers in authorship attribution domain. In: CzachĂłrski, T., Gelenbe, E., Grochla, K., Lent, R. (eds.) ISCIS 2016. CCIS, vol. 659, pp. 81â89. Springer, Cham (2016). doi:10.1007/978-3-319-47217-1_9
Baron, G., HarÄĆŒlak, K.: On approaches to discretization of datasets used for evaluation of decision systems. In: Czarnowski, I., Caballero, A.M., Howlett, R.J., Jain, L.C. (eds.) Intelligent Decision Technologies 2016. SIST, vol. 57, pp. 149â159. Springer, Cham (2016). doi:10.1007/978-3-319-39627-9_14
Bazan, J.G., Szczuka, M.: The rough set exploration system. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets III. LNCS, vol. 3400, pp. 37â56. Springer, Heidelberg (2005). doi:10.1007/11427834_2
BĆaszczyĆski, J., Greco, S., SĆowiĆski, R.: Inductive discovery of laws using monotonic rules. Eng. Appl. Artif. Intell. 25, 284â294 (2012)
Burrows, J.: Textual analysis. In: Schreibman, S., Siemens, R., Unsworth, J. (eds.) A Companion to Digital Humanities. Blackwell, Oxford (2004)
Chikalov, I., Lozin, V., Lozina, I., Moshkov, M., Nguyen, H., Skowron, A., Zielosko, B.: Three Approaches to Data Analysis - Test Theory, Rough Sets and Logical Analysis of Data. Intelligent Systems Reference Library, vol. 41. Springer, Heidelberg (2013). doi:10.1007/978-3-642-28667-4
Craig, H.: Stylistic analysis and authorship studies. In: Schreibman, S., Siemens, R., Unsworth, J. (eds.) A Companion to Digital Humanities. Blackwell, Oxford (2004)
Cyran, K., Stanczyk, U.: Indiscernibility relation for continuous attributes: application in image recognition. In: Kryszkiewicz, M., Peters, J.F., Rybinski, H., Skowron, A. (eds.) RSEISP 2007. LNCS, vol. 4585, pp. 726â735. Springer, Heidelberg (2007). doi:10.1007/978-3-540-73451-2_76
Dougherty, J., Kohavi, R., Sahami, M.: Supervised and unsupervised discretization of continuous features. In: Machine Learning Proceedings 1995: Proceedings of the 12th International Conference on Machine Learning, pp. 194â202. Elsevier (1995)
Fayyad, U., Irani, K.: Multi-interval discretization of continuous valued attributes for classification learning. In: The 13th International Joint Conference on Artificial Intelligence, vol. 2, pp. 1022â1027. Morgan Kaufmann Publishers (1993)
Greco, S., Matarazzo, B., SĆowiĆski, R.: The use of rough sets and fuzzy sets in MCDM. In: Gal, T., Hanne, T., Stewart, T. (eds.) Advances in Multiple Criteria Decision Making, chap. 14, pp. 14.1â14.59. Kluwer Academic Publishers, Dordrecht (1999)
Greco, S., Matarazzo, B., SĆowiĆski, R.: Dominance-based rough set approach as a proper way of handling graduality in rough set theory. In: Peters, J.F., Skowron, A., Marek, V.W., OrĆowska, E., SĆowiĆski, R., Ziarko, W. (eds.) Transactions on Rough Sets VII. LNCS, vol. 4400, pp. 36â52. Springer, Heidelberg (2007). doi:10.1007/978-3-540-71663-1_3
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10â18 (2009)
Jockers, M., Witten, D.: A comparative study of machine learning methods for authorship attribution. Lit. Linguist. Comput. 25(2), 215â223 (2010)
Kononenko, I.: Estimating attributes: analysis and extensions of RELIEF. In: Bergadano, F., Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 171â182. Springer, Heidelberg (1994). doi:10.1007/3-540-57868-4_57
Koppel, M., Schler, J., Argamon, S.: Computational methods in authorship attribution. J. Am. Soc. Inf. Sci. Technol. 60(1), 9â26 (2009)
Kotsiantis, S., Kanellopoulos, D.: Discretization techniques: a recent survey. GESTS Int. Trans. Comput. Sci. Eng. 32(1), 47â58 (2006)
Michalak, M., Sikora, M., WrĂłbel, L.: Rule quality measures settings in a sequential covering rule induction algorithm - an empirical approach. In: Proceedings of the 2015 Federated Conference on Computer Science and Information Systems. ACSIS-Annals of Computer Science and Information Systems, vol. 5, pp. 109â118 (2015)
Moshkov, M., Zielosko, B.: Combinatorial Machine Learning - A Rough Set Approach. Studies in Computational Intelligence, vol. 360. Springer, Heidelberg (2011). doi:10.1007/978-3-642-20995-6
Pawlak, Z.: Rough sets and intelligent data analysis. Inf. Sci. 147, 1â12 (2002)
Pawlak, Z., Skowron, A.: Rough sets and Boolean reasoning. Inf. Sci. 177(1), 41â73 (2007)
Peng, R., Hengartner, H.: Quantitative analysis of literary styles. Am. Stat. 56(3), 15â38 (2002)
SĆowiĆski, R., Greco, S., Matarazzo, B.: Dominance-based rough set approach to reasoning about ordinal data. In: Kryszkiewicz, M., Peters, J.F., Rybinski, H., Skowron, A. (eds.) RSEISP 2007. LNCS, vol. 4585, pp. 5â11. Springer, Heidelberg (2007). doi:10.1007/978-3-540-73451-2_2
StaĆczyk, U.: Attribute ranking driven filtering of decision rules. In: Kryszkiewicz, M., Cornelis, C., Ciucci, D., Medina-Moreno, J., Motoda, H., RaĆ, Z.W. (eds.) RSEISP 2014. LNCS, vol. 8537, pp. 217â224. Springer, Cham (2014). doi:10.1007/978-3-319-08729-0_21
StaĆczyk, U.: RELIEF-based selection of decision rules. Procedia Comput. Sci. 35, 299â308 (2014)
StaĆczyk, U.: Measuring quality of decision rules through ranking of conditional attributes. In: Czarnowski, I., Caballero, A.M., Howlett, R.J., Jain, L.C. (eds.) Intelligent Decision Technologies 2016. SIST, vol. 56, pp. 269â279. Springer, Cham (2016). doi:10.1007/978-3-319-39630-9_22
StaĆczyk, U.: Weighting and pruning of decision rules by attributes and attribute rankings. In: CzachĂłrski, T., Gelenbe, E., Grochla, K., Lent, R. (eds.) ISCIS 2016. CCIS, vol. 659, pp. 106â114. Springer, Cham (2016). doi:10.1007/978-3-319-47217-1_12
WrĂłbel, L., Sikora, M., Michalak, M.: Rule quality measures settings in classification, regression and survival rule induction â an empirical approach. Fundam. Inform. 149, 419â449 (2016)
Acknowledgments
In the research there was used RSES system, developed at the Institute of Mathematics, Warsaw University (http://logic.mimuw.edu.pl/~rses/) [4], 4eMka Software developed at the Laboratory of Intelligent Decision Support Systems, PoznaĆ [24], and WEKA workbench [14]. The research was performed at the Silesian University of Technology, Gliwice, within the project BK/RAu2/2017, and at the University of Silesia, Sosnowiec, within the project âMethods of artificial intelligence in information systemsâ.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
StaĆczyk, U., Zielosko, B. (2017). On Combining Discretisation Parameters and Attribute Ranking for Selection of Decision Rules. In: Polkowski, L., et al. Rough Sets. IJCRS 2017. Lecture Notes in Computer Science(), vol 10313. Springer, Cham. https://doi.org/10.1007/978-3-319-60837-2_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-60837-2_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-60836-5
Online ISBN: 978-3-319-60837-2
eBook Packages: Computer ScienceComputer Science (R0)