Selecting the Most Informative Inputs in Modelling Problems with Vague Data Applied to the Search of Informative Code Metrics for Continuous Assessment in Computer Science Online Courses

Otero, José; Del Rosario Suárez, Maria; Palacios, Ana; Couso, Inés; Sánchez, Luciano

doi:10.1007/978-3-319-08644-6_31

Selecting the Most Informative Inputs in Modelling Problems with Vague Data Applied to the Search of Informative Code Metrics for Continuous Assessment in Computer Science Online Courses

José Otero²⁵,
Maria Del Rosario Suárez²⁵,
Ana Palacios²⁶,
Inés Couso²⁷ &
…
Luciano Sánchez²⁵

Conference paper

879 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8536))

Abstract

Sorting a set of inputs for relevance in modeling problems may be ambiguous if the data is vague. A general extension procedure is proposed in this paper that allows applying different deterministic or random feature selection algorithms to fuzzy data. This extension is based on a model of the relevance of a feature as a possibility distribution. The possibilistic relevances are ordered with the help of a fuzzy ranking. A practical problem where the most informative software metrics are searched for in an automatic grading problem is solved with this technique.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abdellatief, M., et al.: A mapping study to investigate component-based software system metrics. Journal of Systems and Software 86(3), 587–603 (2013)
Article Google Scholar
Arnow, D., Barshay, O.: On-line programming examinations using Web to teach. In: Proceedings of the 4th Annual SIGCSE/SIGCUE ITiCSE Conference on Innovation and Technology in Computer Science Education (ITiCSE 1999), pp. 21–24 (1999)
Google Scholar
Battiti, R.: Using mutual information for selecting features in supervised neural net learning. IEEE Trans. Neural Nets. 5(4) (July 1994)
Google Scholar
Boehm, B., et al.: Cost models for future software life cycle processes: COCOMO 2.0. Annals of Software Engineering 1(1), 57–94 (1995)
Article Google Scholar
Bortolan, G., Degani, R.: A review of some methods for ranking fuzzy subsets. Fuzzy Sets and Systems 15(1) (1985)
Google Scholar
Breiman, L., Friedman, L.J., Olshen, A., Stone, C.: Classification and Regression Trees. Wadsworth (1984)
Google Scholar
Breiman, L.: Random Forests. Machine Learning 45(1), 5–32 (2001)
Article MATH Google Scholar
Casillas, J., Martinez-Lopez, F., Martinez, F.: Fuzzy association rules for estimating consumer behaviour models and their application to explaining trust in internet shopping. Fuzzy Economic Review IX(2), 3–26 (2004)
Google Scholar
Couso, I., Montes, S., Gil, P.: The necessity of the strong alpha-cuts of a fuzzy set. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 9(2), 249–262 (2001)
Article MathSciNet MATH Google Scholar
Couso, I., Sanchez, L.: Higher order models for fuzzy random variables. Fuzzy Sets and Systems 159(3), 237–258 (2008)
Article MathSciNet MATH Google Scholar
Cheang, B., Kurnia, A., Lim, A., Oon, W.: On automated grading of programming assignments in an academic institution. Comput. Educ. 41(2), 121–131 (2003)
Article Google Scholar
Dubois, D., Prade, H.: Fuzzy sets - a convenient fiction for modeling vagueness and possibility. IEEE Transactions on Fuzzy Systems 2(1), 16–21 (1994)
Article Google Scholar
Haykin, S.: Neural Networks: A Comprehensive Foundation, 2nd edn. Prentice Hall (1998)
Google Scholar
Jurado, F., Redondo, M., Ortega, M.: Using fuzzy logic applied to software metrics and test cases to assess programming assignments and give advice. J. Netw. Comput. Appl. 35(2) (2012)
Google Scholar
Kurnia, A., Lim, A., Cheang, B.: Online judge. Comput. Educ. 36(4), 299–315 (2001)
Article Google Scholar
McCabe, T.: A complexity measure. IEEE Trans. on Software Engineering 2(4), 308–320 (1976)
Article MathSciNet MATH Google Scholar
Reek, K.A.: A software infrastructure to support introductory computer science courses. In: Klee, K.J. (ed.) Proceedings of the Twenty-Seventh SIGCSE Technical Symposium on Computer Science Education (SIGCSE 1996), pp. 125–129. ACM, New York (1996)
Chapter Google Scholar
Saeys, Y., Abeel, T., Van de Peer, Y.: Robust Feature Selection Using Ensemble Feature Selection Techniques. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 313–325. Springer, Heidelberg (2008)
Chapter Google Scholar
Sanchez, L., Otero, J., Couso, I.: Obtaining linguistic fuzzy rule-based regression models from imprecise data with multiobjective genetic algorithms. Soft Comput. 13(5), 467–479 (2008)
Article Google Scholar
Sanchez, L., Couso, I., Casillas, J.: Genetic learning of fuzzy rules on low quality data. Fuzzy Sets and Systems 160(17), 2524–2552 (2009)
Article MathSciNet MATH Google Scholar
Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 14(3), 199–222 (2004)
Article MathSciNet Google Scholar
Vujosevic-Janicica, M., Nikolica, M., Tosica, D., Kuncak, V.: Software verification and graph similarity for automated evaluation of students assignments. Information and Software Technology 55(6), 1004–1016 (2013)
Article Google Scholar
Wang, T., Su, X., Ma, P., Wang, Y., Wang, K.: Ability-training-oriented automated assessment in introductory programming course. Comput. Educ. 56(1), 220–226 (2011)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Informática, Edificio Departamental Oeste, Universidad de Oviedo, Campus de Viesques, Gijón, 33204, Asturias, España
José Otero, Maria Del Rosario Suárez & Luciano Sánchez
Departamento de Ciencias de la Computación e Inteligencia Artificial, ETSIIyT, Universidad de Granada, Granada, 18071, Granada, España
Ana Palacios
Departamento de Estadística e I.O. y D. M. EPI Gijón, Edificio Principal, Universidad de Oviedo, Campus de Viesques, Gijón, 33204, Asturias, España
Inés Couso

Authors

José Otero
View author publications
You can also search for this author in PubMed Google Scholar
Maria Del Rosario Suárez
View author publications
You can also search for this author in PubMed Google Scholar
Ana Palacios
View author publications
You can also search for this author in PubMed Google Scholar
Inés Couso
View author publications
You can also search for this author in PubMed Google Scholar
Luciano Sánchez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Artificial Intelligence, University of Granada, Calle del Periodista Daniel Saucedo Aranda s/n, 18071, Granada, Spain
Chris Cornelis
Institute of Computer Science, Warsaw University of Technology, Nowowiejska 15/19, 00-665, Warsaw, Poland
Marzena Kryszkiewicz
University of Warsaw, Poland
Dominik Ślȩzak
Polytechnic University of Madrid, Spain
Ernestina Menasalvas Ruiz
Deparment of Computer Sciences, Universidad Central Marta Abreu de las Villas, Santa Clara, Villa Clara, Cuba
Rafael Bello
Department of Computer Science and Technology, Nanjing University, 210023, Nanjing, China
Lin Shang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Otero, J., Del Rosario Suárez, M., Palacios, A., Couso, I., Sánchez, L. (2014). Selecting the Most Informative Inputs in Modelling Problems with Vague Data Applied to the Search of Informative Code Metrics for Continuous Assessment in Computer Science Online Courses. In: Cornelis, C., Kryszkiewicz, M., Ślȩzak, D., Ruiz, E.M., Bello, R., Shang, L. (eds) Rough Sets and Current Trends in Computing. RSCTC 2014. Lecture Notes in Computer Science(), vol 8536. Springer, Cham. https://doi.org/10.1007/978-3-319-08644-6_31

Download citation

DOI: https://doi.org/10.1007/978-3-319-08644-6_31
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08643-9
Online ISBN: 978-3-319-08644-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics