Skip to main content

Selecting the Most Informative Inputs in Modelling Problems with Vague Data Applied to the Search of Informative Code Metrics for Continuous Assessment in Computer Science Online Courses

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8536))

Abstract

Sorting a set of inputs for relevance in modeling problems may be ambiguous if the data is vague. A general extension procedure is proposed in this paper that allows applying different deterministic or random feature selection algorithms to fuzzy data. This extension is based on a model of the relevance of a feature as a possibility distribution. The possibilistic relevances are ordered with the help of a fuzzy ranking. A practical problem where the most informative software metrics are searched for in an automatic grading problem is solved with this technique.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abdellatief, M., et al.: A mapping study to investigate component-based software system metrics. Journal of Systems and Software 86(3), 587–603 (2013)

    Article  Google Scholar 

  2. Arnow, D., Barshay, O.: On-line programming examinations using Web to teach. In: Proceedings of the 4th Annual SIGCSE/SIGCUE ITiCSE Conference on Innovation and Technology in Computer Science Education (ITiCSE 1999), pp. 21–24 (1999)

    Google Scholar 

  3. Battiti, R.: Using mutual information for selecting features in supervised neural net learning. IEEE Trans. Neural Nets. 5(4) (July 1994)

    Google Scholar 

  4. Boehm, B., et al.: Cost models for future software life cycle processes: COCOMO 2.0. Annals of Software Engineering 1(1), 57–94 (1995)

    Article  Google Scholar 

  5. Bortolan, G., Degani, R.: A review of some methods for ranking fuzzy subsets. Fuzzy Sets and Systems 15(1) (1985)

    Google Scholar 

  6. Breiman, L., Friedman, L.J., Olshen, A., Stone, C.: Classification and Regression Trees. Wadsworth (1984)

    Google Scholar 

  7. Breiman, L.: Random Forests. Machine Learning 45(1), 5–32 (2001)

    Article  MATH  Google Scholar 

  8. Casillas, J., Martinez-Lopez, F., Martinez, F.: Fuzzy association rules for estimating consumer behaviour models and their application to explaining trust in internet shopping. Fuzzy Economic Review IX(2), 3–26 (2004)

    Google Scholar 

  9. Couso, I., Montes, S., Gil, P.: The necessity of the strong alpha-cuts of a fuzzy set. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 9(2), 249–262 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  10. Couso, I., Sanchez, L.: Higher order models for fuzzy random variables. Fuzzy Sets and Systems 159(3), 237–258 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  11. Cheang, B., Kurnia, A., Lim, A., Oon, W.: On automated grading of programming assignments in an academic institution. Comput. Educ. 41(2), 121–131 (2003)

    Article  Google Scholar 

  12. Dubois, D., Prade, H.: Fuzzy sets - a convenient fiction for modeling vagueness and possibility. IEEE Transactions on Fuzzy Systems 2(1), 16–21 (1994)

    Article  Google Scholar 

  13. Haykin, S.: Neural Networks: A Comprehensive Foundation, 2nd edn. Prentice Hall (1998)

    Google Scholar 

  14. Jurado, F., Redondo, M., Ortega, M.: Using fuzzy logic applied to software metrics and test cases to assess programming assignments and give advice. J. Netw. Comput. Appl. 35(2) (2012)

    Google Scholar 

  15. Kurnia, A., Lim, A., Cheang, B.: Online judge. Comput. Educ. 36(4), 299–315 (2001)

    Article  Google Scholar 

  16. McCabe, T.: A complexity measure. IEEE Trans. on Software Engineering 2(4), 308–320 (1976)

    Article  MathSciNet  MATH  Google Scholar 

  17. Reek, K.A.: A software infrastructure to support introductory computer science courses. In: Klee, K.J. (ed.) Proceedings of the Twenty-Seventh SIGCSE Technical Symposium on Computer Science Education (SIGCSE 1996), pp. 125–129. ACM, New York (1996)

    Chapter  Google Scholar 

  18. Saeys, Y., Abeel, T., Van de Peer, Y.: Robust Feature Selection Using Ensemble Feature Selection Techniques. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 313–325. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  19. Sanchez, L., Otero, J., Couso, I.: Obtaining linguistic fuzzy rule-based regression models from imprecise data with multiobjective genetic algorithms. Soft Comput. 13(5), 467–479 (2008)

    Article  Google Scholar 

  20. Sanchez, L., Couso, I., Casillas, J.: Genetic learning of fuzzy rules on low quality data. Fuzzy Sets and Systems 160(17), 2524–2552 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  21. Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 14(3), 199–222 (2004)

    Article  MathSciNet  Google Scholar 

  22. Vujosevic-Janicica, M., Nikolica, M., Tosica, D., Kuncak, V.: Software verification and graph similarity for automated evaluation of students assignments. Information and Software Technology 55(6), 1004–1016 (2013)

    Article  Google Scholar 

  23. Wang, T., Su, X., Ma, P., Wang, Y., Wang, K.: Ability-training-oriented automated assessment in introductory programming course. Comput. Educ. 56(1), 220–226 (2011)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Otero, J., Del Rosario Suárez, M., Palacios, A., Couso, I., Sánchez, L. (2014). Selecting the Most Informative Inputs in Modelling Problems with Vague Data Applied to the Search of Informative Code Metrics for Continuous Assessment in Computer Science Online Courses. In: Cornelis, C., Kryszkiewicz, M., Ślȩzak, D., Ruiz, E.M., Bello, R., Shang, L. (eds) Rough Sets and Current Trends in Computing. RSCTC 2014. Lecture Notes in Computer Science(), vol 8536. Springer, Cham. https://doi.org/10.1007/978-3-319-08644-6_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08644-6_31

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08643-9

  • Online ISBN: 978-3-319-08644-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics