Skip to main content

Higher Education Programming Competencies: A Novel Dataset

  • Conference paper
  • First Online:
Artificial Neural Networks and Machine Learning – ICANN 2023 (ICANN 2023)

Abstract

Students’ challenges in introductory programming courses have long been subject to research. In fact, learners are faced with cognitively complex tasks, such as modeling and writing programs. At the same time, educators are known to experience challenges with the classification of a competency’s cognitive complexity. In this paper, we present a text dataset with competency goals expected in basic programming courses. We then apply a deep learning approach to the dataset to classify the competency-based learning objectives as a use case. A manually annotated dataset of 35 German universities and their learning objectives in 129 introductory programming courses was processed into a machine-readable format to achieve these goals. It contains 1015 competency goals (both in German and English) and their classification into dimensions of complexity. Different state-of-the-art machine learning (ML) models, e.g., BERT, along with Natural Language Processing techniques, i.e., parts-of-speech-tagging, were combined to train a deep learning model in a supervised manner for the classification of competencies. The proof-of-concept shows that knowledge can be derived from the dataset. In the presented use case, the ML classification achieved a maximum accuracy of 81.4%. This work has several implications for educators, as it is the foundation for an application that classifies competency goals according to their cognitive complexity. The dataset can further be used to test language models as a baseline performance task. Moreover, the dataset can be extended, e.g., with data from other countries and languages. The dataset is available online under a Creative Commons license (https://github.com/nkiesler-cs/HEPComp-Dataset).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. ACM: Computer Science Curricula 2013: Curriculum Guidelines for Undergraduate Degree Programs in Computer Science. Association for Computing Machinery, New York (2013)

    Google Scholar 

  2. Anderson, L.W., et al.: A Taxonomy for Learning, Teaching, and Assessing: A Revision of Bloom’s Taxonomy of Educational Objectives. Addison Wesley Longman, New York (2001)

    Google Scholar 

  3. Bain, G., Barnes, I.: Why is programming so hard to learn? In: Proceedings of the 2014 Conference on Innovation & Technology in Computer Science Education, p. 356 (2014)

    Google Scholar 

  4. Cai, X., Dong, S., Hu, J.: A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records. BMC Med. Inform. Decis. Mak. 19(S2), 101–109 (2019)

    Article  Google Scholar 

  5. Clark, K., Luong, M.T., Le, Q.V., Manning, C.D.: ELECTRA: pre-training text encoders as discriminators rather than generators (2020)

    Google Scholar 

  6. Clear, A., et al.: Computing Curricula 2020 Paradigms for Global Computing Education. ACM, New York (2020)

    Google Scholar 

  7. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 (2018)

  8. Fink, L.D.: Creating Significant Learning Experiences: An Integrated Approach to Designing College Courses. Wiley, San Francisco (2013)

    Google Scholar 

  9. Gluga, R., Kay, J., Lister, R., Kleitman, S., Lever, T.: Coming to terms with bloom: an online tutorial for teachers of programming fundamentals. In: Proceedings of the Fourteenth Australasian Computing Education Conference, ACE 2012, vol. 123, pp. 147–156. Australian Computer Society Inc, AUS (2012)

    Google Scholar 

  10. HRK, S., Hippler, H.: Hochschulkompass. Online (2018). https://www.hochschulkompass.de/service/impressum.html

  11. Kiesler, N.: Kompetenzmodellierung für die grundlegende Programmierausbildung-Eine kritische Diskussion zu Vorzügen und Anwendbarkeit der Anderson Krathwohl Taxonomie im Vergleich zum Kompetenzmodell der GI. In: Zender, R., Ifenthaler, D., an Clara Schumacher, T.L. (eds.) DELFI 2020-Die 18. Fachtagung Bildungstechnologien der Gesellschaft für Informatik e.V., Online, 14–18 September 2020. LNI, vol. P-308, pp. 187–192. Gesellschaft für Informatik e.V. (2020). https://dl.gi.de/20.500.12116/34158

  12. Kiesler, N.: Towards a competence model for the novice programmer using bloom’s revised taxonomy - an empirical approach. In: Proceedings of the 2020 ACM Conference on Innovation and Technology in Computer Science Education, ITiCSE 2020, pp. 459–465. ACM, New York (2020). https://doi.org/10.1145/3341525.3387419

  13. Kiesler, N.: Zur modellierung und klassifizierung von kompetenzen in der grundlegenden programmierausbildung anhand der anderson krathwohl taxonomie. CoRR abs/2006.16922 (2020). arXiv:2006.16922

  14. Kiesler, N.: Kompetenzförderung in der Programmierausbildung durch Modellierung von Kompetenzen und informativem Feedback. Dissertation, Johann Wolfgang Goethe-Universität, Frankfurt am Main, January 2022. Fachbereich Informatik und Mathematik

    Google Scholar 

  15. Kiesler, N., Schiffner, D.: Why we need open data in computer science education research? In: Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education, ITiCSE 2023, vol. 1. ACM, New York (2023). https://doi.org/10.1145/3587102.3588860. ISBN: 979-8-4007-0138-2/23/07

  16. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv:1412.6980 (2014)

  17. Lan, Z., Chen, M., Goodman, S., Gimpel, K., Sharma, P., Soricut, R.: ALBERT: a lite BERT for self-supervised learning of language representations. arXiv:1909.11942 (2019)

  18. Loshchilov, I., Hutter, F.: Decoupled weight decay regularization. arXiv:1711.05101 (2017)

  19. Masapanta-Carrión, S., Velázquez-Iturbide, J.A.: A systematic review of the use of bloom’s taxonomy in computer science education. In: Proceedings of the 49th ACM Technical Symposium on Computer Science Education, SIGCSE 2018, pp. 441–446. Association for Computing Machinery, New York (2018). https://doi.org/10.1145/3159450.3159491

  20. Mayring, P.: Qualitative content analysis forum qualitative sozialforschung. In: Forum: Qualitative Social Research, vol. 1 (2000)

    Google Scholar 

  21. Mayring, P.: Combination and integration of qualitative and quantitative analysis. In: Forum Qualitative Sozialforschung/Forum: Qualitative Social Research, vol. 2, p. Art. 6 (2001)

    Google Scholar 

  22. Mayring, P.: Qualitative Inhaltsanalyse: Grundlagen und Techniken. Beltz, Weinheim, 12. auflage edn. (2015)

    Google Scholar 

  23. McCracken, M., et al.: A multi-national, multi-institutional study of assessment of programming skills of first-year CS students. In: Working Group Reports from ITiCSE on Innovation and Technology in Computer Science Education, pp. 125–180. ACM, New York (2001)

    Google Scholar 

  24. Raj, R., et al.: Professional competencies in computing education: pedagogies and assessment. In: Proceedings of the 2021 Working Group Reports on Innovation and Technology in Computer Science Education, pp. 133–161. ACM, New York (2021). https://doi.org/10.1145/3502870.3506570

  25. Raj, R.K., et al.: Toward practical computing competencies. In: Proceedings of the 26th ACM Conference on Innovation and Technology in Computer Science Education, ITiCSE 2021, vol. 2, pp. 603–604. ACM, New York (2021). https://doi.org/10.1145/3456565.3461442

  26. Sabin, M., et al.: Information Technology Curricula 2017: Curriculum Guidelines for Baccalaureate Degree Programs in Information Technology. ACM, New York, December 2017

    Google Scholar 

  27. Scott, T.: Bloom’s taxonomy applied to testing in computer science classes. J. Comput. Sci. Coll. 19(1), 267–274 (2003)

    Google Scholar 

  28. Shazeer, N., Lan, Z., Cheng, Y., Ding, N., Hou, L.: Talking-heads attention. arXiv preprint arXiv:2003.02436 (2020)

  29. Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(56), 1929–1958 (2014). https://jmlr.org/papers/v15/srivastava14a.html

  30. Turc, I., Chang, M.W., Lee, K., Toutanova, K.: Well-read students learn better: on the importance of pre-training compact models. arXiv:1908.08962v2 (2019)

  31. Wartena, C.: A probabilistic morphology model for German lemmatization. In: Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019), pp. 40–49 (2019). https://doi.org/10.25968/opus-1527

  32. Weinert, F.E.: Concept of competence: a conceptual clarification. (2001)

    Google Scholar 

  33. Whalley, J.L., et al.: An Australasian study of reading and comprehension skills in novice programmers, using the bloom and solo taxonomies. In: Proceedings of the 8th Australasian Conference on Computing Education, ACE 2006, vol. 52, pp. 243–252. Australian Computer Society Inc, AUS (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Natalie Kiesler .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kiesler, N., Pfülb, B. (2023). Higher Education Programming Competencies: A Novel Dataset. In: Iliadis, L., Papaleonidas, A., Angelov, P., Jayne, C. (eds) Artificial Neural Networks and Machine Learning – ICANN 2023. ICANN 2023. Lecture Notes in Computer Science, vol 14261. Springer, Cham. https://doi.org/10.1007/978-3-031-44198-1_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-44198-1_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-44197-4

  • Online ISBN: 978-3-031-44198-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics