Abstract
Students’ challenges in introductory programming courses have long been subject to research. In fact, learners are faced with cognitively complex tasks, such as modeling and writing programs. At the same time, educators are known to experience challenges with the classification of a competency’s cognitive complexity. In this paper, we present a text dataset with competency goals expected in basic programming courses. We then apply a deep learning approach to the dataset to classify the competency-based learning objectives as a use case. A manually annotated dataset of 35 German universities and their learning objectives in 129 introductory programming courses was processed into a machine-readable format to achieve these goals. It contains 1015 competency goals (both in German and English) and their classification into dimensions of complexity. Different state-of-the-art machine learning (ML) models, e.g., BERT, along with Natural Language Processing techniques, i.e., parts-of-speech-tagging, were combined to train a deep learning model in a supervised manner for the classification of competencies. The proof-of-concept shows that knowledge can be derived from the dataset. In the presented use case, the ML classification achieved a maximum accuracy of 81.4%. This work has several implications for educators, as it is the foundation for an application that classifies competency goals according to their cognitive complexity. The dataset can further be used to test language models as a baseline performance task. Moreover, the dataset can be extended, e.g., with data from other countries and languages. The dataset is available online under a Creative Commons license (https://github.com/nkiesler-cs/HEPComp-Dataset).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
ACM: Computer Science Curricula 2013: Curriculum Guidelines for Undergraduate Degree Programs in Computer Science. Association for Computing Machinery, New York (2013)
Anderson, L.W., et al.: A Taxonomy for Learning, Teaching, and Assessing: A Revision of Bloom’s Taxonomy of Educational Objectives. Addison Wesley Longman, New York (2001)
Bain, G., Barnes, I.: Why is programming so hard to learn? In: Proceedings of the 2014 Conference on Innovation & Technology in Computer Science Education, p. 356 (2014)
Cai, X., Dong, S., Hu, J.: A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records. BMC Med. Inform. Decis. Mak. 19(S2), 101–109 (2019)
Clark, K., Luong, M.T., Le, Q.V., Manning, C.D.: ELECTRA: pre-training text encoders as discriminators rather than generators (2020)
Clear, A., et al.: Computing Curricula 2020 Paradigms for Global Computing Education. ACM, New York (2020)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 (2018)
Fink, L.D.: Creating Significant Learning Experiences: An Integrated Approach to Designing College Courses. Wiley, San Francisco (2013)
Gluga, R., Kay, J., Lister, R., Kleitman, S., Lever, T.: Coming to terms with bloom: an online tutorial for teachers of programming fundamentals. In: Proceedings of the Fourteenth Australasian Computing Education Conference, ACE 2012, vol. 123, pp. 147–156. Australian Computer Society Inc, AUS (2012)
HRK, S., Hippler, H.: Hochschulkompass. Online (2018). https://www.hochschulkompass.de/service/impressum.html
Kiesler, N.: Kompetenzmodellierung für die grundlegende Programmierausbildung-Eine kritische Diskussion zu Vorzügen und Anwendbarkeit der Anderson Krathwohl Taxonomie im Vergleich zum Kompetenzmodell der GI. In: Zender, R., Ifenthaler, D., an Clara Schumacher, T.L. (eds.) DELFI 2020-Die 18. Fachtagung Bildungstechnologien der Gesellschaft für Informatik e.V., Online, 14–18 September 2020. LNI, vol. P-308, pp. 187–192. Gesellschaft für Informatik e.V. (2020). https://dl.gi.de/20.500.12116/34158
Kiesler, N.: Towards a competence model for the novice programmer using bloom’s revised taxonomy - an empirical approach. In: Proceedings of the 2020 ACM Conference on Innovation and Technology in Computer Science Education, ITiCSE 2020, pp. 459–465. ACM, New York (2020). https://doi.org/10.1145/3341525.3387419
Kiesler, N.: Zur modellierung und klassifizierung von kompetenzen in der grundlegenden programmierausbildung anhand der anderson krathwohl taxonomie. CoRR abs/2006.16922 (2020). arXiv:2006.16922
Kiesler, N.: Kompetenzförderung in der Programmierausbildung durch Modellierung von Kompetenzen und informativem Feedback. Dissertation, Johann Wolfgang Goethe-Universität, Frankfurt am Main, January 2022. Fachbereich Informatik und Mathematik
Kiesler, N., Schiffner, D.: Why we need open data in computer science education research? In: Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education, ITiCSE 2023, vol. 1. ACM, New York (2023). https://doi.org/10.1145/3587102.3588860. ISBN: 979-8-4007-0138-2/23/07
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv:1412.6980 (2014)
Lan, Z., Chen, M., Goodman, S., Gimpel, K., Sharma, P., Soricut, R.: ALBERT: a lite BERT for self-supervised learning of language representations. arXiv:1909.11942 (2019)
Loshchilov, I., Hutter, F.: Decoupled weight decay regularization. arXiv:1711.05101 (2017)
Masapanta-Carrión, S., Velázquez-Iturbide, J.A.: A systematic review of the use of bloom’s taxonomy in computer science education. In: Proceedings of the 49th ACM Technical Symposium on Computer Science Education, SIGCSE 2018, pp. 441–446. Association for Computing Machinery, New York (2018). https://doi.org/10.1145/3159450.3159491
Mayring, P.: Qualitative content analysis forum qualitative sozialforschung. In: Forum: Qualitative Social Research, vol. 1 (2000)
Mayring, P.: Combination and integration of qualitative and quantitative analysis. In: Forum Qualitative Sozialforschung/Forum: Qualitative Social Research, vol. 2, p. Art. 6 (2001)
Mayring, P.: Qualitative Inhaltsanalyse: Grundlagen und Techniken. Beltz, Weinheim, 12. auflage edn. (2015)
McCracken, M., et al.: A multi-national, multi-institutional study of assessment of programming skills of first-year CS students. In: Working Group Reports from ITiCSE on Innovation and Technology in Computer Science Education, pp. 125–180. ACM, New York (2001)
Raj, R., et al.: Professional competencies in computing education: pedagogies and assessment. In: Proceedings of the 2021 Working Group Reports on Innovation and Technology in Computer Science Education, pp. 133–161. ACM, New York (2021). https://doi.org/10.1145/3502870.3506570
Raj, R.K., et al.: Toward practical computing competencies. In: Proceedings of the 26th ACM Conference on Innovation and Technology in Computer Science Education, ITiCSE 2021, vol. 2, pp. 603–604. ACM, New York (2021). https://doi.org/10.1145/3456565.3461442
Sabin, M., et al.: Information Technology Curricula 2017: Curriculum Guidelines for Baccalaureate Degree Programs in Information Technology. ACM, New York, December 2017
Scott, T.: Bloom’s taxonomy applied to testing in computer science classes. J. Comput. Sci. Coll. 19(1), 267–274 (2003)
Shazeer, N., Lan, Z., Cheng, Y., Ding, N., Hou, L.: Talking-heads attention. arXiv preprint arXiv:2003.02436 (2020)
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(56), 1929–1958 (2014). https://jmlr.org/papers/v15/srivastava14a.html
Turc, I., Chang, M.W., Lee, K., Toutanova, K.: Well-read students learn better: on the importance of pre-training compact models. arXiv:1908.08962v2 (2019)
Wartena, C.: A probabilistic morphology model for German lemmatization. In: Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019), pp. 40–49 (2019). https://doi.org/10.25968/opus-1527
Weinert, F.E.: Concept of competence: a conceptual clarification. (2001)
Whalley, J.L., et al.: An Australasian study of reading and comprehension skills in novice programmers, using the bloom and solo taxonomies. In: Proceedings of the 8th Australasian Conference on Computing Education, ACE 2006, vol. 52, pp. 243–252. Australian Computer Society Inc, AUS (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Kiesler, N., Pfülb, B. (2023). Higher Education Programming Competencies: A Novel Dataset. In: Iliadis, L., Papaleonidas, A., Angelov, P., Jayne, C. (eds) Artificial Neural Networks and Machine Learning – ICANN 2023. ICANN 2023. Lecture Notes in Computer Science, vol 14261. Springer, Cham. https://doi.org/10.1007/978-3-031-44198-1_27
Download citation
DOI: https://doi.org/10.1007/978-3-031-44198-1_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44197-4
Online ISBN: 978-3-031-44198-1
eBook Packages: Computer ScienceComputer Science (R0)