Higher Education Programming Competencies: A Novel Dataset

Kiesler, Natalie; Pfülb, Benedikt

doi:10.1007/978-3-031-44198-1_27

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14261))

Included in the following conference series:

International Conference on Artificial Neural Networks

672 Accesses
8 Citations

Abstract

Students’ challenges in introductory programming courses have long been subject to research. In fact, learners are faced with cognitively complex tasks, such as modeling and writing programs. At the same time, educators are known to experience challenges with the classification of a competency’s cognitive complexity. In this paper, we present a text dataset with competency goals expected in basic programming courses. We then apply a deep learning approach to the dataset to classify the competency-based learning objectives as a use case. A manually annotated dataset of 35 German universities and their learning objectives in 129 introductory programming courses was processed into a machine-readable format to achieve these goals. It contains 1015 competency goals (both in German and English) and their classification into dimensions of complexity. Different state-of-the-art machine learning (ML) models, e.g., BERT, along with Natural Language Processing techniques, i.e., parts-of-speech-tagging, were combined to train a deep learning model in a supervised manner for the classification of competencies. The proof-of-concept shows that knowledge can be derived from the dataset. In the presented use case, the ML classification achieved a maximum accuracy of 81.4%. This work has several implications for educators, as it is the foundation for an application that classifies competency goals according to their cognitive complexity. The dataset can further be used to test language models as a baseline performance task. Moreover, the dataset can be extended, e.g., with data from other countries and languages. The dataset is available online under a Creative Commons license (https://github.com/nkiesler-cs/HEPComp-Dataset).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

ACM: Computer Science Curricula 2013: Curriculum Guidelines for Undergraduate Degree Programs in Computer Science. Association for Computing Machinery, New York (2013)
Google Scholar
Anderson, L.W., et al.: A Taxonomy for Learning, Teaching, and Assessing: A Revision of Bloom’s Taxonomy of Educational Objectives. Addison Wesley Longman, New York (2001)
Google Scholar
Bain, G., Barnes, I.: Why is programming so hard to learn? In: Proceedings of the 2014 Conference on Innovation & Technology in Computer Science Education, p. 356 (2014)
Google Scholar
Cai, X., Dong, S., Hu, J.: A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records. BMC Med. Inform. Decis. Mak. 19(S2), 101–109 (2019)
Article Google Scholar
Clark, K., Luong, M.T., Le, Q.V., Manning, C.D.: ELECTRA: pre-training text encoders as discriminators rather than generators (2020)
Google Scholar
Clear, A., et al.: Computing Curricula 2020 Paradigms for Global Computing Education. ACM, New York (2020)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 (2018)
Fink, L.D.: Creating Significant Learning Experiences: An Integrated Approach to Designing College Courses. Wiley, San Francisco (2013)
Google Scholar
Gluga, R., Kay, J., Lister, R., Kleitman, S., Lever, T.: Coming to terms with bloom: an online tutorial for teachers of programming fundamentals. In: Proceedings of the Fourteenth Australasian Computing Education Conference, ACE 2012, vol. 123, pp. 147–156. Australian Computer Society Inc, AUS (2012)
Google Scholar
HRK, S., Hippler, H.: Hochschulkompass. Online (2018). https://www.hochschulkompass.de/service/impressum.html
Kiesler, N.: Kompetenzmodellierung für die grundlegende Programmierausbildung-Eine kritische Diskussion zu Vorzügen und Anwendbarkeit der Anderson Krathwohl Taxonomie im Vergleich zum Kompetenzmodell der GI. In: Zender, R., Ifenthaler, D., an Clara Schumacher, T.L. (eds.) DELFI 2020-Die 18. Fachtagung Bildungstechnologien der Gesellschaft für Informatik e.V., Online, 14–18 September 2020. LNI, vol. P-308, pp. 187–192. Gesellschaft für Informatik e.V. (2020). https://dl.gi.de/20.500.12116/34158
Kiesler, N.: Towards a competence model for the novice programmer using bloom’s revised taxonomy - an empirical approach. In: Proceedings of the 2020 ACM Conference on Innovation and Technology in Computer Science Education, ITiCSE 2020, pp. 459–465. ACM, New York (2020). https://doi.org/10.1145/3341525.3387419
Kiesler, N.: Zur modellierung und klassifizierung von kompetenzen in der grundlegenden programmierausbildung anhand der anderson krathwohl taxonomie. CoRR abs/2006.16922 (2020). arXiv:2006.16922
Kiesler, N.: Kompetenzförderung in der Programmierausbildung durch Modellierung von Kompetenzen und informativem Feedback. Dissertation, Johann Wolfgang Goethe-Universität, Frankfurt am Main, January 2022. Fachbereich Informatik und Mathematik
Google Scholar
Kiesler, N., Schiffner, D.: Why we need open data in computer science education research? In: Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education, ITiCSE 2023, vol. 1. ACM, New York (2023). https://doi.org/10.1145/3587102.3588860. ISBN: 979-8-4007-0138-2/23/07
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv:1412.6980 (2014)
Lan, Z., Chen, M., Goodman, S., Gimpel, K., Sharma, P., Soricut, R.: ALBERT: a lite BERT for self-supervised learning of language representations. arXiv:1909.11942 (2019)
Loshchilov, I., Hutter, F.: Decoupled weight decay regularization. arXiv:1711.05101 (2017)
Masapanta-Carrión, S., Velázquez-Iturbide, J.A.: A systematic review of the use of bloom’s taxonomy in computer science education. In: Proceedings of the 49th ACM Technical Symposium on Computer Science Education, SIGCSE 2018, pp. 441–446. Association for Computing Machinery, New York (2018). https://doi.org/10.1145/3159450.3159491
Mayring, P.: Qualitative content analysis forum qualitative sozialforschung. In: Forum: Qualitative Social Research, vol. 1 (2000)
Google Scholar
Mayring, P.: Combination and integration of qualitative and quantitative analysis. In: Forum Qualitative Sozialforschung/Forum: Qualitative Social Research, vol. 2, p. Art. 6 (2001)
Google Scholar
Mayring, P.: Qualitative Inhaltsanalyse: Grundlagen und Techniken. Beltz, Weinheim, 12. auflage edn. (2015)
Google Scholar
McCracken, M., et al.: A multi-national, multi-institutional study of assessment of programming skills of first-year CS students. In: Working Group Reports from ITiCSE on Innovation and Technology in Computer Science Education, pp. 125–180. ACM, New York (2001)
Google Scholar
Raj, R., et al.: Professional competencies in computing education: pedagogies and assessment. In: Proceedings of the 2021 Working Group Reports on Innovation and Technology in Computer Science Education, pp. 133–161. ACM, New York (2021). https://doi.org/10.1145/3502870.3506570
Raj, R.K., et al.: Toward practical computing competencies. In: Proceedings of the 26th ACM Conference on Innovation and Technology in Computer Science Education, ITiCSE 2021, vol. 2, pp. 603–604. ACM, New York (2021). https://doi.org/10.1145/3456565.3461442
Sabin, M., et al.: Information Technology Curricula 2017: Curriculum Guidelines for Baccalaureate Degree Programs in Information Technology. ACM, New York, December 2017
Google Scholar
Scott, T.: Bloom’s taxonomy applied to testing in computer science classes. J. Comput. Sci. Coll. 19(1), 267–274 (2003)
Google Scholar
Shazeer, N., Lan, Z., Cheng, Y., Ding, N., Hou, L.: Talking-heads attention. arXiv preprint arXiv:2003.02436 (2020)
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(56), 1929–1958 (2014). https://jmlr.org/papers/v15/srivastava14a.html
Turc, I., Chang, M.W., Lee, K., Toutanova, K.: Well-read students learn better: on the importance of pre-training compact models. arXiv:1908.08962v2 (2019)
Wartena, C.: A probabilistic morphology model for German lemmatization. In: Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019), pp. 40–49 (2019). https://doi.org/10.25968/opus-1527
Weinert, F.E.: Concept of competence: a conceptual clarification. (2001)
Google Scholar
Whalley, J.L., et al.: An Australasian study of reading and comprehension skills in novice programmers, using the bloom and solo taxonomies. In: Proceedings of the 8th Australasian Conference on Computing Education, ACE 2006, vol. 52, pp. 243–252. Australian Computer Society Inc, AUS (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

DIPF Leibniz Institute for Research and Information in Education, Frankfurt, Germany
Natalie Kiesler & Benedikt Pfülb

Authors

Natalie Kiesler
View author publications
You can also search for this author in PubMed Google Scholar
Benedikt Pfülb
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Natalie Kiesler .

Editor information

Editors and Affiliations

Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
Democritus University of Thrace, Xanthi, Greece
Antonios Papaleonidas
Lancaster University, Lancaster, UK
Plamen Angelov
Teesside University, Middlesbrough, UK
Chrisina Jayne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kiesler, N., Pfülb, B. (2023). Higher Education Programming Competencies: A Novel Dataset. In: Iliadis, L., Papaleonidas, A., Angelov, P., Jayne, C. (eds) Artificial Neural Networks and Machine Learning – ICANN 2023. ICANN 2023. Lecture Notes in Computer Science, vol 14261. Springer, Cham. https://doi.org/10.1007/978-3-031-44198-1_27

Download citation

DOI: https://doi.org/10.1007/978-3-031-44198-1_27
Published: 22 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44197-4
Online ISBN: 978-3-031-44198-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics