research-article

Identifying Common Errors in Open-Ended Machine Learning Projects

Authors:

James Skripchuk,

Thomas PriceAuthors Info & Claims

SIGCSE 2022: Proceedings of the 53rd ACM Technical Symposium on Computer Science Education - Volume 1

Pages 216 - 222

https://doi.org/10.1145/3478431.3499397

Published: 22 February 2022 Publication History

Abstract

Machine learning (ML) is one of the fastest growing subfields in Computer Science, and it is important to identify ways to improve ML education. A key way to do so is by understanding the common errors that students make when writing ML programs, so they can be addressed. Prior work investigating ML errors has focused on an instructor perspective, but has not looked at student programming artifacts, such as projects and code submissions to understand how these errors occur and which are most common. To address this, we qualitatively coded over 2,500 cells of code from 19 final team projects (63 students) in an upper-division machine learning course. By isolating and codifying common errors and misconceptions across projects, we can identify what ML errors students struggle with. In our results, we found that library usage, hyperparameter tuning, and misusing test data were among the most common errors, and we give examples of how and when they occur. We then provide suggestions on why these misconceptions may occur, and how instructors and software designers can possibly mitigate these errors.

Supplementary Material

MP4 File (SIGCSE22-V1fp545v.mp4)

Identifying Common Errors in Open-Ended Machine Learning Projects - Presentation

Download
275.27 MB

References

[1]

Virginia Braun and Victoria Clarke. 2012. Thematic analysis. (2012).

[2]

Neil CC Brown and Amjad Altadmri. 2017. Novice Java programming mistakes: Large-scale data vs. educator beliefs. ACM Transactions on Computing Education (TOCE), Vol. 17, 2 (2017), 1--21.

Digital Library

[3]

Ricardo Caceffo, Pablo Frank-Bolton, Renan Souza, and Rodolfo Azevedo. 2019. Identifying and validating java misconceptions toward a cs1 concept inventory. In Proceedings of the 2019 ACM Conference on Innovation and Technology in Computer Science Education. 23--29.

Digital Library

[4]

Souti Chattopadhyay, Ishita Prasad, Austin Z Henley, Anita Sarma, and Titus Barik. 2020. What's Wrong with Computational Notebooks? Pain Points, Needs, and Design Opportunities. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 1--12.

Digital Library

[5]

Alexandra Chouldechova. 2017. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big data, Vol. 5, 2 (2017), 153--163.

[6]

Holger Danielsiek, Wolfgang Paul, and Jan Vahrenhold. 2012. Detecting and understanding students' misconceptions related to algorithms and data structures. In Proceedings of the 43rd ACM technical symposium on Computer Science Education. 21--26.

Digital Library

[7]

Amit Datta, Michael Carl Tschantz, and Anupam Datta. 2014. Automated experiments on ad privacy settings: A tale of opacity, choice, and discrimination. arXiv preprint arXiv:1408.6491 (2014).

[8]

Preet Kamal Dhillon and Gurleen Sidhu. 2012. Can software faults be analyzed using bad code smells?: An empirical study. Int J Sci Res Publ, Vol. 2, 10 (2012), 1--7.

[9]

Yihuan Dong, Samiha Marwan, Veronica Catete, Thomas Price, and Tiffany Barnes. 2019. Defining tinkering behavior in open-ended block-based programming assignments. In Proceedings of the 50th ACM Technical Symposium on Computer Science Education. 1204--1210.

Digital Library

[10]

Gao Gao, Finn Voichick, Michelle Ichinco, and Caitlin Kelleher. 2020. Exploring Programmers' API Learning Processes: Collecting Web Resources as External Memory. In 2020 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC). IEEE, 1--10.

[11]

Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2001. The Elements of Statistical Learning .Springer New York Inc., New York, NY, USA.

[12]

Felienne Hermans and Efthimia Aivaloglou. 2016. Do code smells hamper novice programming? A controlled experiment on Scratch programs. In 2016 IEEE 24th International Conference on Program Comprehension (ICPC). IEEE, 1--10.

[13]

Sean Kross and Philip J Guo. 2019. Practitioners teaching data science in industry and academia: Expectations, workflows, and challenges. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. 1--14.

Digital Library

[14]

Roberta Kwok. 2019. Junior AI researchers are in demand by universities and industry. Nature, Vol. 568, 7752 (2019), 581--584.

[15]

Gustavo A Lujan-Moreno, Phillip R Howard, Omar G Rojas, and Douglas C Montgomery. 2018. Design of experiments and response surface methodology to tune machine learning hyperparameters, with a random forest case-study. Expert Systems with Applications, Vol. 109 (2018), 195--205.

Digital Library

[16]

Joshua J Michalenko, Andrew S Lan, and Richard G Baraniuk. 2017. Data-mining textual responses to uncover misconception patterns. In Proceedings of the Fourth (2017) ACM Conference on Learning@ Scale. 245--248.

Digital Library

[17]

Philipp Probst, Anne-Laure Boulesteix, and Bernd Bischl. 2019. Tunability: importance of hyperparameters of machine learning algorithms. The Journal of Machine Learning Research, Vol. 20, 1 (2019), 1934--1965.

Digital Library

[18]

Jonathan G Richens, Ciarán M Lee, and Saurabh Johri. 2020. Improving the accuracy of medical diagnosis with causal machine learning. Nature communications, Vol. 11, 1 (2020), 1--9.

[19]

R Benjamin Shapiro and Rebecca Fiebrink. 2019. Introduction to the special section: Launching an agenda for research on learning machine learning.

[20]

Elisabeth Sulmont, Elizabeth Patitsas, and Jeremy R Cooperstock. 2019 a. Can You Teach Me To Machine Learn?. In Proceedings of the 50th ACM Technical Symposium on Computer Science Education. 948--954.

Digital Library

[21]

Elisabeth Sulmont, Elizabeth Patitsas, and Jeremy R Cooperstock. 2019 b. What is hard about teaching machine learning to non-majors? Insights from classifying instructors' learning goals. ACM Transactions on Computing Education (TOCE), Vol. 19, 4 (2019), 1--16.

Digital Library

[22]

Kyle Thayer, Sarah E Chasins, and Amy J Ko. 2021. A theory of robust API knowledge. ACM Transactions on Computing Education (TOCE), Vol. 21, 1 (2021), 1--32.

Digital Library

[23]

Gavriel Yarmish and Danny Kopec. 2007. Revisiting novice programmer errors. ACM SIGCSE Bulletin, Vol. 39, 2 (2007), 131--137.

Digital Library

Cited By

Hernández-Cuevas BLewis MJunkins WCrawford CDenham ALuo FStone JYuen TShoop LRebelsky SPrather J(2025)PhysioML: A Web-Based Tool for Machine Learning Education with Real-Time Physiological DataProceedings of the 56th ACM Technical Symposium on Computer Science Education V. 110.1145/3641554.3701815(485-491)Online publication date: 12-Feb-2025
https://dl.acm.org/doi/10.1145/3641554.3701815
Meijer WCombemale BWimmer MChechik MEgyed A(2024)Contract-based Validation of Conceptual Design Bugs for Engineering Complex Machine Learning SoftwareProceedings of the ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems10.1145/3652620.3688201(155-161)Online publication date: 22-Sep-2024
https://dl.acm.org/doi/10.1145/3652620.3688201
Ishmakhametov NNaser MBhattacharya S(2024)Assessing the Rigor of Machine Learning in Physiological Signal Processing ApplicationsSoutheastCon 202410.1109/SoutheastCon52093.2024.10500274(1525-1533)Online publication date: 15-Mar-2024
https://doi.org/10.1109/SoutheastCon52093.2024.10500274
Show More Cited By

Index Terms

Identifying Common Errors in Open-Ended Machine Learning Projects
1. Computing methodologies
  1. Machine learning
2. Social and professional topics
  1. Professional topics
    1. Computing education

Recommendations

Can You Teach Me To Machine Learn?
SIGCSE '19: Proceedings of the 50th ACM Technical Symposium on Computer Science Education

Machine learning (ML) has become an important topic for students across disciplines to understand because of its useful applications and its societal impacts. At the same time, there is little existing work on ML education, particularly about teaching ...
Common Errors in Machine Learning Projects: A Second Look
Koli Calling '23: Proceedings of the 23rd Koli Calling International Conference on Computing Education Research

While machine learning (ML) has proved impactful in many disciplines, design decisions involved in building ML models are difficult for novices to make, and mistakes can cause harm. Prior work by Skripchuk et al. [35] identified common errors made by ML ...
What Is Hard about Teaching Machine Learning to Non-Majors? Insights from Classifying Instructors’ Learning Goals
Special Section on ML Education and Regular Articles

Given its societal impacts and applications to numerous fields, machine learning (ML) is an important topic to understand for many students outside of computer science and statistics. However, machine-learning education research is nascent, and research ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGCSE 2022: Proceedings of the 53rd ACM Technical Symposium on Computer Science Education - Volume 1

February 2022

1049 pages

ISBN:9781450390705

DOI:10.1145/3478431

General Chairs:
Larry Merkle
Air Force Institute of Technology, USA
,
Maureen Doyle
Northern Kentucky University, USA
,
Program Chairs:
Judithe Sheard
Monash University, Australia
,
Leen-Kiat Soh
University of Nebraska-Lincoln, USA
,
Brian Dorn
University of Nebraska at Omaha, USA

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGCSE: ACM Special Interest Group on Computer Science Education

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 February 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SIGCSE 2022

Sponsor:

SIGCSE

SIGCSE 2022: The 53rd ACM Technical Symposium on Computer Science Education

March 3 - 5, 2022

RI, Providence, USA

Acceptance Rates

Overall Acceptance Rate 1,787 of 5,146 submissions, 35%

Upcoming Conference

SIGCSE TS 2025

Sponsor:
sigcse

The 56th ACM Technical Symposium on Computer Science Education

February 26 - March 1, 2025

Pittsburgh , PA , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
349
Total Downloads

Downloads (Last 12 months)95
Downloads (Last 6 weeks)10

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Hernández-Cuevas BLewis MJunkins WCrawford CDenham ALuo FStone JYuen TShoop LRebelsky SPrather J(2025)PhysioML: A Web-Based Tool for Machine Learning Education with Real-Time Physiological DataProceedings of the 56th ACM Technical Symposium on Computer Science Education V. 110.1145/3641554.3701815(485-491)Online publication date: 12-Feb-2025
https://dl.acm.org/doi/10.1145/3641554.3701815
Meijer WCombemale BWimmer MChechik MEgyed A(2024)Contract-based Validation of Conceptual Design Bugs for Engineering Complex Machine Learning SoftwareProceedings of the ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems10.1145/3652620.3688201(155-161)Online publication date: 22-Sep-2024
https://dl.acm.org/doi/10.1145/3652620.3688201
Ishmakhametov NNaser MBhattacharya S(2024)Assessing the Rigor of Machine Learning in Physiological Signal Processing ApplicationsSoutheastCon 202410.1109/SoutheastCon52093.2024.10500274(1525-1533)Online publication date: 15-Mar-2024
https://doi.org/10.1109/SoutheastCon52093.2024.10500274
Di Nunzio GMinzoni R(2023)A Thorough Reproducibility Study on Sentiment Classification: Methodology, Experimental Setting, ResultsInformation10.3390/info1402007614:2(76)Online publication date: 28-Jan-2023
https://doi.org/10.3390/info14020076

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten