An Exploratory Study on Instructors’ Agreement on the Correctness of Computer Program Outputs

Tang, Chung Man; Yu, Y. T.

doi:10.1007/978-3-642-39750-9_7

Chung Man Tang²¹ &
Y. T. Yu²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8038))

Included in the following conference series:

International Conference on Hybrid Learning and Continuing Education

2257 Accesses
3 Citations

Abstract

Many universities have developed Automated Program Assessment Systems to automate the tasks of assessing students’ computer programs so as to enhance students’ learning and relieve instructors’ workload. These systems typically evaluate the correctness of a program by comparing its actual outputs with the instructor’s pre-defined expected outputs. However, an actual output may still be correct even if it deviates from the expected output. One challenge in building such a system is to devise an automated mechanism for determining program output correctness that matches the instructor’s own judgment. This is difficult if instructors have different individual judgments. This paper reports an exploratory empirical study which evaluates instructors’ agreement on the correctness of students’ program outputs. Our study demonstrates reasonably good overall agreement between the instructors and reveals the categories of program output variants for which they are more likely to agree or disagree.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Joy, M., Griffiths, N., Royatt, R.: The BOSS Online Submission and Assessment System. ACM Journal on Educational Resources in Computing 5(3), Article 2 (2005)
Google Scholar
Morris, D.S.: Automatic Grading of Student’s Programming Assignments: An Interactive Process and Suite of Programs. In: 33rd ASEE/IEEE Frontiers in Education Conference (FIE 2003), pp. S3F-1–S3F-6 (2003)
Google Scholar
Ala-Mutka, K.: A Survey of Automated Assessment Approaches for Programming Assignments. Computer Science Education 15(2), 83–102 (2005)
Article Google Scholar
Jackson, D.: Using Software Tools to Automate the Assessment of Student Programs. Computers and Education 17(2), 133–143 (1991)
Article Google Scholar
Yu, Y.T., Poon, C.K., Choy, M.Y.: Experiences with PASS: Developing and Using a Programming Assignment Assessment System. In: 6th International Conference on Quality Software (QSIC 2006), pp. 360–365 (2006)
Google Scholar
Tang, C.M., Yu, Y.T., Poon, C.K.: An Approach towards Automatic Testing of Student Programs Using Token Patterns. In: 17th International Conference on Computers in Education (ICCE 2009), pp. 188–190 (2009)
Google Scholar
Howden, W.E.: A Functional Approach to Program Testing and Analysis. IEEE Transactions on Software Engineering 12(10), 997–1005 (1986)
Article Google Scholar
Butcher, P.G., Jordan, S.E.: A Comparison of Human and Computer Marking of Short Free-Text Student Responses. Computers and Education 55(2), 489–499 (2010)
Article Google Scholar
Higgins, C., Hergazy, T., Symeonidis, P., Tsinsifas, A.: The CourseMarker CBA System: Improvements over Ceilidh. Education and Information Technologies 8(3), 287–304 (2003)
Article Google Scholar
Tang, C.M., Yu, Y.T., Poon, C.K.: An Experimental Prototype for Automatically Testing Student Programs using Token Patterns. In: 2nd International Conference on Computer Supported Education (CSEDU 2010), pp. 144–149 (2010)
Google Scholar
Tang, C.M., Yu, Y.T., Poon, C.K.: A Review of the Strategies for Output Correctness Determination in Automated Assessment of Student Programs. In: 14th Global Chinese Conference on Computers in Education (GCCCE 2010), pp. 551–558 (2010)
Google Scholar
Gwet, K.L.: Handbook of Inter-rater Reliability: The Definitive Guide to Measuring the Extent of Agreement among Multiple Raters, 3rd edn. Advanced Analytics, LLC (2012)
Google Scholar
Cohen, J.: A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement 20, 37–46 (1960)
Article Google Scholar
Scott, W.A.: Reliability of Content Analysis: The Case of Nominal Scale Coding. Public Opinion Quarterly 19(3), 321–325 (1955)
Article Google Scholar
Gwet, K.L.: Computing Inter-rater Reliability and Its Variance in the Presence of High Agreement. British Journal of Mathematical and Statistical Psychology 61, 29–48 (2008)
Article MathSciNet Google Scholar
Landis, J.R., Koch, G.G.: The Measurement of Observer Agreement for Categorical Data. Biometrics 33, 159–174 (1977)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, City University of Hong Kong, Hong Kong
Chung Man Tang & Y. T. Yu

Authors

Chung Man Tang
View author publications
You can also search for this author in PubMed Google Scholar
Y. T. Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Open University of Hong Kong, Good Shepherd Street, Homantin, Kowloon, Hong Kong
Simon K. S. Cheung
Computer Science Department, City University of Hong Kong, Hong Kong
Joseph Fong
School of Continuing Studies, University of Toronto, Canada
Wilfred Fong
Deaprtment of Business Administration, Caritas Institute of Higher Education, 18 Chui Ling Road, Tseung Kwan O, Hong Kong, China
Fu Lee Wang
Department of Computer Science, College of Science and Engineering, City University of Hong Kong, Hong Kong, China
Lam For Kwok

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tang, C.M., Yu, Y.T. (2013). An Exploratory Study on Instructors’ Agreement on the Correctness of Computer Program Outputs. In: Cheung, S.K.S., Fong, J., Fong, W., Wang, F.L., Kwok, L.F. (eds) Hybrid Learning and Continuing Education. ICHL 2013. Lecture Notes in Computer Science, vol 8038. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39750-9_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-39750-9_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39749-3
Online ISBN: 978-3-642-39750-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics