Evaluating the Robustness of Learning Analytics Results Against Fake Learners

Alexandron, Giora; Ruipérez-Valiente, José A.; Lee, Sunbok; Pritchard, David E.

doi:10.1007/978-3-319-98572-5_6

Giora Alexandron¹⁷,
José A. Ruipérez-Valiente¹⁸,
Sunbok Lee¹⁹ &
…
David E. Pritchard¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11082))

Included in the following conference series:

European Conference on Technology Enhanced Learning

3966 Accesses
2 Citations

Abstract

Massive Open Online Courses (MOOCs) collect large amounts of rich data. A primary objective of Learning Analytics (LA) research is studying these data in order to improve the pedagogy of interactive learning environments. Most studies make the underlying assumption that the data represent truthful and honest learning activity. However, previous studies showed that MOOCs can have large cohorts of users that break this assumption and achieve high performance through behaviors such as Cheating Using Multiple Accounts or unauthorized collaboration, and we therefore denote them fake learners. Because of their aberrant behavior, fake learners can bias the results of Learning Analytics (LA) models. The goal of this study is to evaluate the robustness of LA results when the data contain a considerable number of fake learners. Our methodology follows the rationale of ‘replication research’. We challenge the results reported in a well-known, and one of the first LA/Pedagogic-Efficacy MOOC papers, by replicating its results with and without the fake learners (identified using machine learning algorithms). The results show that fake learners exhibit very different behavior compared to true learners. However, even though they are a significant portion of the student population (\(\sim \)15%), their effect on the results is not dramatic (does not change trends). We conclude that the LA study that we challenged was robust against fake learners. While these results carry an optimistic message on the trustworthiness of LA research, they rely on data from one MOOC. We believe that this issue should receive more attention within the LA research community, and can explain some ‘surprising’ research results in MOOCs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Alexandron, G., Keinan, G., Levy, B., Hershkovitz, S.: Evaluating the effectiveness of educational videos. In: EdMedia (2018) (To appear)
Google Scholar
Alexandron, G., Pritchard, D.: Discovering the pedagogical resources that assist students in answering questions correctly a machine learning approach. In: Proceedings of the 8th International Conference on Educational Data Mining, pp. 520–523 (2015)
Google Scholar
Alexandron, G., Ruiperez-Valiente, J.A., Pritchard, D.E.: Evidence of MOOC students using multiple accounts to harvest correct answers, learning with MOOCs II (2015)
Google Scholar
Alexandron, G., Ruipérez-Valiente, J.A., Chen, Z., Muñoz-Merino, P.J., Pritchard, D.E.: Copying@Scale: using harvesting accounts for collecting correct answers in a MOOC. Comput. Educ. 108, 96–114 (2017)
Article Google Scholar
Baker, R., Walonoski, J., Heffernan, N., Roll, I., Corbett, A., Koedinger, K.: Why students engage in “Gaming the System" behavior in interactive learning environments. J. Interact. Learn. Res. 19(2), 162–182 (2008)
Google Scholar
Baker, R.S.J.D., De Carvalho, A.M.J.B., Raspat, J., Aleven, V., Corbett, A.T., Koedinger, K.R.: Educational software features that encourage and discourage “gaming the system". In: Proceedings of the 2009 Conference on Artificial Intelligence in Education, pp. 475–482 (2009)
Google Scholar
Champaign, J., Colvin, K.F., Liu, A., Fredericks, C., Seaton, D., Pritchard, D.E.: Correlating skill and improvement in 2 MOOCs with a student’s time on tasks. In: Proceedings of the first ACM conference on Learning @ scale conference - L@S 2014 (March), pp. 11–20 (2014)
Google Scholar
Chen, Z., Chudzicki, C., Palumbo, D., Alexandron, G., Choi, Y.J., Zhou, Q., Pritchard, D.E.: Researching for better instructional methods using AB experiments in MOOCs: results and challenges. Res. Pract. Technol. Enhanc. Learn. 11(1), 9 (2016)
Article Google Scholar
De Ayala, R.: The Theory and Practice of Item Response Theory. Methodology in the social sciences. Guilford Publications, New York (2009)
Google Scholar
U.S. Department of Education, O.o.E.T.: Enhancing teaching and learning through educational data mining and learning analytics: An issue brief (2012)
Google Scholar
Goldhammer, F.: Measuring ability, speed, or both? challenges, psychometric solutions, and what can be gained from experimental control. Measur. Interdisc. Res. Perspect. 13(3–4), 133–164 (2015)
Article Google Scholar
Kim, J., Guo, P.J., Seaton, D.T., Mitros, P., Gajos, K.Z., Miller, R.C.: Understanding in-video dropouts and interaction peaks in online lecture videos (2014)
Google Scholar
Koedinger, K.R., Mclaughlin, E.A., Kim, J., Jia, J.Z., Bier, N.L.: Learning is Not a Spectator Sport: Doing is Better than Watching for Learning from a MOOC, pp. 111–120 (2015)
Google Scholar
MacHardy, Z., Pardos, Z.A.: Toward the evaluation of educational videos using Bayesian knowledge tracing and big data. In: Proceedings of the Second (2015) ACM Conference on Learning @ Scale, L@S 2015, pp. 347–350. ACM (2015)
Google Scholar
Northcutt, C.G., Ho, A.D., Chuang, I.L.: Detecting and preventing “multiple-account" cheating in massive open online courses. Comput. Educ. 100(C), 71–80 (2016)
Google Scholar
O’Neil, C.: Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy. Crown Publishing Group, New York (2016)
MATH Google Scholar
Palazzo, D.J., Lee, Y.J., Warnakulasooriya, R., Pritchard, D.E.: Patterns, correlates, and reduction of homework copying. Phys. Rev. ST Phys. Educ. Res. 6, 010104 (2010)
Article Google Scholar
Ruiperez-Valiente, J.A., Alexandron, G., Chen, Z., Pritchard, D.E.: Using multiple accounts for harvesting solutions in MOOCs. In: Proceedings of the Third (2016) ACM Conference on Learning @ Scale - L@S 2016, pp. 63–70 (2016)
Google Scholar
Ruipérez-Valiente, J.A., Joksimović, S., Kovanović, V., Gašević, D., Muñoz Merino, P.J., Delgado Kloos, C.: A data-driven method for the detection of close submitters in online learning environments. In: Proceedings of the 26th International Conference on World Wide Web Companion, pp. 361–368 (2017)
Google Scholar
Siemens, G.: Learning analytics: the emergence of a discipline. Am. Behav. Sci. 10, 1380–1400 (2013)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Weizmann Institute of Science, Rehovot, Israel
Giora Alexandron
Massachusetts Institute of Technology, Cambridge, MA, USA
José A. Ruipérez-Valiente & David E. Pritchard
University of Houston, Houston, TX, USA
Sunbok Lee

Authors

Giora Alexandron
View author publications
You can also search for this author in PubMed Google Scholar
José A. Ruipérez-Valiente
View author publications
You can also search for this author in PubMed Google Scholar
Sunbok Lee
View author publications
You can also search for this author in PubMed Google Scholar
David E. Pritchard
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giora Alexandron .

Editor information

Editors and Affiliations

Graz University of Technology, Graz, Austria
Viktoria Pammer-Schindler
Pontificia Universidad Católica de Chile, Providencia, Santiago de Chile, Chile
Mar Pérez-Sanagustín
DIPF | Leibniz Institute for Research and Information in Education, Frankfurt, Germany
Hendrik Drachsler
RayCom BV, Utrecht, Utrecht, The Netherlands
Raymond Elferink
Open University Netherlands, Heerlen, The Netherlands
Maren Scheffel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alexandron, G., Ruipérez-Valiente, J.A., Lee, S., Pritchard, D.E. (2018). Evaluating the Robustness of Learning Analytics Results Against Fake Learners. In: Pammer-Schindler, V., Pérez-Sanagustín, M., Drachsler, H., Elferink, R., Scheffel, M. (eds) Lifelong Technology-Enhanced Learning. EC-TEL 2018. Lecture Notes in Computer Science(), vol 11082. Springer, Cham. https://doi.org/10.1007/978-3-319-98572-5_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-98572-5_6
Published: 14 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98571-8
Online ISBN: 978-3-319-98572-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics