Abstract
Intelligent education is a significant application of artificial intelligence. One of the key research topics in intelligence education is cognitive diagnosis, which aims to gauge the level of proficiency among students on specific knowledge concepts (e.g., Geometry). To the best of our knowledge, most of the existing cognitive models primarily focus on improving diagnostic accuracy while rarely considering fairness issues; for instance, the diagnosis of students may be affected by various sensitive attributes (e.g., region). In this paper, we aim to explore fairness in cognitive diagnosis and answer two questions: (1) Are the results of existing cognitive diagnosis models affected by sensitive attributes? (2) If yes, how can we mitigate the impact of sensitive attributes to ensure fair diagnosis results? To this end, we first empirically reveal that several well-known cognitive diagnosis methods usually lead to unfair performances, and the trend of unfairness varies among different cognitive diagnosis models. Then, we make a theoretical analysis to explain the reasons behind this phenomenon. To resolve the unfairness problem in existing cognitive diagnosis models, we propose a general fairness-aware cognitive diagnosis framework, FairCD. Our fundamental principle involves eliminating the effect of sensitive attributes on student proficiency. To achieve this, we divide student proficiency in existing cognitive diagnosis models into two components: bias proficiency and fair proficiency. We design two orthogonal tasks for each of them to ensure that fairness in proficiency remains independent of sensitive attributes and take it as the final diagnosed result. Extensive experiments on the Program for International Student Assessment (PISA) dataset clearly show the effectiveness of our framework.
Similar content being viewed by others
References
Lord F. A theory of test scores. Psychometric Monographs, 1952, 7: 84
Liu Q, Huang Z, Yin Y, et al. EKT: exercise-aware knowledge tracing for student performance prediction. IEEE Trans Knowl Data Eng, 2019, 33: 100–115
de la Torre J. DINA model and parameter estimation: a didactic. J Educational Behaval Stat, 2009, 34: 115–130
Templin J L, Henson R A. Measurement of psychological disorders using cognitive diagnosis models. Psychological Methods, 2006, 11: 287–305
Leighton J P, Gierl M J, Hunka S M. The attribute hierarchy method for cognitive assessment: a variation on Tatsuoka’s rule-space approach. J Educational Measurement, 2004, 41: 205–237
Bi H, Ma H, Huang Z, et al. Quality meets diversity: a model-agnostic framework for computerized adaptive testing. In: Proceedings of IEEE International Conference on Data Mining (ICDM), 2020. 42–51
Huang Z, Liu Q, Zhai C, et al. Exploring multi-objective exercise recommendations in online education systems. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019. 1261–1270
Reckase M D. Multidimensional Item Response Theory Models. Berlin: Springer, 2009. 79–112
von Davier M. The DINA model as a constrained general diagnostic model: two variants of a model equivalency. Brit J Math Statis, 2014, 67: 49–71
Wang F, Liu Q, Chen E, et al. Neural cognitive diagnosis for intelligent education systems. In: Proceedings of the AAAI Conference on Artificial Intelligence, 2020. 6153–6161
Kizilcec R F, Lee H. Algorithmic fairness in education. 2020. ArXiv:2007.05443
Pedreshi D, Ruggieri S, Turini F. Discrimination-aware data mining. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008. 560–568
Hardt M, Price E, Srebro N. Equality of opportunity in supervised learning. In: Proceedings of the 30th International Conference on Neural Information Processing Systems, 2016. 3323–3331
Bose A, Hamilton W. Compositional fairness constraints for graph embeddings. In: Proceedings of International Conference on Machine Learning, 2019. 715–724
Shao P Y, Wu L, Chen L, et al. FairCF: fairness-aware collaborative filtering. Sci China Inf Sci, 2022, 65: 222102
Wu L, Chen L, Shao P, et al. Learning fair representations for recommendation: a graph-based perspective. In: Proceedings of the Web Conference, 2021. 2198–2208
Yao S, Huang B. Beyond parity: fairness objectives for collaborative filtering. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, 2017. 2925–2934
Lee M K, Rich K. Who is included in human perceptions of AI? Trust and perceived fairness around healthcare AI and cultural mistrust. In: Proceedings of the CHI Conference on Human Factors in Computing Systems, 2021. 1–14
Goodfellow I, Pouget-Abadie J, Mirza M, et al. Generative adversarial networks. Commun ACM, 2020, 63: 139–144
Gao W, Liu Q, Huang Z, et al. RCD: relation map driven cognitive diagnosis for intelligent education systems. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021. 501–510
Rudner L M. Implementing the graduate management admission test computerized adaptive test. In: Proceedings of Elements of Adaptive Testing, 2009. 151–165
Mills C N. The GRE computer adaptive test: operational issues. In: Proceedings of Computerized Adaptive Testing: Theory and Practice, 2000
Zhuang Y, Liu Q, Huang Z, et al. A robust computerized adaptive testing approach in educational question retrieval. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2022. 416–426
Zhuang Y, Liu Q, Ning Y, et al. Efficiently measuring the cognitive ability of LLMs: an adaptive testing perspective. 2023. ArXiv:2306.10512
Gao W, Wang H, Liu Q, et al. Leveraging transferable knowledge concept graph embedding for cold-start cognitive diagnosis. In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023. 983–992
Lin Z Q, Chen H X. Recommendation over time: a probabilistic model of time-aware recommender systems. Sci China Inf Sci, 2019, 62: 212105
McKinley R, Kingston N. Exploring the use of IRT equating for the GRE subject test in mathematics. ETS Res Report Ser, 1987, 1987: 1–35
Liu Q. Towards a new generation of cognitive diagnosis. In: Proceedings of the 30th International Joint Conference on Artificial Intelligence, 2021. 4961–4964
Wang F, Liu Q, Chen E, et al. NeuralCD: a general framework for cognitive diagnosis. IEEE Trans Knowl Data Eng, 2023, 35: 8312–8327
Ghosh A, Raspat J, Lan A. Option tracing: beyond correctness analysis in knowledge tracing. In: Proceedings of International Conference on Artificial Intelligence in Education, 2021. 137–149
Cheng Y, Li M, Chen H, et al. Neural cognitive modeling based on the importance of knowledge point for student performance prediction. In: Proceedings of the 16th International Conference on Computer Science & Education, 2021. 495–499
Wu J W, Shen L W, Guo W N, et al. Code recommendation for Android development: how does it work and what can be improved?. Sci China Inf Sci, 2017, 60: 092111
Chen H H, Jin H, Cui X L. Hybrid followee recommendation in microblogging systems. Sci China Inf Sci, 2017, 60: 012102
He X, Liao L, Zhang H, et al. Neural collaborative filtering. In: Proceedings of the 26th International Conference on World Wide Web, 2017. 173–182
Huang H Y, Wang J Q, Fei C, et al. A probabilistic risk assessment framework considering lane-changing behavior interaction. Sci China Inf Sci, 2020, 63: 190203
Xiao S T, Shao Y X, Li Y W, et al. LECF: recommendation via learnable edge collaborative filtering. Sci China Inf Sci, 2022, 65: 112101
Hu H C, Guo Y F, Yi P, et al. Achieving fair service with a hybrid scheduling scheme for CICQ switches. Sci China Inf Sci, 2012, 55: 689–700
Ekstrand M D, Tian M, Azpiazu I M, et al. All the cool kids, how do they fit in? Popularity and demographic biases in recommender evaluation and effectiveness. In: Proceedings of Conference on Fairness, Accountability and Transparency, 2018. 172–186
Khademi A, Lee S, Foley D, et al. Fairness in algorithmic decision making: an excursion through the lens of causality. In: Proceedings of the Web Conference, 2019. 2907–2914
Liu J H, Yu Y, Bi H L, et al. Post quantum secure fair data trading with deterability based on machine learning. Sci China Inf Sci, 2022, 65: 170308
Dwork C, Hardt M, Pitassi T, et al. Fairness through awareness. In: Proceedings of the 3rd Innovations in Theoretical Computer Science Conference, 2012. 214–226
Zafar M B, Valera I, Rogriguez M G, et al. Fairness constraints: mechanisms for fair classification. In: Proceedings of Artificial Intelligence and Statistics, 2017. 962–970
Kang J, He J, Maciejewski R, et al. Inform: individual fairness on graph mining. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2020. 379–389
Calmon F, Wei D, Vinzamuri B, et al. Optimized pre-processing for discrimination prevention. In: Proceedings of Advances in Neural Information Processing Systems, 2017
Dong Y, Kang J, Tong H, et al. Individual fairness for graph neural networks: a ranking based approach. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 2021. 300–310
Li Y, Chen H, Fu Z, et al. User-oriented fairness in recommendation. In: Proceedings of the Web Conference, 2021. 624–632
Simon F, Małgorzata K, Beatriz P. No more failures: ten steps to equity in education. OECD Publishing, 2007. doi: https://doi.org/10.1787/9789264032606-en
Rao Y S, Zhang J Z, Zou Y, et al. An advanced operating environment for mathematics education resources. Sci China Inf Sci, 2018, 61: 098102
Warren C J E. Brown v. Board of Education. United States Reports, 1954
Hutt S, Gardner M, Duckworth A L, et al. Evaluating fairness and generalizability in models predicting on-time graduation from college applications. In: Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019), 2019
Hu Q, Rangwala H. Towards fair educational data mining: a case study on detecting at-risk students. In: Proceedings of the 13th International Conference on Educational Data Mining, 2020. 431–437
Yu R, Li Q, Fischer C, et al. Towards accurate and fair prediction of college success: evaluating different sources of student data. In: Proceedings of International Conference on Educational Data Mining (EDM 2020), 2020
Li C, Xing W, Leite W. Using fair AI to predict students’ math learning outcomes in an online platform. Interactive Learn Environ, 2022. doi: https://doi.org/10.1080/10494820.2022.2115076
Gómez E, Zhang C S, Boratto L, et al. The winner takes it all: geographic imbalance and provider (un) fairness in educational recommender systems. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021. 1808–1812
Embretson S E. Item Response Theory. London: Psychology Press, 2013
DiBello L V, Roussos L A, Stout W. 31a review of cognitively diagnostic assessment and a summary of psychometric models. Handbook of Statistics, 2006, 26: 979–1030
Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics, 2010. 249–256
Liu Q, Wu R, Chen E, et al. Fuzzy cognitive diagnosis for modelling examinee performance. ACM Trans Intell Syst Technol, 2018, 9: 1–26
Wu C, Wu F, Wang X, et al. Fairness-aware news recommendation with decomposed adversarial learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, 2021. 4462–4469
Chen J, Li H, Ding W, et al. An educational system for personalized teacher recommendation in K-12 online classrooms. In: Proceedings of International Conference on Artificial Intelligence in Education, 2021. 104–108
Acknowledgements
This work was supported in part by National Key Research and Development Program of China (Grant No. 2021YFF0901003), National Natural Science Foundation of China (Grant Nos. 61922073, U20A20229), and University Synergy Innovation Program of Anhui Province (Grant No. GXXT-2022-042).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, Z., Wu, L., Liu, Q. et al. Understanding and improving fairness in cognitive diagnosis. Sci. China Inf. Sci. 67, 152106 (2024). https://doi.org/10.1007/s11432-022-3852-0
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11432-022-3852-0