skip to main content
10.1145/3027385.3029462acmotherconferencesArticle/Chapter ViewAbstractPublication PageslakConference Proceedingsconference-collections
poster

Mining knowledge components from many untagged questions

Published:13 March 2017Publication History

ABSTRACT

An ongoing study is being run to ensure that the McGraw-Hill Education LearnSmart platform teaches students as efficiently as possible. The first step in doing so is to identify what Knowledge Components (KCs) exist in the content; while the content is tagged by experts, these tags need to be re-calibrated periodically.

LearnSmart courses are organized into chapters corresponding to those found in a textbook; each chapter can have anywhere from about a hundred to a few thousand questions. The KC extraction algorithms proposed by Barnes [1] and Desmarais et al [3] are applied on a chapter-by-chapter basis. To assess the ability of each mined q matrix to describe the observed learning, the PFA model of Pavlik et al [4] is fitted to it and a cross-validated AUC is calculated. The models are assessed based on whether PFA's predictions of student correctness are accurate.

Early results show that both algorithms do a reasonable job of describing student progress, but q matrices with very different numbers of KCs fit observed data similarly well. Consequently, further consideration is required before automated extraction is practical in this context.

References

  1. T. Barnes. The q-matrix method: Mining student response data for knowledge. In Proceedings of the AAAI-2005 Workshop on Educational Data Mining, pages 39--46. Assocation for Advancement of Artificial Intelligence, Jul 2005.Google ScholarGoogle Scholar
  2. A. T. Corbett and J. R. Anderson. Knowledge tracing: Modeling the acquisition of procedural knowledge. User modeling and user-adapted interaction, 4(4):253--278, 1994.Google ScholarGoogle Scholar
  3. M. C. Desmarais, B. Beheshti, and R. Naceur. Item to skills mapping: Deriving a conjunctive q-matrix from data. In International Conference on Intelligent Tutoring Systems, pages 454--463. Springer Science & Business Media, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. I. Pavlik, H. Cen, and K. R. Koedinger. Performance factors analysis-a new alternative to knowledge tracing. In Proceedings of the 2009 Conference on Artificial Intelligence in Education: Building Learning Systems That Care: From Knowledge Representation to Affective Modelling, pages 531--538, Amsterdam, The Netherlands, The Netherlands, 2009. IOS Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825--2830, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. K. K. Tatsuoka. Rule space: An approach for dealing with misconceptions based on item response theory. Journal of educational measurement, 20(4):345--354, 1983.Google ScholarGoogle Scholar
  7. Z. Zhang, T. Li, C. Ding, and X. Zhang. Binary matrix factorization with applications. In Seventh IEEE International Conference on Data Mining (ICDM 2007), pages 391--400. Institute of Electrical and Electronics Engineers, Oct 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Mining knowledge components from many untagged questions

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            LAK '17: Proceedings of the Seventh International Learning Analytics & Knowledge Conference
            March 2017
            631 pages
            ISBN:9781450348706
            DOI:10.1145/3027385

            Copyright © 2017 Owner/Author

            Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 13 March 2017

            Check for updates

            Qualifiers

            • poster

            Acceptance Rates

            LAK '17 Paper Acceptance Rate36of114submissions,32%Overall Acceptance Rate236of782submissions,30%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader