Abstract:
This article explores the problems of assigning documents to a limited number of topics and automating the process of topic structuring of Russian educational texts. For ...View moreMetadata
Abstract:
This article explores the problems of assigning documents to a limited number of topics and automating the process of topic structuring of Russian educational texts. For this purpose, we compiled an original corpus of school textbooks on Social Science. We utilized the Latent Dirichlet Allocation model for selection and comparative analysis of topics in the textbooks of different grades. This approach allows the reconstruction of the matrix of topics for each textbook in the orpus. The research demonstrated a grade ranked character of the topics in the text collection under study, in particular, there is a higher cohesion of topics in high school. The research also offers an innovative methodology of quantitative describing topics dynamics in the textbook collection. It allows visualization and comparison of strategies for presenting educational topics by different authors. The results received can be beneficial for both textbook writers as well as teachers and schoolchildren.
Date of Conference: 14-17 December 2020
Date Added to IEEE Xplore: 14 June 2021
ISBN Information: