Abstract
In this paper, we explore challenges in compiling a pedagogic resource like a textbook on a given topic from relevant Wikipedia articles, and present an approach towards assisting humans in this task. We present an algorithm that attempts to suggest the textbook structure from Wikipedia based on a set of seed concepts (chapters) provided by the user. We also conceptualize a decision support system where users can interact with the proposed structure and the corresponding Wikipedia content to improve its pedagogic value. The proposed algorithm is implemented and evaluated against the outline of online textbooks on five different subjects. We also propose a measure to quantify the pedagogic value of the suggested textbook structure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The term “concept” is loosely used to refer to a topic or idea. Here, we use this term interchangeably to correspond to either Wikipedia article titles or textbook topics.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
A demonstration of the interface and a book compiled using the interface can be found at https://sites.google.com/site/compiletextbooks/.
References
Agrawal, R., Gollapudi, S., Kenthapadi, K., Srivastava, N., Velu, R.: Enriching textbooks through data mining. In: ACM DEV, p. 19 (2010)
Li, Y., Chenguang, Z.: A metric normalization of tree edit distance. Front. Comput. Sci. China 5(1), 119–125 (2011)
Pan, L., Li, C., Li, J., Tang, J.: Prerequisite relation learning for concepts in MOOCs. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1447–1456 (2017)
Jain, P., Hitzler, P., Verma, K., Yeh, P. Z., Sheth, A.P.: Moving beyond sameAs with PLATO: partonomy detection for linked data. In: Proceedings of the 23rd ACM Conference on Hypertext and Social Media, pp. 33–42 (2012)
Milligan, G.W., Cooper, M.C.: An examination of procedures for determining the number of clusters in a data set. Psychometrika 50(2), 159–179 (1985)
Mateusz, P., Nikolaus, A.: Tree edit distance: robust and memory-efficient. Inf. Syst. 56, 157–173 (2016)
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Stanford InfoLab (1999)
Cilibrasi, R.L., Vitanyi, P.M.: The google similarity distance. IEEE Trans. Knowl. Data Eng. 19(3), 370–383 (2007)
Witten, I.H., Milne, D.N.: An effective, low-cost measure of semantic relatedness obtained from wikipedia links. In: Wikipedia and AI: An Evolving Synergy, pp. 25–30 (2008)
Liang, C., et al.: Bbookx: an automatic book creation framework. In: Proceedings of the 2015 ACM Symposium on Document Engineering, pp. 121–124 (2015)
Agrawal, R., Chakraborty, S., Gollapudi, S., Kannan, A., Kenthapadi, K.: Quality of textbooks: an empirical study. In: ACM Symposium on Computing for Development (2012)
Talukdar, P.P., Cohen, W.: Crowdsourced Comprehension: predicting prerequisite structure in wikipedia. In: 7th Workshop on Building Educational Applications Using NLP, pp. 307–315 (2012)
Mathew D., Eswaran, D., Chakraborti, S.: Towards creating pedagogic views from encyclopedic resources. In: 10th Workshop on Innovative Use of NLP for Building Educational Applications, pp. 190–195 (2015)
Liang, C., Wu, Z., Huang, W., Lee Giles, C.: Measuring prerequisite relations among concepts. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP, pp. 1668–1674 (2015)
Agrawal, R., Golshan, B., Papalexakis, E.E.: Toward data-driven design of educational courses: a feasibility study. In: Proceedings of the 9th International Conference on Educational Data Mining, EDM, p. 6 (2016)
Wang, S., et al.: Using prerequisites to extract concept maps from textbooks. In: Proceedings of the 25th ACM International Conference on Information and Knowledge Management, CIKM, pp. 317–326 (2016)
Liang, C., Ye, J., Wu, W., Pursel, B., Giles, C.L.: Recovering concept prerequisite relations from university course dependencies. In: (2017) Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 4786–4791 (2016)
Levary, D., Eckmann, J., Moses, E., Tlusty, T.: Loops and self-reference in the construction of dictionaries. Phys. Rev. 2(3), 031018 (2012)
Agrawal, R., Golshan, B., Papalexakis, E.: Data-driven synthesis of study plans. Data Insights Laboratories (2015)
Negahban, S., Oh, S., Shah, D.: Rank centrality: ranking from pairwise comparisons. Oper. Res. 65(1), 266–287 (2016)
Jenks, G.F.: The data model concept in statistical mapping. Int. Yearb. Cartography 7, 186–190 (1967)
Acknowledgements
We thank Prof. Marti A. Hearst for the fruitful discussion and feedback, and the members of AIDB lab for their insightful comments. This work is partially funded by TCS Research Scholar Program, India.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Mathew, D., Chakraborti, S. (2018). Towards Compiling Textbooks from Wikipedia. In: Mitrovic, T., Xue, B., Li, X. (eds) AI 2018: Advances in Artificial Intelligence. AI 2018. Lecture Notes in Computer Science(), vol 11320. Springer, Cham. https://doi.org/10.1007/978-3-030-03991-2_75
Download citation
DOI: https://doi.org/10.1007/978-3-030-03991-2_75
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03990-5
Online ISBN: 978-3-030-03991-2
eBook Packages: Computer ScienceComputer Science (R0)