Towards Compiling Textbooks from Wikipedia

Mathew, Ditty; Chakraborti, Sutanu

doi:10.1007/978-3-030-03991-2_75

Ditty Mathew¹⁶ &
Sutanu Chakraborti¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11320))

Included in the following conference series:

Australasian Joint Conference on Artificial Intelligence

2287 Accesses
7 Altmetric

Abstract

In this paper, we explore challenges in compiling a pedagogic resource like a textbook on a given topic from relevant Wikipedia articles, and present an approach towards assisting humans in this task. We present an algorithm that attempts to suggest the textbook structure from Wikipedia based on a set of seed concepts (chapters) provided by the user. We also conceptualize a decision support system where users can interact with the proposed structure and the corresponding Wikipedia content to improve its pedagogic value. The proposed algorithm is implemented and evaluated against the outline of online textbooks on five different subjects. We also propose a measure to quantify the pedagogic value of the suggested textbook structure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The term “concept” is loosely used to refer to a topic or idea. Here, we use this term interchangeably to correspond to either Wikipedia article titles or textbook topics.
2.
https://en.wikipedia.org/w/index.php?title=Special:Book&bookcmd=book_creator.
3.
https://en.wikibooks.org/wiki/Wikibooks.
4.
http://www.tutorialspoint.com.
5.
https://openstax.org/details/books/precalculus.
6.
https://nlp.stanford.edu/IR-book/html/htmledition.
7.
A demonstration of the interface and a book compiled using the interface can be found at https://sites.google.com/site/compiletextbooks/.

References

Agrawal, R., Gollapudi, S., Kenthapadi, K., Srivastava, N., Velu, R.: Enriching textbooks through data mining. In: ACM DEV, p. 19 (2010)
Google Scholar
Li, Y., Chenguang, Z.: A metric normalization of tree edit distance. Front. Comput. Sci. China 5(1), 119–125 (2011)
Article MathSciNet Google Scholar
Pan, L., Li, C., Li, J., Tang, J.: Prerequisite relation learning for concepts in MOOCs. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1447–1456 (2017)
Google Scholar
Jain, P., Hitzler, P., Verma, K., Yeh, P. Z., Sheth, A.P.: Moving beyond sameAs with PLATO: partonomy detection for linked data. In: Proceedings of the 23rd ACM Conference on Hypertext and Social Media, pp. 33–42 (2012)
Google Scholar
Milligan, G.W., Cooper, M.C.: An examination of procedures for determining the number of clusters in a data set. Psychometrika 50(2), 159–179 (1985)
Article Google Scholar
Mateusz, P., Nikolaus, A.: Tree edit distance: robust and memory-efficient. Inf. Syst. 56, 157–173 (2016)
Article Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Stanford InfoLab (1999)
Google Scholar
Cilibrasi, R.L., Vitanyi, P.M.: The google similarity distance. IEEE Trans. Knowl. Data Eng. 19(3), 370–383 (2007)
Article Google Scholar
Witten, I.H., Milne, D.N.: An effective, low-cost measure of semantic relatedness obtained from wikipedia links. In: Wikipedia and AI: An Evolving Synergy, pp. 25–30 (2008)
Google Scholar
Liang, C., et al.: Bbookx: an automatic book creation framework. In: Proceedings of the 2015 ACM Symposium on Document Engineering, pp. 121–124 (2015)
Google Scholar
Agrawal, R., Chakraborty, S., Gollapudi, S., Kannan, A., Kenthapadi, K.: Quality of textbooks: an empirical study. In: ACM Symposium on Computing for Development (2012)
Google Scholar
Talukdar, P.P., Cohen, W.: Crowdsourced Comprehension: predicting prerequisite structure in wikipedia. In: 7th Workshop on Building Educational Applications Using NLP, pp. 307–315 (2012)
Google Scholar
Mathew D., Eswaran, D., Chakraborti, S.: Towards creating pedagogic views from encyclopedic resources. In: 10th Workshop on Innovative Use of NLP for Building Educational Applications, pp. 190–195 (2015)
Google Scholar
Liang, C., Wu, Z., Huang, W., Lee Giles, C.: Measuring prerequisite relations among concepts. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP, pp. 1668–1674 (2015)
Google Scholar
Agrawal, R., Golshan, B., Papalexakis, E.E.: Toward data-driven design of educational courses: a feasibility study. In: Proceedings of the 9th International Conference on Educational Data Mining, EDM, p. 6 (2016)
Google Scholar
Wang, S., et al.: Using prerequisites to extract concept maps from textbooks. In: Proceedings of the 25th ACM International Conference on Information and Knowledge Management, CIKM, pp. 317–326 (2016)
Google Scholar
Liang, C., Ye, J., Wu, W., Pursel, B., Giles, C.L.: Recovering concept prerequisite relations from university course dependencies. In: (2017) Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 4786–4791 (2016)
Google Scholar
Levary, D., Eckmann, J., Moses, E., Tlusty, T.: Loops and self-reference in the construction of dictionaries. Phys. Rev. 2(3), 031018 (2012)
Article Google Scholar
Agrawal, R., Golshan, B., Papalexakis, E.: Data-driven synthesis of study plans. Data Insights Laboratories (2015)
Google Scholar
Negahban, S., Oh, S., Shah, D.: Rank centrality: ranking from pairwise comparisons. Oper. Res. 65(1), 266–287 (2016)
Article MathSciNet Google Scholar
Jenks, G.F.: The data model concept in statistical mapping. Int. Yearb. Cartography 7, 186–190 (1967)
Google Scholar

Download references

Acknowledgements

We thank Prof. Marti A. Hearst for the fruitful discussion and feedback, and the members of AIDB lab for their insightful comments. This work is partially funded by TCS Research Scholar Program, India.

Author information

Authors and Affiliations

Artificial Intelligence and Databases Lab, Department of Computer Sciene and Engineering, Indian Institute of Technology Madras, Chennai, 600036, India
Ditty Mathew & Sutanu Chakraborti

Authors

Ditty Mathew
View author publications
You can also search for this author in PubMed Google Scholar
Sutanu Chakraborti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ditty Mathew .

Editor information

Editors and Affiliations

University of Canterbury, Christchurch, New Zealand
Tanja Mitrovic
School of Engineering and Computer Science, Victoria University of Wellington, Wellington, New Zealand
Bing Xue
RMIT University, Melbourne, VIC, Australia
Xiaodong Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mathew, D., Chakraborti, S. (2018). Towards Compiling Textbooks from Wikipedia. In: Mitrovic, T., Xue, B., Li, X. (eds) AI 2018: Advances in Artificial Intelligence. AI 2018. Lecture Notes in Computer Science(), vol 11320. Springer, Cham. https://doi.org/10.1007/978-3-030-03991-2_75

Download citation

DOI: https://doi.org/10.1007/978-3-030-03991-2_75
Published: 10 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03990-5
Online ISBN: 978-3-030-03991-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics