Skip to main content
Log in

Enhanced metadata modelling and extraction methods to acquire contextual pedagogical information from e-learning contents for personalised learning systems

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

To make online learning systems (OLSs) effective, it is important to make sure that the learners get the learning objects (LOs) according to their pedagogical suitability and requirements. To assess the suitability of an LO, sufficient information of it is required to be available. These information can be specified as metadata of the document. But there is a dearth of metadata defined for educational documents. Existing standard metadata models like IEEE LOM and others are promising but lack in capturing some crucial learning and pedagogical aspects of LOs. In this paper, we propose a new metadata model that has extended the IEEE LOM to provide an extensive set of metadata for LOs. The proposed metadata seem adequate to describe the contextual learning and pedagogical information of any text and web document based LO. But only specifying the metadata is not sufficient; they need to be extracted from a learning content automatically so that these information can be used by the learners and the OLSs and the educational recommendation systems. Automated extraction of metadata from e-learning contents is a non-trivial task. In view of that, we have provided extraction mechanisms for each of the specified metadata, separately. The experimental results show that the proposed extraction methods are quite accurate in identifying and retrieving the different educational metadata. The statistical inferences of the automated and manual extractions are found to have substantial similarities for each of the extracted metadata element.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25
Fig. 26
Fig. 27
Fig. 28
Fig. 29

Similar content being viewed by others

Notes

  1. https://www.dublincore.org/specifications/dublin-core/dces/

  2. https://www.imsglobal.org/metadata/index.html

  3. http://www.mandela.ac.za/cyberhunts/bloom.htm

  4. https://poi.apache.org/index.html

  5. https://pdfbox.apache.org/

  6. https://jsoup.org/

  7. https://jena.apache.org/

References

  1. Anderson LW, Krathwohl DR, Bloom BS (2001) In: Anderson DKLW (ed) A taxonomy for learning, teaching, and assessing : a revision of Bloom’s taxonomy of educational objectives. New York, Longman

  2. Aoki K (2000) Taxonomy of interactivity on the web. [Online]. Available: https://www.semanticscholar.org/paper/Taxonomy-of-Interactivity-on-the-Web-Aoki/ed396d88541685ec738d1b9f5ac859e377036aa7. Accessed 8 July 2020

  3. Dey AK, Poddar B, Pramanik PKD, Debnath NC, Aljahdali S, Choudhury P (2020) Real-time learner classification using cognitive score. In: Lee G, Jin Y (eds) Proceedings of 35th International Conference on Computers and Their Applications. CATA 2020. EPiC Series in Computing, vol 69, pp 264–276

  4. EF Education First (2020) 3000 most common words in English. EF Education First, January 2020. [Online]. Available: https://www.ef.com/in/englishresources/english-vocabulary/top-3000-words/. Accessed 14 October 2020

  5. Flesch R (2005) How to write plain english, 1 May 2005. [Online]. Available: https://web.archive.org/web/20160712094308/http://www.mang.canterbury.ac.nz/writing_guide/writing/flesch.shtml. Accessed 15 October 2018

  6. Flynn P, Zhou L, Maly K, Zeil S, Zubair M (2007) Automated template-based metadata extraction architecture. In: Goh DHL, Cao TH, Sølvberg IT, Rasmussen E (eds) Asian digital libraries. Looking back 10 years and forging new frontiers. ICADL 2007. Lecture Notes in Computer Science, vol 4822. Springer, Berlin, Heidelberg, pp 327–336

  7. IEEE Computer Society (2002) IEEE standard for learning object metadata. IEEE Std 1484.12.1–2002:1–40

  8. IEEE Computer Society (2002) 1484.12.1 IEEE standard for learning object. The Institute of Electrical and Electronics Engineers, Inc, New York

  9. Jebali B, Farhat R (2013) Ontology-based semantic metadata extraction approach. In: Proceedings of International Conference on Electrical Engineering and Software Applications, pp 1–5

  10. Jensen JF (1998) 'Interactivity': tracking a new concept in media and communication studies. Nordicom Rev 19(1):185–204

  11. Kawtrakul A, Yingsaeree C (2005) A unified framework for automatic metadata extraction from electronic document. In:Proceedings of International Advanced Digital Library Conference, pp 1–8

  12. Laurel BK (1986) Toward the design of a computer-based interactive fantasy system. In: Ph D Thesis. Ohio State University, Ohio

  13. Laurel B (2013) Computers as theatre. Addison-Wesley Longman Publishing

  14. Marinai S (2009) Metadata extraction from PDF papers for digital library ingest. In: Proceedings of International Conference on Document Analysis and Recognition, pp 251–255

  15. Mukhopadhyay M, Pal S, Nayyar A, Pramanik PKD, Dasgupta N, Choudhury P (2020) Facial emotion detection to assess learner’s state of mind in an online learning system. In: Proceedings of 5th International Conference on Intelligent Information Technology. ICIIT, vol 2020, pp 107–115

  16. Nam T-J, Park S, Verlinden J (2009) A model to conceptualize interactivity. Int J Interact Des Manuf (IJIDeM) 3:147–156

  17. Othman N, Amiruddinb MH (2010) Different perspectives of learning styles from VARK model. In: Procedia - social and behavioral sciences. Proceedings of International Conference on Learner Diversity, vol 7, pp 652–660

  18. Pal S, Pramanik PKD, Choudhury P (2019) A step towards smart learning: designing an interactive video-based M-learning system for educational institutes. Int J Web-Based Learn Teach Technol 14(4):26–48

  19. Pal S, Pramanik PKD, Majumdar T, Choudhury P (2019) A semi-automatic metadata extraction model and method for video-based e-learning contents. Educ Inf Technol 24(6):3243–3268

  20. Pal S, Mukhopadhyay M, Pramanik PKD, Choudhury P (2020) Assessing the learning difficulty of text-based learning materials. In: Satapathy S, Bhateja V, Nguyen B, Nguyen N, Le DN (eds) Frontiers in intelligent computing: theory and applications. Advances in Intelligent Systems and Computing, vol 1013. Springer, Singapore, pp 275–286

  21. Pramanik PKD, Choudhury P, Saha A (2017) Economical supercomputing thru smartphone crowd computing: an assessment of opportunities, benefits, deterrents, and applications from India’s perspective. In: Proceedings of 4th International Conference on Advanced Computing and Communication Systems. ICACCS, 2017, pp 1–7

  22. Pramanik PKD, Sinhababu N, Mukherjee B, Padmanaban S, Maity A, Upadhyaya BK, Holm-Nielsen JB, Choudhury P (2019) Power consumption analysis, measurement, management, and issues: a state-of-the-art review on smartphone battery and energy usage. IEEE Access 7(1):182113–182172

  23. Rafaeli S (1988) Interactivity: From new media to communication. In: Hawkins RP, Wiemann JM, Pingree S (eds) Advancing Communication Science: Merging Mass and Interpersonal Processes. Sage, Newbury Park, pp 110–134

  24. Rogers EM (1986) Communication technology. The New Media in Society, New York

  25. Rouse M (2014) Document metadata. TechTarget, August 2014. [Online]. Available: http://whatis.techtarget.com/definition/document-metadata. Accessed 3 July 2018

  26. Sedig K, Parsons P, Babanski A (2012) Towards a characterization of interactivity in visual analytics. J Multim Process Technol 3(1):12–28

  27. SkillSoft (2018) Expertise level. SkillSoft, 18 January 2018. [Online]. Available: https://skillsoftscustomercommunit.force.com/kb/s/article/Expertise-Level. Accessed 11 October 2020

  28. Steuer J (1995) Defining virtual reality: dimensions determining Telepresence. In: Biocca F, Levy MR (eds) Communication in the age of virtual reality. Lawrence Erlbaum Associates, Hillsdale

  29. Tang X, Zeng Q, Cui T, Wu Z (2010) Regular expression-based reference metadata extraction from the web. In: Proceedings of IEEE 2nd Symposium on Web Society, pp 346–350

  30. Yilmazel O, Finneran CM, Liddy ED (2004) MetaExtract: an NLP system to automatically assign metadata. In: Proceedings of Joint ACM/IEEE Conference on Digital Libraries, pp 241–242

  31. Zhang Y, Yu Z, Liu L, Guo J, Mao C (2012) Semi-supervised expert metadata extraction based on co-training style. In: Proceedings of 9th International Conference on Fuzzy Systems and Knowledge Discovery, pp 1344–1347

Download references

Data availability

Data sharing is not applicable to this article as no datasets were generated or analysed during the current study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pijush Kanti Dutta Pramanik.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pal, S., Pramanik, P.K.D. & Choudhury, P. Enhanced metadata modelling and extraction methods to acquire contextual pedagogical information from e-learning contents for personalised learning systems. Multimed Tools Appl 80, 25309–25366 (2021). https://doi.org/10.1007/s11042-020-10380-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-020-10380-z

Keywords

Navigation