Abstract
To make online learning systems (OLSs) effective, it is important to make sure that the learners get the learning objects (LOs) according to their pedagogical suitability and requirements. To assess the suitability of an LO, sufficient information of it is required to be available. These information can be specified as metadata of the document. But there is a dearth of metadata defined for educational documents. Existing standard metadata models like IEEE LOM and others are promising but lack in capturing some crucial learning and pedagogical aspects of LOs. In this paper, we propose a new metadata model that has extended the IEEE LOM to provide an extensive set of metadata for LOs. The proposed metadata seem adequate to describe the contextual learning and pedagogical information of any text and web document based LO. But only specifying the metadata is not sufficient; they need to be extracted from a learning content automatically so that these information can be used by the learners and the OLSs and the educational recommendation systems. Automated extraction of metadata from e-learning contents is a non-trivial task. In view of that, we have provided extraction mechanisms for each of the specified metadata, separately. The experimental results show that the proposed extraction methods are quite accurate in identifying and retrieving the different educational metadata. The statistical inferences of the automated and manual extractions are found to have substantial similarities for each of the extracted metadata element.
Similar content being viewed by others
References
Anderson LW, Krathwohl DR, Bloom BS (2001) In: Anderson DKLW (ed) A taxonomy for learning, teaching, and assessing : a revision of Bloom’s taxonomy of educational objectives. New York, Longman
Aoki K (2000) Taxonomy of interactivity on the web. [Online]. Available: https://www.semanticscholar.org/paper/Taxonomy-of-Interactivity-on-the-Web-Aoki/ed396d88541685ec738d1b9f5ac859e377036aa7. Accessed 8 July 2020
Dey AK, Poddar B, Pramanik PKD, Debnath NC, Aljahdali S, Choudhury P (2020) Real-time learner classification using cognitive score. In: Lee G, Jin Y (eds) Proceedings of 35th International Conference on Computers and Their Applications. CATA 2020. EPiC Series in Computing, vol 69, pp 264–276
EF Education First (2020) 3000 most common words in English. EF Education First, January 2020. [Online]. Available: https://www.ef.com/in/englishresources/english-vocabulary/top-3000-words/. Accessed 14 October 2020
Flesch R (2005) How to write plain english, 1 May 2005. [Online]. Available: https://web.archive.org/web/20160712094308/http://www.mang.canterbury.ac.nz/writing_guide/writing/flesch.shtml. Accessed 15 October 2018
Flynn P, Zhou L, Maly K, Zeil S, Zubair M (2007) Automated template-based metadata extraction architecture. In: Goh DHL, Cao TH, Sølvberg IT, Rasmussen E (eds) Asian digital libraries. Looking back 10 years and forging new frontiers. ICADL 2007. Lecture Notes in Computer Science, vol 4822. Springer, Berlin, Heidelberg, pp 327–336
IEEE Computer Society (2002) IEEE standard for learning object metadata. IEEE Std 1484.12.1–2002:1–40
IEEE Computer Society (2002) 1484.12.1 IEEE standard for learning object. The Institute of Electrical and Electronics Engineers, Inc, New York
Jebali B, Farhat R (2013) Ontology-based semantic metadata extraction approach. In: Proceedings of International Conference on Electrical Engineering and Software Applications, pp 1–5
Jensen JF (1998) 'Interactivity': tracking a new concept in media and communication studies. Nordicom Rev 19(1):185–204
Kawtrakul A, Yingsaeree C (2005) A unified framework for automatic metadata extraction from electronic document. In:Proceedings of International Advanced Digital Library Conference, pp 1–8
Laurel BK (1986) Toward the design of a computer-based interactive fantasy system. In: Ph D Thesis. Ohio State University, Ohio
Laurel B (2013) Computers as theatre. Addison-Wesley Longman Publishing
Marinai S (2009) Metadata extraction from PDF papers for digital library ingest. In: Proceedings of International Conference on Document Analysis and Recognition, pp 251–255
Mukhopadhyay M, Pal S, Nayyar A, Pramanik PKD, Dasgupta N, Choudhury P (2020) Facial emotion detection to assess learner’s state of mind in an online learning system. In: Proceedings of 5th International Conference on Intelligent Information Technology. ICIIT, vol 2020, pp 107–115
Nam T-J, Park S, Verlinden J (2009) A model to conceptualize interactivity. Int J Interact Des Manuf (IJIDeM) 3:147–156
Othman N, Amiruddinb MH (2010) Different perspectives of learning styles from VARK model. In: Procedia - social and behavioral sciences. Proceedings of International Conference on Learner Diversity, vol 7, pp 652–660
Pal S, Pramanik PKD, Choudhury P (2019) A step towards smart learning: designing an interactive video-based M-learning system for educational institutes. Int J Web-Based Learn Teach Technol 14(4):26–48
Pal S, Pramanik PKD, Majumdar T, Choudhury P (2019) A semi-automatic metadata extraction model and method for video-based e-learning contents. Educ Inf Technol 24(6):3243–3268
Pal S, Mukhopadhyay M, Pramanik PKD, Choudhury P (2020) Assessing the learning difficulty of text-based learning materials. In: Satapathy S, Bhateja V, Nguyen B, Nguyen N, Le DN (eds) Frontiers in intelligent computing: theory and applications. Advances in Intelligent Systems and Computing, vol 1013. Springer, Singapore, pp 275–286
Pramanik PKD, Choudhury P, Saha A (2017) Economical supercomputing thru smartphone crowd computing: an assessment of opportunities, benefits, deterrents, and applications from India’s perspective. In: Proceedings of 4th International Conference on Advanced Computing and Communication Systems. ICACCS, 2017, pp 1–7
Pramanik PKD, Sinhababu N, Mukherjee B, Padmanaban S, Maity A, Upadhyaya BK, Holm-Nielsen JB, Choudhury P (2019) Power consumption analysis, measurement, management, and issues: a state-of-the-art review on smartphone battery and energy usage. IEEE Access 7(1):182113–182172
Rafaeli S (1988) Interactivity: From new media to communication. In: Hawkins RP, Wiemann JM, Pingree S (eds) Advancing Communication Science: Merging Mass and Interpersonal Processes. Sage, Newbury Park, pp 110–134
Rogers EM (1986) Communication technology. The New Media in Society, New York
Rouse M (2014) Document metadata. TechTarget, August 2014. [Online]. Available: http://whatis.techtarget.com/definition/document-metadata. Accessed 3 July 2018
Sedig K, Parsons P, Babanski A (2012) Towards a characterization of interactivity in visual analytics. J Multim Process Technol 3(1):12–28
SkillSoft (2018) Expertise level. SkillSoft, 18 January 2018. [Online]. Available: https://skillsoftscustomercommunit.force.com/kb/s/article/Expertise-Level. Accessed 11 October 2020
Steuer J (1995) Defining virtual reality: dimensions determining Telepresence. In: Biocca F, Levy MR (eds) Communication in the age of virtual reality. Lawrence Erlbaum Associates, Hillsdale
Tang X, Zeng Q, Cui T, Wu Z (2010) Regular expression-based reference metadata extraction from the web. In: Proceedings of IEEE 2nd Symposium on Web Society, pp 346–350
Yilmazel O, Finneran CM, Liddy ED (2004) MetaExtract: an NLP system to automatically assign metadata. In: Proceedings of Joint ACM/IEEE Conference on Digital Libraries, pp 241–242
Zhang Y, Yu Z, Liu L, Guo J, Mao C (2012) Semi-supervised expert metadata extraction based on co-training style. In: Proceedings of 9th International Conference on Fuzzy Systems and Knowledge Discovery, pp 1344–1347
Data availability
Data sharing is not applicable to this article as no datasets were generated or analysed during the current study.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Pal, S., Pramanik, P.K.D. & Choudhury, P. Enhanced metadata modelling and extraction methods to acquire contextual pedagogical information from e-learning contents for personalised learning systems. Multimed Tools Appl 80, 25309–25366 (2021). https://doi.org/10.1007/s11042-020-10380-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10380-z