skip to main content
10.1145/2383276.2383328acmotherconferencesArticle/Chapter ViewAbstractPublication PagescompsystechConference Proceedingsconference-collections
research-article

Automatic generation of SCORM compliant metadata for portable document format files

Published: 22 June 2012 Publication History

Abstract

The Shareable Content Object Reference Model (SCORM) is a widely adopted collection of specifications for web-based e-learning to which most Learning Management Systems adhere. While it allows reusability of content, it requires extensive, slow and expensive metadata annotation, and this fact prevents many content producers from properly creating and using Learning Objects. We propose an automatic metadata generation procedure that allows to label specific Learning Objects (scientific papers) with general metadata compliant to the SCORM. As some metadata are intrinsically unrelated to structure while others are strictly connected to structure, two different techniques were developed: one based on vocabularies and the other based on structural features. Results show that, in the provided context and for the "general" metadata category, the accuracy of annotations is comparable to that of a human expert.

References

[1]
L.A. Alvarez G., D. P Espinoza P. and S. G. Bucaraya A. Empaquetamiento y Visualización de Objetos de Aprendizaje SCORM en LMSs de Código Abierto. Primera Conferencia Latinoamericana de Objetos de Aprendizaje, pp. 1--10, 2006.
[2]
P. Baumgartner, H. Häfele and K. Maier-Häfele. E-Learning Standards aus didaktischer Perspektive. In: Campus 2002: Die virtuelle Hochschule in der Konsolidierungsphase. G. Bachmann, O. Haefeli und M. Kindt. Münster, Waxmann. 18: pp. 277--286, 2002.
[3]
K. Bird, and the Jorum Team. Automated Metadata - A review of existing and potential metadata automation within Jorum and an overview of other automation systems. 31st March 2006, Version 1.0, Final, Signed off by JISC and Intrallect July 2006.
[4]
O. Bohl, J. Schellhase, R. Sengler and U. Winand. The Sharable Content Object Reference Model (SCORM) - A Critical Review. Proceedings of the International Conference on Computers in Education (ICCE'02), pp. 950--951, vol. 2, 2002.
[5]
S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, vol. 30, pp. 1--7, 1998.
[6]
L.F.H. Edvardsen, I. T. Sølvberg, T. Aalberg and H. Trætteberg. Using Automatic Metadata Generation to reduce the knowledge and time requirements for making SCORM Learning Objects. Proceedings of the 3rd IEEE International Conference on Digital Ecosystems and Technologies, pp. 392--397, 2009.
[7]
G. Giuffrida, E. C. Shek, and J. Yang. Knowledge-Based Metadata Extraction from PostScript Files. Proceedings of the Fifth ACM Conference on Digital Libraries, pp. 77--84, 2000.
[8]
V. Gonçalves and E. Carrapatoso. Web Semântica e e-Learning juntos por uma boa causa. 8th International Symposium on Computers In Education, 1: pp. 1--10, 2010.
[9]
J. Greenberg. Metadata Extraction and Harvesting: A Comparison of Two Automatic Metadata Generation Applications. Journal of Internet Cataloging, 6(4): pp. 59--82, 2004.
[10]
C. Jenkins, and D. Inman. Server-side Automatic Metadata Generation using Qualified Dublin Core and RDF. Proceedings of International Conference on Digital Libraries: Research and Practice, pp. 262--271, 2000.
[11]
A. Kawtrakul and C. Yingsaeree. A Unified Framework for Automatic Metadata Extraction from Electronic Document. Proceedings of IADLC2005, pp. 71--77, 2005.
[12]
R. Klaus, M. Dyks. Rozwiązania e-edukacji w zarządzaniu kapitałem ludzkim. Komputerowo Zintegrowane Zarządzanie R. Knosala, Oficyna Wydawnicza PTZP,vol. 1, pp. 671--675, 2010.
[13]
E.D. Liddy, E. Allen, S. Harwell, S. Corieri, O. Yilmazel, N. E. Ozgencil, A. Diekema, N. J. McCracken, J. Silverstein, and S. A. Sutton. Automatic metadata generation and evaluation. Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.401--402, 2002.
[14]
I. Madjarov, A. Betari, B. Shishedjiev, Z. Bakkoury. Une architecture orientée services pour la création et le cheminement d'objets pédagogiques de type questionnaire. Premier Congrès International Technologies Numériques de l'Information et de Communication Educatives - Expériences et Perspectives (TNICE-EP'2007), Marrakech, Maroc, 2--4 mai 2007.
[15]
C. Ramakrishnan, A. Patnia, E. Hovy and G. A. Burns. Layout-Aware Text Extraction from Full-text PDF of Scientific Articles. Source Code for Biology and Medicine, Vol. 7, No. 1, 2012.
[16]
K. Seymore, A. McCallum and R. Rosenfeld. Learning hidden Markov model structure for information extraction. Proceedings of Workshop on Machine Learning for Information Extraction, pp. 37--42, 1999.
[17]
A. Valdivieso, V. Preti. MOODLE PER L'APPRENDIMENTO LINGUISTICO: elementi critici per una integrazione di sistema. Conferenza nazionale italiana Moodlemoot, 2010.

Cited By

View all
  • (2024)Ferramentas de geração automática e semiautomática de metadadosAutomatic and semi-automatic metadata generation toolsHerramientas de generación de metadatos automáticas y semiautomáticasInformação & Informação10.5433/1981-8920.2024v29n1p6829:1(68-98)Online publication date: 11-Dec-2024
  • (2018)Practical use of medical terminology in curriculum mappingComputers in Biology and Medicine10.1016/j.compbiomed.2015.05.00663:C(74-82)Online publication date: 28-Dec-2018
  • (2013)Generation of description metadata for video filesProceedings of the 14th International Conference on Computer Systems and Technologies10.1145/2516775.2516795(262-269)Online publication date: 28-Jun-2013

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CompSysTech '12: Proceedings of the 13th International Conference on Computer Systems and Technologies
June 2012
440 pages
ISBN:9781450311939
DOI:10.1145/2383276
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 June 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. LOM
  2. SCORM
  3. automatic metadata generation
  4. portable document format
  5. vector space model

Qualifiers

  • Research-article

Conference

CompSysTech'12

Acceptance Rates

Overall Acceptance Rate 241 of 492 submissions, 49%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Ferramentas de geração automática e semiautomática de metadadosAutomatic and semi-automatic metadata generation toolsHerramientas de generación de metadatos automáticas y semiautomáticasInformação & Informação10.5433/1981-8920.2024v29n1p6829:1(68-98)Online publication date: 11-Dec-2024
  • (2018)Practical use of medical terminology in curriculum mappingComputers in Biology and Medicine10.1016/j.compbiomed.2015.05.00663:C(74-82)Online publication date: 28-Dec-2018
  • (2013)Generation of description metadata for video filesProceedings of the 14th International Conference on Computer Systems and Technologies10.1145/2516775.2516795(262-269)Online publication date: 28-Jun-2013

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media