Abstract
During the execution of business processes involving various organizations, Master Data is usually shared and exchanged. It is necessary to keep appropriate levels of quality in these Master Data, in order to prevent defects and failures in the business processes. A way to support the decision about the usage of data in business processes is to include information about the level of quality alongside the Master Data. ISO/TS 8000 parts 100 to 140, may support the provision of this kind of information in a usable manner. Specifically I8K, a reference implementation from academic sources of the aforementioned standard parts (ISO/TS 8000:100-140), may be used for this objective. Regrettably, I8K is not aimed to support the assessment of large Master Data volumes and does not reach the required efficiency in Big Data surroundings. This paper describe an extension of I8K to resolve those problems of efficiency in Big Data projects.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Mohanty, S., Jagadeesh, M., Srivatsa, H.: Big Data Imperatives. Apress, New York (2013)
Redman, T.C., Blanton, A.: Data Quality for the Information Age. Artech House Inc., London (1997)
Loshin, D.: Master Data Management. Morgan Kaufmann, San Francisco (2010)
ISO/TS: ISO 8000-100: Data Quality - Part 100: Master data: Exchange of charateristic data: Overview, ed. (2009)
ISO/TS: ISO 8000-110, Data quality - Part 110: Master data: Exchange of characteristic data: Syntax, semantic encoding, and conformance to data specification., ed. (2009)
ISO/TS: ISO/TS 8000-120, Data quality - Part 120: Master data: Exchange of characteristic data: Provenance, ed. (2009)
ISO/TS: ISO/TS 8000-130, Data quality — Part 130: Master data: Exchange of characteristic data: Accuracy, ed. (2009)
ISO/TS: ISO/TS 8000-140, Data quality — Part 140: Master data: Exchange of characteristic data: Completeness, ed. (2009)
Caballero, I., Bermejo, I., Parody, L., López, M.T.G., Gasca, R.M., Piattini, M.: SLA4DQ-I8K: Acuerdos a Nivel de Servicio para Calidad de Datos en Intercambios de Datos Maestros regulados por ISO 8000-1x0, JCIS (2014)
Caballero, I., Bermejo, I., López, M.T.G., Gasca, R.M., Piattini, M.: I8K: An Implementation Of ISO 8000-1X0. In: 17th International Conference on Information Quality (ICIQ) (2013)
Chen, C.P., Zhang, C.-Y.: Data-intensive applications, challenges, techniques and technologies: a survey on big data. Inf. Sci. 275, 314–347 (2014)
The Apache Software Foundation, Apache Hadoop, 04 May 2015. https://hadoop.apache.org
The Apache Software Foundation, Map Reduce (2015). https://hadoop.apache.org/docs/current/hadoop-mapreduce-client/hadoop-mapreduce-client-core/MapReduceTutorial.html
Borek, A., Parlikad, A.K., Webb, J., Woodall, P.: Total Information Risk Management: Maximizing The Value Of Data And Information Assets. Newnes, Oxford (2013)
Bermejo, I.: Bachellor dissertation thesis I8K: Arquitectura de Servicios para la Gestión de la Calidad de los Datos: Una implementación de ISO 8000:2009-100 (2013)
The Apache Software Foundation, HDFS, 11 March 2015. http://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
Acknowledgements
This work has been partially funded by GEODAS-BC project (Ministerio de Economía y Competitividad y Fondo Europeo de Desarrollo Regional FEDER, TIN2012-37493-C03-01); SERENIDAD project (Consejería de Educación, Ciencia y Cultura de la Junta de Comunidades de Castilla La Mancha, y Fondo Europeo de Desarrollo Regional FEDER, PEII-2014-045-P); VILMA project (Consejería de Educación, Ciencia y Cultura de la Junta de Comunidades de Castilla La Mancha, y Fondo Europeo de Desarrollo Regional FEDER, PEII-2014-048-P); GLOBALIA project (Consejería de Educación, Ciencia y Cultura de la Junta de Comunidades de Castilla La Mancha, de la Junta de Comunidades de Castilla La Mancha, y Fondo Europeo de Desarrollo Regional FEDER, PEII-2014-038-P) and CGT – DESARROLLO GLOBAL DEL SOFTWARE (12 FEB 2014).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Rivas, B., Merino, J., Serrano, M., Caballero, I., Piattini, M. (2015). I8K|DQ-BigData: I8K Architecture Extension for Data Quality in Big Data. In: Jeusfeld, M., Karlapalem, K. (eds) Advances in Conceptual Modeling. ER 2015. Lecture Notes in Computer Science(), vol 9382. Springer, Cham. https://doi.org/10.1007/978-3-319-25747-1_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-25747-1_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25746-4
Online ISBN: 978-3-319-25747-1
eBook Packages: Computer ScienceComputer Science (R0)