Elsevier

Computers in Biology and Medicine

Volume 67, 1 December 2015, Pages 95-103
Computers in Biology and Medicine

Construction of quality-assured infant feeding process of care data repositories: definition and design (Part 1)

https://doi.org/10.1016/j.compbiomed.2015.09.024Get rights and content

Abstract

This is the first paper of a series of two regarding the construction of data quality (DQ) assured repositories for the reuse of information on infant feeding from birth until two years old. This first paper justifies the need for such repositories and describes the design of a process to construct them from Electronic Health Records (EHR). As a result, Part 1 proposes a computational process to obtain quality-assured datasets represented by a canonical structure extracted from raw data from multiple EHR. For this, 13 steps were defined to ensure the harmonization, standardization, completion, de-duplication, and consistency of the dataset content. Moreover, the quality of the input and output data for each of these steps is controlled according to eight DQ dimensions: predictive value, correctness, duplication, consistency, completeness, contextualization, temporal-stability and spatial-stability. The second paper of the series will describe the application of this computational process to construct the first quality-assured repository for the reuse of information on infant feeding in the perinatal period aimed at the monitoring of clinical activities and research.

Introduction

The World Health Organization defines breastfeeding as the natural and healthiest way of feeding infants and young children. Exclusive breastfeeding is recommended for the first six months of life, while an adequate complementary feeding is recommended up to two years or older [1]. Currently, breastfeeding is considered one of the main determinants of maternal and child health. Having related to the prevention of risk factors for chronic diseases of childhood and preventable burden of disease [2]. However, European data show that less than 20% of babies are fed following these recommendations [3].

Given the significant health implications of these low rates, the European strategy for the protection, promotion and support of breastfeeding [4] proposes, among other actions, the widespread deployment of the Baby Friendly Hospital Initiative (BFHI), using global and coordinated actions, along with the development of control mechanisms that may facilitate its implementation. Nevertheless, the absence of these control mechanisms is the main cause of the existing gap between the evidence and the daily clinical practice [5].

In the “Virgen del Castillo” Hospital, the international guidelines described above materialized through an intervention in the improvement of the quality of maternal and child care, which allowed its accreditation as a baby friendly hospital and produced significant improvements in breastfeeding rates [6]. However, the sustainability of such interventions depends on the continuous adaptation to the local context and its systematic inclusion in routine workflows [7]. Thus the improvement group design the process of care of infant feeding (PCIF) (PIEMCA08/13 project) extending the scope to primary care to match the clinical practice from birth until two years of age to the recommendations of the evidence.

For monitoring the clinical process and activating research studies on the PCIF, it was decided to use population data from the Electronic Health Records (EHR) through the construction of a quality-assured data repository. Hence, the aim of the present work is to design the process to construct repositories on birth and infant feeding based on population data from the EHR. The construction process has been implemented by means of 13 quality-assured procedures to assess, recover or filter clinical data to enable their reuse in the monitoring of the PCIF and to study relevant questions on infant feeding.

Section snippets

Background

The current deployment of health information systems and EHR is a source of valuable data for the systematic monitoring of clinical practice, both for management and research. However, unlike in other health care fields, to our knowledge there is no experience in the use of the data from EHR on infant feeding for the implementation and control of the BFHI, although evidence shows a relationship between the convenience of use population data for monitoring hospital practices and the feasible

Results

We propose the sequential pipeline described graphically in Fig. 3 that is composed of 13 procedures, and DQ assurance before and after each one. The procedures are designed to ensure the harmonization, standardization, completion, de-duplication, and consistency of the dataset content by correcting, discarding or evaluating the tolerance of the affected data, according to the functionality of each procedure. Fig. 3 shows the phases of the process, the procedures in each phase and the quality

Discussion

In this work we have proposed a computational process to extract quality-assured repositories represented by a canonical structure from raw data from multiple EHR systems. The process consists of 13 procedures for the harmonization, standardization, completion, de-duplication, and consistency of the data repository. A DQ framework controls the effect over the repository of each procedure that affects the data in order to ensure a total quality management of the repository construction.

Conclusion

In this first study of a series of two, we have proposed and designed the construction process of quality-assured infant feeding process of care data repositories from birth until the age of two years. For this, 13 steps for the harmonization, standardization, completion, de-duplication, and consistency of the dataset content were defined. Moreover, the quality of the input and output data for each step is controlled based on eight quality dimensions: predictive value, correctness, duplication,

Conflicts of interest statement

The authors do not have any conflict of interest.

Acknowledgements

This study has been partially funded by the IBIME research group and by the EMCA Programme (Foundation for Training and Healthcare Research in the Region of Murcia and Regional Health and Consumer Authority of Murcia, Project PIEMCA08-13). The authors thank Montserrat Robles (Director of ITACA institute), the codification service of the Hospital Virgen del Castillo and the management team of Gerencia Área de Salud V-Altiplano for their valuable collaboration in general aspects of this work.

References (47)

  • A. Cattaneo et al.

    Protection, promotion and support of breast-feeding in Europe: progress from 2002 to 2007

    Public Health Nutr.

    (2010)
  • EU Project on Promotion of Breastfeeding in Europe, Protection, promotion and support of breastfeeding in Europe: a...
  • R. Garcia-de-Leon-Gonzalez et al.

    Becoming baby-friendly in Spain: a quality-improvement process

    Acta Paediatr.

    (2011)
  • A. Scott et al.

    ProvenCare perinatal: a model for delivering evidence/guideline-based care for perinatal populations

    Jt. Comm. J. Qual. Patient Saf.

    (2011)
  • R. Pérez-Escamilla et al.

    Breastfeeding protection, promotion, and support in the United States: a time to nudge, a time to measure

    J. Hum. Lact.

    (2012)
  • N. Klazinga et al.

    Health services research related to performance indicators and benchmarking in Europe

    J. Health Serv. Res. Policy

    (2011)
  • World Health Organization, Innovative care for chronic conditions: building blocks for action, Geneva, World Health...
  • American Academy of Pediatrics

    Steering committee on quality improvement and management and committee on practice and ambulatory medicine. Principles for the development and use of quality measures

    Pediatrics

    (2008)
  • W.K. Yih et al.

    Active surveillance for adverse events: the experience of the Vaccine Safety Datalink project

    Pediatrics

    (2011)
  • J.F. Burnum

    The misinformation era: the fall of the medical record

    Ann. Int. Med.

    (1989)
  • C. Weng et al.

    Using EHRs to integrate research with patient care: promises and challenges

    J. Am. Med. Inform. Assoc.

    (2012)
  • N.G. Weiskopf et al.

    Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research

    J. Am Med. Inform. Assoc.

    (2013)
  • R.Y. Wang et al.

    Beyond accuracy: what data quality means to data consumers

    J. Manag. Inform. Syst.

    (1996)
  • Cited by (7)

    • Robust estimation of infant feeding indicators by data quality assessment of longitudinal electronic health records from birth up to 18 months of life

      2021, Computer Methods and Programs in Biomedicine
      Citation Excerpt :

      The policy included a population monitoring from EHR data to assess not only the maintenance of BFHI-based quality criteria, but also the promotion of breastfeeding in school, work and social settings that was taking place. In previous works, the authors proposed a DQ assessment and assurance process for EHR data [33,34], including an application to perinatal data [35] based in 8 DQ dimensions: predictive value, correctness, duplication, consistency, completeness, contextualization, temporal-stability, and spatial-stability. Later, they designed a pilot project in Spain [36] that established the basis for reliable monitoring of infant feeding indicators, focusing on data standardization and data quality analysis, mainly of perinatal variables, without addressing the analysis of the longitudinal feeding register.

    • Construction of quality-assured infant feeding process of care data repositories: Construction of the perinatal repository (Part 2)

      2016, Computers in Biology and Medicine
      Citation Excerpt :

      Moreover, this quality-assured repository can be used as a “learning healthcare system” as proposed by the U.S. Institute of Medicine [21] and others that make use of routinely collected data in order to guide care. Based on the DQ procedures implemented in Part 1 [16], for creating our repository the next resources were required: (1) 70 person-hours for the definition and parameterization of DQ rules along with several consensus meetings; (2) the homogenization and stabilization of the raw data required 10 person-hours. This was the optimized using the P3 and P6 procedures; and (3) the semantic analysis required 30 person-hours.

    • Data Quality in health records: A literature review

      2021, Iberian Conference on Information Systems and Technologies, CISTI
    • Usages of the ContSys Standard: A Position Paper

      2021, Communications in Computer and Information Science
    View all citing articles on Scopus
    View full text