Skip to main content

Advertisement

Log in

Using Machine Learning Classifiers to Assist Healthcare-Related Decisions: Classification of Electronic Patient Records

  • Original Paper
  • Published:
Journal of Medical Systems Aims and scope Submit manuscript

Abstract

Surveillance Levels (SLs) are categories for medical patients (used in Brazil) that represent different types of medical recommendations. SLs are defined according to risk factors and the medical and developmental history of patients. Each SL is associated with specific educational and clinical measures. The objective of the present paper was to verify computer-aided, automatic assignment of SLs. The present paper proposes a computer-aided approach for automatic recommendation of SLs. The approach is based on the classification of information from patient electronic records. For this purpose, a software architecture composed of three layers was developed. The architecture is formed by a classification layer that includes a linguistic module and machine learning classification modules. The classification layer allows for the use of different classification methods, including the use of preprocessed, normalized language data drawn from the linguistic module. We report the verification and validation of the software architecture in a Brazilian pediatric healthcare institution. The results indicate that selection of attributes can have a great effect on the performance of the system. Nonetheless, our automatic recommendation of surveillance level can still benefit from improvements in processing procedures when the linguistic module is applied prior to classification. Results from our efforts can be applied to different types of medical systems. The results of systems supported by the framework presented in this paper may be used by healthcare and governmental institutions to improve healthcare services in terms of establishing preventive measures and alerting authorities about the possibility of an epidemic.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Salles, R. F., Análise de um programa de intervenção com bebês e famílias atendidas em unidades básicas de saúde—SUS [dissertation]. São Carlos (SP): Universidade Federal de São Carlos, 2001.

    Google Scholar 

  2. Panico, S. R. G., Canziani, M. L., and Guerchon, N., Políticas Públicas Municipais. In: Panico, S. R. G., (Ed.), Indicadores Nipe: Subsídios para Políticas Municipais de Saúde. São Carlos: NIPE; 1997.

    Google Scholar 

  3. Pollettini, J. T., Miranda, G. H. B., Goularte, R., Panico, S. R. G., Daneluzzi, J. C., and Macedo, A. A., Sistema de Informação Geográfica: uma Abordagem Integrada a Sistemas de Informação em Saúde. In: XII Congresso Brasileiro de Informática em Saúde (CBIS), 2010. Porto de Galinhas, PE. Anais do CBIS 2010,18–22 outubro de 2010.

  4. Dietterich, T. G., Limitations on inductive learning (extended abstract), 1997. web.engr.oregonstate.edu/~tgd/publications/ml89-limits.ps.gz, Visited Apr. 2012.

  5. Kohavi, R. A., Study of cross-validation and bootstrap for accuracy estimation and model selection. In: Int J Artif Intell Tool. 6(4):537–566, 1997.

    Article  Google Scholar 

  6. Opitz, D., and Maclin, R., Popular ensemble methods: An empirical study. 11:169–198, 1999.

  7. Schaffer, C., A conservation law for generalization performance. In: Cohen, W. W., and Hirsh, H., (Eds.), Proceedings of the Eleventh International Conference on Machine Learning. New Brunswick, New Jersey, 1994, p 259–265.

  8. Witten, I. H., and Frank, E., Data mining: practical machine learning tools and techniques. 2nd ed. San Francisco: Morgan Kaufmann, 2005.

    MATH  Google Scholar 

  9. A.R. F. F. (ARFF). Wiki article ARFF. http://www.cs.waikato.ac.nz/ml/weka/arff.html, Visited Feb. 2010.

  10. Porter, M., An algorithm for suffix stripping. Program. 14(3):130–7, 1980.

    Article  Google Scholar 

  11. Rocchio, J. J., Relevance feedback in information retrieval. In: Salton, G., (Ed.), The Smart Retrieval System—Experiments in Automatic Document Processing. Upper Saddle River: Prentice-Hall, Inc., 1971.

    Google Scholar 

  12. Pollettini, J. T., Nicolas, F. P., Panico, S. R. G., Daneluzzi, J. C., Tinós, R., Baranauskas, J. A., and Macedo, A. A., A software architecture-based framework supporting suggestion of medical surveillance level from classification of electronic patient records. In: Proc of the 12th IEEE International Conference on Computational Science and Engineering. Vancouver, Canada: IEEE Computer Society, 2009, p 166–173. doi:10.1109/CSE.2009.231

  13. Costa, T. M., Daneluzzi, J. C., Panico, S. R. G., Felipe, J. C., Projeto de Dados de Sistema para Integração Longitudinal de Informações e Procedimentos em Centros Médicos. In: Proceedings of XI Congresso Brasileiro de Informática em Saúde, 2008. Campos do Jordão (SP): CBIS, 2008. 6p.

  14. Costa, T. M., Ruiz, E. E. S., and Panico, S. R. G., Projeto interdisciplinar para a criação de uma ferramenta computacional que facilite a pesquisa acadêmica e o acompanhamento da saúde e do desenvolvimento de adolescentes em atenção básica à saúde. Relatório de Iniciação Científica apresentado à Fundação de Amparo ao Ensino e Pesquisa Aplicada do HCFMRP. Ribeirão Preto (SP), 2006.

  15. de Paula, D. S., Panico, S. R. G., Daneluzzi, J. C., Ruiz, E. E. S., Felipe, J. C., and Macedo, A. A., Sistema de Informação de Apoio ao Programa de Educação para Pais e Famílias. In: Proceedings of XI Congresso Brasileiro de Informática em Saúde, 2008. Campos do Jordão (SP): CBIS, 2008. 6p.

  16. Pollettini, J. T., Tinós, R., Panico, S. R. G., Daneluzzi, J. C., and Macedo, A. A., Classificação automática de pacientes para atendimento médico pediátrico multidisciplinar a partir do seu Grau de Vigilância. In: VIII Workshop de Informática Médica (evento paralelo ao Congresso da Sociedade Brasileira de Computação), 2008, Belém-Pará-Brazil. Anais do VIII Workshop de Informática Médica, 2008. p. 61–70.

  17. Pollettini, J. T., Tinós, R., Panico, S. R. G., Daneluzzi, J. C., and Macedo, A. A., Vigilância em atenção básica à saúde a partir do uso de relevance feedback para classificação de pacientes em diferentes níveis de cuidado em saúde. In: Workshop de Informática Médica (evento paralelo ao XXIX Congresso da Sociedade Brasileira de Computação), 2009, Bento Gonçalves. IX Workshop de Informática Médica (WIM 2009), 2009. p. 1945–1954.

  18. Demsar, J., Statistical comparisons of classifiers over multiple data sets. The Journal of Machine Learning Research (JMR). 7:1–30, 2006.

    MathSciNet  MATH  Google Scholar 

  19. Provost, F., and Domingos, P., Tree induction for probability-based ranking. Mach Learn. 52(3):199–215, 2003.

    Article  MATH  Google Scholar 

  20. Batista, G., Prati, R., and Monard, M., A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explorations. 6:20–9, 2004.

    Article  Google Scholar 

  21. Zmiri, D., Shahar, Y., and Taieb-Maimon, M., Classification of patients by severity grades during triage in the emergency department using data mining methods. J Eval Clin Pract. 18(2):378–388, 2012. doi:10.1111/j.1365-2753.2010.01592.x. Available at: http://onlinelibrary.wiley.com/doi/10.1111/j.1365-2753.2010.01592.x/abstract, Visited Apr. 2012.

  22. Chacón, M., and Luci, O., Patients classification by risk using cluster analysis and genetic algorithms. In: Progress in Pattern Recognition, Speech and Image Analysis. Lecture Notes in Computer Science. 2905/2003:350–358, 2003. doi:10.1007/978-3-540-24586-5_43. Available at: http://www.springerlink.com/content/02m4u30ktkfrfvlm/, Visited Apr. 2012.

  23. Tranquitelli, A. M., and Padilha, K. G., Sistemas de classificação de pacientes como instrumentos de gestão em Unidades de Terapia Intensiva. Rev. Esc. Enferm. USP [online]. 41(1):141–146, 2007. ISSN 0080–6234. doi: http://dx.doi.org/10.1590/S0080-62342007000100019. Visited Apr. 2012.

  24. Dindo, D., Demartines, N., and Clavien, P. A., Classification of surgical complications: A new proposal with evaluation in a cohort of 6336 patients and results of a survey. Ann Surg. 240(2):205–213, 2004. doi:10.1097/01.sla.0000133083.54934.ae. Visited Apr. 2012.

    Article  Google Scholar 

Download references

Acknowledgments

The authors would like to thank FAPESP and RUSP for the financial support. The authors would also like to acknowledge Professor Evandro Ruiz, Gisele Miranda, Professor Joaquim Felipe, Renato Pedigoni, Professor Rudinei Goularte, Thiago da Costa, and everyone at Vila Lobato.

Conflict of interest statement

All authors declare that they have no conflicts of interest.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alessandra A. Macedo.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pollettini, J.T., Panico, S.R.G., Daneluzzi, J.C. et al. Using Machine Learning Classifiers to Assist Healthcare-Related Decisions: Classification of Electronic Patient Records. J Med Syst 36, 3861–3874 (2012). https://doi.org/10.1007/s10916-012-9859-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10916-012-9859-6

Keywords

Navigation