Abstract
The appropriate funding of hospital services may depend upon grouping hospital episodes into Diagnosis Related Groups (DRGs). DRGs rely on the quality of clinical data held in administrative healthcare databases, mainly proper diagnoses and procedure codes. This work proposes a methodology based on unsupervised machine learning and statistical methods to generate alerts of suspect cases of up- and under-coding in healthcare administrative databases. The administrative database, with a DRG assigned to each hospital episode, was split into homogeneous patient subgroups by applying decision tree-based algorithms. The proportions of specific diagnosis and procedure codes were compared within targeted subgroups to identify hospitals with abnormal distributions. Preliminary results indicate that the proposed methodology has the potential to automatically identify upcoding and under-coding suspect cases, as well as other relevant types of discrepancies regarding coding practices. Nevertheless, additional evaluation under the medical perspective need to be incorporated in the methodology.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Cheng, P., Gilchrist, A., Robinson, K.M., Paul, L.: The risk and consequences of clinical miscoding due to inadequate medical documentation: a case study of the impact on health services funding. Health Inf. Manage. J. 38, 35–46 (2009)
Mathauer, I., Wittenbecher, F.: Hospital payment systems based on diagnosis-related groups: experiences in low- and middle-income countries. Bull. World Health Organ. 91(10), 746–756 (2013)
Busse, R., Geissler, A., Quentin, W., Wiley, M.: Diagnosis Related Group in Europe. Moving towards transparency, efficiency and quality in hospitals. McGraw Hill, New York (2011)
Fetter, R.B., Thompson, J.D., Mills, R.E.: A system for cost and reimbursement control in hospitals. Yale J Biol Med. 49, 123–136 (1976)
Luo, W., Gallagher, M.: Unsupervised DRG upcoding detection in healthcare databases. In: IEEE International Conference on Data Mining Workshops (ICDMW), pp. 600–605. IEEE (2010)
Dafny L.S: How Do Hospitals Respond to Price Changes? In: National Bureau of Economic Research, Inc., NBER Working Papers: 9972 (2003)
Reid, B., Palmer, G.R., Aisbett, C.: Under-coding in Australia limits the performance of DRG groupers. Health Inf. Manage. 29, 113–117 (1999)
Freitas, A., Lema, I., da Costa-Pereira, A.: Comorbidity coding trends in hospital administrative databases. In: New Advances in Information Systems and Technologies, pp, 609–617. Springer, Cham (2016)
Bauder, R., Khoshgoftaar, T.M., Seliya, N.: A survey on the state of healthcare upcoding fraud analysis and detection. Health Serv. Outcomes Res. Method. 17, 31–55 (2017)
Bauder, R.A., Khoshgoftaar, T.M.: A probabilistic programming approach for outlier detection in healthcare claims. In: 15th IEEE International Conference on Machine Learning And Applications (ICMLA), New York: IEEE, pp. 347–54 (2016)
Suresh, N., de Traversay, J., Gollamudi, H., Pathria, A., Tyler, M.: Detection of Upcoding and Code Gaming Fraud and Abuse in Prospective Payment Healthcare Systems. US Patent 8,666,757 (2014)
Averill, R.F., Goldfield, N., Hughes, J.S., et al.: All Patient Refined Diagnosis Related Groups Methodology Overview 3 M Health Information Systems. https://www.hcup- us.ahrq.gov/db/nation/nis/APR-DRGsV20MethodologyOverviewandBibliography.pdf (2003)
Administração Central do Sistema de Saúde (ACSS). https://www.acss.min-saude.pt/wp-content/uploads/2016/12/Portaria_207_2017-1.pdf. (last accessed 2018/01/07)
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)
Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization. In: 15th International Conference on Machine Learning, pp. 144–151 (1998)
Shi, H.: Best-first Decision Tree Learning. Hamilton, New Zealand (2007)
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth International Group, Belmont (1984)
Kalmegh, S.: Analysis of WEKA data mining algorithm REPTree, simple cart and randomtree for classification of indian news. Int. J. Innov. Sci. Eng. Technol. 2, 438–446 (2015)
Stiglic, G., et al.: Comprehensive Decision Tree Models in Bioinformatics. PLosONE 7(3), e33812 (2012)
Podgorelec, V., Kokol, P., Stiglic, B., Rozman, I.: Decision trees: an overview and their use in medicine. J. Med. Syst. 26, 445–463 (2002)
Azar, A.T., El-Metwally, S.M.: Decision tree classifiers for automated medical diagnosis. Neural Comput. Appl. 23, 2387–2403 (2013)
Warrens, M.J.: On Association Coefficients for 2 × 2 Tables and Properties That Do Not Depend on the Marginal Distributions. Psychometrika 73, 777–789 (2008)
Acknowledgments
Project NORTE-01-0145-FEDER-000016 (NanoSTIMA) is financed by the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, and through the European Regional Development Fund (ERDF). The authors would also like to thank the Central Authority for Health Services, I.P. (ACSS) for providing access to the data.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Souza, J., Santos, J.V., Lopes, F., Viana, J., Freitas, A. (2018). Miscoding Alerts Within Hospital Datasets: An Unsupervised Machine Learning Approach. In: Rocha, Á., Adeli, H., Reis, L., Costanzo, S. (eds) Trends and Advances in Information Systems and Technologies. WorldCIST'18 2018. Advances in Intelligent Systems and Computing, vol 746. Springer, Cham. https://doi.org/10.1007/978-3-319-77712-2_115
Download citation
DOI: https://doi.org/10.1007/978-3-319-77712-2_115
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77711-5
Online ISBN: 978-3-319-77712-2
eBook Packages: EngineeringEngineering (R0)