Skip to main content
Log in

An abnormal surgical record recognition model with keywords combination patterns based on TextRank for medical insurance fraud detection

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Increasing insurance fraud has resulted in the loss of large amounts of money, making it difficult to expand insurance coverage and scale. This phenomenon is particularly acute in the field of health insurance. Medical insurance fraud is the falsification of medical records to obtain medical insurance funds or medical insurance benefits. Therefore, effective detection of health insurance fraud is of great importance for the rational use of health insurance funds. To address the frequent violations and frauds in health insurance, this paper proposes a keyword combination-based approach for health insurance fraud detection. First, a medical dictionary is built by TextRank to segment the surgical procedure text and extract the surgical keywords, then, the synonyms corresponding to each keyword are extracted from the electronic medical record data to form a keyword combination pattern as the final detection rule, and finally, a medical insurance fraud detection model is built on this basis. In this paper, data on acute myocardial infarction and unstable angina were selected for examination, with 1371 and 1787 cases, respectively. The performance of the model was evaluated by the coverage rate and compared experimentally with the TF-IDF and LDA algorithms. The experiments also prove the efficiency and advancedness of the algorithm in this paper. In the case of acute myocardial infarction, the method in this paper improved the coverage rate by 23.77% and 9.4% compared with the TF-IDF and LDA methods respectively. In the case of unstable angina, the coverage of the method in this paper was improved by 20.21% compared to both TF-IDF and LDA methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Data Availability

The patient population data used to support the findings of this study have not been made available because the data are supplied by Cancer Hospital of Liaoning under license and so cannot be made freely available. Requests for access to these data should be made to the corresponding author.

References

  1. Chiu C-C, Tsai C-Y (2004) A web services-based collaborative scheme for credit card fraud detection. In: IEEE international conference on e-technology, e-commerce and e-service, 2004. EEE ’04. 2004, pp 177–181

  2. Ilango V et al (2020) A time-efficient model for detecting fraudulent health insurance claims using artificial neural networks. In: 2020 International Conference on System, Computation, Automation and Networking (ICSCAN), IEEE, pp 1–6

  3. Jingzhong W, Tongxiang Q (2015) Focused topic web crawler based on improved tf-idf alogorithm. J Comput Appl 35(10):2901–2904,2919

    Google Scholar 

  4. Li W, Zhao J (2016) Textrank algorithm by exploiting wikipedia for short text keywords extraction. In: 2016 3rd international conference on information science and control engineering (ICISCE), IEEE, pp 683–686

  5. Lielei C, Hui F (2018) Keyphrases automatic extraction from the abstracts of English scientific papers based on Scopus retrieval. Journal of Nanjing University (Natural Science), pp 604–611

  6. Liu J, Bier E, Wilson A, Honda T, Kumar S, Gilpin L, Guerra-Gomez J, Davies D (2015) Graph analysis for detecting fraud, waste, and abuse in healthcare data. In: AAAI’15 proceedings of the twenty-ninth AAAI conference on artificial intelligence, pp 3912–3919

  7. Lloyd JL, Wellman NS (2015) Older americans act nutrition programs: a community-based nutrition program helping older adults remain at home. J Nutr Gerontol Geriatr 34(2):90–109

    Article  Google Scholar 

  8. Matloob I, Khan S (2019) A framework for fraud detection in government supported national healthcare programs. In: 2019 11th international conference on electronics, computers and artificial intelligence (ECAI)

  9. Mihalcea R, Tarau P (2004) Textrank: bringing order into text. In: Proceedings of the 2004 conference on empirical methods in natural language processing, pp 404–411

  10. Pandey P, Saroliya A, Kumar R (2018) Analyses and detection of health insurance fraud using data mining and predictive modeling techniques

  11. Rosen-Zvi M, Griffiths T, Steyvers M, Smyth P (2004) The author-topic model for authors and documents. In: UAI ’04 Proceedings of the 20th conference on Uncertainty in artificial intelligence, pp 487–494

  12. Soleymani MH, Yaseri M, Farzadfar F, Mohammadpour A, Sharifi F, Kabir MJ (2018) Detecting medical prescriptions suspected of fraud using an unsupervised data mining algorithm. DARU 26(2):209–214

    Article  Google Scholar 

  13. Wei L, Ya P (2007) Performance comparison and analysis of several general text classification algorithms. J Hunan Univ(Nat Sci) 34(6):67–69

    Google Scholar 

  14. Wei X, Yi S, Yongzheng M (2016) Recommendation system for paper reviewing based on graph computing. Appl Res Comput 33(3):798–801

    Google Scholar 

  15. Xiao-ping J, Cheng-hua L, Wen X, Xin-fang Z (2011) Naive bayesian text classification algorithm in cloud computing environment. J Comput Appl 31(9):2551–2554,2566

    Google Scholar 

  16. Xuemei Y, Xuemin M, Jinchun X, Bo W (2019) Improved approach to tf-idf algorithm in text classification. Comput Eng Appl 55(2):104–109,161

    Google Scholar 

  17. Yong F, Hua L, Jiang Z, Chun-xiao Y (2010) Text classification algorithm based on adaptive chinese word segmentation and proximal svm. Comput Sci 37(1):251–254,293

    Google Scholar 

  18. Yuan L (2010) An analysis on medical insurance fraud researches in the domestic and overseas market. Insur Stud 12:115–122

    Google Scholar 

  19. Yuan L (2011) Research on word segmentation and feature selection of chinese text classification, master’s thesis, Jilin University

  20. Zhang H, Wang L (2018) Prescription fraud detection through statistic modeling. In: Proceedings of 2018 international conference on mathematics and artificial intelligence, ICMAI ’18. Association for Computing Machinery, New York, pp 85–89

  21. Zhuchen L, Hao C, Yanhua Y, Jie L (2018) Extracting keywords with textrank and weighted word positions. Data Anal Knowl Dis 2(9):74–79

    Google Scholar 

Download references

Acknowledgements

This work was supported by the Fundamental Research Funds for the Central Universities (N2016006), National Key R&D Program of China (2018YFC0830701), Shenyang Medical Imaging Processing Engineering Technology Research Center (17-134-8-00).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xin Min.

Ethics declarations

Ethics approval and consent to participate

This article does not contain any studies with human participants or animals performed by any of the authors.

Conflict of Interests

There are no conflicts of interest declared.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, W., Ye, P., Yu, K. et al. An abnormal surgical record recognition model with keywords combination patterns based on TextRank for medical insurance fraud detection. Multimed Tools Appl 82, 30949–30963 (2023). https://doi.org/10.1007/s11042-023-14529-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-14529-4

Keywords

Navigation