An abnormal surgical record recognition model with keywords combination patterns based on TextRank for medical insurance fraud detection

Li, Wei; Ye, Panpan; Yu, Kun; Min, Xin; Xie, Weidong

doi:10.1007/s11042-023-14529-4

An abnormal surgical record recognition model with keywords combination patterns based on TextRank for medical insurance fraud detection

Published: 23 March 2023

Volume 82, pages 30949–30963, (2023)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Wei Li¹,
Panpan Ye²,
Kun Yu³,
Xin Min ORCID: orcid.org/0000-0002-1996-7241² &
…
Weidong Xie²

210 Accesses
1 Citation
Explore all metrics

Abstract

Increasing insurance fraud has resulted in the loss of large amounts of money, making it difficult to expand insurance coverage and scale. This phenomenon is particularly acute in the field of health insurance. Medical insurance fraud is the falsification of medical records to obtain medical insurance funds or medical insurance benefits. Therefore, effective detection of health insurance fraud is of great importance for the rational use of health insurance funds. To address the frequent violations and frauds in health insurance, this paper proposes a keyword combination-based approach for health insurance fraud detection. First, a medical dictionary is built by TextRank to segment the surgical procedure text and extract the surgical keywords, then, the synonyms corresponding to each keyword are extracted from the electronic medical record data to form a keyword combination pattern as the final detection rule, and finally, a medical insurance fraud detection model is built on this basis. In this paper, data on acute myocardial infarction and unstable angina were selected for examination, with 1371 and 1787 cases, respectively. The performance of the model was evaluated by the coverage rate and compared experimentally with the TF-IDF and LDA algorithms. The experiments also prove the efficiency and advancedness of the algorithm in this paper. In the case of acute myocardial infarction, the method in this paper improved the coverage rate by 23.77% and 9.4% compared with the TF-IDF and LDA methods respectively. In the case of unstable angina, the coverage of the method in this paper was improved by 20.21% compared to both TF-IDF and LDA methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Associative Feature Information Extraction Using Text Mining from Health Big Data

Article 18 April 2018

Improvement of TextRank Based on Co-occurrence Word Pairs and Context Information

Keyword extraction and structuralization of medical reports

Article 03 April 2020

Data Availability

The patient population data used to support the findings of this study have not been made available because the data are supplied by Cancer Hospital of Liaoning under license and so cannot be made freely available. Requests for access to these data should be made to the corresponding author.

References

Chiu C-C, Tsai C-Y (2004) A web services-based collaborative scheme for credit card fraud detection. In: IEEE international conference on e-technology, e-commerce and e-service, 2004. EEE ’04. 2004, pp 177–181
Ilango V et al (2020) A time-efficient model for detecting fraudulent health insurance claims using artificial neural networks. In: 2020 International Conference on System, Computation, Automation and Networking (ICSCAN), IEEE, pp 1–6
Jingzhong W, Tongxiang Q (2015) Focused topic web crawler based on improved tf-idf alogorithm. J Comput Appl 35(10):2901–2904,2919
Google Scholar
Li W, Zhao J (2016) Textrank algorithm by exploiting wikipedia for short text keywords extraction. In: 2016 3rd international conference on information science and control engineering (ICISCE), IEEE, pp 683–686
Lielei C, Hui F (2018) Keyphrases automatic extraction from the abstracts of English scientific papers based on Scopus retrieval. Journal of Nanjing University (Natural Science), pp 604–611
Liu J, Bier E, Wilson A, Honda T, Kumar S, Gilpin L, Guerra-Gomez J, Davies D (2015) Graph analysis for detecting fraud, waste, and abuse in healthcare data. In: AAAI’15 proceedings of the twenty-ninth AAAI conference on artificial intelligence, pp 3912–3919
Lloyd JL, Wellman NS (2015) Older americans act nutrition programs: a community-based nutrition program helping older adults remain at home. J Nutr Gerontol Geriatr 34(2):90–109
Article Google Scholar
Matloob I, Khan S (2019) A framework for fraud detection in government supported national healthcare programs. In: 2019 11th international conference on electronics, computers and artificial intelligence (ECAI)
Mihalcea R, Tarau P (2004) Textrank: bringing order into text. In: Proceedings of the 2004 conference on empirical methods in natural language processing, pp 404–411
Pandey P, Saroliya A, Kumar R (2018) Analyses and detection of health insurance fraud using data mining and predictive modeling techniques
Rosen-Zvi M, Griffiths T, Steyvers M, Smyth P (2004) The author-topic model for authors and documents. In: UAI ’04 Proceedings of the 20th conference on Uncertainty in artificial intelligence, pp 487–494
Soleymani MH, Yaseri M, Farzadfar F, Mohammadpour A, Sharifi F, Kabir MJ (2018) Detecting medical prescriptions suspected of fraud using an unsupervised data mining algorithm. DARU 26(2):209–214
Article Google Scholar
Wei L, Ya P (2007) Performance comparison and analysis of several general text classification algorithms. J Hunan Univ(Nat Sci) 34(6):67–69
Google Scholar
Wei X, Yi S, Yongzheng M (2016) Recommendation system for paper reviewing based on graph computing. Appl Res Comput 33(3):798–801
Google Scholar
Xiao-ping J, Cheng-hua L, Wen X, Xin-fang Z (2011) Naive bayesian text classification algorithm in cloud computing environment. J Comput Appl 31(9):2551–2554,2566
Google Scholar
Xuemei Y, Xuemin M, Jinchun X, Bo W (2019) Improved approach to tf-idf algorithm in text classification. Comput Eng Appl 55(2):104–109,161
Google Scholar
Yong F, Hua L, Jiang Z, Chun-xiao Y (2010) Text classification algorithm based on adaptive chinese word segmentation and proximal svm. Comput Sci 37(1):251–254,293
Google Scholar
Yuan L (2010) An analysis on medical insurance fraud researches in the domestic and overseas market. Insur Stud 12:115–122
Google Scholar
Yuan L (2011) Research on word segmentation and feature selection of chinese text classification, master’s thesis, Jilin University
Zhang H, Wang L (2018) Prescription fraud detection through statistic modeling. In: Proceedings of 2018 international conference on mathematics and artificial intelligence, ICMAI ’18. Association for Computing Machinery, New York, pp 85–89
Zhuchen L, Hao C, Yanhua Y, Jie L (2018) Extracting keywords with textrank and weighted word positions. Data Anal Knowl Dis 2(9):74–79
Google Scholar

Download references

Acknowledgements

This work was supported by the Fundamental Research Funds for the Central Universities (N2016006), National Key R&D Program of China (2018YFC0830701), Shenyang Medical Imaging Processing Engineering Technology Research Center (17-134-8-00).

Author information

Authors and Affiliations

Key Laboratory of Intelligent Computing in Medical Image (MIIC), Northeastern University, Ministry of Education, Shenyang, China
Wei Li
School of Computer Science and Engineering, Northeastern University, Shenyang, China
Panpan Ye, Xin Min & Weidong Xie
Biomedical and Information Engineering School, Northeastern University, Shenyang, China
Kun Yu

Authors

Wei Li
View author publications
You can also search for this author in PubMed Google Scholar
Panpan Ye
View author publications
You can also search for this author in PubMed Google Scholar
Kun Yu
View author publications
You can also search for this author in PubMed Google Scholar
Xin Min
View author publications
You can also search for this author in PubMed Google Scholar
Weidong Xie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xin Min.

Ethics declarations

Ethics approval and consent to participate

This article does not contain any studies with human participants or animals performed by any of the authors.

Conflict of Interests

There are no conflicts of interest declared.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, W., Ye, P., Yu, K. et al. An abnormal surgical record recognition model with keywords combination patterns based on TextRank for medical insurance fraud detection. Multimed Tools Appl 82, 30949–30963 (2023). https://doi.org/10.1007/s11042-023-14529-4

Download citation

Received: 06 December 2020
Revised: 23 July 2021
Accepted: 31 January 2023
Published: 23 March 2023
Issue Date: August 2023
DOI: https://doi.org/10.1007/s11042-023-14529-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An abnormal surgical record recognition model with keywords combination patterns based on TextRank for medical insurance fraud detection

Abstract

Access this article

Similar content being viewed by others

Associative Feature Information Extraction Using Text Mining from Health Big Data

Improvement of TextRank Based on Co-occurrence Word Pairs and Context Information

Keyword extraction and structuralization of medical reports

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An abnormal surgical record recognition model with keywords combination patterns based on TextRank for medical insurance fraud detection

Abstract

Access this article

Similar content being viewed by others

Associative Feature Information Extraction Using Text Mining from Health Big Data

Improvement of TextRank Based on Co-occurrence Word Pairs and Context Information

Keyword extraction and structuralization of medical reports

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation