Abstract
Fraud, waste, abuse and error (FWAE) incidents lead to higher co-payments and premiums and other costs that can significantly impact the quality of care one receives. Curbing such incidents of overpayment in claims settlement is a major organizational goal for healthcare companies. As claims are settled by examining the combination of clinical codes assigned, the task at hand is to predict if a new claim would lead to overpayment. This prediction task can be solved by building a classification model that would accept a representation of the clinical codes (which form an ontology graph among themselves) and other feature vectors appearing in claims data. In this work, we learn the embedded representation of these clinical nodes and relations among them in the ontology graph (excerpts from Unified Medical Language System (UMLS)) by incorporating knowledge from the semantics of code descriptions and edge relations. We combine the Paragraph Vector (PV) model with translation-based models in a framework of multi-relational learning. We carry out intrinsic evaluations of these embedding models on different tasks. Finally, we apply this representation learning by detecting overpayment on claims in healthcare application and by computing the savings achieved in fraud prevention in healthcare.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
we use knowledge graph and ontology graph interchangeably.
- 2.
- 3.
For more details see https://www.ncbi.nlm.nih.gov/books/NBK9684/.
References
(AMA), A.M.A.: CPT Process. Archived May 11, 2016 at the Wayback Machine (2016)
Bethesda (MD): UMLSReference Manual [Internet]. National Library of Medicine (US) (2009). https://www.ncbi.nlm.nih.gov/books/NBK9676/
Bordes, A., Glorot, X., Weston, J., Bengio, Y.: A semantic matching energy function for learning with multi-relational data - application to word-sense disambiguation. Mach. Learn. 94(2), 233–259 (2014)
Bordes, A., Usunier, N., García-Durán, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Proceedings of 27th NIPS, pp. 2787–2795 (2013)
Bordes, A., Weston, J., Collobert, R., Bengio, Y.: Learning structured embeddings of knowledge bases. In: Burgard, W., Roth, D. (eds.) Proceedings of the Twenty-Fifth AAAI 2011. AAAI Press (2011)
Chang, K.W., Yih, W.t., Meek, C.: Multi-relational latent semantic analysis. In: Proceedings of EMNLP, pp. 1602–1612 (2013)
Han, X., Liu, Z., Sun, M.: Joint representation learning of text and knowledge for knowledge graph completion. CoRR arXiv:1611.04125 (2016)
Jenatton, R., Roux, N.L., Bordes, A., Obozinski, G.: A latent factor model for highly multi-relational data. In: Proceedings of the 26th NIPS, pp. 3176–3184 (2012)
Ji, G., He, S., Xu, L., Liu, K., Zhao, J.: Knowledge graph embedding via dynamic mapping matrix. In: Proceedings of the 53rd ACL, vol. 1, pp. 687–696 (2015)
Le, Q.V., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31th ICML, pp. 1188–1196 (2014)
Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. In: Proceedings of AAAI (2015)
Ling, Y., An, Y., Liu, M., Hasan, S.A., Fan, Y., Hu, X.: Integrating extra knowledge into word embedding models for biomedical NLP tasks. In: IJCNN, pp. 968–975. IEEE (2017)
Luong, T., Socher, R., Manning, C.D.: Better word representations with recursive neural networks for morphology. In: Proceedings the 17th CoNLL, pp. 104–113 (2013)
Lv, X., Hou, L., Li, J., Liu, Z.: Differentiating concepts and instances for knowledge graph embedding. In: Proceedings of EMNLP, pp. 1971–1979 (2018)
Mai, G., Janowicz, K., Yan, B.: Combining text embedding and knowledge graph embedding techniques for academic search engines. In: Joint proceedings of the 4th Workshop SemDeep-4 and NLIWoD4 and QALD-9 co-located with 17th ISWC, vol. 2241, pp. 77–88. CEUR-WS.org (2018)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: 27th NIPS, Proceedings, pp. 3111–3119 (2013)
Socher, R., Chen, D., Manning, C.D., Ng, A.Y.: Reasoning with neural tensor networks for knowledge base completion. In: 27th NIPS, Proceedings, pp. 926–934 (2013)
Socher, R., Huang, E.H., Pennington, J., Ng, A.Y., Manning, C.D.: Dynamic pooling and unfolding recursive autoencoders for paraphrase detection. In: 25th NIPS, Proceedings, pp. 801–809 (2011)
Socher, R., Lin, C.C., Ng, A.Y., Manning, C.D.: Parsing natural scenes and natural language with recursive neural networks. In: Proceedings of the 28th ICML, pp. 129–136 (2011)
Sutskever, I., Salakhutdinov, R., Tenenbaum, J.B.: Modelling relational data using bayesian clustered tensor factorization. In: 23rd NIPS, Proceedings, pp. 1821–1828 (2009)
Wang, Q., Mao, Z., Wang, B., Guo, L.: Knowledge graph embedding: a survey of approaches and applications. IEEE Trans. Knowl. Data Eng. 29(12), 2724–2743 (2017)
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the 28th AAAI, pp. 1112–1119 (2014)
Weston, J., Bordes, A., Yakhnenko, O., Usunier, N.: Connecting language and knowledge bases with embedding models for relation extraction. In: Proceedings of the EMNLP, pp. 1366–1371 (2013)
Zhang, D., Yuang, B., Wang, D., Liu, R.: Joint semantic relevance learning with text data and graph knowledge. Technical report, CSLT TECHNICAL REPORT-20150023 (2015)
Zhang, Y., Yao, Q., Shao, Y., Chen, L.: NSCaching: simple and efficient negative sampling for knowledge graph embedding. In: 35th IEEE ICDE, pp. 614–625 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Roy, S., Kumar, A., Sengupta, A., Mattivi, R., Ahmed, S., Bridges, M. (2021). An Embedded Representation Learning of Relational Clinical Codes. In: Hacid, H., Kao, O., Mecella, M., Moha, N., Paik, Hy. (eds) Service-Oriented Computing. ICSOC 2021. Lecture Notes in Computer Science(), vol 13121. Springer, Cham. https://doi.org/10.1007/978-3-030-91431-8_66
Download citation
DOI: https://doi.org/10.1007/978-3-030-91431-8_66
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91430-1
Online ISBN: 978-3-030-91431-8
eBook Packages: Computer ScienceComputer Science (R0)