An Embedded Representation Learning of Relational Clinical Codes

Roy, Suman; Kumar, Amit; Sengupta, Ayan; Mattivi, Riccardo; Ahmed, Selim; Bridges, Michael

doi:10.1007/978-3-030-91431-8_66

An Embedded Representation Learning of Relational Clinical Codes

Suman Roy¹³,
Amit Kumar¹³,
Ayan Sengupta¹³,
Riccardo Mattivi¹⁴,
Selim Ahmed¹⁴ &
…
Michael Bridges¹⁴

Conference paper
First Online: 18 November 2021

2883 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 13121))

Abstract

Fraud, waste, abuse and error (FWAE) incidents lead to higher co-payments and premiums and other costs that can significantly impact the quality of care one receives. Curbing such incidents of overpayment in claims settlement is a major organizational goal for healthcare companies. As claims are settled by examining the combination of clinical codes assigned, the task at hand is to predict if a new claim would lead to overpayment. This prediction task can be solved by building a classification model that would accept a representation of the clinical codes (which form an ontology graph among themselves) and other feature vectors appearing in claims data. In this work, we learn the embedded representation of these clinical nodes and relations among them in the ontology graph (excerpts from Unified Medical Language System (UMLS)) by incorporating knowledge from the semantics of code descriptions and edge relations. We combine the Paragraph Vector (PV) model with translation-based models in a framework of multi-relational learning. We carry out intrinsic evaluations of these embedding models on different tasks. Finally, we apply this representation learning by detecting overpayment on claims in healthcare application and by computing the savings achieved in fraud prevention in healthcare.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
we use knowledge graph and ontology graph interchangeably.
2.
Also see https://www.nlm.nih.gov/research/umls/quickstart.html.
3.
For more details see https://www.ncbi.nlm.nih.gov/books/NBK9684/.

References

(AMA), A.M.A.: CPT Process. Archived May 11, 2016 at the Wayback Machine (2016)
Google Scholar
Bethesda (MD): UMLSReference Manual [Internet]. National Library of Medicine (US) (2009). https://www.ncbi.nlm.nih.gov/books/NBK9676/
Bordes, A., Glorot, X., Weston, J., Bengio, Y.: A semantic matching energy function for learning with multi-relational data - application to word-sense disambiguation. Mach. Learn. 94(2), 233–259 (2014)
Article MathSciNet Google Scholar
Bordes, A., Usunier, N., García-Durán, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Proceedings of 27th NIPS, pp. 2787–2795 (2013)
Google Scholar
Bordes, A., Weston, J., Collobert, R., Bengio, Y.: Learning structured embeddings of knowledge bases. In: Burgard, W., Roth, D. (eds.) Proceedings of the Twenty-Fifth AAAI 2011. AAAI Press (2011)
Google Scholar
Chang, K.W., Yih, W.t., Meek, C.: Multi-relational latent semantic analysis. In: Proceedings of EMNLP, pp. 1602–1612 (2013)
Google Scholar
Han, X., Liu, Z., Sun, M.: Joint representation learning of text and knowledge for knowledge graph completion. CoRR arXiv:1611.04125 (2016)
Jenatton, R., Roux, N.L., Bordes, A., Obozinski, G.: A latent factor model for highly multi-relational data. In: Proceedings of the 26th NIPS, pp. 3176–3184 (2012)
Google Scholar
Ji, G., He, S., Xu, L., Liu, K., Zhao, J.: Knowledge graph embedding via dynamic mapping matrix. In: Proceedings of the 53rd ACL, vol. 1, pp. 687–696 (2015)
Google Scholar
Le, Q.V., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31th ICML, pp. 1188–1196 (2014)
Google Scholar
Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. In: Proceedings of AAAI (2015)
Google Scholar
Ling, Y., An, Y., Liu, M., Hasan, S.A., Fan, Y., Hu, X.: Integrating extra knowledge into word embedding models for biomedical NLP tasks. In: IJCNN, pp. 968–975. IEEE (2017)
Google Scholar
Luong, T., Socher, R., Manning, C.D.: Better word representations with recursive neural networks for morphology. In: Proceedings the 17th CoNLL, pp. 104–113 (2013)
Google Scholar
Lv, X., Hou, L., Li, J., Liu, Z.: Differentiating concepts and instances for knowledge graph embedding. In: Proceedings of EMNLP, pp. 1971–1979 (2018)
Google Scholar
Mai, G., Janowicz, K., Yan, B.: Combining text embedding and knowledge graph embedding techniques for academic search engines. In: Joint proceedings of the 4th Workshop SemDeep-4 and NLIWoD4 and QALD-9 co-located with 17th ISWC, vol. 2241, pp. 77–88. CEUR-WS.org (2018)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: 27th NIPS, Proceedings, pp. 3111–3119 (2013)
Google Scholar
Socher, R., Chen, D., Manning, C.D., Ng, A.Y.: Reasoning with neural tensor networks for knowledge base completion. In: 27th NIPS, Proceedings, pp. 926–934 (2013)
Google Scholar
Socher, R., Huang, E.H., Pennington, J., Ng, A.Y., Manning, C.D.: Dynamic pooling and unfolding recursive autoencoders for paraphrase detection. In: 25th NIPS, Proceedings, pp. 801–809 (2011)
Google Scholar
Socher, R., Lin, C.C., Ng, A.Y., Manning, C.D.: Parsing natural scenes and natural language with recursive neural networks. In: Proceedings of the 28th ICML, pp. 129–136 (2011)
Google Scholar
Sutskever, I., Salakhutdinov, R., Tenenbaum, J.B.: Modelling relational data using bayesian clustered tensor factorization. In: 23rd NIPS, Proceedings, pp. 1821–1828 (2009)
Google Scholar
Wang, Q., Mao, Z., Wang, B., Guo, L.: Knowledge graph embedding: a survey of approaches and applications. IEEE Trans. Knowl. Data Eng. 29(12), 2724–2743 (2017)
Article Google Scholar
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the 28th AAAI, pp. 1112–1119 (2014)
Google Scholar
Weston, J., Bordes, A., Yakhnenko, O., Usunier, N.: Connecting language and knowledge bases with embedding models for relation extraction. In: Proceedings of the EMNLP, pp. 1366–1371 (2013)
Google Scholar
Zhang, D., Yuang, B., Wang, D., Liu, R.: Joint semantic relevance learning with text data and graph knowledge. Technical report, CSLT TECHNICAL REPORT-20150023 (2015)
Google Scholar
Zhang, Y., Yao, Q., Shao, Y., Chen, L.: NSCaching: simple and efficient negative sampling for knowledge graph embedding. In: 35th IEEE ICDE, pp. 614–625 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Optum Global Advantage (OGA), UnitedHealth Group, Bangalore, 560 103, India
Suman Roy, Amit Kumar & Ayan Sengupta
Optum Global Advantage (OGA), UnitedHealth Group, Dublin, Ireland
Riccardo Mattivi, Selim Ahmed & Michael Bridges

Authors

Suman Roy
View author publications
You can also search for this author in PubMed Google Scholar
Amit Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Ayan Sengupta
View author publications
You can also search for this author in PubMed Google Scholar
Riccardo Mattivi
View author publications
You can also search for this author in PubMed Google Scholar
Selim Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Michael Bridges
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Suman Roy .

Editor information

Editors and Affiliations

Zayed University, Dubai, United Arab Emirates
Hakim Hacid
Technical University of Berlin, Berlin, Germany
Odej Kao
Informatica Automatica Gestio, Sapienza University of Rome, Rome, Italy
Massimo Mecella
Departement d'Informatique, University of Quebec, Montreal, QC, Canada
Naouel Moha
UNSW Sydney, Sydney, NSW, Australia
Hye-young Paik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roy, S., Kumar, A., Sengupta, A., Mattivi, R., Ahmed, S., Bridges, M. (2021). An Embedded Representation Learning of Relational Clinical Codes. In: Hacid, H., Kao, O., Mecella, M., Moha, N., Paik, Hy. (eds) Service-Oriented Computing. ICSOC 2021. Lecture Notes in Computer Science(), vol 13121. Springer, Cham. https://doi.org/10.1007/978-3-030-91431-8_66

Download citation

DOI: https://doi.org/10.1007/978-3-030-91431-8_66
Published: 18 November 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91430-1
Online ISBN: 978-3-030-91431-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics