Skip to main content

An Embedded Representation Learning of Relational Clinical Codes

  • Conference paper
  • First Online:
  • 2883 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 13121))

Abstract

Fraud, waste, abuse and error (FWAE) incidents lead to higher co-payments and premiums and other costs that can significantly impact the quality of care one receives. Curbing such incidents of overpayment in claims settlement is a major organizational goal for healthcare companies. As claims are settled by examining the combination of clinical codes assigned, the task at hand is to predict if a new claim would lead to overpayment. This prediction task can be solved by building a classification model that would accept a representation of the clinical codes (which form an ontology graph among themselves) and other feature vectors appearing in claims data. In this work, we learn the embedded representation of these clinical nodes and relations among them in the ontology graph (excerpts from Unified Medical Language System (UMLS)) by incorporating knowledge from the semantics of code descriptions and edge relations. We combine the Paragraph Vector (PV) model with translation-based models in a framework of multi-relational learning. We carry out intrinsic evaluations of these embedding models on different tasks. Finally, we apply this representation learning by detecting overpayment on claims in healthcare application and by computing the savings achieved in fraud prevention in healthcare.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    we use knowledge graph and ontology graph interchangeably.

  2. 2.

    Also see https://www.nlm.nih.gov/research/umls/quickstart.html.

  3. 3.

    For more details see https://www.ncbi.nlm.nih.gov/books/NBK9684/.

References

  1. (AMA), A.M.A.: CPT Process. Archived May 11, 2016 at the Wayback Machine (2016)

    Google Scholar 

  2. Bethesda (MD): UMLSReference Manual [Internet]. National Library of Medicine (US) (2009). https://www.ncbi.nlm.nih.gov/books/NBK9676/

  3. Bordes, A., Glorot, X., Weston, J., Bengio, Y.: A semantic matching energy function for learning with multi-relational data - application to word-sense disambiguation. Mach. Learn. 94(2), 233–259 (2014)

    Article  MathSciNet  Google Scholar 

  4. Bordes, A., Usunier, N., García-Durán, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Proceedings of 27th NIPS, pp. 2787–2795 (2013)

    Google Scholar 

  5. Bordes, A., Weston, J., Collobert, R., Bengio, Y.: Learning structured embeddings of knowledge bases. In: Burgard, W., Roth, D. (eds.) Proceedings of the Twenty-Fifth AAAI 2011. AAAI Press (2011)

    Google Scholar 

  6. Chang, K.W., Yih, W.t., Meek, C.: Multi-relational latent semantic analysis. In: Proceedings of EMNLP, pp. 1602–1612 (2013)

    Google Scholar 

  7. Han, X., Liu, Z., Sun, M.: Joint representation learning of text and knowledge for knowledge graph completion. CoRR arXiv:1611.04125 (2016)

  8. Jenatton, R., Roux, N.L., Bordes, A., Obozinski, G.: A latent factor model for highly multi-relational data. In: Proceedings of the 26th NIPS, pp. 3176–3184 (2012)

    Google Scholar 

  9. Ji, G., He, S., Xu, L., Liu, K., Zhao, J.: Knowledge graph embedding via dynamic mapping matrix. In: Proceedings of the 53rd ACL, vol. 1, pp. 687–696 (2015)

    Google Scholar 

  10. Le, Q.V., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31th ICML, pp. 1188–1196 (2014)

    Google Scholar 

  11. Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. In: Proceedings of AAAI (2015)

    Google Scholar 

  12. Ling, Y., An, Y., Liu, M., Hasan, S.A., Fan, Y., Hu, X.: Integrating extra knowledge into word embedding models for biomedical NLP tasks. In: IJCNN, pp. 968–975. IEEE (2017)

    Google Scholar 

  13. Luong, T., Socher, R., Manning, C.D.: Better word representations with recursive neural networks for morphology. In: Proceedings the 17th CoNLL, pp. 104–113 (2013)

    Google Scholar 

  14. Lv, X., Hou, L., Li, J., Liu, Z.: Differentiating concepts and instances for knowledge graph embedding. In: Proceedings of EMNLP, pp. 1971–1979 (2018)

    Google Scholar 

  15. Mai, G., Janowicz, K., Yan, B.: Combining text embedding and knowledge graph embedding techniques for academic search engines. In: Joint proceedings of the 4th Workshop SemDeep-4 and NLIWoD4 and QALD-9 co-located with 17th ISWC, vol. 2241, pp. 77–88. CEUR-WS.org (2018)

    Google Scholar 

  16. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: 27th NIPS, Proceedings, pp. 3111–3119 (2013)

    Google Scholar 

  17. Socher, R., Chen, D., Manning, C.D., Ng, A.Y.: Reasoning with neural tensor networks for knowledge base completion. In: 27th NIPS, Proceedings, pp. 926–934 (2013)

    Google Scholar 

  18. Socher, R., Huang, E.H., Pennington, J., Ng, A.Y., Manning, C.D.: Dynamic pooling and unfolding recursive autoencoders for paraphrase detection. In: 25th NIPS, Proceedings, pp. 801–809 (2011)

    Google Scholar 

  19. Socher, R., Lin, C.C., Ng, A.Y., Manning, C.D.: Parsing natural scenes and natural language with recursive neural networks. In: Proceedings of the 28th ICML, pp. 129–136 (2011)

    Google Scholar 

  20. Sutskever, I., Salakhutdinov, R., Tenenbaum, J.B.: Modelling relational data using bayesian clustered tensor factorization. In: 23rd NIPS, Proceedings, pp. 1821–1828 (2009)

    Google Scholar 

  21. Wang, Q., Mao, Z., Wang, B., Guo, L.: Knowledge graph embedding: a survey of approaches and applications. IEEE Trans. Knowl. Data Eng. 29(12), 2724–2743 (2017)

    Article  Google Scholar 

  22. Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the 28th AAAI, pp. 1112–1119 (2014)

    Google Scholar 

  23. Weston, J., Bordes, A., Yakhnenko, O., Usunier, N.: Connecting language and knowledge bases with embedding models for relation extraction. In: Proceedings of the EMNLP, pp. 1366–1371 (2013)

    Google Scholar 

  24. Zhang, D., Yuang, B., Wang, D., Liu, R.: Joint semantic relevance learning with text data and graph knowledge. Technical report, CSLT TECHNICAL REPORT-20150023 (2015)

    Google Scholar 

  25. Zhang, Y., Yao, Q., Shao, Y., Chen, L.: NSCaching: simple and efficient negative sampling for knowledge graph embedding. In: 35th IEEE ICDE, pp. 614–625 (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Suman Roy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Roy, S., Kumar, A., Sengupta, A., Mattivi, R., Ahmed, S., Bridges, M. (2021). An Embedded Representation Learning of Relational Clinical Codes. In: Hacid, H., Kao, O., Mecella, M., Moha, N., Paik, Hy. (eds) Service-Oriented Computing. ICSOC 2021. Lecture Notes in Computer Science(), vol 13121. Springer, Cham. https://doi.org/10.1007/978-3-030-91431-8_66

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-91431-8_66

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-91430-1

  • Online ISBN: 978-3-030-91431-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics