Skip to main content

Configurable In-Database Similarity Search of Electronic Medical Records

  • Conference paper
  • First Online:
Web Information Systems and Applications (WISA 2021)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12999))

Included in the following conference series:

Abstract

With the development of technology, Electronic Medical Re-cords (EMRs) are widely used for medical analysis through methods such as similarity search. Typical EMRs contain attributes of different data types including string, enumeration and numeric data, and are commonly stored in a database. However, many EMR similarity search algorithms neither separate different data types nor conduct search directly in the database. In addition, for researchers and doctors who need similarity search but do not have strong programming background, a user-friendly interface is missing. Therefore, we design a tool “SIR” to solve the aforementioned problems. SIR can conduct configurable similarity search in high dimensions (within 0.0931 and 0.7824 s respectively using its basic and advanced version), can be embedded directly in the database, and has an intuitive interface.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    SIR is user-friendly in the sense that no heavy coding is required, but users are supposed to have knowledge in patient similarity search and know how to make their own customized search settings, such as weight configurations.

  2. 2.

    Reference: https://deepai.org/machine-learning-glossary-and-terms/jaccard-index.

  3. 3.

    https://dev.mysql.com/doc/extending-mysql/5.7/en/udf-features.html.

References

  1. Boytsov, L., Naidan, B.: Engineering efficient and effective non-metric space library. In: Brisaboa, N., Pedreira, O., Zezula, P. (eds.) SISAP 2013. LNCS, vol. 8199, pp. 280–293. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41062-8_28

    Chapter  Google Scholar 

  2. Celi, L.A., Charlton, P., Ghassemi, M.M., et al.: Secondary Analysis of Electronic Health Records. Springer, New York (2016)

    Google Scholar 

  3. Eteffa, K.F., Ansong, S., Li, C., Sheng, M., Zhang, Y., Xing, C.: Application of patient similarity in smart health: a case study in medical education. In: Ni, W., Wang, X., Song, W., Li, Y. (eds.) WISA 2019. LNCS, vol. 11817, pp. 714–719. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30952-7_72

    Chapter  Google Scholar 

  4. Eteffa, K.F., Ansong, S., Li, C., Sheng, M., Zhang, Y., Xing, C.: An experimental study of time series based patient similarity with graphs. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds.) WISA 2020. LNCS, vol. 12432, pp. 467–474. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60029-7_42

    Chapter  Google Scholar 

  5. Gravano, L., Ipeirotis, P.G., Jagadish, H.V., et al.: Approximate string joins in a database (almost) for free. VLDB 1, 491–500 (2001)

    Google Scholar 

  6. He, Z., Yang, J., Wang, Q., Li, J.: A method of electronic medical record similarity computation. In: Xing, C., Zhang, Y., Liang, Y. (eds.) ICSH 2016. LNCS, vol. 10219, pp. 182–191. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59858-1_18

    Chapter  Google Scholar 

  7. Le, N., Wiley, M., Loza, A., et al.: Prediction of medical concepts in electronic health records: similar patient analysis. JMIR Med. Inform. 8(7), e16008 (2020)

    Article  Google Scholar 

  8. Malkov, Y.A., Yashunin, D.A.: Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs. CoRR abs/1603.09320 (2016). http://arxiv.org/abs/1603.09320

  9. McGinn, C.A., Grenier, S., Duplantie, J., et al.: Comparison of user groups’ perspectives of barriers and facilitators to implementing electronic health records: a systematic review. BMC Med. 9(1), 1–10 (2011)

    Article  Google Scholar 

  10. Grand Viewer Research: Electronic health records market size, share and trends analysis report, 2021–2028, April 2021. https://www.grandviewresearch.com/industry-analysis/electronic-health-records-ehr-market

  11. Sherman, R.E., Anderson, S.A., Dal Pan, G.J., et al.: Real-world evidence–what is it and what can it tell us. N. Engl. J. Med. 375(23), 2293–2297 (2016)

    Article  Google Scholar 

  12. Sundari, P.S., Subaji, M., Karthikeyan, J.: A survey on effective similarity search models and techniques for big data processing in healthcare system. Res. J. Pharm. Technol. 10(8), 2677–2684 (2017)

    Article  Google Scholar 

  13. Tashkandi, A., Wiese, I., Wiese, L.: Efficient in-database patient similarity analysis for personalized medical decision support systems. Big Data Res. 13, 52–64 (2018)

    Article  Google Scholar 

Download references

Acknowledgement

This work was supported by National Key R&D Program of China (2020AAA0109603), State Key Laboratory of Computer Architecture (ICT, CAS) under Grant No. CARCHA202008.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yong Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wu, Y., Zhang, Y., Wu, J. (2021). Configurable In-Database Similarity Search of Electronic Medical Records. In: Xing, C., Fu, X., Zhang, Y., Zhang, G., Borjigin, C. (eds) Web Information Systems and Applications. WISA 2021. Lecture Notes in Computer Science(), vol 12999. Springer, Cham. https://doi.org/10.1007/978-3-030-87571-8_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-87571-8_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-87570-1

  • Online ISBN: 978-3-030-87571-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics