Skip to main content

Improving Medical Record Search Performance by Particle Swarm Optimization Based Data Fusion Techniques

  • Conference paper
  • First Online:
Web Information Systems and Applications (WISA 2021)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12999))

Included in the following conference series:

Abstract

In this paper, we aim to improve the performance of electronic medical record search by fusing results from multiple search engines. We propose a particle swarm optimization-based data fusion method. To evaluate it effectiveness, experiments are carried out with two data sets from the TREC medical track in 2011 and 2012. We find that on average the proposed method outperforms the two traditional data fusion methods CombSum and CombMNZ and three different linear combination methods: multiple linear regression, learning to rank, and the genetic algorithm. An analysis is given to explain why the particle swarm optimization algorithm outperforms the others.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amini, I., Martínez, D., Li, X., Sanderson, M.: Improving patient record search: a meta-data based approach. Inf. Process. Manage. 52(2), 258–272 (2016). https://doi.org/10.1016/j.ipm.2015.07.005

    Article  Google Scholar 

  2. Aslam, J.A., Montague, M.H.: Models for Metasearch. In: Proceedings of SIGIR, 2001, pp. 275-284 (2001). https://doi.org/10.1145/383952.384007

  3. Bartell, B.T., Cottrell, G.W., Belew, R.K.: Automatic combination of multiple ranked re-trieval systems. In: Proceedings of SIGIR, 1994, pp. 173-181 (1994). https://doi.org/10.1007/978-1-4471-2099-5_18

  4. Bhatt, M., Rahayu, J.W., Soni, S.P., Wouters, C.: Ontology driven semantic profiling and retrieval in medical information systems. J. Web Sem. 7(4), 317–331 (2009). https://doi.org/10.1016/j.websem.2009.05.004

    Article  Google Scholar 

  5. Bhogal, J., MacFarlane, A., Smith, P.: A review of ontology based query expansion. Inf. Process. Manage. 43(4), 866–886 (2007). https://doi.org/10.1016/j.ipm.2006.09.003

    Article  Google Scholar 

  6. Campbell, K.E., Das, A.K., Musen, M.A.: Research paper: a logical foundation for rep-resentation of clinical data. JAMIA 1(3), 218–232 (1994). https://doi.org/10.1136/jamia.1994.95236154

    Article  Google Scholar 

  7. Cao, Z., Qin, T., Liu, T., Tsai, M., Li, H.: Learning to rank: from pairwise approach to list-wise approach. In: Proceedings of ICML, 2007, pp. 129-136 (2007). https://doi.org/10.1145/1273496.1273513

  8. Cormack, G.V., Clarke, C.L.A., Büttcher, S.: Reciprocal rank fusion outperforms Condorcet and individual rank learning methods. In: Proceedings of SIGIR, 2009, pp. 758-759 (2009). https://doi.org/10.1145/1571941.1572114

  9. Díaz-Galiano, M.C., Martín-Valdivia, M., López, L.A.U.: Query expansion with a medical ontology to improve a multimodal information retrieval system. Comp. Bio. Med. 39(4), 396–403 (2009). https://doi.org/10.1016/j.compbiomed.2009.01.012

    Article  Google Scholar 

  10. Durao, F., Bayyapu, K., Xu, G., Dolog, P., Lage, R.: Expanding user’s query with tag-neighbors for effective medical information retrieval. Multimedia Tools Appl. 71(2), 905–929 (2012). https://doi.org/10.1007/s11042-012-1316-5

    Article  Google Scholar 

  11. Evans, D.A., Cimino, J.J., Hersh, W.R., Huff, S.M., Bell, D.S.: Position paper: toward a medical-concept representation language. JAMIA 1(3), 207–217 (1994). https://doi.org/10.1136/jamia.1994.95236153

    Article  Google Scholar 

  12. Farah, M., Vanderpooten, D.: An outranking approach for information retrieval. Inf. Retr. 11(4), 315–334 (2008). https://doi.org/10.1007/s10791-008-9046-z

    Article  Google Scholar 

  13. Fox, E.A., Shaw, J.A.: Combination of multiple searches. In: Proceedings of TREC, 1993, pp. 243-252 (1993)

    Google Scholar 

  14. Ghosh, K., Parui, S.K., Majumder, P.: Learning combination weights in data fusion using genetic algorithms. Inf. Process. Manage. 51(3), 306–328 (2015). https://doi.org/10.1016/j.ipm.2014.12.002

    Article  Google Scholar 

  15. William, M.G., Jim, J., Nancy, M.L., Dario, A.G.: StarTracker: an integrated, web-based clinical search engine. In: AMIA (2003)

    Google Scholar 

  16. David, A.H.: EMERSE: the electronic medical record search engine. In: AMIA (2006)

    Google Scholar 

  17. David, A.H., Qiaozhu, M., James, L., Ritu, K., Kai, Z.: Supporting information retrieval from electronic health records: a report of University of Michigan’s nine-year experience in developing and using the electronic medical record search engine (EMERSE). J. Biomed. Inf. 55, 290–300 (2015). https://doi.org/10.1016/j.jbi.2015.05.003

    Article  Google Scholar 

  18. He, Y., Hu, Q., Song, Y., He, L.: Estimating probability density of content types for promoting medical records search. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 252–263. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_19

    Chapter  Google Scholar 

  19. King, B., Wang, L., Provalov, I., Zhou, J.: Cengage learning at TREC 2011 medical track. In: Proceeding of TREC (2011)

    Google Scholar 

  20. Lee, J.H.: Combining multiple evidence from different properties of weighting schemes. In: Proceeding of SIGIR, 1995, pp. 180-188 (1995). https://doi.org/10.1145/215206.215358

  21. Montague, M.H., Aslam, J.A.: Condorcet fusion for improved retrieval. In: Proceeding of CIKM, 2002, pp. 538-548 (2002). https://doi.org/10.1145/584792.584881

  22. Natarajan, K., Stein, D.M., Jain, S., Elhadad, N.: An analysis of clinical queries in an elec-tronic health record search utility. I. J. Medical Informatics 79(7), 515–522 (2010). https://doi.org/10.1016/j.ijmedinf.2010.03.004

    Article  Google Scholar 

  23. Pedersen, M.E.H., Chipperfield, A.J.: Simplifying particle swarm optimization. Appl. Soft Comput. 10(2), 618–628 (2010). https://doi.org/10.1016/j.asoc.2009.08.029

    Article  Google Scholar 

  24. Prasath, R., Duane, A., O’Reilly, P.: Topic assisted fusion to re-rank texts for multi-faceted information retrieval. In: Banchs, R.E., Silvestri, F., Liu, T.-Y., Zhang, M., Gao, S., Lang, J. (eds.) AIRS 2013. LNCS, vol. 8281, pp. 97–108. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-45068-6_9

    Chapter  Google Scholar 

  25. Quantin, C., Jaquet-Chiffelle, D., Coatrieux, G., Benzenine, E., Allaert, F.: Medical record search engines, using pseudonymised patient identity: an alternative to centralised medical records. Int. J. Med. Inf. 80(2), e6–e11 (2011). https://doi.org/10.1016/j.ijmedinf.2010.10.003

    Article  Google Scholar 

  26. Scully, K.W., et al.: Development of an enterprise-wide clinical data repository: merging multiple legacy databases. In: AMIA (1997)

    Google Scholar 

  27. Soldaini, L., Yates, A., Yom-Tov, E., Frieder, O., Goharian, N.: Enhancing web search in the medical domain via query clarification. Inf. Retr. J. 19(1–2), 149–173 (2015). https://doi.org/10.1007/s10791-015-9258-y

    Article  Google Scholar 

  28. Vogt, C.C., Cottrell, G.W.: Fusion via a linear combination of scores. Inf. Retr. 1(3), 151–173 (1999). https://doi.org/10.1023/A:1009980820262

    Article  Google Scholar 

  29. Voorhees, E.M., Hersh, W.R.: Overview of the TREC 2012 medical records track. In: Proceeding of TREC (2012)

    Google Scholar 

  30. Wang, H., Zhang, Q., Yuan, J.: Semantically enhanced medical information retrieval system: a tensor factorization based approach. IEEE Access 5, 7584–7593 (2007). https://doi.org/10.1109/ACCESS.2017.2698142

    Article  Google Scholar 

  31. Wang, Y., Lu, K., Fang, H.: Learning2extract for medical domain retrieval. In: Proceeding of AIRS, 2017, pp. 45-57 (2017). https://doi.org/10.1007/978-3-319-70145-5_4

  32. Wei, F., Li, W., Liu, S.: iRANK: a rank-learn-combine framework for unsupervised en-semble ranking. JASIST 61(6), 1232–1243 (2011). https://doi.org/10.1002/asi.21296

    Article  Google Scholar 

  33. Wu, S.: Linear combination of component results in information retrieval. Data Knowl. Eng. 71(1), 114–126 (2012). https://doi.org/10.1016/j.datak.2011.08.003

    Article  Google Scholar 

  34. Shengli, W.: Data Fusion in Information Retrieval. Springer, Heidelberg (2012)

    MATH  Google Scholar 

  35. Wu, S.: The weighted Condorcet fusion in information retrieval. Inf. Process. Manage. 49(1), 108–122 (2012). https://doi.org/10.1016/j.ipm.2012.02.007

    Article  Google Scholar 

  36. Yan, W., Wang, Y., Huang, C., Wu, S.: Word embedding-based reformulation for long queries in information search. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds.) WISA 2020. LNCS, vol. 12432, pp. 202–214. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60029-7_19

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shengli Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xu, Q., Wu, S. (2021). Improving Medical Record Search Performance by Particle Swarm Optimization Based Data Fusion Techniques. In: Xing, C., Fu, X., Zhang, Y., Zhang, G., Borjigin, C. (eds) Web Information Systems and Applications. WISA 2021. Lecture Notes in Computer Science(), vol 12999. Springer, Cham. https://doi.org/10.1007/978-3-030-87571-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-87571-8_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-87570-1

  • Online ISBN: 978-3-030-87571-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics