Abstract
In this paper, we aim to improve the performance of electronic medical record search by fusing results from multiple search engines. We propose a particle swarm optimization-based data fusion method. To evaluate it effectiveness, experiments are carried out with two data sets from the TREC medical track in 2011 and 2012. We find that on average the proposed method outperforms the two traditional data fusion methods CombSum and CombMNZ and three different linear combination methods: multiple linear regression, learning to rank, and the genetic algorithm. An analysis is given to explain why the particle swarm optimization algorithm outperforms the others.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amini, I., MartÃnez, D., Li, X., Sanderson, M.: Improving patient record search: a meta-data based approach. Inf. Process. Manage. 52(2), 258–272 (2016). https://doi.org/10.1016/j.ipm.2015.07.005
Aslam, J.A., Montague, M.H.: Models for Metasearch. In: Proceedings of SIGIR, 2001, pp. 275-284 (2001). https://doi.org/10.1145/383952.384007
Bartell, B.T., Cottrell, G.W., Belew, R.K.: Automatic combination of multiple ranked re-trieval systems. In: Proceedings of SIGIR, 1994, pp. 173-181 (1994). https://doi.org/10.1007/978-1-4471-2099-5_18
Bhatt, M., Rahayu, J.W., Soni, S.P., Wouters, C.: Ontology driven semantic profiling and retrieval in medical information systems. J. Web Sem. 7(4), 317–331 (2009). https://doi.org/10.1016/j.websem.2009.05.004
Bhogal, J., MacFarlane, A., Smith, P.: A review of ontology based query expansion. Inf. Process. Manage. 43(4), 866–886 (2007). https://doi.org/10.1016/j.ipm.2006.09.003
Campbell, K.E., Das, A.K., Musen, M.A.: Research paper: a logical foundation for rep-resentation of clinical data. JAMIA 1(3), 218–232 (1994). https://doi.org/10.1136/jamia.1994.95236154
Cao, Z., Qin, T., Liu, T., Tsai, M., Li, H.: Learning to rank: from pairwise approach to list-wise approach. In: Proceedings of ICML, 2007, pp. 129-136 (2007). https://doi.org/10.1145/1273496.1273513
Cormack, G.V., Clarke, C.L.A., Büttcher, S.: Reciprocal rank fusion outperforms Condorcet and individual rank learning methods. In: Proceedings of SIGIR, 2009, pp. 758-759 (2009). https://doi.org/10.1145/1571941.1572114
DÃaz-Galiano, M.C., MartÃn-Valdivia, M., López, L.A.U.: Query expansion with a medical ontology to improve a multimodal information retrieval system. Comp. Bio. Med. 39(4), 396–403 (2009). https://doi.org/10.1016/j.compbiomed.2009.01.012
Durao, F., Bayyapu, K., Xu, G., Dolog, P., Lage, R.: Expanding user’s query with tag-neighbors for effective medical information retrieval. Multimedia Tools Appl. 71(2), 905–929 (2012). https://doi.org/10.1007/s11042-012-1316-5
Evans, D.A., Cimino, J.J., Hersh, W.R., Huff, S.M., Bell, D.S.: Position paper: toward a medical-concept representation language. JAMIA 1(3), 207–217 (1994). https://doi.org/10.1136/jamia.1994.95236153
Farah, M., Vanderpooten, D.: An outranking approach for information retrieval. Inf. Retr. 11(4), 315–334 (2008). https://doi.org/10.1007/s10791-008-9046-z
Fox, E.A., Shaw, J.A.: Combination of multiple searches. In: Proceedings of TREC, 1993, pp. 243-252 (1993)
Ghosh, K., Parui, S.K., Majumder, P.: Learning combination weights in data fusion using genetic algorithms. Inf. Process. Manage. 51(3), 306–328 (2015). https://doi.org/10.1016/j.ipm.2014.12.002
William, M.G., Jim, J., Nancy, M.L., Dario, A.G.: StarTracker: an integrated, web-based clinical search engine. In: AMIA (2003)
David, A.H.: EMERSE: the electronic medical record search engine. In: AMIA (2006)
David, A.H., Qiaozhu, M., James, L., Ritu, K., Kai, Z.: Supporting information retrieval from electronic health records: a report of University of Michigan’s nine-year experience in developing and using the electronic medical record search engine (EMERSE). J. Biomed. Inf. 55, 290–300 (2015). https://doi.org/10.1016/j.jbi.2015.05.003
He, Y., Hu, Q., Song, Y., He, L.: Estimating probability density of content types for promoting medical records search. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 252–263. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_19
King, B., Wang, L., Provalov, I., Zhou, J.: Cengage learning at TREC 2011 medical track. In: Proceeding of TREC (2011)
Lee, J.H.: Combining multiple evidence from different properties of weighting schemes. In: Proceeding of SIGIR, 1995, pp. 180-188 (1995). https://doi.org/10.1145/215206.215358
Montague, M.H., Aslam, J.A.: Condorcet fusion for improved retrieval. In: Proceeding of CIKM, 2002, pp. 538-548 (2002). https://doi.org/10.1145/584792.584881
Natarajan, K., Stein, D.M., Jain, S., Elhadad, N.: An analysis of clinical queries in an elec-tronic health record search utility. I. J. Medical Informatics 79(7), 515–522 (2010). https://doi.org/10.1016/j.ijmedinf.2010.03.004
Pedersen, M.E.H., Chipperfield, A.J.: Simplifying particle swarm optimization. Appl. Soft Comput. 10(2), 618–628 (2010). https://doi.org/10.1016/j.asoc.2009.08.029
Prasath, R., Duane, A., O’Reilly, P.: Topic assisted fusion to re-rank texts for multi-faceted information retrieval. In: Banchs, R.E., Silvestri, F., Liu, T.-Y., Zhang, M., Gao, S., Lang, J. (eds.) AIRS 2013. LNCS, vol. 8281, pp. 97–108. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-45068-6_9
Quantin, C., Jaquet-Chiffelle, D., Coatrieux, G., Benzenine, E., Allaert, F.: Medical record search engines, using pseudonymised patient identity: an alternative to centralised medical records. Int. J. Med. Inf. 80(2), e6–e11 (2011). https://doi.org/10.1016/j.ijmedinf.2010.10.003
Scully, K.W., et al.: Development of an enterprise-wide clinical data repository: merging multiple legacy databases. In: AMIA (1997)
Soldaini, L., Yates, A., Yom-Tov, E., Frieder, O., Goharian, N.: Enhancing web search in the medical domain via query clarification. Inf. Retr. J. 19(1–2), 149–173 (2015). https://doi.org/10.1007/s10791-015-9258-y
Vogt, C.C., Cottrell, G.W.: Fusion via a linear combination of scores. Inf. Retr. 1(3), 151–173 (1999). https://doi.org/10.1023/A:1009980820262
Voorhees, E.M., Hersh, W.R.: Overview of the TREC 2012 medical records track. In: Proceeding of TREC (2012)
Wang, H., Zhang, Q., Yuan, J.: Semantically enhanced medical information retrieval system: a tensor factorization based approach. IEEE Access 5, 7584–7593 (2007). https://doi.org/10.1109/ACCESS.2017.2698142
Wang, Y., Lu, K., Fang, H.: Learning2extract for medical domain retrieval. In: Proceeding of AIRS, 2017, pp. 45-57 (2017). https://doi.org/10.1007/978-3-319-70145-5_4
Wei, F., Li, W., Liu, S.: iRANK: a rank-learn-combine framework for unsupervised en-semble ranking. JASIST 61(6), 1232–1243 (2011). https://doi.org/10.1002/asi.21296
Wu, S.: Linear combination of component results in information retrieval. Data Knowl. Eng. 71(1), 114–126 (2012). https://doi.org/10.1016/j.datak.2011.08.003
Shengli, W.: Data Fusion in Information Retrieval. Springer, Heidelberg (2012)
Wu, S.: The weighted Condorcet fusion in information retrieval. Inf. Process. Manage. 49(1), 108–122 (2012). https://doi.org/10.1016/j.ipm.2012.02.007
Yan, W., Wang, Y., Huang, C., Wu, S.: Word embedding-based reformulation for long queries in information search. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds.) WISA 2020. LNCS, vol. 12432, pp. 202–214. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60029-7_19
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Xu, Q., Wu, S. (2021). Improving Medical Record Search Performance by Particle Swarm Optimization Based Data Fusion Techniques. In: Xing, C., Fu, X., Zhang, Y., Zhang, G., Borjigin, C. (eds) Web Information Systems and Applications. WISA 2021. Lecture Notes in Computer Science(), vol 12999. Springer, Cham. https://doi.org/10.1007/978-3-030-87571-8_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-87571-8_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87570-1
Online ISBN: 978-3-030-87571-8
eBook Packages: Computer ScienceComputer Science (R0)