Abstract
This paper presents a Web Information Retrieval System (WebIRS), which is designed to assist the healthcare professionals to obtain up-to-date medical knowledge and information via the World Wide Web (WWW). The system leverages the document classification and text summarization techniques to deliver the highly correlated medical information to the physicians. The system architecture of the proposed WebIRS is first discussed, and then a case study on an application of the proposed system in a Hong Kong medical organization is presented to illustrate the adoption process and a questionnaire is administrated to collect feedback on the operation and performance of WebIRS in comparison with conventional information retrieval in the WWW. A prototype system has been constructed and implemented on a trial basis in a medical organization. It has proven to be of benefit to healthcare professionals through its automatic functions in classification and summarizing the medical information that the physicians needed and interested. The results of the case study show that with the use of the proposed WebIRS, significant reduction of searching time and effort, with retrieval of highly relevant materials can be attained.










Similar content being viewed by others
References
Housel, T., and Bell, A. A., Measuring and managing knowledge. McGraw Hill, Irwin, 2001.
Bell, G. B., and Sethi, A., Matching records in a national medical patient index. Commun. ACM 44(9):83–88, 2001.
Ybarra, M. L., and Suman, M., Help seeking behavior and the Internet: a national survey. Int. J. Med. Inform. 75(1):29–41, 2006.
Gilmour, J. A., Scott, S. D., and Huntington, N., Nurses and Internet health information: a questionnaire survey. J. Adv. Nurs. 61(1):19–28, 2008.
McHugh, S. M., Corrigan, M., Morney, N., Sheikh, A., Lehane, E., and Hill, A. D. K., A quantitative assessment of changing trends in Internet usage for cancer information. World J. Surg. 35(2):253–257, 2011.
Rogers, S. N., Rozek, A., Aleyaasin, N., Promod, P., and Lowe, D., Internet use among head and neck cancer survivors in the North West of England. Br. J. Oral Maxillofac. Surg. 50(3):208–214, 2012.
Holzinger, A., Geierhofer, R., Modritscher, F., and Tatzl, R., Semantic information in medical information systems: utilization of text mining techniques to analyze medical diagnoses. J. Univ. Comput. Sci. 14(22):3781–3795, 2008.
Casebeer, L., Bennett, N., and Kristofco, R., Physician Internet medical information seeking and on-line continuing education use patterns. J. Contin. Educ. Health Prof. 22:33–42, 2002.
Jennings, N. R., and Wooldridge, M. J., Agent technology: foundations, applications, and markets. Springer, Berlin, 1998.
Manning, C. D., Raghavan, P., and Schütze, H., Introduction to information retrieval. Cambridge University Press, U.K., 2008.
Vakali, A., and Pallis, G., Web data management practices: emerging techniques and technologies. Idea Group Publishing, U.S.A., 2007.
Velasquez, J. D., and Palade, V., Adaptive web sites: a knowledge extraction from web data approach. Ios Press, Netherlands, 2008.
Tao, X., Li, Y., and Zhong, N., A knowledge-based model using ontologies for personalized web information gathering. Web Intell. Agent Syst. 8(3):235–254, 2010.
Kalichman, S. C., Weinhardt, L., and Benotsch, E., Internet access and internet use for health information among people living with HIV-AIDS. Patient Educ. Couns. 46(2):109–116, 2002.
Risch, N. A., Kwon, H. T., and Scarbrough, W., Minority primary care physicians’ knowledge, attitudes, and practices on eye health and preferred sources of information. J. Natl. Med. Assoc. 101(12):1247–1253, 2009.
Walczak, S., A multiagent architecture for developing medical information retrieval agents. J. Med. Syst. 27(5):479–498, 2003.
Craan, F., and Oleske, D. M., Medical information and the Internet: do you know what you are getting? J. Med. Syst. 26(6):511–518, 2002.
Ku, Y., Chiu, C., and Liou, B. H., “Applying text mining to assist people who inquire HIV/AIDS information from Internet”. In: Proceedings ISI 2008 Workshops. pp. 440–448, 2008.
Szulencki, P., “Number of pages on Internet according to Google”, available at: http://www.seoblogr.com/google/number-of-pages-on-internet-according-to-google (accessed 15 April 2011), 2008.
Antonio do Prado, H., and Ferneda, E., Emerging technologies of text mining: techniques and applications. Information Science Reference, U.S.A, 2008.
Berry, M. W., Survey of text mining: clustering, classification and retrieval. Springer, New York, 2004.
Song, M., and Wu, Y. F., Handbook of research on text and web mining technologies. Information Science Reference, U.S.A., 2009.
Han, J., and Kamber, M., Data mining: concept and techniques. Morgan Kaufmann, San Francisco, 2006.
Ting, S. L., Shum, C. C., Kwok, S. K., Tsang, A. H. C., and Lee, W. B., Data mining in biomedicine: current applications and further directions for research. J. Softw. Eng. Appl. 2(3):150–159, 2009.
Lu, W. H., Lin, R. S., Chan, Y. C., and Chen, K. H., Using Web resources to construct multilingual medical thesaurus for cross-language medical information retrieval. Decis. Support. Syst. 45(3):585–595, 2008.
Patil, S. B., and Kumaraswamy, Y. S., Intelligent and effective heart attack prediction system using data mining and artificial neural network. Eur. J. Sci. Res. 31(4):642–656, 2009.
Mostafa, J., and Lam, W., Automatic classification using supervised learning in a medical document filtering application. Inf. Process. Manag. 36(3):415–444, 2000.
Liu, Z., and Chu, W. W., Knowledge-based query expansion to support scenario-specific retrieval of medical free text. Inf. Retr. 10(2):173–202, 2007.
Elhadad, N., Kan, M. Y., Klavans, J. L., and McKeown, K. R., Customization in a unified framework for summarizing medical literature. Artif. Intell. Med. 33(2):179–198, 2005.
Lewis, D. D., “Naive Bayes at 40: the independence assumption in information retrieval”. In: Proceedings of the 10th European Conference on Machine Learning. pp. 4–15, 1998.
McCallum, A., and Nigam, K., “A comparison of event models for Naive Bayes text classification”. In: Proceedings of AAAI-98 Workshop Learning for Text Categorization. 1998.
Zhan, J. M., Loh, H. T., and Liu, Y., Gather customer concerns from online product reviews—a text summarization approach. Expert Syst. Appl. 36(2):2107–2115, 2009.
Lloret, E., Llorens, H., Moreda, P., Saquete, E., and Palomar, M., Text summarization contribution to semantic question answering: new approaches for finding answers on the web. Int. J. Intell. Syst. 26(12):1125–1152, 2011.
Fan, W., Wallace, L., Rich, S., and Zhang, Z., Tapping the power of text mining. Commun. ACM 49(9):76–82, 2006.
Cothey, V., Web-crawling reliability. J. Am. Soc. Inf. Sci. Technol. 55(14):1228–1238, 2004.
Olston, C., and Najork, M., Web crawling. Now Publishers Inc, Hanover, M.A., 2010.
Salton, G., Automatic text processing: the transformation, analysis, and retrieval of information by computer. Addison-Wesley, Reading, M.A., 1989.
Yang, Y., and Chute, C. G., An example-based mapping method for text categorization and retrieval. ACM Trans. Inf. Syst. 12(3):252–277, 1994.
Chakrabarti, S., Roy, S., and Soundalgekar, M. V., Fast and accurate text classification via multiple linear discriminant projection. Int. J. Very Large Data Bases 12(2):170–185, 2003.
Patwardhan, S., and Pedersen, T., Using WordNet-based context vectors to estimate the semantic relatedness of concepts. National Science Foundation Faculty, U.S.A., 2006.
Chen, J., Huang, H., Tian, S., and Qu, Y., Feature selection for text classification with Naïve Bayes. Expert Syst. Appl. 36(3):5432–5435, 2009.
Mladeni, D., and Grobelnik, M., Feature selection on hierarchy of web documents. Decis. Support. Syst. 35(1):45–87, 2003.
Shang, W., Huang, H., Zhu, H., Lin, Y., Qu, Y., and Wang, Z., A novel feature selection algorithm for text categorization. Expert Syst. Appl. 33(1):1–5, 2007.
Lan, M., Sung, S. Y., Low, H. B., and Tan, C. L., “A comparative study on term weighting schemes for text categorization”. In: Proceedings of International Joint Conference on Neural Networks (IJCNN-05). Vol. 1, pp. 546–551, 2005.
Aizawa, A., An information-theoretic perspective of tf-idf measures. Inf. Process. Manag. 39(1):45–65, 2003.
Radev, D. R., Jing, H. Y., and Tam, D., Centroid-based summarization of multiple documents. Inf. Process. Manag. 40(6):919–938, 2004.
Wu, H. C., Luk, Y. P., and Wong, K. F., Interpreting TF-IDF term weights as making relevance decisions. ACM Trans. Inf. Syst. 26(3):13–37, 2008.
Isa, D., Kallimani, V. P., and Lee, L. H., Using the self organizing map for clustering of text documents. Expert Syst. Appl. 36(5):9584–9591, 2009.
Lin, S. S., A document classification and retrieval system for R&D in semiconductor industry—a hybrid approach. Expert Syst. Appl. 36(3):4753–4764, 2009. Part 1.
Xhemali, D., Hinde, C. J., and Stone, R. G., Naive Bayes vs. decision trees vs. neural networks in the classification of training web pages. Int. J. Comput. Sci. Issues 4(1):16–23, 2009.
Ou, S., Khoo, S. G., and Goh, D. H., Automatic multidocument summarization of research abstracts: design and user evaluation. J. Am. Soc. Inf. Sci. Technol. 58(10):1419–1435, 2007.
Jacobsen, I., Booch, G., and Rumbaugh, J., The unified software development process. Addison Wesley, Boston, M.A., 1999.
Ting, J. S. L., Kwok, S. K., Tsang, A. H. C., Lee, W. B., and Yee, K. F., “Experiences sharing of implementing template-based electronic medical record system (TEMRS) in a Hong Kong medical organization”. J. Med. Syst. 35(6):1605–1615, 2011.
Acknowledgment
Acknowledgement is given to Dr. Peter Lo, Dr. Francis Liu, Dr. C.W. Lo and Miss Maggie Poon for their guidance on issues in clinical coding and medical knowledge in general. The authors would also like to express their sincere thanks to the Research Committee of the Hong Kong Polytechnic University for providing the financial support for this research work.
Conflict of interest statement
There are no potential conflicts of interest in this paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ting, S.L., See-To, E.W.K. & Tse, Y.K. Web Information Retrieval for Health Professionals. J Med Syst 37, 9946 (2013). https://doi.org/10.1007/s10916-013-9946-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10916-013-9946-3