A Dynamic and Semantically-Aware Technique for Document Clustering in Biomedical Literature

A Dynamic and Semantically-Aware Technique for Document Clustering in Biomedical Literature

Min Song, Xiaohua Hu, Illhoi Yoo, Eric Koppel
ISBN13: 9781609605377|ISBN10: 1609605373|EISBN13: 9781609605384
DOI: 10.4018/978-1-60960-537-7.ch013
Cite Chapter Cite Chapter

MLA

Song, Min, et al. "A Dynamic and Semantically-Aware Technique for Document Clustering in Biomedical Literature." Integrations of Data Warehousing, Data Mining and Database Technologies: Innovative Approaches, edited by David Taniar and Li Chen, IGI Global, 2011, pp. 307-319. https://doi.org/10.4018/978-1-60960-537-7.ch013

APA

Song, M., Hu, X., Yoo, I., & Koppel, E. (2011). A Dynamic and Semantically-Aware Technique for Document Clustering in Biomedical Literature. In D. Taniar & L. Chen (Eds.), Integrations of Data Warehousing, Data Mining and Database Technologies: Innovative Approaches (pp. 307-319). IGI Global. https://doi.org/10.4018/978-1-60960-537-7.ch013

Chicago

Song, Min, et al. "A Dynamic and Semantically-Aware Technique for Document Clustering in Biomedical Literature." In Integrations of Data Warehousing, Data Mining and Database Technologies: Innovative Approaches, edited by David Taniar and Li Chen, 307-319. Hershey, PA: IGI Global, 2011. https://doi.org/10.4018/978-1-60960-537-7.ch013

Export Reference

Mendeley
Favorite

Abstract

As an unsupervised learning process, document clustering has been used to improve information retrieval performance by grouping similar documents and to help text mining approaches by providing a high-quality input for them. In this paper, the authors propose a novel hybrid clustering technique that incorporates semantic smoothing of document models into a neural network framework. Recently, it has been reported that the semantic smoothing model enhances the retrieval quality in Information Retrieval (IR). Inspired by that, the authors developed and applied a context-sensitive semantic smoothing model to boost accuracy of clustering that is generated by a dynamic growing cell structure algorithm, a variation of the neural network technique. They evaluated the proposed technique on biomedical article sets from MEDLINE, the largest biomedical digital library in the world. Their experimental evaluations show that the proposed algorithm significantly improves the clustering quality over the traditional clustering techniques including k-means and self-organizing map (SOM).

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.