Overview

Authors:

Laith Mohammad Qasim Abualigah ⁰

Laith Mohammad Qasim Abualigah
1. Universiti Sains Malaysia, Penang, Malaysia
View author publications

You can also search for this author in PubMed Google Scholar

Presents a new method for solving the text document clustering problem and demonstrates that it can outperform other comparable methods
Covers the main text clustering preprocessing steps and the metaheuristics needed in order to deal with the text document clustering problems
Proposes methods that can be applied to a broad range of text documents (e.g. newsgroup documents appearing on newswires, Internet web pages, and hospital information), modern applications (technical reports and university data), and the biomedical sciences (large biomedical datasets)

Part of the book series: Studies in Computational Intelligence (SCI, volume 816)

4545 Accesses
384 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

eBook USD 84.99

Price excludes VAT (USA)

Hardcover Book USD 109.99

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

This book puts forward a new method for solving the text document (TD) clustering problem, which is established in two main stages: (i) A new feature selection method based on a particle swarm optimization algorithm with a novel weighting scheme is proposed, as well as a detailed dimension reduction technique, in order to obtain a new subset of more informative features with low-dimensional space. This new subset is subsequently used to improve the performance of the text clustering (TC) algorithm and reduce its computation time. The k-mean clustering algorithm is used to evaluate the effectiveness of the obtained subsets. (ii) Four krill herd algorithms (KHAs), namely, the (a) basic KHA, (b) modified KHA, (c) hybrid KHA, and (d) multi-objective hybrid KHA, are proposed to solve the TC problem; each algorithm represents an incremental improvement on its predecessor. For the evaluation process, seven benchmark text datasets are used with different characterizations and complexities.

Text document (TD) clustering is a new trend in text mining in which the TDs are separated into several coherent clusters, where all documents in the same cluster are similar. The findings presented here confirm that the proposed methods and algorithms delivered the best results in comparison with other, similar methods to be found in the literature.

Hybrid clustering analysis using improved krill herd algorithm

Article 23 May 2018

Unsupervised text feature selection by binary fire hawk optimizer for text clustering

Article 30 March 2024

A novel hybrid multi-verse optimizer with K-means for text documents clustering

Article 11 May 2020

Keywords

Table of contents (6 chapters)

Front Matter

Pages i-xxvii

Download chapter PDF
Introduction
- Laith Mohammad Qasim Abualigah
Pages 1-9
Krill Herd Algorithm
- Laith Mohammad Qasim Abualigah
Pages 11-19
Literature Review
- Laith Mohammad Qasim Abualigah
Pages 21-60
Proposed Methodology
- Laith Mohammad Qasim Abualigah
Pages 61-103
Experimental Results
- Laith Mohammad Qasim Abualigah
Pages 105-162
Conclusion and Future Work
- Laith Mohammad Qasim Abualigah
Pages 163-165

Reviews

“The book is well written, with high-quality tables and graphs. Each chapter ends with a collection of references, including the most recent work in the area. The book should be very useful for scholars who want to study the general field of text document clustering. It is also a good reference for those who work in text document clustering and use genetic algorithms.” (Xiannong Meng, ComputingReviews, May 10, 2019)

Authors and Affiliations

Universiti Sains Malaysia, Penang, Malaysia

Laith Mohammad Qasim Abualigah

Bibliographic Information

Book Title: Feature Selection and Enhanced Krill Herd Algorithm for Text Document Clustering
Authors: Laith Mohammad Qasim Abualigah
Series Title: Studies in Computational Intelligence
DOI: https://doi.org/10.1007/978-3-030-10674-4
Publisher: Springer Cham
eBook Packages: Intelligent Technologies and Robotics, Intelligent Technologies and Robotics (R0)
Copyright Information: Springer Nature Switzerland AG 2019
Hardcover ISBN: 978-3-030-10673-7Published: 03 January 2019
eBook ISBN: 978-3-030-10674-4Published: 18 December 2018
Series ISSN: 1860-949X
Series E-ISSN: 1860-9503
Edition Number: 1
Number of Pages: XXVII, 165
Number of Illustrations: 2 b/w illustrations, 21 illustrations in colour
Topics: Computational Intelligence, Artificial Intelligence

Publish with us

Policies and ethics

Feature Selection and Enhanced Krill Herd Algorithm for Text Document Clustering