research-article

Knowledge Discovery for research Documents using Improved K-means Technique

Authors:
Sachin Shinde

Department of Computer Engineering, Flora institute of Technology, Pune, Maharashtra, India, +91-9767162757

Department of Computer Engineering, Flora institute of Technology, Pune, Maharashtra, India, +91-9767162757
View Profile

,
Bharat Tidke

Department of Computer Engineering, Flora institute of Technology, Pune, Maharashtra, India, +91-7507258506

Department of Computer Engineering, Flora institute of Technology, Pune, Maharashtra, India, +91-7507258506
View Profile

ICCCT '15: Proceedings of the Sixth International Conference on Computer and Communication Technology 2015September 2015Pages 15–19https://doi.org/10.1145/2818567.2818570

Published:25 September 2015Publication History

ICCCT '15: Proceedings of the Sixth International Conference on Computer and Communication Technology 2015

Pages 15–19

ABSTRACT

Clustering focuses to organize a collection of data items into clusters, such that items within a cluster are more "similar" to each other than they are to items in the other clusters. The k-means method is one of the most widely used clustering techniques for various applications. Applications like Searching, Retrieving as well as Reading research Documents are more Time consuming because we need more time for searching or reading single papers or document, so it is required that use enhanced search engine which is based on fastest reading algorithm which provides best output or results. So we are proposed Enhanced architecture with improved k-means algorithm, which proposes a method for making the algorithm more effective and efficient, so as to get better clustering with reduced complexity. It will search the base keyword or string of the content from the knowledge database. Proposed work uses the search engine based on clustering and text mining.

References

A. M. Fahim, A. M. Salem, F. A. Torkey and M. A. Ramadan, "An Efficient enhanced k-means clustering algorithm", journal of Zhejiang University, 10(7): 16261633, 2006.Google Scholar
K. A. Abdul Nazeer and M. P. Sebastian, "Improving the accuracy and e_ciency of the k-means clustering algorithm", in International Conference on Data Mining and Knowledge Engineering (ICDMKE), Proceedings of the World Congress on Engineering (WCE-2009), Vol 1, London, UK, July 2009.Google Scholar
Chen Zhang and Shixiong Xia, "K-means Clustering Algorithm with Improved Initial center", in Second International Workshop on Knowledge Discovery and Data Mining (WKDD), pp. 790--792, 2009. Google ScholarDigital Library
F. Yuan, Z. H. Meng, H. X. Zhangz, C. R. Dong, "A New Algorithm to Get the Initial Centroids", proceedings of the 3rd International Conference on Machine Learning and Cybernetics, pp. 26--29, August 2004.Google Scholar
Koheri Arai and Ali Ridho Barakbah, "Hierarchical K-means: an algorithm for Centroids initialization for k-means",department of information science and Electrical Engineering Poli technique in Surabaya, Faculty of Science and Engineering, Saga University, Vol. 36, No.1, 2007.Google Scholar
A. Bhattacharya and R. K. De, "Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression pro_les", bioinformatics, Vol. 24, pp. 1359--1366, 2008. Google ScholarDigital Library
Jieming Zhou, J. G. and X. Chen, "An Enhancement of K means Clustering Algorithm", in Business Intelligence & Financial Engineering, BIFE ^a09. International Conference on, Beijing,2009.Google Scholar
Chakrabarti, S., "Mining The Web: Discovering knowledge from hypertext data", Part 2. 2003. Google ScholarDigital Library
Jieming Zhou, J. G. and X. Chen, "An Enhancement of K means Clustering Algorithm", in Business Intelligence and Financial Engineering, BIFE 09. International Conference on, Beijing,2009.Google Scholar
Sachin Shinde, Bharat Tidke, "improved k-mean algorithm for searching research papers", IJCSCN, ISSN 2249-5789, vol 4(6), pp.197--202 dec 2014.Google Scholar
Aukasz Machnik, "Documents Clustering techniques", in Annales UMCS Informatica Lublin-Polonia Sectio AI, p 401 411,2004. International Conference on, Beijing,2009.Google Scholar
Marijus Bernotas, Kazys Karklius, Remigijus Laurutis, Asta Slotkien A, "The Peculiarities Of The Text Document Representation, Using Ontology And Tagging-Based Clustering Technique", 124x Information Technology And Control, Vol.36, No.2,2007.Google Scholar
Anand M. Baswade, Prakash S. Nalwade, "Selection of Initial Centroids for k-Means Algorithm", in International Journal of Computer Science and Mobile Computing, Vol. 2, Issue. 7, pg.161 ^a 164, July 2013,Google Scholar
E. AlanCalvillo, Alejandro Padilla, Jaime Munoz "Searching Research papers using clustering and text minig" IEEE 2013.Google Scholar
Vishwanath Bijalwan, Vinay Kumar, Pinki Kumari and Jordan Pascual, "KNN based Machine Learning Approach for Text and Document Mining", in International Journal of Database Theory and Application, Vol.7, No.1, pp.61--70,(2014).Google Scholar

Index Terms

Knowledge Discovery for research Documents using Improved K-means Technique

Recommendations

Improved k- means clustering algorithm for two dimensional data
CCSEIT '12: Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology

Clustering is a procedure of organizing the objects in groups whose member exhibits some kind of similarity. So a cluster is a collection of objects which are alike and are different from the objects belonging to other clusters. K-Means is one of ...
Read More
Clustering stability-based Evolutionary K-Means

Evolutionary K-Means (EKM), which combines K-Means and genetic algorithm, solves K-Means' initiation problem by selecting parameters automatically through the evolution of partitions. Currently, EKM algorithms usually choose silhouette index as cluster ...
Read More
Enhancing the K-means Clustering Algorithm by Using a O(n logn) Heuristic Method for Finding Better Initial Centroids
EAIT '11: Proceedings of the 2011 Second International Conference on Emerging Applications of Information Technology

With the advent of modern techniques for scientific data collection, large quantities of data are getting accumulated at various databases. Systematic data analysis methods are necessary to extract useful information from rapidly growing data banks. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICCCT '15: Proceedings of the Sixth International Conference on Computer and Communication Technology 2015
September 2015
481 pages
ISBN:9781450335522
DOI:10.1145/2818567

Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 September 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Clustering
Enhanced k-means algorithm
Text mining
k-means algorithm
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate33of124submissions,27%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 181
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Knowledge Discovery for research Documents using Improved K-means Technique

ICCCT '15: Proceedings of the Sixth International Conference on Computer and Communication Technology 2015

ABSTRACT

References

Cited By

Index Terms

Recommendations

Improved k- means clustering algorithm for two dimensional data

Clustering stability-based Evolutionary K-Means

Enhancing the K-means Clustering Algorithm by Using a O(n logn) Heuristic Method for Finding Better Initial Centroids

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Knowledge Discovery for research Documents using Improved K-means Technique

ICCCT '15: Proceedings of the Sixth International Conference on Computer and Communication Technology 2015

ABSTRACT

References

Cited By

Index Terms

Recommendations

Improved k- means clustering algorithm for two dimensional data

Clustering stability-based Evolutionary K-Means

Enhancing the K-means Clustering Algorithm by Using a O(n logn) Heuristic Method for Finding Better Initial Centroids

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media