ABSTRACT
Clustering focuses to organize a collection of data items into clusters, such that items within a cluster are more "similar" to each other than they are to items in the other clusters. The k-means method is one of the most widely used clustering techniques for various applications. Applications like Searching, Retrieving as well as Reading research Documents are more Time consuming because we need more time for searching or reading single papers or document, so it is required that use enhanced search engine which is based on fastest reading algorithm which provides best output or results. So we are proposed Enhanced architecture with improved k-means algorithm, which proposes a method for making the algorithm more effective and efficient, so as to get better clustering with reduced complexity. It will search the base keyword or string of the content from the knowledge database. Proposed work uses the search engine based on clustering and text mining.
- A. M. Fahim, A. M. Salem, F. A. Torkey and M. A. Ramadan, "An Efficient enhanced k-means clustering algorithm", journal of Zhejiang University, 10(7): 16261633, 2006.Google Scholar
- K. A. Abdul Nazeer and M. P. Sebastian, "Improving the accuracy and e_ciency of the k-means clustering algorithm", in International Conference on Data Mining and Knowledge Engineering (ICDMKE), Proceedings of the World Congress on Engineering (WCE-2009), Vol 1, London, UK, July 2009.Google Scholar
- Chen Zhang and Shixiong Xia, "K-means Clustering Algorithm with Improved Initial center", in Second International Workshop on Knowledge Discovery and Data Mining (WKDD), pp. 790--792, 2009. Google ScholarDigital Library
- F. Yuan, Z. H. Meng, H. X. Zhangz, C. R. Dong, "A New Algorithm to Get the Initial Centroids", proceedings of the 3rd International Conference on Machine Learning and Cybernetics, pp. 26--29, August 2004.Google Scholar
- Koheri Arai and Ali Ridho Barakbah, "Hierarchical K-means: an algorithm for Centroids initialization for k-means",department of information science and Electrical Engineering Poli technique in Surabaya, Faculty of Science and Engineering, Saga University, Vol. 36, No.1, 2007.Google Scholar
- A. Bhattacharya and R. K. De, "Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression pro_les", bioinformatics, Vol. 24, pp. 1359--1366, 2008. Google ScholarDigital Library
- Jieming Zhou, J. G. and X. Chen, "An Enhancement of K means Clustering Algorithm", in Business Intelligence & Financial Engineering, BIFE ^a09. International Conference on, Beijing,2009.Google Scholar
- Chakrabarti, S., "Mining The Web: Discovering knowledge from hypertext data", Part 2. 2003. Google ScholarDigital Library
- Jieming Zhou, J. G. and X. Chen, "An Enhancement of K means Clustering Algorithm", in Business Intelligence and Financial Engineering, BIFE 09. International Conference on, Beijing,2009.Google Scholar
- Sachin Shinde, Bharat Tidke, "improved k-mean algorithm for searching research papers", IJCSCN, ISSN 2249-5789, vol 4(6), pp.197--202 dec 2014.Google Scholar
- Aukasz Machnik, "Documents Clustering techniques", in Annales UMCS Informatica Lublin-Polonia Sectio AI, p 401 411,2004. International Conference on, Beijing,2009.Google Scholar
- Marijus Bernotas, Kazys Karklius, Remigijus Laurutis, Asta Slotkien A, "The Peculiarities Of The Text Document Representation, Using Ontology And Tagging-Based Clustering Technique", 124x Information Technology And Control, Vol.36, No.2,2007.Google Scholar
- Anand M. Baswade, Prakash S. Nalwade, "Selection of Initial Centroids for k-Means Algorithm", in International Journal of Computer Science and Mobile Computing, Vol. 2, Issue. 7, pg.161 ^a 164, July 2013,Google Scholar
- E. AlanCalvillo, Alejandro Padilla, Jaime Munoz "Searching Research papers using clustering and text minig" IEEE 2013.Google Scholar
- Vishwanath Bijalwan, Vinay Kumar, Pinki Kumari and Jordan Pascual, "KNN based Machine Learning Approach for Text and Document Mining", in International Journal of Database Theory and Application, Vol.7, No.1, pp.61--70,(2014).Google Scholar
Index Terms
- Knowledge Discovery for research Documents using Improved K-means Technique
Recommendations
Improved k- means clustering algorithm for two dimensional data
CCSEIT '12: Proceedings of the Second International Conference on Computational Science, Engineering and Information TechnologyClustering is a procedure of organizing the objects in groups whose member exhibits some kind of similarity. So a cluster is a collection of objects which are alike and are different from the objects belonging to other clusters. K-Means is one of ...
Clustering stability-based Evolutionary K-Means
Evolutionary K-Means (EKM), which combines K-Means and genetic algorithm, solves K-Means' initiation problem by selecting parameters automatically through the evolution of partitions. Currently, EKM algorithms usually choose silhouette index as cluster ...
Enhancing the K-means Clustering Algorithm by Using a O(n logn) Heuristic Method for Finding Better Initial Centroids
EAIT '11: Proceedings of the 2011 Second International Conference on Emerging Applications of Information TechnologyWith the advent of modern techniques for scientific data collection, large quantities of data are getting accumulated at various databases. Systematic data analysis methods are necessary to extract useful information from rapidly growing data banks. ...
Comments