Abstract
Secondary Data Processing deals the information further by re-crawling and categories based on the basic of structured data. It is the key researching module of Vertical Search Engines. This paper proposes an improved KNN algorithm for the categories. This algorithm achieves the responsiveness and the accuracy of vertical search by reducing the time complexity and accelerating the speed of classification. The experiment proved the improved algorithm has the better feasibility and robustness when it’s used in secondary data processing and participle of vertical search engines.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Yang, J., Ling, P.: Improvement of PageRank Algorithm for Search Engine. Computer Engineer 35(22), 35–37 (2009)
Pan, L., Yang, B.: Study on KNN Arithmetic Based on Cluster. Computer Engineering and Design 30(18), 4260–4261 (2009)
Yan, W., Wu, W.: Data Structures(C Edition), pp. 13–17. Tsinghua University Press, Beijing (2008)
Soderland, S., Cardie, C., Mooney, R.: Learning Information Extraction Rules for Semi-strucrured and Free Text. Machine Learning (1999)
Bertoli, C., Crescenzi, V., Merialdo, P.: Crawling Programs for Wrapper-based Application. In: IEEE IRI 2008, pp. 160–165 (2008)
Zhang, N., Jia, Z., Shi, Z.: Text Categorization with KNN Algorithm. Computer Engineering 31(8), 171–185 (2005)
Pei, Z., Shi, X., Maurizio, M., Liang, Y.: An Enhanced Text Categorization Method Based on Improved Text Frequency Approach and Mutual Information Algorithm. Progress in Natural Science 17(12), 1494–1500 (2007)
Shao, F., Yu, Z.: Principle and Algorithm of Data Mining, pp. 126–176. Waterpub Press, Beijing (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jia, Y., Fan, H., Xia, G., Dong, X. (2011). An Improved KNN Algorithm for Vertical Search Engines. In: Gong, Z., Luo, X., Chen, J., Lei, J., Wang, F.L. (eds) Web Information Systems and Mining. WISM 2011. Lecture Notes in Computer Science, vol 6988. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23982-3_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-23982-3_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23981-6
Online ISBN: 978-3-642-23982-3
eBook Packages: Computer ScienceComputer Science (R0)