Abstract
This paper presents an approach to find associations between Web documents using collocated word pairs. Given two Web documents which are connected via a hyperlink, we attempt to find the contextual association of these two Web pages by using collocations of word pairs from a statistical point of view. Our preliminary experimental results show that our approach is able to extract fairly coherent word pairs to derive associations between hyperlinked Web documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ahonen-Myka, H.: Discovery of Frequent Word Sequences in Text. In: Hand, D.J., Adams, N.M., Bolton, R.J. (eds.) Pattern Detection and Discovery. LNCS (LNAI), vol. 2447, pp. 180–328. Springer, Heidelberg (2002)
Qureshi, M., Younus, A., Rojas, F.: Analyzing Web Crawler as Feed Forward Engine for Efficient Solution to Search Problem in the Minimum Amount of Time through a Distributed Framework. In: Proceedings of 1st International Conference on Information Science and Applications, Seoul, Korea (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yong-Jin Tee, J., Soon, LK., Ranaivo-Malançon, B. (2012). Finding Web Document Associations Using Frequent Pairs of Adjacent Words. In: Lukose, D., Ahmad, A.R., Suliman, A. (eds) Knowledge Technology. KTW 2011. Communications in Computer and Information Science, vol 295. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32826-8_39
Download citation
DOI: https://doi.org/10.1007/978-3-642-32826-8_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32825-1
Online ISBN: 978-3-642-32826-8
eBook Packages: Computer ScienceComputer Science (R0)