Abstract
This paper proposes a method that quantifies the similarity between documents based on the level of relevance among terms in order to deliver a search that captures the meaning of documents. More specifically, this paper proposes a method that uses a concept-base to look for relevance among different terms and calculates the degree of association between documents using the Earth Mover’s Distance. When the proposed methods were subjected to comparison tests with other methods using the NTCIR3-WEB, they achieved good results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Okumura, N., Yoshimura, E., Watabe, H., Kawaoka, T.: An Association Method Using Concept-Base. In: Apolloni, B., Howlett, R.J., Jain, L. (eds.) KES 2007, Part I. LNCS (LNAI), vol. 4692, pp. 604–611. Springer, Heidelberg (2007)
Rubner, Y., Tomasi, C., Guibas, L.: The earth mover’s distance as a metric for image retrieval. Int. J. Comput. Vision 40, 99–121 (2000)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18(3), 613–620 (1975)
Robertson, S.E., Walker, S., Jones, S., Beaulieu, M., Gatford, M.: Okapi at TREC-3. In: Proceeding of the 3rd Text Retrieval Conference, pp. 109–126 (1995)
Miller, G.A.: “WordNet”: A lexical database for English”. Commun. ACM 38(11), 39–41 (1995)
Wan, X., Peng, Y.: The Earth Mover’s Distance as a Semantic Meature for Document Similality. In: Proc. of 14th ACM international conference on Information and knowledge management, pp. 301–302 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Watabe, H., Yoshimura, E., Tsuchiya, S. (2009). Degree of Association between Documents Using Association Mechanism. In: Velásquez, J.D., Ríos, S.A., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2009. Lecture Notes in Computer Science(), vol 5711. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04595-0_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-04595-0_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04594-3
Online ISBN: 978-3-642-04595-0
eBook Packages: Computer ScienceComputer Science (R0)