Abstract
The experimental results show that the proposed simple weighting scheme helps retrieve a significant proportion of relevant data after traversing only a small portion of a peer-to-peer hierarchical peer network in a depth-first manner. A real, large, highly heterogeneous test collection searched by very short, ambiguous queries was used for supporting the results. The efficiency and the effectiveness would suggest the implementation, for instance, in audio-video information retrieval systems, digital libraries or personal archives.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Broder, A.: A taxonomy of Web search. SIGIR Forum 36(2), 3–10 (2002)
Salton, G., Buckley, C.: Term weighting approaches in automatic text retrieval. IPM 24(5), 513–523 (1988)
Melucci, M., Castiglion, R.: A weighing framework for information retrieval in peer-to-peer networks. In: Proc. of DEXA Workshop, Copenhagen, August 22-26, 2005, pp. 374–378. IEEE Computer Society Press, Los Alamitos (2005)
Melucci, M., Castiglion, R.: An evaluation of a recursive weighing scheme for information retrieval in peer-to-peer networks. In: Proc. of CIKM Workshop on IR in P2P Networks, Bremen, Germany, November 4, 2005, pp. 9–16. ACM Press, New York (2005)
Bawa, M., Manku, G.S., Raghavan, P.: Sets: Search enhanced by topic-segmentation. In: Proc. of SIGIR, ACM Press, New York (2003)
Chernov, S.: Result Merging in a Peer-to-Peer Web Search Engine. PhD thesis, University of Saarland (February 2005)
Klampanos, I.A., et al.: A Suite of Testbeds for the Realistic Evaluation of Peer-to-Peer Information Retrieval Systems. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, Springer, Heidelberg (2005)
Callan, J.: Distributed information retrieval. In: Croft, W.B. (ed.) Advances in information retrieval, pp. 127–150. Kluwer Academic Publishers, Dordrecht (2000)
Gravano, L., et al.: STARTS: Stanford proposal for internet retrieval and search. Technical Report SIDL-WP-1996-0043, Computer Science Department, Stanford University (1996)
Lu, J., Callan, J.: Federated search of text-based digital libraries in hierarchical peer-to-peer networks. In: Proc. of SIGIR, Sheffield, UK, ACM Press, New York (2004)
Larkey, L.S., Connell, M.E., Callan, J.P.: Collection selection and results merging with topically organized U.S. patents and TREC data. In: Proc. of CIKM, McLean, Virginia, US, pp. 282–289. ACM Press, New York (2000)
Lu, J., Callan, J.: Merging retrieval results in hierarchical peer-to-peer networks. In: Proc. of SIGIR, Sheffield, UK, ACM Press, New York (2004)
Si, L., Callan, J.: A semi-supervised learning method to merge search engine results. ACM TOIS 21(4), 457–491 (2003)
Singhal, A., Buckley, C., Mitra, M.: Pivoted document length normalization. In: Proc. of SIGIR, Zurich, Switzerland, pp. 21–29. ACM Press, New York (1996)
Gulutzan, P.: MySQL’s full-text formulas (January 2006), http://www.databasejournal.com/features/mysql/article.php/3512461
Lu, J., Callan, J.: Content-based retrieval in hybrid peer-to-peer networks. In: Proc. of CIKM (2003)
Lu, J., Callan, J.: Peer-to-peer testbed definitions: trecwt10g-2500-bysource-v1 and trecwt10g-query-bydoc-v1 (January 2006), http://hartford.lti.cs.cmu.edu/callan/Data
Hawking, D.: Overview of the TREC-9 Web track. In: Voorhes, E.M., Harman, D.K. (eds.) Proc. of TREC, Gaithersburg, Maryland, September 2001, pp. 87–101. Department of Commerce, NIST (2001)
Bailey, P., Craswell, N., Hawking, D.: Engineering a multi-purpose test collection for Web retrieval experiments. IPM 39(6), 853–871 (2003)
Nottelmann, H., Fuhr, N.: Comparing different architectures for query routing in peer-to-peer networks. In: Lalmas, M., et al. (eds.) ECIR 2006. LNCS, vol. 3936, Springer, Heidelberg (2006)
Stutzbach, D., Rejaie, R., Sen, S.: Characterizing unstructured overlay topologies in modern P2P file-sharing systems. In: Proc. of IMC, pp. 49–62 (2005)
Zhao, S., Stutzbach, D., Rejaie, R.: Characterizing files in the modern Gnutella network: A measurement study. In: Proc. of MMCN, San Jose, CA (January 2006)
Stutzbach, D., Rejaie, R.: Characterizing the two-tier Gnutella topology. In: Proc. of SIGMETRICS, Banff, Alberta, Canada, pp. 402–403. ACM Press, New York (2005)
Lv, Q., et al.: Search and replication in unstructured peer-to-peer networks. In: Proc. of ICS, pp. 84–95. ACM Press, New York (2002)
TREC. Text REtrieval Conference (January 2006), http://trec.nist.gov .
Kwok, S.: P2P searching trends: 2002-2004. IPM 42(1), 237–247 (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Melucci, M., Poggiani, A. (2007). A Study of a Weighting Scheme for Information Retrieval in Hierarchical Peer-to-Peer Networks. In: Amati, G., Carpineto, C., Romano, G. (eds) Advances in Information Retrieval. ECIR 2007. Lecture Notes in Computer Science, vol 4425. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71496-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-71496-5_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71494-1
Online ISBN: 978-3-540-71496-5
eBook Packages: Computer ScienceComputer Science (R0)