Abstract
In recent years there has been a significant interest in peer-to-peer (P2P) environments in the community of data management. However, almost all works, as far, focused on exact query processing in current P2P data systems. The autonomy of peers also doesn’t be considered enough. In addition, the system cost is very high because the information publishing method of shared data is based on each document instead of document set.
In this paper, abstract indices are presented to implement content-based approximate queries in P2P data systems. It can be used to search as few peers as possible but get as many returns satisfying users’ queries as possible on the guarantee of high autonomy of peers. Also, abstract indices have low system cost, can improve the query processing speed, and support very frequent updates.
In order to verify the effectiveness of abstract indices, a simulator of 10,000 peers, over 3 million documents is made. The experimental results show that abstract indices work well in P2P data systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wang, C., Li, J., Shi, S.: AbIx: An Approach to Content-Based Approximate Queries in Peer-to-Peer Data Systems. Tech. Report HIT/DCSE-DB-03-0710, Database Lab. of Harbin Institute of Technology, Harbin, China (2003)
Yang, B., Garcia-Molina, H.: Comparing Hybrid Peer-to-Peer Systems. In: Proceedings of the 27th International Conference on Very Large Data Bases, Roma, Italy, pp. 561–570 (2001)
Stoica, I., Morris, R., Liben-Nowell, D., Karger, D.R., Kaashoek, M.F., Dabek, F., Balakrishnan, H.: Chord: A Scalable Peer-to-Peer Lookup Protocol for Internet Applications. IEEE/ACM Transactions on Networking 11, 17–32 (2003)
Ratnasamy, S., Francis, P., Handley, M., Karp, R., Schenker, S.: A Scalable Content-Addressable Network. In: Proceedings of ACM SIGCOMM, pp. 161–172 (2001)
Yang, B., Garcia-Molina, H.: Efficient Search in Peer-to-Peer Networks. In: Proceedings of the 22nd International Conference on Distributed Computing Systems, pp. 5–14 (2002)
Crespo, A., Garcia-Molina, H.: Routing Indices for Peer-to-Peer Systems. In: Proceedings of the 22nd International Conference on Distributed Computing Systems, pp. 23–34 (2002)
Rowstron, A., Druschel, P.: Pastry: Scalable, Distributed Object Location and Routing for Large-Scale Peer-to-Peer Systems. In: IFIP/ACM International Conference on Distributed Systems Platforms (Middleware), pp. 329–350 (2001)
Zhao, B.Y., Kubiatowicz, J., Joseph, A.D.: Tapestry: An Infrastructure for Faulttolerant Wide-area Location and Routing. Tech. Report UCB/CSD-01-1141, University of California, Berkeley, California 94720 (2001)
Cuenca-Acuna, F.M., Nguyen, T.D.: Text-Based Content Search and Retrieval in ad hoc P2P Communities. In: Proceedings of the International Workshop on Peer-to-Peer Computing (2002)
Tang, C., Xu, Z., Mahalingam, M.: pSearch: Information Retrieval in Structured Overlays. In: Proceedings of the 1st HotNets-I, Princeton, New Jersey, USA, ACM Press, New York (2002)
Wang, C., Li, J., Shi, S.: A Kind of Content-Based Music Information Retrieval Method in a Peer-to-Peer Environment. In: Proceedings of the 3rd International Symposium on Music Information Retrieval, Paris, France, pp. 178–186 (2002)
Gao, J., Tzanetakis, G., Steenkiste, P.: Content-Based Retrieval of Music in Scalable Peer-to-Peer Networks. In: The 2003 IEEE International Conference on Multimedia & Expo(ICME 2003), Baltimore, MD, USA, IEEE CS Press, Los Alamitos (2003)
Gribble, S., Halevy, A., Ives, Z., Rodrig, M., Suciu, D.: What Can Database Do for Peer-to-Peer? In: Proceedings of the 4th International Workshop on the Web and Databases, pp. 31–36 (2001)
Halevy, A.Y., Ives, Z.G., Suciu, D., Tatarinov, I.: Schema Mediation in Peer Data Management Systems. In: Proceedings of the 19th International Conference on Data Engineering (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, C., Li, J., Shi, S. (2004). An Approach to Content-Based Approximate Query Processing in Peer-to-Peer Data Systems. In: Li, M., Sun, XH., Deng, Qn., Ni, J. (eds) Grid and Cooperative Computing. GCC 2003. Lecture Notes in Computer Science, vol 3032. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24679-4_69
Download citation
DOI: https://doi.org/10.1007/978-3-540-24679-4_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21988-0
Online ISBN: 978-3-540-24679-4
eBook Packages: Springer Book Archive