Abstract
Peer-to-Peer (P2P) systems with distinct advantages such as scalability, robustness and low cost have developed quickly in recent years. The resource search problem, that is to find out a peer having a desired resource, is a fundamental problem to be initially solved in most P2P applications. P2P systems can also be categorized under database systems. One of the most important problems of P2P database systems is the query processing algorithms. Even a rather small improvement over these algorithms can have a significant impact on the performance of a P2P database system. One of the most effective methods to reduce query processing cost is peer clustering. Hence, clustering peers with similar contents makes the queries directed to the appropriate cluster and thus to be efficiently answered. In this paper, we present a new peer clustering algorithm for P2P database systems that uses ant-based clustering algorithms. We have adjusted ant-based clustering algorithm to peer clustering problem. Performance evaluation of the algorithm shows its strength in clustering peers in P2P database systems to be of a higher quality and accuracy compared to the existing systems, to have a relatively short running time and to quickly converge to an optimum. Moreover, our algorithm achieves a high scalability to the increasing problem size.
Similar content being viewed by others
Notes
As explained in the rest of the paper, in ant clustering algorithms, the act of picking up and dropping an item are influenced with both stochastic and probabilistic factors.
References
Aberer K, Cudré-Mauroux P, Hauswirth M, Van Pelt T, (2004) Gridvine: building internet-scale semantic overlay networks. The Semantic Web-ISWC, (2004) Springer, Berlin Heidelberg, pp 107–121. doi:10.1007/978-3-540-30475-3_9
Aberer K, Cudré-Mauroux P, Datta A et al (2003) P-grid: a self-organizing structured p2p system. ACM SIGMOD Record 32(3):29–33. doi:10.1145/945721.945729
Abraham A, Ramos V (2004), Web usage mining using artificial ant colony clustering and linear genetic programming. IEEE, Evolutionary Computation CEC’03 2, pp 1384–1391, doi:10.1109/CEC.2003.1299832
Androutsellis-Theotokis S, Spinellis D (2004) A survey of peer-to-peer content distribution technologies. ACM Comput Surv (CSUR) 36(4):335–371. doi:10.1145/1041680.1041681
Bonabeau E, Dorigo M, Theraulaz G (1999) Swarm intelligence: from natural to artificial systems. Oxford University Press Inc., Oxford
Bonifati A, Chrysanthis PK, Ouksel AM et al (2008) Distributed databases and peer-to-peer databases: past and present. ACM SIGMOD Record 37(1):5–11. doi:10.1145/1374780.1374781
Chandrasekar R, Srinivasan T (2007) An improved probabilistic ant based clustering for distributed databases. In: IJCAI 2007, Proceedings of 20th international joint conference on artificial intelligence, pp 2701–2706
Channa AH, Rajpoot NM, Rajpoot KM (2006) Texture segmentation using ant tree clustering. 2006 IEEE international conference on engineering of intelligent systems, ICEIS 2006 (IEEE), pp 1–6. doi:10.1109/ICEIS.2006.1703192
Cholvi V, Felber P, Biersack E (2004) Efficient search in unstructured peer to peer networks. Eur Trans Telecommun 15(6):535–548. doi:10.1002/ett.1017
Cohen E, Fiat A, Kaplan H (2003) Associative search in peer to peer networks: harnessing latent semantics. Comput Netw 51(8):1861–1881. doi:10.1016/j.comnet.2006.09.014
Crespo A, Garcia-Molina H (2005) Semantic overlay networks for p2p systems. Agents and peer-to-peer computing, Springer, Berlin Heidelberg, pp 1–13. doi:10.1007/11574781_1
Cuenca-Acuna FM, Peery C, Martin RP, et al (2003) Planetp: using gossiping to build content addressable peer-to-peer information sharing communities. In: Proceedings of high performance distributed computing HPDC’03 (IEEE), pp 236–246. doi:10.1109/HPDC.2003.1210033
Deneubourg JL, Goss S, Franks N, et al (1991) The dynamics of collective sorting robot-like ants and ant-like robots. In: Proceedings of the first international conference on simulation of adaptive behavior on From animals to animats, pp 356–363
Eisenhardt M, Muller W, Henrich A (2003) Classifying documents by distributed p2p clustering. GI Jahrestagung (2) 35:286–291
Goli M, Rouhani Rankoohi SMT (2011) A new vertical fragmentation algorithm based on ant collective behavior in distributed database systems. Knowl Inf Syst 30(2):435–455. doi:10.1007/s10115-011-0384-6
Gunes OG, Uyar AS (2010) Parallelization of an ant-based clustering approach. Kybernetes 39(4):656–677. doi:10.1108/03684921011036844
Handl J, Meyer B (2002) Improved ant-based clustering and sorting in a document retrieval interface. Parallel problem solving from nature–PPSN VII, Springer, Berlin Heidelberg, pp 913–923. doi:10.1007/3-540-45712-7_88
Handl J, Meyer B (2007) Ant-based and swarm-based clustering. Swarm Intell 1(2):95–113. doi:10.1007/s11721-007-0008-7
Jafar OAM, Sivakumar R (2010) Ant-based clustering algorithms: a brief survey. Int J Comput Theory Eng 2(5):1793–1820
Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv (CSUR) 31(3):264–323. doi:10.1145/331499.331504
Kantere V, Tsoumakos D, Sellis T et al (2009) Groupeer: dynamic clustering of p2p databases. Inf Syst 34(1):62–86. doi:10.1016/j.is.2008.04.002
Kantere V, Kiringa I, Mylopoulos J et al (2004) Coordinating peer databases using ECA rules. Databases, information systems, and peer-to-peer computing, Springer, Berlin Heidelberg, pp 108–122: doi:10.1007/978-3-540-24629-9_9
Karimi Adl R, Rouhani Rankoohi SMT (2009) A new ant colony optimization based algorithm for data allocation problem in distributed databases. Knowl Inf Syst 20(3):349–373. doi:10.1007/s10115-008-0182-y
Kolaitis PG (2005) Schema mappings, data exchange, and metadata management. Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems, ACM, pp 61–75. doi:10.1145/1065167.1065176
Kuntz P, Snyers D, Layzell P (1999) A stochastic heuristic for visualising graph clusters in a bi-dimensional space prior to partitioning. J Heuristics 5(3):327–351. doi:10.1023/A:1009665701840
Liu F, Ma F, Li M, (2004) Distributed information retrieval based on hierarchical semantic overlay network. Grid and Cooperative Computing-GCC, et al (2004) Springer. Berlin Heidelberg, pp 657–64. doi:10.1007/978-3-540-30208-7_88
Liu X, Wang J, Vuong ST (2005) A category overlay infrastructure for peer-to-peer content search. In: Proceedings of parallel and distributed processing symposium (IPDPS’05), IEEE, pp 204a–204a. doi:10.1109/IPDPS.2005.3
Löser A, Naumann F, Siberski W et al (2004) Semantic overlay clusters within super-peer networks. Databases, information systems, and peer-to-peer computing, Springer, Berlin Heidelberg, pp 33–47. doi:10.1007/978-3-540-24629-9_4
Lumer E, Faieta B (1994) Diversity and adaptation in populations of clustering ants. In: Proceedings of the third international conference on Simulation of adaptive behavior: from animals to animats 3. MIT Press, pp 501–508
Lv J, Cheng X (2004) WonGoo: A pure peer-to-peer full text information retrieval system based on semantic overlay networks. In: Proceedings of Network Computing and Applications (NCA’04), IEEE, pp 47–54. doi:10.1109/NCA.2004.1347761
Mandreoli F, Martoglia R, Penzo W, et al (2007) SUNRISE: exploring PDMS networks with semantic routing indexes. In: 4th European Semantic Web Conference, Innsbruck, Austria
Merkle D, Middendorf M, Scheidler A (2004) Decentralized packet clustering in networks. In: Proceedings of the 18th international parallel and distributed processing symposium (IPDPS 2004), IEEE, pp 163–170. doi:10.1109/IPDPS.2004.1303157
Milojicic DS, Kalogeraki V, Lukose R et al (2002) Peer-to-peer computing. Technical Report HPL-2002-57R1, HP Laboratories, Palo Alto
Nejdl W, Wolpers M, Siberski W, et al (2003) Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks. In: Proceedings of the 12th international conference on world wide web, ACM, pp 536–543. doi:10.1145/775152.775229
Parreira JX, Michel S, Weikum G (2007) P2pdating: real life inspired semantic overlay networks for web search. Inf Proces Manag 43(3):643–664. doi:10.1016/j.ipm.2006.09.007
Ramaswamy L, Gedik B, Liu L (2003) Connectivity based node clustering in decentralized peer-to-peer networks. In: Proceedings of the third international conference on peer-to-peer computing, IEEE, pp 66–73. doi:10.1109/PTP.2003.1231505
Santos DS, de Oliveira D, Bazzan ALC (2009) A multiagent, multiobjective clustering algorithm. Data mining and multi-agent integration, Springer, US, pp 239–49. doi:10.1007/978-1-4419-0522-2_16
Sven H, Heese Ralf (2005) Humboldt Discoverer: a Semantic P2P Index for PDMS. International workshop data integration and the semantic web (DISWeb), Porto, Portugal
Tang C, Xu Z, Dwarkadas S (2003) Peer-to-peer information retrieval using self-organizing semantic overlay networks. In: Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications, ACM, pp 175–186. doi:10.1145/863955.863976
Tempich C, Staab S, Wranik A (2004) Remindin’: semantic query routing in peer-to-peer networks based on social metaphors. In: Proceedings of the 13th international conference on world wide web, ACM, pp 640–649. doi:10.1145/988672.988759
Triantafillou P, Xiruhaki C, Koubarakis M, et al. (2003) Towards high performance peer-to-peer content and resource sharing systems. In: Proceedings of CIDR
Valduriez P, Pacitti E (2005) Data management in large-scale P2P systems. High performance computing for computational science (VECPAR), Springer, Berlin Heidelberg, pp 104–118. doi:10.1007/11403937_9
Vazirgiannis M, Nørvag K, Doulkeridis C (2006) Peer-to-peer clustering for semantic overlay network generation. In: Proceedings of the 6th international workshop on pattern recognition in, information systems (PRIS’06)
Vizine AL, De Castro LN, Hruschka ER et al (2005) Towards improving clustering ants: an adaptive ant clustering algorithm. Informatica 29(2):143–154
Yang Y, Kamel MS (2006) An aggregated clustering approach using multi-ant colonies algorithms. Pattern Recognit 39(7):1278–1289. doi:10.1016/j.patcog.2006.02.012
Zachary AH, Halevy AY, Ives ZG, et al. (2003) Piazza: data management infrastructure for semantic web applications. In: Proceedings of the 12th international conference on world wide web, Budapest, Hungary, pp 556–567. doi:10.1145/775152.775231
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ebrahimi, M., Rouhani Rankoohi, S.M.T. An ant-based approach to cluster peers in P2P database systems. Knowl Inf Syst 43, 219–247 (2015). https://doi.org/10.1007/s10115-014-0743-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-014-0743-1