Abstract
In this paper, we propose a Focused Crawler for Cloud service Discovery (FC4DC). Its service oriented architecture ensures an easier modification for rapid updating and better performance. Furthermore, the proposed crawler guarantees saving the search time and a better exploitation of the provider offerings thanks to a dedicated Cloud service description ontology. We finally present some experiments to evaluate the proposed crawler and demonstrate its effectiveness and efficiency.


















Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Notes
References
Afify YM, Moawad IF, Badr NL, Tolba MF (2014) Concept recommendation system for cloud services advertisement. Springer, Cham, pp 57–66. https://doi.org/10.1007/978-3-319-13461-1_7
Buyya R, Yeo C, Venugopal S, Broberg J, Brandic I (2009) Cloud computing and emerging it platforms: vision, hype, and reality for delivering computing as the 5th utility. Future Gener Comput Syst 25(6):599–616. https://doi.org/10.1016/j.future.2008.12.001
Chakrabarti S, van den Berg M, Dom B (1999) Focused crawling: a new approach to topic-specific web resource discovery. Comput Netw 31(1116):1623–1640. https://doi.org/10.1016/S1389-1286(99)00052-3
Cormen TH, Leiserson CE, Rivest RL, Stein C (2009) Introduction to algorithms, 3rd edn. The MIT Press, Cambridge
Dastjerdi AV, Tabatabaei SGH, Buyya R (2010) An effective architecture for automated appliance management system applying ontology-based cloud discovery. In: 2010 10th IEEE/ACM international conference on cluster, cloud and grid computing, pp 104–112. https://doi.org/10.1109/CCGRID.2010.87
Doan A, Domingos P, Halevy A (2001) Reconciling schemas of disparate data sources: a machine-learning approach. SIGMOD Rec 30(2):509–520. https://doi.org/10.1145/376284.375731
Elgazzar K, Hassanein H, Martin P (2014) Daas: cloud-based mobile web service discovery. Pervasive Mob Comput 13:67–84. https://doi.org/10.1016/j.pmcj.2013.10.015
Elyasir AMH, Anbananthen KSM (2012) Focused web crawler. In: International conference on information and knowledge management, ICIKM 2012
Embley F, Tao C, Liddle S (2005) Automating the extraction of data from html tables with unknown structure. Data Knowl Eng 54(1):3–28. https://doi.org/10.1016/j.datak.2004.10.004
Gruber T (1995) Toward principles for the design of ontologies used for knowledge sharing. Int J Hum Comput Stud 43(5–6):907–928. https://doi.org/10.1006/ijhc.1995.1081
Hofer C, Karagiannis G (2011) Cloud computing services: taxonomy and comparison. J Internet Serv Appl 2(2):81–94
Javed B, Bloodsworth P, Rasool R, Munir K, Rana O (2016) Cloud market maker: an automated dynamic pricing marketplace for cloud users. Future Gener Comput Syst 54:52–67. https://doi.org/10.1016/j.future.2015.06.004
Komoda N (2006) Service oriented architecture (SOA) in industrial systems. In: IEEE international conference on industrial informatics, pp 1–5. https://doi.org/10.1109/INDIN.2006.275708
Lopresti D, Nagy G (2000) A tabular survey of automated table processing. In: Selected papers from the third international workshop on graphics recognition, recent advances. Springer, London, UK, UK, GREC’99, pp 93–120. URL http://dl.acm.org/citation.cfm?id=645438.652758
Macaas M, Guitart J (2016) Analysis of a trust model for SLA negotiation and enforcement in cloud markets. Future Gener Comput Syst 55:460–472. https://doi.org/10.1016/j.future.2015.03.011
Miller R, Haas L, Hernandez M (2000) Schema mapping as query discovery. In: Proceedings of the 26th international conference on very large data bases. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, VLDB’00, pp 77–88. URL http://dl.acm.org/citation.cfm?id=645926.671677
Moscato F, Aversa R, Martino BD, Fortis T, Munteanu V (2011) An analysis of mosaic ontology for cloud resources annotation. In: Federated conference on computer science and information systems, FedCSIS, pp 973–980
Narasimhan B, Nichols R (2011) State of cloud applications and platforms: the cloud adopters’ view. Computer 44(3):24–28. https://doi.org/10.1109/MC.2011.66
Noor T, Sheng Q, Alfazi A, Ngu A, Law J (2013) CSCE: a crawler engine for cloud services discovery on the world wide web. In: ICWS, IEEE computer society, pp 443–450. URL http://dblp.uni-trier.de/db/conf/icws/icws2013.html#NoorSANL13
Olston C, Najork M (2010) Web crawling. Found Trends Inf Retr 4(3):175–246
Parhi M, Pattanayak B, Patra M (2015) A multi-agent-based framework for cloud service description and discovery using ontology. In: Jain LC, Patnaik S, Ichalkaranje N (eds) Intelligent computing, communication and devices, advances in intelligent systems and computing, vol 308. Springer India, New Delhi, pp 337–348. https://doi.org/10.1007/978-81-322-2012-1_35
Skiena S (2008) The algorithm design manual, 2nd edn. Springer, London
Sun C, He L, Wang Q, Willenborg R (2008) Simplifying service deployment with virtual appliances. In: IEEE international conference on services computing, SCC’08, vol 2, pp 265–272. https://doi.org/10.1109/SCC.2008.53
Sun L, Dong H, Hussain F, Hussain OK, Chang E (2014) Cloud service selection: state-of-the-art and future research directions. J Netw Comput Appl 45:134–150. https://doi.org/10.1016/j.jnca.2014.07.019
Wang W, Chen X, Zou Y, Wang H, Dai Z (2010) A focused crawler based on naive Bayes classifier. In: Third international symposium on intelligent information technology and security informatics, IITSI, pp 517–521. https://doi.org/10.1109/IITSI.2010.30
Wei Y, Blake M (2010) Service-oriented computing and cloud computing: challenges and opportunities. IEEE Internet Comput 14(6):72–75. https://doi.org/10.1109/MIC.2010.147
Wheal J, Yang Y (2015) Csrecommender: a cloud service searching and recommendation system. J Comput Commun 3(6):65–73. https://doi.org/10.4236/jcc.2015.36007
Zhang M, Ranjan R, Haller A, Georgakopoulos D, Menzel M, Nepal S (2012) An ontology based system for cloud infrastructure services discovery. CoRR abs/1212.0156. arXiv:1212.0156
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Boukadi, K., Rekik, M., Rekik, M. et al. FC4CD: a new SOA-based Focused Crawler for Cloud service Discovery. Computing 100, 1081–1107 (2018). https://doi.org/10.1007/s00607-018-0600-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00607-018-0600-2