Skip to main content

Advertisement

Log in

FC4CD: a new SOA-based Focused Crawler for Cloud service Discovery

  • Published:
Computing Aims and scope Submit manuscript

Abstract

In this paper, we propose a Focused Crawler for Cloud service Discovery (FC4DC). Its service oriented architecture ensures an easier modification for rapid updating and better performance. Furthermore, the proposed crawler guarantees saving the search time and a better exploitation of the provider offerings thanks to a dedicated Cloud service description ontology. We finally present some experiments to evaluate the proposed crawler and demonstrate its effectiveness and efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

Explore related subjects

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Notes

  1. http://cloudharmony.com/.

  2. http://www.cloudorado.com/.

  3. http://www.nist.gov/.

  4. http://projects.spring.io/spring-framework/.

  5. http://pr.efactory.de/e-pagerank-algorithm.shtml.

  6. https://eclipse.org/ide/.

  7. http://protege.stanford.edu/.

  8. https://github.com/yasserg/crawler4j.

  9. http://jsoup.org/.

  10. https://jena.apache.org/.

References

  1. Afify YM, Moawad IF, Badr NL, Tolba MF (2014) Concept recommendation system for cloud services advertisement. Springer, Cham, pp 57–66. https://doi.org/10.1007/978-3-319-13461-1_7

    Google Scholar 

  2. Buyya R, Yeo C, Venugopal S, Broberg J, Brandic I (2009) Cloud computing and emerging it platforms: vision, hype, and reality for delivering computing as the 5th utility. Future Gener Comput Syst 25(6):599–616. https://doi.org/10.1016/j.future.2008.12.001

    Article  Google Scholar 

  3. Chakrabarti S, van den Berg M, Dom B (1999) Focused crawling: a new approach to topic-specific web resource discovery. Comput Netw 31(1116):1623–1640. https://doi.org/10.1016/S1389-1286(99)00052-3

    Article  Google Scholar 

  4. Cormen TH, Leiserson CE, Rivest RL, Stein C (2009) Introduction to algorithms, 3rd edn. The MIT Press, Cambridge

    MATH  Google Scholar 

  5. Dastjerdi AV, Tabatabaei SGH, Buyya R (2010) An effective architecture for automated appliance management system applying ontology-based cloud discovery. In: 2010 10th IEEE/ACM international conference on cluster, cloud and grid computing, pp 104–112. https://doi.org/10.1109/CCGRID.2010.87

  6. Doan A, Domingos P, Halevy A (2001) Reconciling schemas of disparate data sources: a machine-learning approach. SIGMOD Rec 30(2):509–520. https://doi.org/10.1145/376284.375731

    Article  Google Scholar 

  7. Elgazzar K, Hassanein H, Martin P (2014) Daas: cloud-based mobile web service discovery. Pervasive Mob Comput 13:67–84. https://doi.org/10.1016/j.pmcj.2013.10.015

    Article  Google Scholar 

  8. Elyasir AMH, Anbananthen KSM (2012) Focused web crawler. In: International conference on information and knowledge management, ICIKM 2012

  9. Embley F, Tao C, Liddle S (2005) Automating the extraction of data from html tables with unknown structure. Data Knowl Eng 54(1):3–28. https://doi.org/10.1016/j.datak.2004.10.004

    Article  Google Scholar 

  10. Gruber T (1995) Toward principles for the design of ontologies used for knowledge sharing. Int J Hum Comput Stud 43(5–6):907–928. https://doi.org/10.1006/ijhc.1995.1081

    Article  Google Scholar 

  11. Hofer C, Karagiannis G (2011) Cloud computing services: taxonomy and comparison. J Internet Serv Appl 2(2):81–94

    Article  Google Scholar 

  12. Javed B, Bloodsworth P, Rasool R, Munir K, Rana O (2016) Cloud market maker: an automated dynamic pricing marketplace for cloud users. Future Gener Comput Syst 54:52–67. https://doi.org/10.1016/j.future.2015.06.004

    Article  Google Scholar 

  13. Komoda N (2006) Service oriented architecture (SOA) in industrial systems. In: IEEE international conference on industrial informatics, pp 1–5. https://doi.org/10.1109/INDIN.2006.275708

  14. Lopresti D, Nagy G (2000) A tabular survey of automated table processing. In: Selected papers from the third international workshop on graphics recognition, recent advances. Springer, London, UK, UK, GREC’99, pp 93–120. URL http://dl.acm.org/citation.cfm?id=645438.652758

  15. Macaas M, Guitart J (2016) Analysis of a trust model for SLA negotiation and enforcement in cloud markets. Future Gener Comput Syst 55:460–472. https://doi.org/10.1016/j.future.2015.03.011

    Article  Google Scholar 

  16. Miller R, Haas L, Hernandez M (2000) Schema mapping as query discovery. In: Proceedings of the 26th international conference on very large data bases. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, VLDB’00, pp 77–88. URL http://dl.acm.org/citation.cfm?id=645926.671677

  17. Moscato F, Aversa R, Martino BD, Fortis T, Munteanu V (2011) An analysis of mosaic ontology for cloud resources annotation. In: Federated conference on computer science and information systems, FedCSIS, pp 973–980

  18. Narasimhan B, Nichols R (2011) State of cloud applications and platforms: the cloud adopters’ view. Computer 44(3):24–28. https://doi.org/10.1109/MC.2011.66

    Article  Google Scholar 

  19. Noor T, Sheng Q, Alfazi A, Ngu A, Law J (2013) CSCE: a crawler engine for cloud services discovery on the world wide web. In: ICWS, IEEE computer society, pp 443–450. URL http://dblp.uni-trier.de/db/conf/icws/icws2013.html#NoorSANL13

  20. Olston C, Najork M (2010) Web crawling. Found Trends Inf Retr 4(3):175–246

    Article  MATH  Google Scholar 

  21. Parhi M, Pattanayak B, Patra M (2015) A multi-agent-based framework for cloud service description and discovery using ontology. In: Jain LC, Patnaik S, Ichalkaranje N (eds) Intelligent computing, communication and devices, advances in intelligent systems and computing, vol 308. Springer India, New Delhi, pp 337–348. https://doi.org/10.1007/978-81-322-2012-1_35

    Google Scholar 

  22. Skiena S (2008) The algorithm design manual, 2nd edn. Springer, London

    Book  MATH  Google Scholar 

  23. Sun C, He L, Wang Q, Willenborg R (2008) Simplifying service deployment with virtual appliances. In: IEEE international conference on services computing, SCC’08, vol 2, pp 265–272. https://doi.org/10.1109/SCC.2008.53

  24. Sun L, Dong H, Hussain F, Hussain OK, Chang E (2014) Cloud service selection: state-of-the-art and future research directions. J Netw Comput Appl 45:134–150. https://doi.org/10.1016/j.jnca.2014.07.019

    Article  Google Scholar 

  25. Wang W, Chen X, Zou Y, Wang H, Dai Z (2010) A focused crawler based on naive Bayes classifier. In: Third international symposium on intelligent information technology and security informatics, IITSI, pp 517–521. https://doi.org/10.1109/IITSI.2010.30

  26. Wei Y, Blake M (2010) Service-oriented computing and cloud computing: challenges and opportunities. IEEE Internet Comput 14(6):72–75. https://doi.org/10.1109/MIC.2010.147

    Article  Google Scholar 

  27. Wheal J, Yang Y (2015) Csrecommender: a cloud service searching and recommendation system. J Comput Commun 3(6):65–73. https://doi.org/10.4236/jcc.2015.36007

    Article  Google Scholar 

  28. Zhang M, Ranjan R, Haller A, Georgakopoulos D, Menzel M, Nepal S (2012) An ontology based system for cloud infrastructure services discovery. CoRR abs/1212.0156. arXiv:1212.0156

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mouna Rekik.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Boukadi, K., Rekik, M., Rekik, M. et al. FC4CD: a new SOA-based Focused Crawler for Cloud service Discovery. Computing 100, 1081–1107 (2018). https://doi.org/10.1007/s00607-018-0600-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00607-018-0600-2

Keywords

Mathematics Subject Classification