Skip to main content
Log in

Clustering Web services to facilitate service discovery

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Clustering Web services would greatly boost the ability of Web service search engine to retrieve relevant services. The performance of traditional Web service description language (WSDL)-based Web service clustering is not satisfied, due to the singleness of data source. Recently, Web service search engines such as Seekda! allow users to manually annotate Web services using tags, which describe functions of Web services or provide additional contextual and semantical information. In this paper, we cluster Web services by utilizing both WSDL documents and tags. To handle the clustering performance limitation caused by uneven tag distribution and noisy tags, we propose a hybrid Web service tag recommendation strategy, named WSTRec, which employs tag co-occurrence, tag mining, and semantic relevance measurement for tag recommendation. Extensive experiments are conducted based on our real-world dataset, which consists of 15,968 Web services. The experimental results demonstrate the effectiveness of our proposed service clustering and tag recommendation strategies. Specifically, compared with traditional WSDL-based Web service clustering approaches, the proposed approach produces gains in both precision and recall for up to 14 % in most cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. http://webservices.seekda.com.

  2. http://www.premis.cz/PremisWS/MeteorologyWS.asmx?WSDL.

  3. http://rpc.test.sunnycars.com/CarRentalAgentService121/CarRentalAgentService.asmx?wsdl.

  4. http://www.w3.org/Submission/OWL-S/.

  5. http://www.w3.org/Submission/WSMO/.

  6. http://db.cs.washington.edu/woogle.html.

  7. http://ws.cdyne.com/WeatherWS/Weather.asmx?wsdl.

  8. http://gordan.gorodok.net/sharpcast.wsdl.

  9. STag Dataset 1.0, collected by Liang Chen (cliang@zju.edu.cn) and Johnny Jian (johnnyjian@gmail.com).

  10. http://ccnt.zju.edu.cn:8080.

References

  1. Al-Masri E, Mahmoud QH (2008) Investigating web services on the world wide web. In: Proceedings of international World Wide Web conference, pp 795–804

  2. Nayak, Richi (2008) Data mining in web service discovery and monitoring. Int J Web Serv Res 5(1):62–80

    Google Scholar 

  3. Lim SY, Song MH, Lee SJ (2004) The construction of domain ontology and its application to document retrieval. Lect Notes Comput Sci 3261:117–127

    Article  Google Scholar 

  4. Elgazzar K, Hassan AE, Martin P (2009) Clustering wsdl documents to bootstrap the discovery of web services. In: Proceedings of international conference on Web services, pp 147–154

  5. Liu W, Wong W (2009) Web service clustering using text mining techniques. Int J Agent-Oriented Softw Eng 3(1):6–26

    Article  Google Scholar 

  6. Lipczak M, Hu Y, Kollet Y, Milios E (2009) Tag sources for recommendation in collaborative tagging systems. ECML PKDD Discov Chall 497:157–172

    Google Scholar 

  7. Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of international conference on very large data bases, pp 487–499

  8. Wu Z, Deng S, Li Y, Wu J (2009) Computing compatibility in dynamic service composition. Int J Knowl Inf Syst 19(1):107–129

    Article  MathSciNet  Google Scholar 

  9. Hu C, Zhu Y, Huai J, Liu Y, Ni LM (2007) S-club: an overlay-based efficient service discovery mechanism in crown grid. Int J Knowl Inf Syst 12(1):55–75

    Article  Google Scholar 

  10. Liang QA, Chung J-Y, Miller S (2007) Modeling semantics in composite web service requests by utility elicitation. Int J Knowl Inf Syst 13(3):367–394

    Article  Google Scholar 

  11. Wu J, Chen L, Xie Y, Zheng Z (2012) Titan: a system for effective web service discovery. In: Proceedings of international conference companion on World Wide Web, pp 441–444

  12. Agarwal S, Studer R (2006) Automatic matchmaking of web services. In: Proceedings of international conference on Web services, pp 45–54

  13. Klusch M, Fries B, Sycara K (2006) Automated semantic web service discovery with owls-mx. In: Proceedings of international conference on autonomous agents and multiagent systems, pp 915–922

  14. Benatallah B, Hacid M, Leger A, Rey C, Toumani F (2005) On automating web services discovery. Int J Very Large Data Bases 14(1):84–96

    Google Scholar 

  15. Zhang Y, Zheng Z, Lyu MR (2010) Wsexpress: a qos-aware search engine for web services. In: Proceedings of international conference on Web services, pp 91–98

  16. Dong X, Halevy A, Madhavan J, Nemes E, Zhang J (2004) Similarity search for web services. In: Proceedings of international conference on very large data bases, pp 372–383

  17. Hu S, Muthusamy V, Li G, Jacobsen HA (2008) Distributed automatic service composition in large-scale systems. In: Proceedings of distributed event-based systems conference, pp 233–244

  18. Liu F, Shi Y, Yu J, Wang T, Wu J (2010) Measuring similarity of web services based on wsdl. In: Proceedings of international conference on Web services, pp 155–162

  19. Ran S (2003) A model for web services discovery with qos. ACM Sigecom Exch 4(1):1–10

    Article  Google Scholar 

  20. Lara R, Corella MA, Castells P (2006) A flexible model for web service discovery. In: Proceedings of international conference on very large data bases

  21. Zheng Z, Ma H, Lyu MR, King I (2009) Wsrec: a collaborative filtering based web service recommender system. In: Proceedings of international conference on Web services, pp 437–444

  22. Zheng Z, Ma H, Lyu MR, King I (2011) Qos-aware web service recommendation by collaborative filtering. IEEE Trans Serv Comput 4(2):140–152

    Article  Google Scholar 

  23. Wu J, Chen L, Feng Y, Zheng Z, Zhou M, Wu Z (2013) Predicting quality of service for selection by neighborhood-based collaborative filtering. IEEE Trans Syst Man Cybern Part A 43(2):428–439

    Google Scholar 

  24. Hao Y, Junliang C, Xiangwu M, Bingyu Q (2007) Dynamically traveling web service clustering based on spatial and temporal aspects. Lect Notes Comput Sci 4802:348–357

    Article  Google Scholar 

  25. Platzer C, Rosenberg F, Dustdar S (2009) Web service clustering using multidimensional angles as proximity measures. ACM Trans Intern Technol 9(3):1–26

    Article  Google Scholar 

  26. Bianchini D, Antonellis VD, Pernici B, Plebani P (2006) Ontology-based methodology for e-service discovery. ACM J Inf Syst 31(4):361–380

    Article  Google Scholar 

  27. Sun P, Jiang C (2008) Using service clustering to facilitate process-oriented semantic web service discovery. Chin J Comput 31(8):1340–1353

    MathSciNet  Google Scholar 

  28. Pop CB, Chifu VR, Salomie I, Dinsoreanu M, David T, Acretoaie V (2010) Semantic web service clustering for efficient discovery using an ant-based method. Stud Comput Intell 315:23–33

    Article  Google Scholar 

  29. Dasgupta S, Bhat S, Lee Y (2010) Taxonomic clustering of web service for efficient discovery. In: Proceedings of international conference on information and, knowledge management, pp 1617–1620

  30. WeiLiu, Wong W (2008) Discovering homogeneous service communities through web service clustering. Serv Oriented Comput Agents Semant Eng 5006:69–82

  31. Chen L, Hu L, Wu J, Zheng Z, Ying J, Li Y, Deng S (2011) Wtcluster: utilizing tags for web service clustering. In: Proceedings of international conference on service oriented, computing, pp 204–218

  32. Porter MF (1980) An algorithm for suffix stripping. Program 14(3):130–137

    Article  Google Scholar 

  33. Church K, Gale WA (1995) Inverse document frequency (idf): a measure of deviations from poisson. In: Proceedings of the ACL 3rd workshop on very large corpora, pp 121–130

  34. Cilibrasi RL, Vitnyi PMB (2007) The google similarity distance. IEEE Trans Knowl Data Eng 19(3):370–383

    Article  Google Scholar 

  35. Jain A, Dubes R (1988) Algorithms for clustering data. Prentice Hall, New Jersey

    MATH  Google Scholar 

  36. MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth symposium on math, statistics, and probability, pp 281–297

  37. Manning CD, Raghavan P, Schtze H (2008) Introduction to information retrieval. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

Download references

Acknowledgments

This research was partially supported by the National Technology Support Program under the grant of 2011BAH16B04, the National Natural Science Foundation of China under the grant of No. 61173176, Science and Technology Program of Zhejiang Province under the grant of 2008C03007, National High-Tech Research and Development Plan of China under the Grant No. 2011AA010501, and a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. CUHK 415409).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jian Wu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, J., Chen, L., Zheng, Z. et al. Clustering Web services to facilitate service discovery. Knowl Inf Syst 38, 207–229 (2014). https://doi.org/10.1007/s10115-013-0623-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-013-0623-0

Keywords

Navigation