Skip to main content
Log in

Mining and clustering service goals for RESTful service discovery

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

In recent years, RESTful services that are mainly described using short texts are becoming increasingly popular. The keyword-based discovery technology adopted by existing service registries usually suffers from low recall and is insufficient to retrieve accurate RESTful services according to users’ functional goals. Moreover, it is often difficult for users to specify queries that can precisely represent their requirements due to the lack of knowledge on their desired service functionalities. Toward these issues, we propose a RESTful service discovery approach by leveraging service goal (i.e., service functionality) knowledge mined from services’ textual descriptions. The approach first groups the available services into clusters using probabilistic topic models. Then, service goals are extracted from the textual descriptions of services and also clustered based on the topic modeling results of services. Based on service goal clusters, we design a mechanism to recommend semantically relevant service goals to help users refine their initial queries. Relevant services are retrieved by matching user selected service goals with those of candidate services. To improve the recall of the goal-based service discovery approach, we further propose a hybrid approach by integrating it with two existing service discovery approaches. A series of experiments conducted on real-world services crawled from a publicly accessible registry, ProgrammableWeb, demonstrate the effectiveness of the proposed approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Notes

  1. http://www.programmableweb.com/.

  2. http://wordnet.princeton.edu/.

  3. http://www.nltk.org/.

  4. http://jgibblda.sourceforge.net/.

  5. http://nlp.stanford.edu/software/lex-parser.shtml.

References

  1. Hu Y, Peng Q, Hu X et al (2015) Time aware and data sparsity tolerant web service recommendation based on improved collaborative filtering. IEEE Trans Serv Comput 8(5):782–794

    Article  Google Scholar 

  2. John D, Rajasree MS (2013) RESTDoc: describe, discover and compose RESTful semantic web services using annotated documentations. Int J Web Semant Technol 4(1):37–49

    Article  Google Scholar 

  3. Fielding RT (2000) Architectural styles and the design of network-based software architectures. PhD thesis, University of California, Irvine

  4. Maleshkova M, Pedrinaci C, Domingue J (2010) Investigating web APIs on the World Wide Web. In: Proceedings of the IEEE European conference on web services, pp 107–114

  5. Jiang W, Lee D, Hu S (2012) Large-scale longitudinal analysis of SOAP-based and RESTful web services. In: Proceedings of the IEEE international conference on web services, pp 218–225

  6. Wang J, Zhang N, Zeng C et al (2013) Towards services discovery based on service goal extraction and recommendation. In: Proceedings of the IEEE international conference on services computing, pp 65–72

  7. Wang Y, Stroulia E (2003) Flexible interface matching for web service discovery. In: Proceedings of the international conference on web information systems engineering, pp 147–156

  8. Plebani P, Pernici B (2009) URBE: web service retrieval based on similarity evaluation. IEEE Trans Knowl Data Eng 21(11):1629–1642

    Article  Google Scholar 

  9. Liu F, Shi Y, Yu J et al (2010) Measuring similarity of web services based on WSDL. In: Proceedings of the IEEE international conference on web services, pp 155–162

  10. Kokash N, Heuvel WJVD, D’Andrea V (2006) Leveraging web services discovery with customizable hybrid matching. In: Proceedings of the international conference on service-oriented computing, pp 522–528

  11. Paulraj D, Swamynathan S (2011) Content based service discovery in semantic web services using WordNet. In: Proceedings of the international conference on advanced computing, network and security, pp 48–56

  12. Paliwal AV, Shafiq B, Vaidya J et al (2012) Semantics-based automated service discovery. IEEE Trans Serv Comput 5(2):260–275

    Article  Google Scholar 

  13. Ma SP, Li CH, Tsai YY et al (2013) Web service discovery using lexical and semantic query expansion. In: Proceedings of the IEEE international conference on e-business engineering, pp 423–428

  14. Cong Z, Fernandez A, Billhardt H et al (2015) Service discovery acceleration with hierarchical clustering. Inf Syst Front 17(4):799–808

    Article  Google Scholar 

  15. Ma J, Zhang Y, He J (2008) Efficiently finding web services using a clustering semantic approach. In: Proceedings of the international workshop on context enabled source and service selection, integration and adaptation, pp 1–8

  16. Cassar G, Barnaghi P, Moessner K (2013) Probabilistic matchmaking methods for automated service discovery. IEEE Trans Serv Comput 7(4):654–666

    Article  Google Scholar 

  17. Li Z, He K, Wang J et al (2014) An on-demand services discovery approach based on topic clustering. J Internet Technol 15(4):543–555

    Google Scholar 

  18. Wang J, Gao P, Ma Y et al (2017) A web service discovery approach based on common topic groups extraction. IEEE Access 5:10193–10208

    Article  Google Scholar 

  19. Chen L, Hu L, Zheng Z, et al (2011) WTCluster: utilizing tags for web services clustering. In: Proceedings of the international conference on service-oriented computing, pp 204–218

  20. Klusch M, Fries B, Sycara K (2009) OWLS-MX: a hybrid semantic web service matchmaker for OWL-S services. Web Semant Sci Serv Agents World Wide Web 7(2):121–133

    Article  Google Scholar 

  21. Klusch M, Kapahnke P, Zinnikus I (2009) Hybrid adaptive web service selection with SAWSDL-MX and WSDL-analyzer. In: Proceedings of the European semantic web conference on the semantic web: research and applications, pp 550–564

  22. Klusch M, Kaufer F (2009) WSMO-MX: a hybrid semantic web service matchmaker. Web Intell Agent Syst 7(1):23–42

    Google Scholar 

  23. García JM, Ruiz D, Ruiz-Cortés A (2012) Improving semantic web services discovery using SPARQL-based repository filtering. Web Semant Sci Serv Agents World Wide Web 17(4):12–24

    Article  Google Scholar 

  24. Lampe U, Schulte S, Siebenhaar M, et al (2010) Adaptive matchmaking for RESTful services based on hRESTS and MicroWSMO. In: Proceedings of the workshop on emerging web services technology, pp 10–17

  25. Sellami S, Slaimi F, Boucelma O et al (2013) Flexible matchmaking for RESTful web services. In: Proceedings of the OTM conference on the move to meaningful internet systems, pp 542–554

  26. Roman D, Kopecký J, Vitvar T et al (2015) WSMO-Lite and hRESTS: lightweight semantic annotations for web services and RESTful APIs. Web Semant Sci Serv Agents World Wide Web 31:39–58

    Article  Google Scholar 

  27. Zhang N, He K, Wang J et al (2016) WSGM-SD: an approach to RESTful service discovery based on weighted service goal model. Chin J Electron 25(2):256–263

    Article  Google Scholar 

  28. Jung Y, Cho Y, Park YM et al (2013) Automatic tagging of functional-goals for goal-driven semantic service discovery. In: Proceedings of the IEEE international conference on semantic computing, pp 212–219

  29. Kopecky J, Gomadam K, Vitvar T (2008) hRESTS: an HTML microformat for describing RESTful web services. In: Proceedings of the IEEE international conference on web intelligence and intelligent agent technology, pp 619–625

  30. Liu X, Agarwal S, Ding C et al (2016) An LDA-SVM active learning framework for web service classification. In: Proceedings of the IEEE international conference on web services, pp 49–56

  31. Strohmaier M, Lux M, Granitzer M et al (2007) How do users express goals on the web? An exploration of intentional structures in web search. In: Proceedings of the international conference on web information systems engineering, pp 67–78

  32. Zhou TC, Lyu RT, King I et al (2014) Learning to suggest questions in social media. Knowl Inf Syst 43(2):389–416

    Article  Google Scholar 

  33. Miller GA (1995) WordNet: a lexical database for English. ACM Commun 38(11):39–41

    Article  Google Scholar 

  34. Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  35. Wang H, Shi Y, Zhou X, et al (2010) Web service classification using support vector machine. In: Proceedings of the IEEE international conference on tools with artificial intelligence, pp 3–6

  36. Zhang J, Wang J, Hung P et al (2012) Leveraging incrementally enriched domain knowledge to enhance service categorization. Int J Web Serv Res 9(3):43–66

    Article  Google Scholar 

  37. Chen L, Wang Y, Yu Q et al (2013) WT-LDA: user tagging augmented LDA for web service clustering. In: Proceedings of the international conference on service-oriented computing, pp 162–176

  38. Crasso M, Zunino A, Campo M (2011) A survey of approaches to web service discovery in service-oriented architectures. J Database Manag 22(1):102–132

    Article  Google Scholar 

  39. Wang W, Barnaghi P, Bargiela A (2010) Probabilistic topic models for learning terminological ontologies. IEEE Trans Knowl Data Eng 22(7):1028–1040

    Article  Google Scholar 

  40. Bird S, Loper E, Klein E (2009) Natural language processing with Python. O’Reilly Media Inc., Sebastopol

    MATH  Google Scholar 

  41. Korenius T, Laurikkala J, Järvelin K, et al (2004) Stemming and lemmatization in the clustering of Finnish text documents. In: Proceedings of the ACM international conference on information and knowledge management, pp 625–633

  42. Yao L, Mimno D, Mccallum A (2009) Efficient methods for topic model inference on streaming document collections. In: Proceedings of the ACM international conference on knowledge discovery and data mining, pp 937–946

  43. Griffiths TL, Steyvers M (2004) Finding scientific topics. Proc Nat Acad Sci USA 101(Suppl 1):5228–5235

    Article  Google Scholar 

  44. Andrieu C, Freitas ND, Doucet A et al (2002) An introduction to MCMC for machine learning. Mach Learn 50(1):5–43

    MATH  Google Scholar 

  45. Zhang N, Wang J, Ma Y (2017) Mining domain knowledge on service goals from textual service descriptions. IEEE Trans Serv Comput. https://doi.org/10.1109/TSC.2017.2693147

    Article  Google Scholar 

  46. Rolland C, Souveyet C, Achour CB (1998) Guiding goal modeling using scenarios. IEEE Trans Softw Eng 24(12):1055–1071

    Article  Google Scholar 

  47. Stevenson M, Greenwood MA (2006) Comparing information extraction pattern models. In: Proceedings of the workshop on information extraction beyond the document. Association for Computational Linguistics, pp 12–19

  48. Santorini B (1990) Part-of-speech tagging guidelines for the Penn Treebank Project. Technical Reports (CIS), Paper 570

  49. Marnee MCD, Manning CD (2008) Stanford typed dependencies manual

  50. Manning CD, Raghavan P, Schütze H (2009) An introduction to information retrieval. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  51. Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval. Int J Inf Process Manag 24(5):513–523

    Article  Google Scholar 

  52. Wang J, Feng Z, Zhang J et al (2014) A unified RGPS-based approach supporting service-oriented process customization. Web Serv Foundations, pp 657–682

Download references

Acknowledgements

This research was supported by the National Basic Research Program of China (No. 2014CB340404), the National Key Research and Development Program of China (No. 2017YFB1400602), and the National Natural Science Foundation of China (Nos. 61672387, 61702378, 61402150, and 61562073), the Strategic Team-Building of Scientific and Technological Innovation in Hubei Province, and the Natural Science Foundation of Hubei Province of China (No. 2017CKB894).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jian Wang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, N., Wang, J., He, K. et al. Mining and clustering service goals for RESTful service discovery. Knowl Inf Syst 58, 669–700 (2019). https://doi.org/10.1007/s10115-018-1171-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-018-1171-4

Keywords

Navigation