Abstract
In recent years, RESTful services that are mainly described using short texts are becoming increasingly popular. The keyword-based discovery technology adopted by existing service registries usually suffers from low recall and is insufficient to retrieve accurate RESTful services according to users’ functional goals. Moreover, it is often difficult for users to specify queries that can precisely represent their requirements due to the lack of knowledge on their desired service functionalities. Toward these issues, we propose a RESTful service discovery approach by leveraging service goal (i.e., service functionality) knowledge mined from services’ textual descriptions. The approach first groups the available services into clusters using probabilistic topic models. Then, service goals are extracted from the textual descriptions of services and also clustered based on the topic modeling results of services. Based on service goal clusters, we design a mechanism to recommend semantically relevant service goals to help users refine their initial queries. Relevant services are retrieved by matching user selected service goals with those of candidate services. To improve the recall of the goal-based service discovery approach, we further propose a hybrid approach by integrating it with two existing service discovery approaches. A series of experiments conducted on real-world services crawled from a publicly accessible registry, ProgrammableWeb, demonstrate the effectiveness of the proposed approaches.














Similar content being viewed by others
References
Hu Y, Peng Q, Hu X et al (2015) Time aware and data sparsity tolerant web service recommendation based on improved collaborative filtering. IEEE Trans Serv Comput 8(5):782–794
John D, Rajasree MS (2013) RESTDoc: describe, discover and compose RESTful semantic web services using annotated documentations. Int J Web Semant Technol 4(1):37–49
Fielding RT (2000) Architectural styles and the design of network-based software architectures. PhD thesis, University of California, Irvine
Maleshkova M, Pedrinaci C, Domingue J (2010) Investigating web APIs on the World Wide Web. In: Proceedings of the IEEE European conference on web services, pp 107–114
Jiang W, Lee D, Hu S (2012) Large-scale longitudinal analysis of SOAP-based and RESTful web services. In: Proceedings of the IEEE international conference on web services, pp 218–225
Wang J, Zhang N, Zeng C et al (2013) Towards services discovery based on service goal extraction and recommendation. In: Proceedings of the IEEE international conference on services computing, pp 65–72
Wang Y, Stroulia E (2003) Flexible interface matching for web service discovery. In: Proceedings of the international conference on web information systems engineering, pp 147–156
Plebani P, Pernici B (2009) URBE: web service retrieval based on similarity evaluation. IEEE Trans Knowl Data Eng 21(11):1629–1642
Liu F, Shi Y, Yu J et al (2010) Measuring similarity of web services based on WSDL. In: Proceedings of the IEEE international conference on web services, pp 155–162
Kokash N, Heuvel WJVD, D’Andrea V (2006) Leveraging web services discovery with customizable hybrid matching. In: Proceedings of the international conference on service-oriented computing, pp 522–528
Paulraj D, Swamynathan S (2011) Content based service discovery in semantic web services using WordNet. In: Proceedings of the international conference on advanced computing, network and security, pp 48–56
Paliwal AV, Shafiq B, Vaidya J et al (2012) Semantics-based automated service discovery. IEEE Trans Serv Comput 5(2):260–275
Ma SP, Li CH, Tsai YY et al (2013) Web service discovery using lexical and semantic query expansion. In: Proceedings of the IEEE international conference on e-business engineering, pp 423–428
Cong Z, Fernandez A, Billhardt H et al (2015) Service discovery acceleration with hierarchical clustering. Inf Syst Front 17(4):799–808
Ma J, Zhang Y, He J (2008) Efficiently finding web services using a clustering semantic approach. In: Proceedings of the international workshop on context enabled source and service selection, integration and adaptation, pp 1–8
Cassar G, Barnaghi P, Moessner K (2013) Probabilistic matchmaking methods for automated service discovery. IEEE Trans Serv Comput 7(4):654–666
Li Z, He K, Wang J et al (2014) An on-demand services discovery approach based on topic clustering. J Internet Technol 15(4):543–555
Wang J, Gao P, Ma Y et al (2017) A web service discovery approach based on common topic groups extraction. IEEE Access 5:10193–10208
Chen L, Hu L, Zheng Z, et al (2011) WTCluster: utilizing tags for web services clustering. In: Proceedings of the international conference on service-oriented computing, pp 204–218
Klusch M, Fries B, Sycara K (2009) OWLS-MX: a hybrid semantic web service matchmaker for OWL-S services. Web Semant Sci Serv Agents World Wide Web 7(2):121–133
Klusch M, Kapahnke P, Zinnikus I (2009) Hybrid adaptive web service selection with SAWSDL-MX and WSDL-analyzer. In: Proceedings of the European semantic web conference on the semantic web: research and applications, pp 550–564
Klusch M, Kaufer F (2009) WSMO-MX: a hybrid semantic web service matchmaker. Web Intell Agent Syst 7(1):23–42
García JM, Ruiz D, Ruiz-Cortés A (2012) Improving semantic web services discovery using SPARQL-based repository filtering. Web Semant Sci Serv Agents World Wide Web 17(4):12–24
Lampe U, Schulte S, Siebenhaar M, et al (2010) Adaptive matchmaking for RESTful services based on hRESTS and MicroWSMO. In: Proceedings of the workshop on emerging web services technology, pp 10–17
Sellami S, Slaimi F, Boucelma O et al (2013) Flexible matchmaking for RESTful web services. In: Proceedings of the OTM conference on the move to meaningful internet systems, pp 542–554
Roman D, Kopecký J, Vitvar T et al (2015) WSMO-Lite and hRESTS: lightweight semantic annotations for web services and RESTful APIs. Web Semant Sci Serv Agents World Wide Web 31:39–58
Zhang N, He K, Wang J et al (2016) WSGM-SD: an approach to RESTful service discovery based on weighted service goal model. Chin J Electron 25(2):256–263
Jung Y, Cho Y, Park YM et al (2013) Automatic tagging of functional-goals for goal-driven semantic service discovery. In: Proceedings of the IEEE international conference on semantic computing, pp 212–219
Kopecky J, Gomadam K, Vitvar T (2008) hRESTS: an HTML microformat for describing RESTful web services. In: Proceedings of the IEEE international conference on web intelligence and intelligent agent technology, pp 619–625
Liu X, Agarwal S, Ding C et al (2016) An LDA-SVM active learning framework for web service classification. In: Proceedings of the IEEE international conference on web services, pp 49–56
Strohmaier M, Lux M, Granitzer M et al (2007) How do users express goals on the web? An exploration of intentional structures in web search. In: Proceedings of the international conference on web information systems engineering, pp 67–78
Zhou TC, Lyu RT, King I et al (2014) Learning to suggest questions in social media. Knowl Inf Syst 43(2):389–416
Miller GA (1995) WordNet: a lexical database for English. ACM Commun 38(11):39–41
Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022
Wang H, Shi Y, Zhou X, et al (2010) Web service classification using support vector machine. In: Proceedings of the IEEE international conference on tools with artificial intelligence, pp 3–6
Zhang J, Wang J, Hung P et al (2012) Leveraging incrementally enriched domain knowledge to enhance service categorization. Int J Web Serv Res 9(3):43–66
Chen L, Wang Y, Yu Q et al (2013) WT-LDA: user tagging augmented LDA for web service clustering. In: Proceedings of the international conference on service-oriented computing, pp 162–176
Crasso M, Zunino A, Campo M (2011) A survey of approaches to web service discovery in service-oriented architectures. J Database Manag 22(1):102–132
Wang W, Barnaghi P, Bargiela A (2010) Probabilistic topic models for learning terminological ontologies. IEEE Trans Knowl Data Eng 22(7):1028–1040
Bird S, Loper E, Klein E (2009) Natural language processing with Python. O’Reilly Media Inc., Sebastopol
Korenius T, Laurikkala J, Järvelin K, et al (2004) Stemming and lemmatization in the clustering of Finnish text documents. In: Proceedings of the ACM international conference on information and knowledge management, pp 625–633
Yao L, Mimno D, Mccallum A (2009) Efficient methods for topic model inference on streaming document collections. In: Proceedings of the ACM international conference on knowledge discovery and data mining, pp 937–946
Griffiths TL, Steyvers M (2004) Finding scientific topics. Proc Nat Acad Sci USA 101(Suppl 1):5228–5235
Andrieu C, Freitas ND, Doucet A et al (2002) An introduction to MCMC for machine learning. Mach Learn 50(1):5–43
Zhang N, Wang J, Ma Y (2017) Mining domain knowledge on service goals from textual service descriptions. IEEE Trans Serv Comput. https://doi.org/10.1109/TSC.2017.2693147
Rolland C, Souveyet C, Achour CB (1998) Guiding goal modeling using scenarios. IEEE Trans Softw Eng 24(12):1055–1071
Stevenson M, Greenwood MA (2006) Comparing information extraction pattern models. In: Proceedings of the workshop on information extraction beyond the document. Association for Computational Linguistics, pp 12–19
Santorini B (1990) Part-of-speech tagging guidelines for the Penn Treebank Project. Technical Reports (CIS), Paper 570
Marnee MCD, Manning CD (2008) Stanford typed dependencies manual
Manning CD, Raghavan P, Schütze H (2009) An introduction to information retrieval. Cambridge University Press, Cambridge
Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval. Int J Inf Process Manag 24(5):513–523
Wang J, Feng Z, Zhang J et al (2014) A unified RGPS-based approach supporting service-oriented process customization. Web Serv Foundations, pp 657–682
Acknowledgements
This research was supported by the National Basic Research Program of China (No. 2014CB340404), the National Key Research and Development Program of China (No. 2017YFB1400602), and the National Natural Science Foundation of China (Nos. 61672387, 61702378, 61402150, and 61562073), the Strategic Team-Building of Scientific and Technological Innovation in Hubei Province, and the Natural Science Foundation of Hubei Province of China (No. 2017CKB894).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, N., Wang, J., He, K. et al. Mining and clustering service goals for RESTful service discovery. Knowl Inf Syst 58, 669–700 (2019). https://doi.org/10.1007/s10115-018-1171-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-018-1171-4