Skip to main content

N-Gram Representation for Web Service Description Classification

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11331))

Abstract

Despite increasing availability of Web Services (WS), their automatic processing (classification, grouping or composition) slows down because of the difficulty to read the WSDL service descriptions without related technical knowledge. Categorizing services for automatic service discovery and composition has become a challenging problem. The paper argues that n-gram representation of the data extracted from the different sections of the WSDL description (types, messages and operations) along with the weighing scheme can benefit the classification of services. Experiments are carried out with three different classifiers over available collections of WS descriptions. It is shown that such representations as word bigrams or letter trigrams extracted from WSDL Operations and Types service description features with TF-IDF as n-gram weighting scheme, can improve automatic WS classification.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Ratnasingam, P.: The importance of technology trust in web services security. Inf. Manag. Comput. Secur. 10(5), 255–260 (2002)

    Article  Google Scholar 

  2. Batra, S., Bawa, S.: Web service categorization using normalized similarity score. Int. J. Comput. Theory Eng. 2(1), 139–141 (2010)

    Article  Google Scholar 

  3. Balasubramanian, D.L., Murugaiyan, S.R., Sambasivam, G., Vengattaraman, T., Dhavachelvan, P.: Semantic web service clustering using concept lattice: multi agent based approach. Int. J. Eng. Technol. 5(5), 3699–3714 (2013)

    Google Scholar 

  4. Wang, H., Shi, Y., Zhou, X., Zhou, Q., Shao, S., Bouguettaya, A.: Web service classification using support vector machine. In: 22nd IEEE International Conference on Tools with Artificial Intelligence (ICTAI), vol. 1, pp. 3–6. IEEE Computer Society, Arras (2010)

    Google Scholar 

  5. Liang, Q., Li, P., Hung, P., Wu, X.: Clustering web services for automatic categorization. In: 2009 IEEE International Conference on Services Computing, SCC 2009, pp. 380–387. IEEE Computer Society, Bangalore (2009)

    Google Scholar 

  6. Yang, J., Zhou, X.: Semi-automatic web service classification using machine learning. Int. J. u-and e-Serv. Sci. Technol. 8(4), 339–348 (2015)

    Article  Google Scholar 

  7. Nisa, R., Qamar, U.: A text mining based approach for web service classification. Inf. Syst. e-Bus. Manag. 13(4), 751–768 (2015)

    Article  Google Scholar 

  8. Saha, S., Murthy, C.A., Pal, S.K.: Classification of web services using tensor space model and rough ensemble classifier. In: An, A., Matwin, S., Ras, Z.W., Slezak, D. (eds.) ISMIS. LNCS, vol. 4994, pp. 508–513. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-68123-6_55

    Chapter  Google Scholar 

  9. Sharma, S., Lather, J.S., Dave, M.: Semantic approach for Web service classification using machine learning and measures of semantic relatedness. Serv. Oriented Comput. Appl. 10(3), 221–231 (2016)

    Article  Google Scholar 

  10. Qamar, U., Niza, R., Bashir, S., Khan, F.H.: A majority vote based classifier ensemble for web service classification. Bus. Inf. Syst. Eng. 58(4), 249–259 (2016)

    Article  Google Scholar 

  11. WSDL. https://www.w3.org/TR/2007/REC-wsdl20-20070626/. Accessed 30 Mar 2018

  12. Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. (CSUR) 34(1), 1–47 (2002)

    Article  MathSciNet  Google Scholar 

  13. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)

    Article  Google Scholar 

  14. Flores, E.: Reutilización de código fuente entre lenguajes de programación. Master’s thesis, Universidad Politécnica de Valencia, Valencia, España, February 2012

    Google Scholar 

  15. Hess, A., Johnston, E., Kushmerick, N.: Machine learning techniques for annotating semantic web services. Citeseer (2005)

    Google Scholar 

  16. Klusch, M., Fries, B., Sycara, K.: OWLS-MX: a hybrid semantic web service matchmaker for OWL-S services. Int. J. Web Semant. 7(2), 121–133 (2009)

    Article  Google Scholar 

  17. Miller, G.A., Beckwith, R., Fellbaum, C.D., Gross, D., Miller, K.: WordNet: an online lexical database. Int. J. Lexicogr. 3(4), 235–244 (1990)

    Article  Google Scholar 

  18. Frank, E., Hall, M.A., Witten, I.H.: Data Mining: Practical Machine Learning Tools and Techniques, 4th edn. Morgan Kaufmann, Los Altos (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christian Sánchez-Sánchez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sánchez-Sánchez, C., Sheremetov, L.B. (2019). N-Gram Representation for Web Service Description Classification. In: Nicosia, G., Pardalos, P., Giuffrida, G., Umeton, R., Sciacca, V. (eds) Machine Learning, Optimization, and Data Science. LOD 2018. Lecture Notes in Computer Science(), vol 11331. Springer, Cham. https://doi.org/10.1007/978-3-030-13709-0_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-13709-0_38

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-13708-3

  • Online ISBN: 978-3-030-13709-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics