Abstract
Service discovery plays an important role in service composition. In order to achieve better performance of service discovery, often, service classification should be in place to group available services into different classes. While having a powerful classifier at hand is essential for the task of classification, existing methods usually assume that the class-labels of services are available in prior, which is not true. Traditional clustering methods consume a great deal of time and resources in processing Web service data and result in poor performance, because of the high-dimensional and sparse characteristics of WSDL documents. In this paper, Latent Semantic Analysis (LSA) is combined with the Expectation-Maximization (EM) algorithm to compensate for the poor performance of a single learning model. The obtained class-labels are then used by the Support Vector Machine (SVM) classifier for further classification. We evaluate our approach based on real world WSDL files. The experimental results reveal the effectiveness of the proposed method in terms of accuracy and quality of service clustering and classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dumais, S.T.: Latent Semantic Analysis. Annual Reiview of Information Science and Technology 38(1), 188–230 (2004)
Borman, S.: The Expectation Maximization Algorithm—A Short Tutorial (2009), http://www.seanborman.com/publications/EMalgorithm.pdf
Burges, C.J.C.: A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery 2(2), 121–167 (1998)
Aversano, L., Bruno, M., et al.: Using Concept Lattices to Support Service Selection. International Journal of Web Services Research 3(4), 32–51 (2006)
Minh, H.Q., Niyogi, P., Yao, Y.: Mercer’s Theorem, Feature Maps, and Smoothing. In: Lugosi, G., Simon, H.U. (eds.) COLT 2006. LNCS (LNAI), vol. 4005, pp. 154–168. Springer, Heidelberg (2006)
Hsu, C.W., Lin, C.J.: A Comparison of Methods for Multi-class Support Vector Machines. IEEE Transactions on Neural Networks 13(2), 415–425 (2002)
Joachims, T.: A Statistical Learning Model of Text Classification for Support Vector Machines. In: Proc. of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 128–136 (2001)
Lewis, D.D., Li, F., Rose, T., Yang, Y.: RCV 1: A new benchmark collection for text categorization research. Journal of Machine Learning Research 5(3), 361–397 (2004)
Forman, G., Cohen, I.: Learning from Little: Comparison of Classifiers Given Little Training. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 161–172. Springer, Heidelberg (2004)
Debole, F., Sebastiani, F.: An analysis of the Relative Hardness of Reuters-21578 Subsets. Journal of the American Society for Information Science and Technology 56(6), 584–596 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Luo, L., Li, L., Wang, Y. (2012). Learning to Classify Service Data with Latent Semantics. In: Li, T., et al. Rough Sets and Knowledge Technology. RSKT 2012. Lecture Notes in Computer Science(), vol 7414. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31900-6_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-31900-6_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31899-3
Online ISBN: 978-3-642-31900-6
eBook Packages: Computer ScienceComputer Science (R0)