Abstract
Ontology population has emerged as an increasingly important problem in semantic web services. In this paper, we propose a method using named entity recognition that extracts keywords from Web pages in order to populate a product ontology. The semantic classification determines meanings of terms and phrases by heuristic rules after the morphological analysis. In addition, our method classifies vocabularies into different semantic tags. Firstly, it records several lists of semantic tags to a history database. Then, we define some rules from the lists to extract a product name. Finally, the rules build and refine the product ontology semi-automatically. According to an evaluation, proposed method achieved 87.1% precision and 87.4% recall. Thus, it can suggest some instances, and it decreases cost of updating the ontology.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Elgedawy, I., Tari, Z., Winikoff, M.: Exact functional context matching for web services. In: Proceedings of the 2nd international conference on Service oriented computing (ICSOC 2004) (2004)
Kawamura, T., Ueno, K., Nagano, S., Hasegawa, T., Ohsuga, A.: Ubiquitous Service Finder - Discovery of Services semantically derived from metadata in Ubiquitous Computing. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729. Springer, Heidelberg (2005)
Sasajima, M., Kitamura, Y., Naganuma, T., Kurakake, S., Mizoguchi, R.: Task Ontology-Based Framework for Modeling Users’ Activities for Mobile Service Navigation. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 71–72. Springer, Heidelberg (2006)
Mizoguchi-Shimogori, Y., Nakamoto, T., Asakawa, K., Nagano, S., Inaba, M., Kawamura, T.: TV Navigation Agent for Measuring Semantic Similarity between Documents. In: Proceedings of 3rd International Workshop on Agents and Web Services in Distributed Environments (AWeSOMe 2007) (2007)
Cho, K., Kawamura, T.: BlogAlpha: Home Automation Robot using Ontology in Home Environment. In: Proceedings of Artificial Intelligence and Applications (AIA 2007) (2007)
Kawamura, T., Nagano, S., Inaba, M., Mizoguchi, Y.: Mobile Service for Reputation Extraction from Weblogs - Public Experiment and Evaluation. In: Proceedings of Twenty-Second Conference on Artificial Intelligence (AAAI 2007) (2007)
Punuru, J., Chen, J.: Learning for Semantic Classification of Conceptual Terms. In: IEEE International Conference on GRC 2007 (2007)
Liu, F., Zhao, J., Lv, B., Xu, B., Yu, H.: Product Named Entity Recognition Based on Hierarchical Hidden Markov Model. In: Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing (2005)
Protégé, http://protege.stanford.edu/
OntoGen, http://ontogen.ijs.si/
WordNet, http://wordnet.princeton.edu/
EDR Electronic Dictionary, http://www2.nict.go.jp/r/r312/EDR/
Noy, N.F., Doan, A., Halevy, A.Y.: Semantic Integration. AI Magazine 26, 7–10 (2005)
Wong, T., Lam, W., Chen, E.: Automatic Domain Ontology Generation from Web Sites. Journal of Integrated Design & Process Science archive 9(3), 29–38 (2005)
Tijerino, Y.A., Embley, D.W., Lonsdale, D.W., Nagy, G.: Ontology generation from tables. In: Proceedings of the Fourth International Conference on Web Information Systems Engineering, pp. 242–249 (2003)
Cohen, W.W., Hurst, M., Jensen, L.S.: A flexible learning system for wrapping tables and lists in HTML documents. In: Proceedings of the 11th international conference on World Wide Web, pp. 32–241 (2002)
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the Fourteenth International Conference on Computational Linguistics, Nantes, France, pp. 539–545 (July 1992)
Cimiano, P., Ladwig, G., Staab, S.: Gimme’ the context: context-driven automatic semantic annotation with C-PANKOW. In: Proceedings of the 14th international conference on World Wide Web May 10-14 (2005)
Pasca, M., Lin, D., Bigham, J., Lifchits, A., Jain, A.: Organizing and searching the world wide web of facts - step one: The one-million fact extraction challenge. In: Proceedings of the 21st National Conference on Artificial Intelligence (2006)
Muslea, I., Minton, S., Knoblock, C.A.: Hierarchical wrapper induction for semistructured information sources. Autonomous Agents and Multi-Agent Systems 4(1/2), 93–114 (2001)
Brin, S.: Extracting patterns and relations from the world wide web. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, Springer, Heidelberg (1998)
Agichtein, E., Gravano, L.: Snowball: Extracting relations from large plaintext collections. In: Proceedings of the 5th ACM International Conference on Digital Libraries (2000)
Sakai, T., Saito, Y., Ichimura, Y., Koyama, M., Kokubu, T., Manabe, T.: ASKMi: A Japanese Question Answering System based on Semantic Role Analysis. In: RIAO 2004 Proceedings, pp. 215–231 (2004)
Frantzi, K., Ananiadou, S.: Extracting Nested Collocations. In: COLING 1996, pp. 41–46 (1996)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Inaba, M. et al. (2008). Pattern-Based Semantic Tagging for Ontology Population. In: Kowalczyk, R., Huhns, M., Klusch, M., Maamar, Z., Vo, Q.B. (eds) Service-Oriented Computing: Agents, Semantics, and Engineering. SOCASE 2008. Lecture Notes in Computer Science, vol 5006. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79968-9_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-79968-9_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-79967-2
Online ISBN: 978-3-540-79968-9
eBook Packages: Computer ScienceComputer Science (R0)