An Architecture to Support Web-Based Information Extraction Using Domain Ontologies

Sugumaran, Vijayan; Meziane, Farid

doi:10.1007/978-3-642-17449-0_1

Vijayan Sugumaran^9,10 &
Farid Meziane¹¹

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 52))

Included in the following conference series:

Workshop on E-Business

1240 Accesses

Abstract

The web is the world’s most valuable information resource. However, a wide gap has emerged between the information available for software applications vis-à-vis human consumption. In response to this problem, new research initiatives have focused on extracting information available on the web with machine-processable semantics. Ontologies play a large role in information extraction, particularly in the context of semantic web, and applications should be able to find appropriate ontologies on the fly. However, existing tools do not adequately support information extraction and ontology selection. This research-in-progress paper presents the architecture for an information extraction system which relies on domain ontologies and lexical resources. We also provide an approach for easy identification of appropriate ontologies for a particular task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Information Extraction Approaches: A Survey

An Approach to Web Information Processing

Kizomba: An Unsupervised Heuristic-Based Web Information Extractor

References

Alani, H., Brewster, C.: Ontology ranking based on the analysis of concept structures. In: Proceedings of the 3rd International Conference on Knowledge Capture, pp. 51–58 (2005)
Google Scholar
Aldea, et al.: An Ontology-Based Knowledge Management Platform. In: Proceedings of IJCAI 2003 Workshop on Information Integration on the Web (IIWeb 2003), Mexico, pp. 177–182 (2003)
Google Scholar
Burton-Jones, A., Storey, V.C., Sugumaran, V., Ahluwalia, P.: A Semiotic Metrics Suite for Assessing the Quality of Ontologies. Data and Knowledge Engineering 55(1), 84–102 (2005)
Article Google Scholar
Chaudhry, W., Meziane, F.: Information Extraction from Heterogeneous Sources Using Domain Ontologies. In: IEEE International Conference on Emerging Technologies, Islamabad, Pakistan, September 17-18, pp. 511–516 (2005)
Google Scholar
Ding, L., Finin, T., Joshi, A., Pan, R., Cost, R.S., Peng, Y., Reddivari, P., Doshi, V., Sachs, J.: Swoogle: a search and metadata engine for the semantic web. In: Proceedings of the 13th ACM Conference on Information and Knowledge Management, pp. 652–659 (2004)
Google Scholar
Hendler, J.: Agents and the Semantic Web. IEEE intelligent Systems 16(2), 30–37 (2001)
Article Google Scholar
Kushmerick, N., Thomas, B.: Adaptive Information Extraction: Core Technologies for Information Agents. In: Klusch, M., Bergamaschi, S., Edwards, P., Petta, P. (eds.) Intelligent Information Agents. LNCS (LNAI), vol. 2586, pp. 79–103. Springer, Heidelberg (2003)
Chapter Google Scholar
Lozano-Tello, A., Gómez-Pérez, A.: OntoMetric: A method to choose the appropriate ontology. Journal of Database Management 15(2) (April-June 2004)
Google Scholar
McDowell, L.K., Cafarella, M.: Ontology-Driven Information Extraction with OntoSyphon. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 428–444. Springer, Heidelberg (2006)
Chapter Google Scholar
Patel, C., Supekar, K., Lee, Y., Park, E.K.: OntoKhoj: a semantic web portal for ontology searching, ranking and classification. In: Proceedings of the 5th ACM International Workshop on Web Information and Data Management, pp. 58–61 (2003)
Google Scholar
Porzel, R., Malaka, R.: A Task-based Approach for Ontology Evaluation. In: ECAI Workshop on Ontology Learning and Population, Valencia, Spain (2004)
Google Scholar
Simon, H.: Sciences of the artificial. MIT Press, Cambridge (1981)
Google Scholar
Stephens, L.M., Huhns, M.N.: Consensus ontologies: reconciling the semantics of web pages and agents. IEEE Internet Computing 5(5), 92–95 (2001)
Article Google Scholar
Vallet, D., Fernández, M., Castells, P.: An Ontology-Based Information Retrieval Model. In: Gómez-Pérez, A., Euzenat, J. (eds.) ESWC 2005. LNCS, vol. 3532, pp. 455–470. Springer, Heidelberg (2005)
Chapter Google Scholar
Yildiz, B., Miksch, S.: Motivating Ontology-Driven Information Extraction. In: International Conference on Semantic Web and Digital Libraries (ICSD 2007), Bangalore, pp. 45–53 (2007)
Google Scholar
Zhang, Y., Vasconcelos, W., Sleeman, D.: Ontosearch: An ontology search engine. In: Proceedings of the 24th SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, Cambridge, UK, December 13 – 15, pp. 58–69 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Decision and Information Sciences School of Business Administration, Oakland University, Rochester, MI, 48309, USA
Vijayan Sugumaran
Department of Service Systems Management and Engineering, Sogang University, 1 Shinsoo-Dong, Mapo-Gu, Seoul, 121-742, South Korea
Vijayan Sugumaran
School of Computing Science and Engineering, Newton Building, The University of Salford, Salford, M5 4WT, UK
Farid Meziane

Authors

Vijayan Sugumaran
View author publications
You can also search for this author in PubMed Google Scholar
Farid Meziane
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Management Science and Systems, State University of New York at Buffalo, 14260, Buffalo, New York, NY, USA
Raj Sharman
Department of Management Science and Systems, State University of New York at Buffalo, School of Management, 14260, Buffalo, New York, NY, USA
H. Raghav Rao
Department of Information Systems, Arizona State University, 601G, 874606, Tempe, AZ, USA
T. S. Raghu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sugumaran, V., Meziane, F. (2010). An Architecture to Support Web-Based Information Extraction Using Domain Ontologies. In: Sharman, R., Rao, H.R., Raghu, T.S. (eds) Exploring the Grand Challenges for Next Generation E-Business. WEB 2009. Lecture Notes in Business Information Processing, vol 52. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17449-0_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-17449-0_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17448-3
Online ISBN: 978-3-642-17449-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics