Skip to main content

An Architecture to Support Web-Based Information Extraction Using Domain Ontologies

  • Conference paper
Exploring the Grand Challenges for Next Generation E-Business (WEB 2009)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 52))

Included in the following conference series:

  • 1203 Accesses

Abstract

The web is the world’s most valuable information resource. However, a wide gap has emerged between the information available for software applications vis-à-vis human consumption. In response to this problem, new research initiatives have focused on extracting information available on the web with machine-processable semantics. Ontologies play a large role in information extraction, particularly in the context of semantic web, and applications should be able to find appropriate ontologies on the fly. However, existing tools do not adequately support information extraction and ontology selection. This research-in-progress paper presents the architecture for an information extraction system which relies on domain ontologies and lexical resources. We also provide an approach for easy identification of appropriate ontologies for a particular task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alani, H., Brewster, C.: Ontology ranking based on the analysis of concept structures. In: Proceedings of the 3rd International Conference on Knowledge Capture, pp. 51–58 (2005)

    Google Scholar 

  2. Aldea, et al.: An Ontology-Based Knowledge Management Platform. In: Proceedings of IJCAI 2003 Workshop on Information Integration on the Web (IIWeb 2003), Mexico, pp. 177–182 (2003)

    Google Scholar 

  3. Burton-Jones, A., Storey, V.C., Sugumaran, V., Ahluwalia, P.: A Semiotic Metrics Suite for Assessing the Quality of Ontologies. Data and Knowledge Engineering 55(1), 84–102 (2005)

    Article  Google Scholar 

  4. Chaudhry, W., Meziane, F.: Information Extraction from Heterogeneous Sources Using Domain Ontologies. In: IEEE International Conference on Emerging Technologies, Islamabad, Pakistan, September 17-18, pp. 511–516 (2005)

    Google Scholar 

  5. Ding, L., Finin, T., Joshi, A., Pan, R., Cost, R.S., Peng, Y., Reddivari, P., Doshi, V., Sachs, J.: Swoogle: a search and metadata engine for the semantic web. In: Proceedings of the 13th ACM Conference on Information and Knowledge Management, pp. 652–659 (2004)

    Google Scholar 

  6. Hendler, J.: Agents and the Semantic Web. IEEE intelligent Systems 16(2), 30–37 (2001)

    Article  Google Scholar 

  7. Kushmerick, N., Thomas, B.: Adaptive Information Extraction: Core Technologies for Information Agents. In: Klusch, M., Bergamaschi, S., Edwards, P., Petta, P. (eds.) Intelligent Information Agents. LNCS (LNAI), vol. 2586, pp. 79–103. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  8. Lozano-Tello, A., Gómez-Pérez, A.: OntoMetric: A method to choose the appropriate ontology. Journal of Database Management 15(2) (April-June 2004)

    Google Scholar 

  9. McDowell, L.K., Cafarella, M.: Ontology-Driven Information Extraction with OntoSyphon. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 428–444. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  10. Patel, C., Supekar, K., Lee, Y., Park, E.K.: OntoKhoj: a semantic web portal for ontology searching, ranking and classification. In: Proceedings of the 5th ACM International Workshop on Web Information and Data Management, pp. 58–61 (2003)

    Google Scholar 

  11. Porzel, R., Malaka, R.: A Task-based Approach for Ontology Evaluation. In: ECAI Workshop on Ontology Learning and Population, Valencia, Spain (2004)

    Google Scholar 

  12. Simon, H.: Sciences of the artificial. MIT Press, Cambridge (1981)

    Google Scholar 

  13. Stephens, L.M., Huhns, M.N.: Consensus ontologies: reconciling the semantics of web pages and agents. IEEE Internet Computing 5(5), 92–95 (2001)

    Article  Google Scholar 

  14. Vallet, D., Fernández, M., Castells, P.: An Ontology-Based Information Retrieval Model. In: Gómez-Pérez, A., Euzenat, J. (eds.) ESWC 2005. LNCS, vol. 3532, pp. 455–470. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  15. Yildiz, B., Miksch, S.: Motivating Ontology-Driven Information Extraction. In: International Conference on Semantic Web and Digital Libraries (ICSD 2007), Bangalore, pp. 45–53 (2007)

    Google Scholar 

  16. Zhang, Y., Vasconcelos, W., Sleeman, D.: Ontosearch: An ontology search engine. In: Proceedings of the 24th SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, Cambridge, UK, December 13 – 15, pp. 58–69 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Sugumaran, V., Meziane, F. (2010). An Architecture to Support Web-Based Information Extraction Using Domain Ontologies. In: Sharman, R., Rao, H.R., Raghu, T.S. (eds) Exploring the Grand Challenges for Next Generation E-Business. WEB 2009. Lecture Notes in Business Information Processing, vol 52. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17449-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17449-0_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17448-3

  • Online ISBN: 978-3-642-17449-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics