Skip to main content

Abstract

To support the realisation of semantic web – as well as digital library, the semantic information content of web documents need to be specified in order to make the tangled information more accessible to search engines and other applications. A number of efforts to support the semantic representation of web documents have been proposed. One such effort is the semantic document modelling of which existing web documents are classified and organised to form a semantic document model representing the contents of respective web documents. In this paper we propose a tool meant to assist in constructing semantic document models using natural language analysis technique and a domain specific ontology. Together with users involvement and participation the tool gradually construct the semantic document model which is represented as XML.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American, 35–43 (May 2001)

    Google Scholar 

  2. Brasethvik, T., Gulla, J.A.: Semantically accessing documents using conceptual model descriptions. In: ER Workshops, pp. 321–333 (1999)

    Google Scholar 

  3. Mattia, D., Luca, I., Danielle, N.: Knowledge representation techniques for information extraction on the web. In: Proceedings of the WebNet 1998 (1998)

    Google Scholar 

  4. Brasethvik, T., Gulla, J.A.: A Conceptual Modelling Approach to Semantic Document Retrieval. In: Advanced Information Systems Engineering, 14th International Conference, pp. 167–182 (2002)

    Google Scholar 

  5. Brasethvik, T., Gulla, J.A.: Natural language analysis for semantic document modeling. Data and Knowledge Engineering, 45–62 (2001)

    Google Scholar 

  6. Alani, H., Kim, S., Millard, D., Weal, M., Hall, W., Lewis, P., Shadbolt, N.: Automatic Ontology-Based Knowledge Extraction from Web Documents. IEEE Intelligent Systems 18(1), 14–21 (2003)

    Article  Google Scholar 

  7. Scott, M.: WordSmith tools, at http://www.liv.ac.uk/~ms2928/wordsmiy.htm (accessed: October 2002)

  8. Voutilainen, A.: A short introduction to the NP tool, http://www.lingsoft.fi/doc/nptool/intro (accessed: October 2002)

  9. Miller, G.: WordNet: a lexical database for English. Communications of the ACM 38(11), 39–41 (1996)

    Article  Google Scholar 

  10. Bannon, L., Bodker, S.: Constructing common information spaces. In: 5th European Conference on CSCW (1997)

    Google Scholar 

  11. Nelson, S.: Medical Subject Headings, at http://www.nlm.nih.gov/mesh/meshhome.html (accessed: 20 November 2002)

  12. Sekine, S.: Proteus Project – Apple Pie Parser (Corpus based Parser) (2002), http://nlp.cs.nyu.edu/app (accessed on September 15, 2002)

  13. Arens, Y., Chee, C.Y., Hsu, C., Knoblosk, A.: Retrieving and integrating data from multiple information sources. International Journal of Intelligent and Cooperative Information Systems 2(2), 127–158 (1993)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Noah, S.A. et al. (2003). A Domain Specific Ontology Driven to Semantic Document Modelling. In: Sembok, T.M.T., Zaman, H.B., Chen, H., Urs, S.R., Myaeng, SH. (eds) Digital Libraries: Technology and Management of Indigenous Knowledge for Global Access. ICADL 2003. Lecture Notes in Computer Science, vol 2911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24594-0_70

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24594-0_70

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20608-8

  • Online ISBN: 978-3-540-24594-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics