Abstract
To support the realisation of semantic web – as well as digital library, the semantic information content of web documents need to be specified in order to make the tangled information more accessible to search engines and other applications. A number of efforts to support the semantic representation of web documents have been proposed. One such effort is the semantic document modelling of which existing web documents are classified and organised to form a semantic document model representing the contents of respective web documents. In this paper we propose a tool meant to assist in constructing semantic document models using natural language analysis technique and a domain specific ontology. Together with users involvement and participation the tool gradually construct the semantic document model which is represented as XML.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American, 35–43 (May 2001)
Brasethvik, T., Gulla, J.A.: Semantically accessing documents using conceptual model descriptions. In: ER Workshops, pp. 321–333 (1999)
Mattia, D., Luca, I., Danielle, N.: Knowledge representation techniques for information extraction on the web. In: Proceedings of the WebNet 1998 (1998)
Brasethvik, T., Gulla, J.A.: A Conceptual Modelling Approach to Semantic Document Retrieval. In: Advanced Information Systems Engineering, 14th International Conference, pp. 167–182 (2002)
Brasethvik, T., Gulla, J.A.: Natural language analysis for semantic document modeling. Data and Knowledge Engineering, 45–62 (2001)
Alani, H., Kim, S., Millard, D., Weal, M., Hall, W., Lewis, P., Shadbolt, N.: Automatic Ontology-Based Knowledge Extraction from Web Documents. IEEE Intelligent Systems 18(1), 14–21 (2003)
Scott, M.: WordSmith tools, at http://www.liv.ac.uk/~ms2928/wordsmiy.htm (accessed: October 2002)
Voutilainen, A.: A short introduction to the NP tool, http://www.lingsoft.fi/doc/nptool/intro (accessed: October 2002)
Miller, G.: WordNet: a lexical database for English. Communications of the ACM 38(11), 39–41 (1996)
Bannon, L., Bodker, S.: Constructing common information spaces. In: 5th European Conference on CSCW (1997)
Nelson, S.: Medical Subject Headings, at http://www.nlm.nih.gov/mesh/meshhome.html (accessed: 20 November 2002)
Sekine, S.: Proteus Project – Apple Pie Parser (Corpus based Parser) (2002), http://nlp.cs.nyu.edu/app (accessed on September 15, 2002)
Arens, Y., Chee, C.Y., Hsu, C., Knoblosk, A.: Retrieving and integrating data from multiple information sources. International Journal of Intelligent and Cooperative Information Systems 2(2), 127–158 (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Noah, S.A. et al. (2003). A Domain Specific Ontology Driven to Semantic Document Modelling. In: Sembok, T.M.T., Zaman, H.B., Chen, H., Urs, S.R., Myaeng, SH. (eds) Digital Libraries: Technology and Management of Indigenous Knowledge for Global Access. ICADL 2003. Lecture Notes in Computer Science, vol 2911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24594-0_70
Download citation
DOI: https://doi.org/10.1007/978-3-540-24594-0_70
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20608-8
Online ISBN: 978-3-540-24594-0
eBook Packages: Springer Book Archive