Abstract
Internet content today is about 80% text-based. No matter static or dynamic, the information is encoded and presented as multilingual, unstructured natural language text pages. As the Semantic Web aims at turning Internet into a machine-understandable resource, it becomes important to consider the natural language content and to assess the feasibility and the innovation of the semantic-based approaches related to unstructured texts. This paper reports about work in progress, an experiment in semantic based annotation and explores scenarios for application of Semantic Web techniques to the textual pages in Internet.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Benjamins, R., Contreras, J.: The six challenges for the Semantic Web. White paper (2002), http://www.isoco.com/isococom/whitepapers/files/SemanticWeb-whitepaper-137.pdf
PROTEGE, http://protege.stanford.edu/
McGuinness, D., Fikes, R., Rice, J.: An environment for merging and testing large ontologies. In: Proceedings of KR 2000, pp. 483–493. Morgan Kaufmann, San Francisco (2000)
Sure, Y., Angele, J., Staab, S.: OntoEdit: Guiding Ontology Development by Methodology and Inferencing. In: Meersman, R., Tari, Z., et al. (eds.) CoopIS 2002, DOA 2002, and ODBASE 2002. LNCS, vol. 2519, Springer, Heidelberg (2002)
Noy, N., Musen, M.: PROMPT:Algorithm and Tool for Automated Ont. Merging and Alignment. In: Proc. Of the 17th National Conference on Artificial Intelligence (AAAI 2000), Austin, TX, pp. 450–455 (2000)
Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learing to Map between Ontologies on the Semantic Web. In: Proc. 11th Int. World Wide Web Conf., WWW 2002 (2002)
The Energy Data Collection (EDC) project: deep focus on hydra-headed metadata, www.digitalgovernment.org/news/stories/2002/images/metadatafinal.pdf
Hovy, E., Clavans, J.: Comparison of Manual and Automatic Inter-Ontology Alignment (2002), http://altamira.isi.edu/alignment
Vargas-Vera, M., Motta, E., Domingue, J., Shum, S.B., Lanzoni, M.: Knowledge Extraction by using an Ontology-based Annotation Tool. In: Proc. 1st Int. Conf. on Knowledge Capture (K-CAP 2001), Workshop on Knowledge Markup & Semantic Annotation, Victoria, B.C., Canada (2001)
SPIRIT, Spatially-Aware Information Retrieval on the Internet, IST FP5 project in Semantic Web, http://www.geo-spirit.org/
Sparck-Jones, Karen: What is the Role of NLP in Text Retrieval? In: Strzalkowski, T. (ed.) Natural Language Information Retrieval, pp. 1–24. Kluwer, Dordrecht (1999)
Kimani, S., Catarci, T., Cruz, I.: Web Rendering Systems: Techniques, Classification Criteria and Challenges. In: Chen, V. (ed.) Vizualizing the Semantic Web Geroimenko, pp. 63–89. Springer, Berlin (2002)
Domingue, J., Dzbor, M., Motta, E.: Semantic Layering with Magpie. In: Handbook on Ontologies, pp. 533–554 (2004)
TeSSIcircledR: Get more out of your unstructured medical documents. Language & Computing, White paper (April 2004), see www.landc.be
Natural Language Processing in Medical Coding. Language & Computing, White Paper (April 2004), www.landc.be
Handschuh, S., Staab, S., Ciravegna, F.: S-CREAM - Semi-Automatic Creation of Metadata. In: Semantic Authoring, Annotation and Markup Workshop, 15th European Conference on Artificial Intelligence (ECAI 2002), Lyon, France, pp. 27–33 (2002)
Ciravegna, F., Dingli, A., Petrelli, D., Wilks, Y.: Document Annotation via Adaptive Information Extraction. In: Poster at the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, August 11-15 (2002)
Vargas-Vera, M., Motta, E., Domingue, J., Lanzoni, M., Stutt, A., Ciravegna, F.: MnM: Ontology Driven Semi-Automatic and Automatic Support for Semantic Markup. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, p. 379. Springer, Heidelberg (2002)
ACM Computing Classification System, http://www.computer.org/mc/keywords/keywords.htm
CGI Taxonomy, http://www.siggraph.org/education/curriculum/projects/Taxonomy2001.htm
SeSDL Taxonomy, www.sesdl.scotcit.ac.uk/taxonomy/ed_tech.html
Nichols, D., Terry, A.: User’s Guide to Teknowledge Ontologies. Teknowledge Corp, ontology.teknowledge.com/Ontology_User_Guide.doc (December 2003)
Pease, A., Niles, I., Li, J.: The Suggested Upper Merged Ontology: A Large Ontology for the Semantic Web and its Applications. In: Working Notes of the AAAI- 2002 Workshop on Ontologies and the Semantic Web, Edmonton, Canada, July 28- August 1 (2002), http://projects.teknowledge.com/AAAI-2002/Pease.ps
Angelova, G., Boytcheva, S., Kalaydjiev, O., Trausan-Matu, S., Nakov, P., Strupchanska, A.: Adaptivity in Web-Based CALL. In: Proc. ECAI 2002, Lyon, France, July 2002, pp. 445–449 (2002), see the LARFLAST ontology at http://www.larflast.bas.bg
Dobrev, P., Toutanova, K.: CGWorld - Architecture and Features. In: Proc. of the 10th International Conference on Conceptual Structures: Integration and Interfaces, pp. 261–270, http://www.larflast.bas.bg:8080/CGWorld
Projects DB-MAT and DBR-MAT, 1992-1998 : knowledge-based machine aided translation, see http://nats-www.informatik.uni-hamburg.de/~dbrmat and http://www.lml.bas.bg/projects/dbr-mat
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dobrev, P., Strupchanska, A., Angelova, G. (2004). Towards a Better Understanding of the Language Content in the Semantic Web. In: Bussler, C., Fensel, D. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 2004. Lecture Notes in Computer Science(), vol 3192. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30106-6_27
Download citation
DOI: https://doi.org/10.1007/978-3-540-30106-6_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22959-9
Online ISBN: 978-3-540-30106-6
eBook Packages: Springer Book Archive