Skip to main content

Towards a Better Understanding of the Language Content in the Semantic Web

  • Conference paper
Book cover Artificial Intelligence: Methodology, Systems, and Applications (AIMSA 2004)

Abstract

Internet content today is about 80% text-based. No matter static or dynamic, the information is encoded and presented as multilingual, unstructured natural language text pages. As the Semantic Web aims at turning Internet into a machine-understandable resource, it becomes important to consider the natural language content and to assess the feasibility and the innovation of the semantic-based approaches related to unstructured texts. This paper reports about work in progress, an experiment in semantic based annotation and explores scenarios for application of Semantic Web techniques to the textual pages in Internet.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Benjamins, R., Contreras, J.: The six challenges for the Semantic Web. White paper (2002), http://www.isoco.com/isococom/whitepapers/files/SemanticWeb-whitepaper-137.pdf

  2. PROTEGE, http://protege.stanford.edu/

  3. McGuinness, D., Fikes, R., Rice, J.: An environment for merging and testing large ontologies. In: Proceedings of KR 2000, pp. 483–493. Morgan Kaufmann, San Francisco (2000)

    Google Scholar 

  4. Sure, Y., Angele, J., Staab, S.: OntoEdit: Guiding Ontology Development by Methodology and Inferencing. In: Meersman, R., Tari, Z., et al. (eds.) CoopIS 2002, DOA 2002, and ODBASE 2002. LNCS, vol. 2519, Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Noy, N., Musen, M.: PROMPT:Algorithm and Tool for Automated Ont. Merging and Alignment. In: Proc. Of the 17th National Conference on Artificial Intelligence (AAAI 2000), Austin, TX, pp. 450–455 (2000)

    Google Scholar 

  6. Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learing to Map between Ontologies on the Semantic Web. In: Proc. 11th Int. World Wide Web Conf., WWW 2002 (2002)

    Google Scholar 

  7. The Energy Data Collection (EDC) project: deep focus on hydra-headed metadata, www.digitalgovernment.org/news/stories/2002/images/metadatafinal.pdf

  8. Hovy, E., Clavans, J.: Comparison of Manual and Automatic Inter-Ontology Alignment (2002), http://altamira.isi.edu/alignment

  9. Vargas-Vera, M., Motta, E., Domingue, J., Shum, S.B., Lanzoni, M.: Knowledge Extraction by using an Ontology-based Annotation Tool. In: Proc. 1st Int. Conf. on Knowledge Capture (K-CAP 2001), Workshop on Knowledge Markup & Semantic Annotation, Victoria, B.C., Canada (2001)

    Google Scholar 

  10. SPIRIT, Spatially-Aware Information Retrieval on the Internet, IST FP5 project in Semantic Web, http://www.geo-spirit.org/

  11. Sparck-Jones, Karen: What is the Role of NLP in Text Retrieval? In: Strzalkowski, T. (ed.) Natural Language Information Retrieval, pp. 1–24. Kluwer, Dordrecht (1999)

    Google Scholar 

  12. Kimani, S., Catarci, T., Cruz, I.: Web Rendering Systems: Techniques, Classification Criteria and Challenges. In: Chen, V. (ed.) Vizualizing the Semantic Web Geroimenko, pp. 63–89. Springer, Berlin (2002)

    Google Scholar 

  13. Domingue, J., Dzbor, M., Motta, E.: Semantic Layering with Magpie. In: Handbook on Ontologies, pp. 533–554 (2004)

    Google Scholar 

  14. TeSSIcircledR: Get more out of your unstructured medical documents. Language & Computing, White paper (April 2004), see www.landc.be

  15. Natural Language Processing in Medical Coding. Language & Computing, White Paper (April 2004), www.landc.be

  16. http://www.w3.org/2001/Annotea/

  17. http://sourceforge.net/projects/ontomat

  18. Handschuh, S., Staab, S., Ciravegna, F.: S-CREAM - Semi-Automatic Creation of Metadata. In: Semantic Authoring, Annotation and Markup Workshop, 15th European Conference on Artificial Intelligence (ECAI 2002), Lyon, France, pp. 27–33 (2002)

    Google Scholar 

  19. Ciravegna, F., Dingli, A., Petrelli, D., Wilks, Y.: Document Annotation via Adaptive Information Extraction. In: Poster at the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, August 11-15 (2002)

    Google Scholar 

  20. Vargas-Vera, M., Motta, E., Domingue, J., Lanzoni, M., Stutt, A., Ciravegna, F.: MnM: Ontology Driven Semi-Automatic and Automatic Support for Semantic Markup. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, p. 379. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  21. ACM Computing Classification System, http://www.computer.org/mc/keywords/keywords.htm

  22. CGI Taxonomy, http://www.siggraph.org/education/curriculum/projects/Taxonomy2001.htm

  23. SeSDL Taxonomy, www.sesdl.scotcit.ac.uk/taxonomy/ed_tech.html

  24. Nichols, D., Terry, A.: User’s Guide to Teknowledge Ontologies. Teknowledge Corp, ontology.teknowledge.com/Ontology_User_Guide.doc (December 2003)

    Google Scholar 

  25. Pease, A., Niles, I., Li, J.: The Suggested Upper Merged Ontology: A Large Ontology for the Semantic Web and its Applications. In: Working Notes of the AAAI- 2002 Workshop on Ontologies and the Semantic Web, Edmonton, Canada, July 28- August 1 (2002), http://projects.teknowledge.com/AAAI-2002/Pease.ps

  26. Angelova, G., Boytcheva, S., Kalaydjiev, O., Trausan-Matu, S., Nakov, P., Strupchanska, A.: Adaptivity in Web-Based CALL. In: Proc. ECAI 2002, Lyon, France, July 2002, pp. 445–449 (2002), see the LARFLAST ontology at http://www.larflast.bas.bg

  27. Dobrev, P., Toutanova, K.: CGWorld - Architecture and Features. In: Proc. of the 10th International Conference on Conceptual Structures: Integration and Interfaces, pp. 261–270, http://www.larflast.bas.bg:8080/CGWorld

  28. JENA, http://jena.sourceforge.net

  29. Projects DB-MAT and DBR-MAT, 1992-1998 : knowledge-based machine aided translation, see http://nats-www.informatik.uni-hamburg.de/~dbrmat and http://www.lml.bas.bg/projects/dbr-mat

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dobrev, P., Strupchanska, A., Angelova, G. (2004). Towards a Better Understanding of the Language Content in the Semantic Web. In: Bussler, C., Fensel, D. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 2004. Lecture Notes in Computer Science(), vol 3192. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30106-6_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30106-6_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22959-9

  • Online ISBN: 978-3-540-30106-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics