skip to main content
10.1145/1967486.1967570acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

Manual and automatic semantic annotation of web documents: the FLERSA tool

Published:08 November 2010Publication History

ABSTRACT

In this paper, FLERSA (Flexible Range Semantic Annotation) is presented as a user-centred annotation tool for Web content. The tool has been developed over a CMS and allows both manual and automatic semantic annotations. For manual annotation, a new flexible range markup technique is used, based on the RDFa standard, to support the evolution of annotated Web documents more effectively than XPointer. For automatic annotation, a hybrid approach based on natural language processing (NLP) techniques (Vector-Space Model + n-grams) is used to determine the concepts that the content of a Web document deals with (from an ontology which provides a taxonomy), based on previous annotations that are used as training Corpus.

References

  1. Bemers-Lee, T., Hendler, J. and Lassila O. 2001. The Semantic Web. Scientific American (May 2001).Google ScholarGoogle Scholar
  2. Sheth, A. and Bertram, C. et al. 2002. Managing Semantic Content for the Web. IEEE Internet Computing (Jul-Aug. 2002), Volume: 6, Issue: 4, pp. 80--87. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Tse-Ming, T. and Han-Kuan, Y. et al. 2003. Ontology-Mediated Integration of Intranet Web Services. Computer Magazine (October 2003), Volume 36, Issue 10, pp. 63--71. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Webster's Online Dictionary. DOI= http://www.websters-online-dictionary.org/Google ScholarGoogle Scholar
  5. Maedche, A., Motik, B., Stojanovic, L., Studer, R. and Volz, R. 2003. Ontologies for Enterprise KM. IEEE Intell, Syst. 18 (2) (2003) 26--33. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Manola, F. and Millar, E. 2004. "RDF Primer". W3C Recommendation. DOI= http://www.w3.org/TR/rdf-primer/Google ScholarGoogle Scholar
  7. Adida, B. and Birbeck, M. 2008. RDFa primer: Bridging the human and data Webs. W3C Working Group Note.Google ScholarGoogle Scholar
  8. Bray, T., Paoli, J., Sperberge-McQueen, C. M., Maler, E. and Yergeau, F. 2008. Extensible Markup Language (XML) 1.0 (fifth edition). W3C Recommendation (26 November 2008).Google ScholarGoogle Scholar
  9. Koivunen, M., Prud'Hommeaux, E., and Swick, R. 2002. Annotea: an open RDF infrastructure for shared Web annotations. Computer Networks, Volume 39, Page 589.Google ScholarGoogle Scholar
  10. DeRose, S., Maler, E. and Daniel, R. Jr. 2001. XML Pointer Language (XPointer) Version 1.0 (February 2001).Google ScholarGoogle Scholar
  11. Kesselman, J., Robie, J., Champion, M., Sharpe, P. and Apparao, V. 2000. Document object model (dom) level 2 traversal and range specification. Technical report, W3C.Google ScholarGoogle Scholar
  12. Amaya Home Page. 2009. DOI= http://www.w3.org/Amava/Google ScholarGoogle Scholar
  13. Epiphany Home Page. 2010. DOI= http://projects.dfki.uni-kl.de/epiphany/Google ScholarGoogle Scholar
  14. Bonino, D., Bosca A., Corno F., Farinetti L., Pescarmona F. 2004. H-DOSE: an Holistic Distributed Open Semantic Elaboration Platform. SWAP2004.Google ScholarGoogle Scholar
  15. Popov, B., et al. 2004. KIM: a semantic platform for information extraction and retrieval. Nat. Lang. Eng. 10(3--4): p. 375--392. DOI= http://www.ontotext.com/kim/ Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Melita. DOI= http://www.aktors.org/technologies/melita/Google ScholarGoogle Scholar
  17. Uren, V., Cimiano, P., Iria, J., Handschuh, S., Vargas-Vera, M., Motta, E. and Ciravegna, F. 2006. Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Journal Web Semantics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Golbeck, J., Grove, M., Parsia, B., Kalyanpur, A. and Hendler, J., 2002. New tools for the Semantic Web. In Knowledge Engineering and Knowledge Management. Proceedings of 13th International Conference. EKAW'02. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Auer, S. 2005. Powl - A Web Based Platform for Collaborative Semantic Web Development. Proc. of 1st Workshop Scripting for the Semantic Web. SFSW'05.Google ScholarGoogle Scholar
  20. RDFa Distiller. DOI= http://www.w3.org/2007/08/pvRdfa/Google ScholarGoogle Scholar
  21. Handschuh, S., Staab, S. and Studer, R. 2003. Leveraging metadata creation for the Semantic Web with CREAM. Proceedings of the Annual German Conference on AI.Google ScholarGoogle Scholar
  22. Joomla. DOI= http://www.joomla.org/Google ScholarGoogle Scholar
  23. Mootools framework. DOI= http://www.mootools.net/Google ScholarGoogle Scholar
  24. Navarro-Galindo, J. L. and Samos, J. 2010. Flexible Range Semantic Annotations Based on RDFa. 27th British National Conference on Databases (BNCOD'10), LNCS 6121. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Navarro-Galindo, J. L. and Samos, J. 2010. Ontology Support and Management for Semantic Annotation. Research Report (LSI, Universidad de Granada).Google ScholarGoogle Scholar
  26. Salton, G. and Lesk, M. E. 1965. The SMART automatic document retrieval system. An illustration, Communications of the ACM, 8(6), 391--398. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Luhn, H. P. The automatic creation of literature abstracts. 1958. IBM journal of research and development, 2:159--165. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Sparck, K. A statistical interpretation of term specificity and its application in retrieval. 1972. Journal of documentation, 28(1):11--21.Google ScholarGoogle ScholarCross RefCross Ref
  29. Halevy, A., Franklin, M., Maier, D. 2006. Principles of dataspace systems. PODS'06. Proceedings of the Twenty-fifth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, New York, pp. 1--9. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Manual and automatic semantic annotation of web documents: the FLERSA tool
              Index terms have been assigned to the content through auto-classification.

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Other conferences
                iiWAS '10: Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
                November 2010
                895 pages
                ISBN:9781450304214
                DOI:10.1145/1967486

                Copyright © 2010 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 8 November 2010

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • research-article

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader