skip to main content
10.1145/1967486.1967570acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

Manual and automatic semantic annotation of web documents: the FLERSA tool

Published: 08 November 2010 Publication History

Abstract

In this paper, FLERSA (Flexible Range Semantic Annotation) is presented as a user-centred annotation tool for Web content. The tool has been developed over a CMS and allows both manual and automatic semantic annotations. For manual annotation, a new flexible range markup technique is used, based on the RDFa standard, to support the evolution of annotated Web documents more effectively than XPointer. For automatic annotation, a hybrid approach based on natural language processing (NLP) techniques (Vector-Space Model + n-grams) is used to determine the concepts that the content of a Web document deals with (from an ontology which provides a taxonomy), based on previous annotations that are used as training Corpus.

References

[1]
Bemers-Lee, T., Hendler, J. and Lassila O. 2001. The Semantic Web. Scientific American (May 2001).
[2]
Sheth, A. and Bertram, C. et al. 2002. Managing Semantic Content for the Web. IEEE Internet Computing (Jul-Aug. 2002), Volume: 6, Issue: 4, pp. 80--87.
[3]
Tse-Ming, T. and Han-Kuan, Y. et al. 2003. Ontology-Mediated Integration of Intranet Web Services. Computer Magazine (October 2003), Volume 36, Issue 10, pp. 63--71.
[4]
Webster's Online Dictionary. DOI= http://www.websters-online-dictionary.org/
[5]
Maedche, A., Motik, B., Stojanovic, L., Studer, R. and Volz, R. 2003. Ontologies for Enterprise KM. IEEE Intell, Syst. 18 (2) (2003) 26--33.
[6]
Manola, F. and Millar, E. 2004. "RDF Primer". W3C Recommendation. DOI= http://www.w3.org/TR/rdf-primer/
[7]
Adida, B. and Birbeck, M. 2008. RDFa primer: Bridging the human and data Webs. W3C Working Group Note.
[8]
Bray, T., Paoli, J., Sperberge-McQueen, C. M., Maler, E. and Yergeau, F. 2008. Extensible Markup Language (XML) 1.0 (fifth edition). W3C Recommendation (26 November 2008).
[9]
Koivunen, M., Prud'Hommeaux, E., and Swick, R. 2002. Annotea: an open RDF infrastructure for shared Web annotations. Computer Networks, Volume 39, Page 589.
[10]
DeRose, S., Maler, E. and Daniel, R. Jr. 2001. XML Pointer Language (XPointer) Version 1.0 (February 2001).
[11]
Kesselman, J., Robie, J., Champion, M., Sharpe, P. and Apparao, V. 2000. Document object model (dom) level 2 traversal and range specification. Technical report, W3C.
[12]
Amaya Home Page. 2009. DOI= http://www.w3.org/Amava/
[13]
Epiphany Home Page. 2010. DOI= http://projects.dfki.uni-kl.de/epiphany/
[14]
Bonino, D., Bosca A., Corno F., Farinetti L., Pescarmona F. 2004. H-DOSE: an Holistic Distributed Open Semantic Elaboration Platform. SWAP2004.
[15]
Popov, B., et al. 2004. KIM: a semantic platform for information extraction and retrieval. Nat. Lang. Eng. 10(3--4): p. 375--392. DOI= http://www.ontotext.com/kim/
[16]
Melita. DOI= http://www.aktors.org/technologies/melita/
[17]
Uren, V., Cimiano, P., Iria, J., Handschuh, S., Vargas-Vera, M., Motta, E. and Ciravegna, F. 2006. Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Journal Web Semantics.
[18]
Golbeck, J., Grove, M., Parsia, B., Kalyanpur, A. and Hendler, J., 2002. New tools for the Semantic Web. In Knowledge Engineering and Knowledge Management. Proceedings of 13th International Conference. EKAW'02.
[19]
Auer, S. 2005. Powl - A Web Based Platform for Collaborative Semantic Web Development. Proc. of 1st Workshop Scripting for the Semantic Web. SFSW'05.
[20]
RDFa Distiller. DOI= http://www.w3.org/2007/08/pvRdfa/
[21]
Handschuh, S., Staab, S. and Studer, R. 2003. Leveraging metadata creation for the Semantic Web with CREAM. Proceedings of the Annual German Conference on AI.
[22]
Joomla. DOI= http://www.joomla.org/
[23]
Mootools framework. DOI= http://www.mootools.net/
[24]
Navarro-Galindo, J. L. and Samos, J. 2010. Flexible Range Semantic Annotations Based on RDFa. 27th British National Conference on Databases (BNCOD'10), LNCS 6121.
[25]
Navarro-Galindo, J. L. and Samos, J. 2010. Ontology Support and Management for Semantic Annotation. Research Report (LSI, Universidad de Granada).
[26]
Salton, G. and Lesk, M. E. 1965. The SMART automatic document retrieval system. An illustration, Communications of the ACM, 8(6), 391--398.
[27]
Luhn, H. P. The automatic creation of literature abstracts. 1958. IBM journal of research and development, 2:159--165.
[28]
Sparck, K. A statistical interpretation of term specificity and its application in retrieval. 1972. Journal of documentation, 28(1):11--21.
[29]
Halevy, A., Franklin, M., Maier, D. 2006. Principles of dataspace systems. PODS'06. Proceedings of the Twenty-fifth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, New York, pp. 1--9.

Cited By

View all
  • (2012)The RDFa Content Editor - From WYSIWYG to WYSIWYMProceedings of the 2012 IEEE 36th Annual Computer Software and Applications Conference10.1109/COMPSAC.2012.72(531-540)Online publication date: 16-Jul-2012
  • (2012)The FLERSA tool: adding semantics to a web content management systemInternational Journal of Web Information Systems10.1108/174400812112226098:1(73-126)Online publication date: 30-Mar-2012
  • (2012)A Hybrid Approach to Text Categorization Applied to Semantic AnnotationDatabase and Expert Systems Applications10.1007/978-3-642-32597-7_4(39-47)Online publication date: 2012

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
iiWAS '10: Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
November 2010
895 pages
ISBN:9781450304214
DOI:10.1145/1967486
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • IIWAS: International Organization for Information Integration
  • Web-b: Web-b

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 November 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. RDFa
  2. automation
  3. metadata
  4. semantic annotation
  5. semantic web

Qualifiers

  • Research-article

Conference

iiWAS '10
Sponsor:
  • IIWAS
  • Web-b

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2012)The RDFa Content Editor - From WYSIWYG to WYSIWYMProceedings of the 2012 IEEE 36th Annual Computer Software and Applications Conference10.1109/COMPSAC.2012.72(531-540)Online publication date: 16-Jul-2012
  • (2012)The FLERSA tool: adding semantics to a web content management systemInternational Journal of Web Information Systems10.1108/174400812112226098:1(73-126)Online publication date: 30-Mar-2012
  • (2012)A Hybrid Approach to Text Categorization Applied to Semantic AnnotationDatabase and Expert Systems Applications10.1007/978-3-642-32597-7_4(39-47)Online publication date: 2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media