Skip to main content

Creation, Population and Preprocessing of Experimental Data Sets for Evaluation of Applications for the Semantic Web

  • Conference paper
SOFSEM 2008: Theory and Practice of Computer Science (SOFSEM 2008)

Abstract

In this paper we describe the process of experimental ontology data set creation. Such a semantically enhanced data set is needed in experimental evaluation of applications for the Semantic Web. Our research focuses on various levels of the process of data set creation – data acquisition using wrappers, data preprocessing on the ontology instance level and adjustment of the ontology according to the nature of the evaluation step. Web application aimed at clustering of ontology instances is utilized in the process of experimental evaluation, serving both as an example of an application and visual presentation of the experimental data set to the user.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Pinto, H.S., Peralta, D.: Combining Ontology Engineering Subprocesses to Build a Time Ontology. In: K-CAP 2003, pp. 88–95. ACM Press, New York (2003)

    Chapter  Google Scholar 

  2. Čerešňa, M.: Interactive Learning of HTML Wrappers Using Attribute Classification. In: Proc. of the First Int. Workshop on Representation and Analysis of Web Space, Prague, Czech Republic, pp. 137–142 (2005)

    Google Scholar 

  3. Kushmerick, N.: Wrapper Induction: Efficiency and expressiveness. Artificial Intelligence 118(1), 15–68 (2000)

    Article  MATH  MathSciNet  Google Scholar 

  4. Simon, K., Lausen, G.: ViPER: Augmenting Automatic Information Extraction with Visual Perceptions. In: CIKM 2005, pp. 381–388. ACM Press, New York (2005)

    Chapter  Google Scholar 

  5. Weinstein, P.C., Birmingham, W.P.: Comparing Concepts in Differentiated Ontologies. In: KAW 1999 (1999)

    Google Scholar 

  6. Andrejko, A., Barla, M., Tvarožek, M.: Comparing Ontological Concepts to Evaulate Similarity. In: Návrat, P., et al. (eds.) Tools For Acquisition, Organisation and Presenting of Information and Knowledge, STU, pp. 71–78 (2006)

    Google Scholar 

  7. Rado, L.: Sharing of Research Results on Portal based on Semantic Web. Master’s thesis project report, Bieliková, M. (supervisor), Slovak University of Technology in Bratislava (2007)

    Google Scholar 

  8. Tvarožek, M., Bieliková, M.: Adaptive Faceted Browser for Navigation in Open Information Spaces. In: WWW 2007, pp. 1311–1312. ACM Press, New York (2007)

    Chapter  Google Scholar 

  9. Beckett, D.: Redland RDF Storage and Retrieval. In: SWAD-Europe Workshop on Semantic Web Storage and Retrieval (2004)

    Google Scholar 

  10. Hinton, G., Sejnowski, T.J.: Unsupervised Learning and Map Formation: Foundations of Neural Computation. MIT Press, Cambridge (1999)

    Google Scholar 

  11. Frivolt, G., Pok, O.: Comparison of Graph Clustering Approaches. In: Bieliková, M. (ed.) IIT.SRC 2006, Bratislava, Slovakia, pp. 168–175 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Viliam Geffert Juhani Karhumäki Alberto Bertoni Bart Preneel Pavol Návrat Mária Bieliková

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Frivolt, G., Suchal, J., Veselý, R., Vojtek, P., Vozár, O., Bieliková, M. (2008). Creation, Population and Preprocessing of Experimental Data Sets for Evaluation of Applications for the Semantic Web. In: Geffert, V., Karhumäki, J., Bertoni, A., Preneel, B., Návrat, P., Bieliková, M. (eds) SOFSEM 2008: Theory and Practice of Computer Science. SOFSEM 2008. Lecture Notes in Computer Science, vol 4910. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77566-9_59

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77566-9_59

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77565-2

  • Online ISBN: 978-3-540-77566-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics