Skip to main content

Digital Archives: Semantic Search and Retrieval

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7947))

Abstract

Social media, in the recent years, has become the main source of information regarding society’s feedback on events that shape the everyday life. The social web is where journalists look to find how people respond to the news they read but is also the place where politicians and political analysts would look to find how societies feel about political decisions, politicians, events and policies that are announced. This work reports on the design and evaluation of a search and retrieval interface for socially enriched web archives. The considerations on the end user requirements regarding the social content are presented as well as the approach on the design and testing using a large collection of web documents.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Schefbeck, G., Spiliotopoulos, D., Risse, T.: The Recent Challenge in Web Archiving: Archiving the Social Web. In: Int. Council on Archives Congress, Brisbane, Australia, August 20-24 (2012)

    Google Scholar 

  2. Anderson, R.E.: Social impacts of computing: Codes of professional ethics. Social Science Computing Review 10(2), 453–469 (1992)

    Article  Google Scholar 

  3. Golberg, J., Wasser, M.: SocialBrowsing: Integrating social networks and web browsing. In: Proc. CHI 2007, San Jose, USA, April 28–May 3 (2007)

    Google Scholar 

  4. Musial, K., Kazienko, P.: Social Networks on the Internet. World Wide Web 16, 31–72 (2013)

    Article  Google Scholar 

  5. Torre, L.: Adaptive systems in the era of the semantic and social web, a survey. User Model. User-Adapt. Interact. 19(5), 433–486 (2009)

    Article  Google Scholar 

  6. Denev, D., Mazeika, A., Spaniol, M., Weikum, G.: The SHARC framework for data quality in Web archiving. VLDB 20(2), 183–207 (2011)

    Article  Google Scholar 

  7. You, G., Park, J., Huang, S., Nie, Z., Wen, J.-R.: SocialSearch+: enriching social network with web evidences. World Wide Web (2013)

    Google Scholar 

  8. Godbole, N., Srinivasaiah, M., Skiena, S.: Large-scale sentiment analysis for news and blogs. In: Proceedings of the International Conference in Weblogs and Social Media (2007)

    Google Scholar 

  9. Ruiz-Martinez, J.M., Valencia-Garcia, R., Garcia-Sanchez, F.: Semantic-Based Sentiment analysis in financial news. In: Proc. 1st Int. Workshop on Finance and Economics on the Semantic Web (FEOSW 2012) in conjunction with 9th Extended Semantic Web Conference (ESWC 2012), Heraklion, Greece, May 27-28 (2012)

    Google Scholar 

  10. Kumar, A., Sebastian, T.M.: Sentiment Analysis: A Perspective on its Past, Present and Future. IJISA 4(10), 1–14 (2012)

    Article  Google Scholar 

  11. Baldoni, M., Baroglio, C., Patti, V., Rena, P.: From tags to emotions: Ontology-driven sentiment analysis in the social semantic web. Intelligenza Artificiale 6(1), 41–54 (2012)

    Google Scholar 

  12. Zhang, X., Hu, B., Chen, J., Moore, P.: Ontology-based context modeling for emotion recognition in an intelligent web. World Wide Web (2013)

    Google Scholar 

  13. Saif, H., He, Y., Alani, H.: Semantic Sentiment analysis of Twitter. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part I. LNCS, vol. 7649, pp. 508–524. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  14. Mukherjee, S., Malu, A., Balamuralli, A.R., Bhattacharyya, P.: TwiSent: A Multistage System for Analyzing Sentiment in Twitter. In: Proc. 21st ACM Int. Conf. on Information and Knowledge Management (CIKM 2012), Maui, USA, October 29–November 02, pp. 2531–2534 (2012)

    Google Scholar 

  15. Petz, G., Karpowicz, M., Fürschuß, H., Auinger, A., Winkler, S.M., Schaller, S., Holzinger, A.: On text preprocessing for opinion mining outside of laboratory environments. In: Huang, R., Ghorbani, A.A., Pasi, G., Yamaguchi, T., Yen, N.Y., Jin, B. (eds.) AMT 2012. LNCS, vol. 7669, pp. 618–629. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  16. Lesk, M.: Understanding Digital Libraries. Morgan Kaufmann (2004)

    Google Scholar 

  17. Hearst, M., English, J., Sinha, R., Swearingen, K., Yee, P.: Finding the Flow in Web Site Search. Communications of the ACM 45(9), 42–49 (2002)

    Article  Google Scholar 

  18. Guha, R., McCool, R., Miller, E.: Semantic Search. In: Proc. 12th Int. Conf. on World Wide Web, pp. 700–709 (2003)

    Google Scholar 

  19. Wang, H., Zhang, K., Liu, Q., Tran, T., Yu, Y.: Q2Semantic: A lightweight keyword interface to semantic search. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 584–598. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  20. Lei, Y., Uren, V.S., Motta, E.: SemSearch: A Search Engine for the Semantic Web. In: Staab, S., Svátek, V. (eds.) EKAW 2006. LNCS (LNAI), vol. 4248, pp. 238–245. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  21. Makela, E., Hyvonen, E., Sidoroff, T.: View-Based User Interfaces for Information Retrieval on the Semantic Web. In: Proc. ISWC-2005 Workshop on End User Semantic Web Interaction (November 2005)

    Google Scholar 

  22. Ziang, J., Marchionini, G.: Evaluation and Evolution of a Browse and Search Interface: Relation Browser++. In: Proc. Conf. on Digital Government Research, pp. 179–188 (2005)

    Google Scholar 

  23. Nedbal, D., Auinger, A., Hochmeier, A., Holzinger, A.: A Systematic Success Factor Analysis in the Context of Enterprise 2.0: Results of an Exploratory Analysis Comprising Digital Immigrants and Digital Natives. In: Huemer, C., Lops, P. (eds.) EC-Web 2012. LNBIP, vol. 123, pp. 163–175. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  24. Calero Valdez, A., Schaar, A.K., Ziefle, M., Holzinger, A., Jeschke, S., Brecher, C.: Using mixed node publication network graphs for analyzing success in interdisciplinary teams. In: Huang, R., Ghorbani, A.A., Pasi, G., Yamaguchi, T., Yen, N.Y., Jin, B. (eds.) AMT 2012. LNCS, vol. 7669, pp. 606–617. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  25. Teevan, J., Alvarado, C., Ackerman, M.S., Karger, D.: The perfect search engine is not enough: a study of orienteering behavior in directed search. In: Proc. SIGCHI Conf. on Human Factors in Computing Systems (CHI 2004), pp. 415–422 (2004)

    Google Scholar 

  26. Chen, Z., Lin, F., Liu, H., Liu, Y., Wenyin, L., Ma, W.: User Intention Modeling in Web Applications Using Data Mining. World Wide Web: Internet and Web Information Systems 5, 181–191 (2002)

    Article  Google Scholar 

  27. Taksa, I., Spink, A.H., Goldberg, R.R.: A task-oriented approach to search engine usability studies. Journal of Software 3(1), 63–73 (2008)

    Article  Google Scholar 

  28. Holzinger, A.: Usability engineering methods for software developers. Communications of the ACM 48, 71–74 (2005)

    Article  Google Scholar 

  29. ARCOMEM: Archive Communities Memories. FP7-ICT-270239, www.arcomem.eu

  30. Faheem, M.: Intelligent crawling of Web applications for Web archiving. In: Proc. PhD Symposium of WWW, Lyon, France (April 2012)

    Google Scholar 

  31. Gouriten, G., Senellart, P.: API Blender: A Uniform Interface to Social Platform APIs. In: Proc. Developer Track of WWW, Lyon, France (April 2012)

    Google Scholar 

  32. WARC File Format specifications, http://archive-access.sourceforge.net/warc/

  33. Europeana Cultural Collections Archive Portal, http://www.europeana.eu/portal/

  34. Spiliotopoulos, D., Tzoannos, E., Stavropoulou, P., Kouroupetroglou, G., Pino, A.: Designing user interfaces for social media driven digital preservation and information retrieval. In: Miesenberger, K., Karshmer, A., Penaz, P., Zagler, W. (eds.) ICCHP 2012, Part I. LNCS, vol. 7382, pp. 581–584. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  35. Diakopoulos, N., De Choudhury, M., Naaman, M.: Finding and assessing social media information sources in the context of journalism. In: Proc. 2012 ACM Annual Conf. Human Factors in Computing Systems (CHI-2012) (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Spiliotopoulos, D., Tzoannos, E., Cabulea, C., Frey, D. (2013). Digital Archives: Semantic Search and Retrieval. In: Holzinger, A., Pasi, G. (eds) Human-Computer Interaction and Knowledge Discovery in Complex, Unstructured, Big Data. HCI-KDD 2013. Lecture Notes in Computer Science, vol 7947. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39146-0_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-39146-0_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-39145-3

  • Online ISBN: 978-3-642-39146-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics