PrEV: Preservation Explorer and Vault for Web 2.0 User-Generated Content

Cui, Anqi; Yang, Liner; Hou, Dejun; Kan, Min-Yen; Liu, Yiqun; Zhang, Min; Ma, Shaoping

doi:10.1007/978-3-642-33290-6_12

Anqi Cui¹⁹,
Liner Yang¹⁹,
Dejun Hou²⁰,
Min-Yen Kan²⁰,
Yiqun Liu¹⁹,
Min Zhang¹⁹ &
…
Shaoping Ma¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7489))

Included in the following conference series:

International Conference on Theory and Practice of Digital Libraries

2333 Accesses

Abstract

We present the Preservation Explorer and Vault (PrEV) system, a city-centric multilingual digital library that archives and makes available Web 2.0 resources, and aims to store a comprehensive record of what urban lifestyle is like. To match the current state of the digital environment, a key architectural design choice in PrEV is to archive not only Web 1.0 web pages, but also Web 2.0 multilingual resources that include multimedia, real-time microblog content, as well as mobile application descriptions (e.g., iPhone app) in a collaborative manner. PrEV performs the preservation of such resources for posterity, and makes them available for programmatic retrieval by third party agents, and for exploration by scholars with its user interface.

This work was supported by Natural Science Foundation (60903107, 61073071), National High Technology Research and Development (863) Program (2011AA01A207) and the Research Fund for the Doctoral Program of Higher Education of China (20090002120005). This work has been done at the NUS–Tsinghua EXtreme search centre (NExT).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

e-Tourism: Governmental Planning and Management Mechanism

Yasmine’s Adventures: An Interactive Urban Experience Exploring the Sociocultural Potential of Digital Entertainment

Using Web Technologies to Implement a Modular Integrated System for Augmented Tourism Destinations

References

Adar, E., Dontcheva, M., Fogarty, J., Weld, D.: Zoetrope: Interacting with the ephemeral web. In: Proceedings of the 21st Annual ACM Symposium on User Interface Software and Technology, pp. 239–248. ACM (2008)
Google Scholar
Albertsen, K.: The paradigma web harvesting environment. In: Proceedings of the 3rd Workshop on Web Archives, pp. 49–62 (August 2003)
Google Scholar
Ball, A.: Web archiving. Tech. rep., Digital Curation Centre, UKOLN, University of Bath (March 2010)
Google Scholar
Campbell, L.E.: Recollection: Integrating Data through Access. In: Agosti, M., Borbinha, J., Kapidakis, S., Papatheodorou, C., Tsakonas, G. (eds.) ECDL 2009. LNCS, vol. 5714, pp. 396–397. Springer, Heidelberg (2009)
Chapter Google Scholar
Chang, H.: Enriched Content: Concept, Architecture, Implementation, and Applications. Ph.D. thesis, New York University (2003)
Google Scholar
Collins, C., Viegas, F., Wattenberg, M.: Parallel tag clouds to explore and analyze faceted text corpora. In: IEEE Symposium on Visual Analytics Science and Technology, VAST 2009, pp. 91–98. IEEE (2009)
Google Scholar
Dougherty, M., Meyer, E., Madsen, C., Van den Heuvel, C., Thomas, A., Wyatt, S.: Researcher engagement with web archives: State of the art (2010)
Google Scholar
Hallgrímsson, T.: The International Internet Preservation Consortium (IIPC). In: Conference of Directors of National Libraries (CDNL 2005), Oslo, Norway, pp. 14–18 (2005)
Google Scholar
Hockx-Yu, H.: The past issue of the web. In: Proceedings of the ACM WebSci Conference 2011, pp. 1–8 (2011)
Google Scholar
Hodge, G.: An information life-cycle approach: Best practices for digital archiving. Journal of Electronic Publishing 5(4) (2000)
Google Scholar
JaJa, J., Song, S.: Robust tools and services for long-term preservation of digital information. Library Trends 57(3) (2009)
Google Scholar
Jatowt, A., Kawai, Y., Tanaka, K.: Visualizing historical content of web pages. In: Proceedings of the 17th International Conference on World Wide Web, pp. 1221–1222. ACM (2008)
Google Scholar
Jatowt, A., Kawai, Y., Tanaka, K.: Page history explorer: Visualizing and comparing page histories. IEICE Transactions on Information and Systems 94(3), 564 (2011)
Article Google Scholar
Kahle, B.: Preserving the Internet. Scientific American 276(3), 82–83 (1997)
Article Google Scholar
Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: Proceedings of the 19th International Conference on World Wide Web, pp. 591–600. ACM (2010)
Google Scholar
McCown, F., Nelson, M.: What happens when facebook is gone? In: Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 251–254. ACM (2009)
Google Scholar
Nelson, M., McCown, F., Smith, J., Klein, M.: Using the web infrastructure to preserve web pages. International Journal on Digital Libraries 6(4), 327–349 (2007)
Article Google Scholar
Petrovic, S., Osborne, M., Lavrenko, V.: The Edinburgh Twitter corpus. In: Proceedings of the NAACL HLT 2010 Workshop on Computational Linguistics in a World of Social Media, pp. 25–26 (2010)
Google Scholar
Ronald Jantz, M., Mlis, M.: Digital archiving and preservation: Technologies and processes for a trusted repository. Journal of Archival Organization 4(1-2), 193–213 (2007)
Article Google Scholar
Seadle, M.: Selection for digital preservation. Library Hi Tech. 22(2), 119–121 (2004)
Article Google Scholar
Van de Sompel, H., Nelson, M., Sanderson, R., Balakireva, L., Ainsworth, S., Shankar, H.: Memento: Time travel for the web. Arxiv preprint arxiv: 0911.1112 (2009)
Google Scholar
Song, S.: Long-term information preservation and access. Ph.D. thesis, University of Maryland, College Park (2011)
Google Scholar
Thomas, A., Meyer, E., Dougherty, M., Van den Heuvel, C., Madsen, C., Wyatt, S.: Researcher engagement with web archives: Challenges and opportunities for investment (2010)
Google Scholar
Yan, H., Huang, L., Chen, C., Xie, Z.: A new data storage and service model of China web infomall. In: 8th European Conference on Research and Advanced Technologies for Digital Libraries The 4th International Web Archiving Workshop (IWAW 2004), Bath, UK (2004)
Google Scholar
Yang, J., Leskovec, J.: Patterns of temporal variation in online media. In: Proceedings of the fourth ACM International Conference on Web Search and Data Mining, pp. 177–186. ACM (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Anqi Cui, Liner Yang, Yiqun Liu, Min Zhang & Shaoping Ma
School of Computing, National University of Singapore, Singapore
Dejun Hou & Min-Yen Kan

Authors

Anqi Cui
View author publications
You can also search for this author in PubMed Google Scholar
Liner Yang
View author publications
You can also search for this author in PubMed Google Scholar
Dejun Hou
View author publications
You can also search for this author in PubMed Google Scholar
Min-Yen Kan
View author publications
You can also search for this author in PubMed Google Scholar
Yiqun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Min Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shaoping Ma
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Multimedia and Graphic Arts, Cyprus University of Technology, 3036, Limassol, Cyprus
Panayiotis Zaphiris & Fernando Loizides &
School of Informatics, City University of London, Northampton Square, EC1V 0HB, London, UK
George Buchanan
School of Library, Archival and Information Studies, Irving K. Barber Learning Centre, The University of British Columbia, V6T 1Z3, Vancouver, BC, Canada
Edie Rasmussen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cui, A. et al. (2012). PrEV: Preservation Explorer and Vault for Web 2.0 User-Generated Content. In: Zaphiris, P., Buchanan, G., Rasmussen, E., Loizides, F. (eds) Theory and Practice of Digital Libraries. TPDL 2012. Lecture Notes in Computer Science, vol 7489. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33290-6_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-33290-6_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33289-0
Online ISBN: 978-3-642-33290-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

PrEV: Preservation Explorer and Vault for Web 2.0 User-Generated Content

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

e-Tourism: Governmental Planning and Management Mechanism

Yasmine’s Adventures: An Interactive Urban Experience Exploring the Sociocultural Potential of Digital Entertainment

Using Web Technologies to Implement a Modular Integrated System for Augmented Tourism Destinations

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

PrEV: Preservation Explorer and Vault for Web 2.0 User-Generated Content

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

e-Tourism: Governmental Planning and Management Mechanism

Yasmine’s Adventures: An Interactive Urban Experience Exploring the Sociocultural Potential of Digital Entertainment

Using Web Technologies to Implement a Modular Integrated System for Augmented Tourism Destinations

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation