course

Exploring the past of the web: alexandria & archive-it hackathon

Authors:
Avishek Anand

L3S Research Center, Hanover, Germany

L3S Research Center, Hanover, Germany
View Profile

,
Jefferson Bailey

Internet Archive, San Francisco, CA

Internet Archive, San Francisco, CA
View Profile

Authors Info & Claims

WebSci '16: Proceedings of the 8th ACM Conference on Web ScienceMay 2016Pages 14https://doi.org/10.1145/2908131.2908212

Published:22 May 2016Publication History

WebSci '16: Proceedings of the 8th ACM Conference on Web Science

Pages 14

ABSTRACT

The Web has pervaded all walks of life and has become an important corpus for studying the humanities, social sciences, and for use by computer scientists and other disciplines. Web archives collect, preserve, and provide ongoing access to ephemeral Web pages and hence encode traces of human thought, activity, and history. This makes them a valuable resource for analysis and study. However, there have been only few concerted efforts to bring together tools, platforms, storage, processing frameworks, and existing collections for mining and analysing Web archives.

Index Terms

Exploring the past of the web: alexandria & archive-it hackathon
1. Applied computing
  1. Computers in other domains
    1. Digital libraries and archives
2. Information systems
  1. Data management systems
    1. Information integration
      1. Extraction, transformation and loading

Recommendations

The past issue of the web
WebSci '11: Proceedings of the 3rd International Web Science Conference

This paper takes a critical look at the efforts since the mid-1990s in archiving and preserving websites by memory institutions around the world. It contains an overview of the approaches and practices to date, and a discussion of the various technical, ...
Read More
A browser for browsing the past web
WWW '06: Proceedings of the 15th international conference on World Wide Web

We describe a browser for the past web. It can retrieve data from multiple past web resources and features a passive browsing style based on change detection and presentation. The browser shows past pages one by one along a time line. The parts that ...
Read More
Search the past with the portuguese web archive
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web

The web was invented to quickly exchange data between scientists, but it became a crucial communication tool to connect the world. However, the web is extremely ephemeral. Most of the information published online becomes quickly unavailable and is lost ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WebSci '16: Proceedings of the 8th ACM Conference on Web Science
May 2016
392 pages
ISBN:9781450342087
DOI:10.1145/2908131
General Chairs:
Wolfgang Nejdl
Leibniz University Hannover & L3S Research Center, Germany
,
Wendy Hall
University of Southampton, UK
,
Program Chairs:
Paolo Parigi
Stanford University
,
Steffen Staab
University of Koblenz, Germany
Copyright © 2016 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 22 May 2016
Check for updates
Qualifiers
- course
Conference

Acceptance Rates
WebSci '16 Paper Acceptance Rate13of70submissions,19%Overall Acceptance Rate218of875submissions,25%
More
Upcoming Conference
Websci '24

Sponsor:

sigweb

16th ACM Web Science Conference

May 21 - 24, 2024

Stuttgart , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 102
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Exploring the past of the web: alexandria & archive-it hackathon

WebSci '16: Proceedings of the 8th ACM Conference on Web Science

ABSTRACT

Cited By

Index Terms

Recommendations

The past issue of the web

A browser for browsing the past web

Search the past with the portuguese web archive