Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/3200334.3200402acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

WAIL: collection-based personal web archiving

Published: 19 June 2017 Publication History

Abstract

Web Archiving Integration Layer (WAIL) is a desktop application written in Python that integrates Heritrix and OpenWayback. In this work we recreate and extend WAIL from the ground up to facilitate collection-based personal Web archiving. Our new iteration of the software, WAIL-Electron, leverages native Web technologies (e.g., JavaScript, Chromium) using Electron to open new potential for Web archiving by individuals in a stand-alone cross-platform native application. By replacing OpenWayback with PyWb, we provide a novel means for personal Web archivists to curate collections of their captures from their own personal computer rather than relying on an external archival Web service. As extended features we also provide the ability for a user to monitor and automatically archive Twitter users' feeds, even those requiring authentication, as well as provide a reference implementation for integrating a browser-based preservation tool into an OS native application.

References

[1]
Justin F. Brunelle, Mat Kelly, Michele C. Weigle, and Michael L. Nelson. 2016. The Impact of JavaScript on Archivability. International Journal on Digital Libraries 17, 2 (2016), 95--117.
[2]
Justin F. Brunelle, Michele C. Weigle, and Michael L. Nelson. 2016. Adapting the Hypercube Model to Archive Deferred Representations and Their Descendants. Technical Report arxiv:1601.05142.
[3]
ISO 28500. 2009. WARC (Web ARChive) file format. http://www.digitalpreservation.gov/formats/fdd/fdd000236.shtml. (August 2009).
[4]
Mat Kelly and Michele C. Weigle. 2012. WARCreate - Create Wayback-Consumable WARC Files from Any Webpage. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL). 437--438.
[5]
Mat Kelly, Michele C. Weigle, and Michael L. Nelson. 2013. Making Enterprise-Level Archive Tools Accessible for Personal Web Archiving. Personal Digital Archiving. (February 2013).
[6]
Hunter Stern. 2011. Fetch Chain Processors. https://webarchive.jira.com/wiki/display/Heritrix/Fetch+Chain+Processors. (2011).

Cited By

View all
  • (2018)A Framework for Aggregating Private and Public Web ArchivesProceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries10.1145/3197026.3197045(273-282)Online publication date: 23-May-2018

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
JCDL '17: Proceedings of the 17th ACM/IEEE Joint Conference on Digital Libraries
June 2017
383 pages
ISBN:9781538638613

Sponsors

Publisher

IEEE Press

Publication History

Published: 19 June 2017

Check for updates

Author Tags

  1. browser-based preservation
  2. personal web archiving
  3. web archive collections

Qualifiers

  • Research-article

Conference

JCDL '17
Sponsor:

Acceptance Rates

Overall Acceptance Rate 415 of 1,482 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2018)A Framework for Aggregating Private and Public Web ArchivesProceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries10.1145/3197026.3197045(273-282)Online publication date: 23-May-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media