Provide a snapshot of the look and feel of a digital project at a particular moment in time. This snapshot enables future visitors to interact with the site even if it isn’t “live” anymore.
Use a web crawler (see list below) to create a .warc (web archive) file from a set of web pages. This .warc file contains the HTML, CSS, JavaScript, and media files (images, video, sound, etc) that were delivered to the browser at the time of capture.
Upload the zipped .warc file to an institutional repository for preservation.
WebRecorder: https://webrecorder.io
Webrecorder is a product of the Rhizome project and is funded by Mellon. Using the webrecorder interface, users can “record” each webpage and interaction they wish to capture. The product recently (2016) came out of beta and is an excellent solution for smaller websites or situations where the site relies heavily on interactive javascript or embedded elements to create the web experience.
Internet Archive: https://archive.org/web/
The Internet Archive is the main entity archiving the web and is the central place for finding archived versions of web domains. Including one’s site in the Internet Archive will make it available to visitors using tools such as the Internet Archive plugin for Firefox.
Heritrix: https://webarchive.jira.com/wiki/display/Heritrix
Web Archive Player: https://github.com/ikreymer/webarchiveplayer
Ask a Librarian | Hours & Directions | Mason Libraries Home
Copyright © George Mason University