Jupyter notebooks for web archives

Play along

Asking questions with web archives – introductory notebooks for historians

 

Project

  • a starting point for researchers
  • use existing APIs (Memento & CDX)
  • no special tools
  • complimenting projects like Archives Unleashed

AIMS

Jupyter

  • combines text and live code
  • use in your browser
  • run in the cloud (no software to install)
  • both tool and tutorial

Jupyter

text

CODE

RUNNING CODE

  • click on a cell
  • hit Shift+Enter

EDITING CODE

  • click on a cell
  • edit the contents
  • hit Shift+Enter to run

GLAM Workbench

GLAM Workbench

GLAM Workbench

GLAM Workbench

Works with these archives

Repositories uSED

But notebooks can be adapted to work with other Pywb, Open Wayback, and Memento compliant systems.

GLAM Workbench

View & download

RUN LIVE!

BINDER

  • builds a customised computing environment
  • opens notebook ready-to-run

BINDER LIMITS

  • inactive notebooks are closed
  • notebooks and data are not saved!
  • use download links in notebooks

MAIN THEMES

  • Types of data
  • Harvesting data & creating datasets
  • Change over time

TYPES OF DATA

Timemaps & Mementos

Harvesting data

Harvesting data

Harvesting data

Harvesting data

Harvesting data

CHANGE over time

CHANGE over time

APPMODE

  • hides all code cells
  • runs all code cells automatically
  • turns a notebook into an app

CHANGE over time

CHANGE over time

CHANGE over time

Suggestions? Problems?

Suggestions? Problems?

Jupyter notebooks for web archives

By Tim Sherratt

Jupyter notebooks for web archives

  • 2,302