Exploring GLAM data in the Humanities

What can you do with 200 million newspaper articles?

These slides

1994–1997

Seeing differently

WWI service records

WEST Ernest Robert : Service Number - 5091 : Place of Birth - Benambra VIC : Place of Enlistment - Sale VIC : Next of Kin - (Mother) WEST Mrs F M

Name: WEST Ernest Robert
Service Number: 5091
Place of Birth: Benambra VIC
Place of Enlistment: Sale VIC
Next of Kin: (Mother) WEST Mrs F M

structured data!

Kevin

me

https://www.realfaceofwhiteaustralia.net/faces/

See: Sherratt & Bagnall, 'The people inside' in Seeing the Past with Computers

The REAL FACE OF WHITE AUSTRALIA

ASKINg NEW Questions

OCRd text

image

metadata

2011

2023

2011

2023

2012

2023

Understanding Access

1915

2011

2014

2014

2021

Last Sunday

#redactionart

Creating pathways

GLAM data is...

  • collection metadata
  • text
  • images, sound, & video
  • 3d models
  • user generated
  • born digital
  • extracted & enriched
  • & more...

Australian government agencies over time

GLAM data

ReseArchers

GLAM WORKBENCH

  • tools
  • tutorials
  • examples
  • handy hacks
  • pre-harvested datasets
  • a work in progress...

The GLAM Workbench is...

  • Trove
  • DigitalNZ
  • Commonwealth Hansard
  • National Archives of Australia
  • National Museum of Australia
  • Te Papa
  • Open data portals
  • Web Archives
  • & more!

COLLECTIONS

Trove user tags

Web archives

gov.au subdomains

abc.net.au

National Archives of Australia

#redactionart

White Australia policy

DigitalNZ

open collections

Papers Past

Commonwealth Hansard

GLAM datasets

9.6 million names

Assembling data

Asking questions

Hacking heritage

Live documentation

  • 100+ Jupyter notebooks
  • Live code, text, images
  • Same code, different views
  • All in your browser!

The HOW

&

THE WHY

code

chart

text

Jupyter notebook

Getting started

National Museum of Australia

Explore APIs

  • Trove
  • DigitalNZ
  • National Museum of Australia
  • Te Papa
  • Museums Victoria
  • Web Archives

GLAM JUPYTER

PLACES TO START

&

PLACES TO GO

GLAM CSV Explorer

app

notebook

ZooM out

&

Zoom in

TROVE newspaper harvester

HarvestING DATA

READY-TO-GO Data!

PLATFORMS

&

ENVIRONMENTS

Jump in with Binder

one click to start

Reclaim Cloud

NECTAR Cloud

Docker IMAGES

A work in progress

DIY GLAM Workbench

a new GLAM Workbench repository!

BEST PRACTICES

get involved!

DATA LABS & Guides

The GLAM Workbench is...

  • Free!
  • Openly licensed to encourage reuse & modification
  • Shared through GitHub
  • Preserved in Zenodo

Thank you!

Exploring GLAM data in the humanities

By Tim Sherratt

Exploring GLAM data in the humanities

TDWG, October 2023

  • 799