Exploring digital collections with the GLAM Workbench

 

Image: State Library of Victoria, http://handle.slv.vic.gov.au/10381/342096

Tim Sherratt

@wragge

these slides

Collections as data?

SEARCH IS FAMILIAR

BUT What about the other 3m?

SAME search

DIFFERENT VIEW

MORE searches

different questions!

APIs

data dumps

CSVs

full text

images

collections

as

data

?

APIs

data dumps

CSVs

full text

images

collections

as

data

+

GLAM Workbench

Not just how, but why...

  • possibilities – why should I be interested?
  • starting points – can you give me an example I can use?
  • pathways – where do I go next?

628

CSV files from Australian GLAM organisations

harvested from government data portals

https://glam-workbench.github.io/glam-data-portals/

CSV FILES

GLAM CSV Explorer

GLAM Workbench

  • tools, tutorials, examples, hacks
  • live code
  • editable, reusable, hackable
  • openly licensed

is

GLAM Workbench

  • coding 101
  • finished or perfect

is NOT

GLAM Workbench

Powered

by

JUPYTER

GLAM Workbench

  • Computing in your browser
  • A computational narrative – combine text, images, code & more
  • A standard format – use on different platforms
  • See Introduction to Jupyter Notebooks

Jupyter Notebooks

Why JUPYTER?

Jupyter Notebooks

GLAM examples

  • used on multiple platforms
  • presented in different formats
  • changed by extensions
  • tailored to different users

Not just notebooks

JupyteR can be...

NOT JUST notebooks

  • create tutorials
  • develop tools
  • harvest data
  • share code snippets
  • document an API
  • visualise a dataset
  • hack an interface
  • move data around

Use Jupyter to...

Examples

WEB ARCHIVE APIS

DOCUMENT DATA Sources

National Museum of Australia

EXPLORE AN API

(collections in time & space)

EXPLORE AN API

(collections in time & space)

National Museum of Australia

EXPLORE AN API

(collections in time & space)

National Museum of Australia

AN API Example

Museums Victoria

PACKAGING DATA

(create collections of newspaper articles)

Trove newspaper harvester

National archives of australia

EXtract metadata

(no API? you can always try screen-scraping!)

EXtract metadata

(no API? you can always try screen-scraping!)

National archives of australia

DOWNLOAD TEXT

(a bit of API, a bit of screen-scraping)

Trove journals

DOWNLOAD IMAGES

(a bit of API, a bit of screen-scraping)

Trove journals

Text

TROVE newspapers

Visualise searches

(asking historical questions with search facets)

TROVE newspapers

Visualise searches

(asking historical questions with search facets)

TROVE newspapers

Visualise searches

(asking historical questions with search facets)

SEEING change

(screenshots over time)

WEB
ARCHIVES

SEEING change

(text in web pages)

WEB
ARCHIVES

SHOWING EVERYTHING

gov.au
subdomains

COUNTING WORDS

australian parliament

COUNTING WORDS

australian parliament

REMIXING WORDS

RECIPE
BOOKS from Trove

tracking text

WEB
ARCHIVES

FINDING FACES

STATE LIBRARY
NSW

FINDING GAPS

Sydney Stock exchange

FINDING GAPS

(70,000 digitised pages in 200 volumes)

Sydney Stock exchange

FINDING GAPS

(calendar view)

Sydney Stock exchange

RANDOM
TROVE
ITEMS

EXTEND APIs

(get a randomly selected newspaper article)

save a newspaper article as an image

Hack Interfaces

(add a new download option to Trove)

save a newspaper article as an image

Hack Interfaces

(the same notebook as an app)

words
from OCR

PLAY with collections

(creating scissors & paste messages)

One notebook, Multiple views!

In the GLAM Workbench

One notebook, Multiple views!

In the GLAM Workbench

} STATIC

} LIVE

One notebook, Multiple views!

POSSIBLE PATHWAYS

} STATIC

} LIVE

Novice

One notebook, Multiple views!

POSSIBLE PATHWAYS

} STATIC

} LIVE

Learning

One notebook, Multiple views!

POSSIBLE PATHWAYS

} STATIC

} LIVE

Experienced

Open notebooks LIVE in binder

BINDER

Binder In GLAM WorKbench

The magic of Binder

Binder In GLAM WorKbench

  • A single click to start
  • Batteries included (no software for the user to install)
  • Encourages experimentation
  • Just try it...

Fingers crossed

The future?

  • better pathways
  • systems for review & collaboration
  • embedding within research training
  • automated testing & building
  • more GLAM collections!

CONTACT ME

@wragge on Twitter

timsherratt.org

GLAM Workbench issues on GitHub

GLAM Workbench on OzGLAM Help

GLAM Workbench Gitter chat

GLAM Workbench to explore digital collections

By Tim Sherratt

GLAM Workbench to explore digital collections

  • 1,344