Exploring collection data through the GLAM Workbench

 

Image: State Library of Victoria, http://handle.slv.vic.gov.au/10381/342096

Tim Sherratt

@wragge

these slides

Collections as data?

SEARCH IS FAMILIAR

BUT What about the other 3m?

SAME search

DIFFERENT VIEW

MORE searches

different questions!

APIs

data dumps

CSVs

full text

images

collections

as

data

?

APIs

data dumps

CSVs

full text

images

collections

as

data

+

GLAM Workbench

GLAM Workbench

  • tools, tutorials, examples, hacks
  • live code
  • editable, reusable, hackable
  • openly licensed

is

GLAM Workbench

  • coding 101
  • finished or perfect

is NOT

GLAM Workbench

Not just how, but why...

  • possibilities – why should I be interested?

  • starting points – can you give me an example I can use?

  • pathways – where do I go next?

Code

Discussion

Results

Jupyter notebooks

  • Computing in your browser
  • A computational narrative – combine text, images, code & more
  • A standard format – use on different platforms
  • See Introduction to Jupyter Notebooks

Jupyter Notebooks

Why JUPYTER?

Jupyter Notebooks

GLAM examples

  • used on multiple platforms
  • presented in different formats
  • changed by extensions
  • tailored to different users

Not just notebooks

JupyteR can be...

FINDING GLAM DATA

PACKAGING DATA

(create collections of newspaper articles)

Trove newspaper harvester

PACKAGING DATA

(create collections of newspaper articles)

Trove newspaper harvester

EXtract metadata

(no API? you can always try screen-scraping!)

National archives of australia

National archives of australia

DOWNLOAD IMAGES

(a bit of API, a bit of screen-scraping)

Trove journals

Text

ASKING different questions

TROVE newspapers

Visualise searches

(asking historical questions with search facets)

DIGITALNZ

OPEN COLLECTIONS

(visualise usage conditions)

COUNTING WORDS

australian parliament

SEEING change

(screenshots over time)

WEB
ARCHIVES

tracking text

WEB
ARCHIVES

HAcking heritage

RANDOM
DIGITALNZ
ITEMS

EXTEND APIs

(get randomly selected collection items)

Trove newspapers

FIND LanguageS

(non-English language newspapers)

words
from OCR

PLAY with collections

(creating scissors & paste messages)

Bringing documentation alive

AN API Example

Museums Victoria

EXPLORE AN API

(collections in time & space)

National Museum of Australia

WEB ARCHIVE APIS

DOCUMENT DATA Sources

PATHWAYS

BUILDING CONFIDENCE

App

Notebook

BUILDING CONFIDENCE

Running jupyter notebooks

RUN LIVE in binder

BINDER

Binder In GLAM WorKbench

The magic of Binder

Binder In GLAM WorKbench

  • a single click to start
  • batteries included (no software for the user to install)
  • encourages experimentation
  • just try it...

MORE
ways to run

RECLAIM CLOUD

  • one-click installation

  • persistent environmments

  • low cost

MORE
ways to run

DOCKER

  • run locally

  • persistent environments

  • free, but more knowledge needed

COPY!
EDIT!
RE-USE!

GLAM WORKBENCH IS OPEN!

  • free!
  • openly licensed to encourage reuse & modification
  • shared through GitHub
  • preserved in Zenodo
  • use it, share it, change it

CONTACT ME

@wragge on Twitter

timsherratt.org

GLAM Workbench issues on GitHub

GLAM Workbench on OzGLAM Help

Don't have a question? Why not ask about my t-shirt?...