Distant Reading, Distant Viewing
Activity with Pixplot and Princeton Slavic Collections
A Distant Viewing Toolkit for Image Collections
What is computer vision?
What and how does a computer "see?"
CGANs
Distant reading is the idea of processing content in...or information about... a large number of textual items without engaging in the reading of the actual text. The “reading” is a form of data mining that allows information in the text or about the text to be processed and analyzed.
...a methodological and theoretical framework for studying large collections of visual materials. Distant viewing is distinguished from other approaches by making explicit the interpretive nature of extracting semantic metadata from images.
Text
Ben Schmidt, machine reading the Hathi Trust (14 million volumes. About as many as there are books in the Library of Congress)
Activity (10 minutes)
a) click here
b) explore the image clusters
c) what is the machine able to see very clearly?
d) could these clusters be useful in art historical research?
Video
vision models
JSON data for each frame
9-types of annotations
4 types of aggregators
BookNLP for Videos
I really wanted to use DVT for this workshop however:
A journey...
IIIF Manifest
pre-trained models
CSV file
Palladio
Omeka
Custom research-tailored computer vision models
Open this Notebook and see what results you get for your images with Google Vision
Vision label detection on a full image collection
Custom research-tailored computer vision models
Google AutoML Vision