Applied AI Task Force
As a researcher who searches in Arabic, Persian, or Chinese, I want the catalog to understand my query regardless of romanization or diacritics, so I can find items I already know exist.
When a query returns no results, I assume that the library does not have an item.
Discovery
To meet federal requirements, all public video content must have captioning that is 99% accurate.
Captioning
A graduate student with dyslexia wants to load primary-source documents into text-to-speech software so they can access handwritten and historical materials.
A faculty member listens to a book from our collections on their drive to Princeton from Brooklyn
Text Recognition
- The quality of text from our current OCR tool (Tesseract) is low.
- Use vision-language models (VLMs) can significantly improve extracted text in Figgy
- We allow staff to request text improvement, which provides data on need and delivery
- Patrons get readable text
OCR and
Handwritten Text Recognition
Letter from Hugh Simm to Andrew Simm, October 9, 1778
Tesseract:
AV ee |
CHCA Ala, |
f Sa :
<a f . : LA
m ef
Sle ed
hgh
fea 2?
Qwen3-VL:
William Varney Dr.
1802
William Varney Dr.
June 1 — to 1 Cow @ 25 Dollars
Settled in full — Cost 15 Dollars
[Marginal note above, partially crossed out:]
Ct. by Cash 5 Dollars July 7
August 19 by Cash 5 Dollars
—
Calf Skin
1802
June 10 — to one Calf Skin from my father — Sent to Joseph
— 2 shillings to him
Cook Almy Daybook

Create a system to measure the relevance of search results.
Evaluate aspects of semantic search.
- Simple embeddings
- Multilingual embeddings
- NLP classification and expansion
Implement those that lead to improved results.
Discovery
Current AI solutions for automatic speech recognition (ASR) are ~95% accurate.
This is sufficient to facilitate discovery, but does not meet federal accessibility requirements.
Vendors provide 99% and continue to be the preferable way to meet this need.
Captioning

$11,600
- Hardware for production delivery and research/development.
- Small budget for renting the hardware first to verify its capacity before purchase.
- 2,500 images a month of cutting-edge HTR for our most complex content. ($50/mo ongoing)
Budget

Applied AI Task Force
By Andrew Janco
Applied AI Task Force
- 7

