A web interface to manage annotations and CNN model training with kraken
Technologies for transforming scans, images, PDFs, audio, and other media into machine readable data.
Clean, evaluate, curate, and document.
Methods and research practices for the study of large document collections. How ML can augment existing research practices.
CNN
CNN
ViT
OCR+HTR
OCR
VDU
Text