https://slides.com/staceymaples/allstaff-mrm
Machines Reading Maps (MRM) is a collaborative project of
The project is funded by the
at the David Rumsey Map Center at Stanford Libraries
Machines Reading Maps is a project to create a generalizeable ML pipeline that uses human collaboration to:
Process printed text on scanned maps
Enrich the printed text
Convert the printed text to structured data
in order to make scanned historical map content easily searchable with support for complex queries
Uses TExt Spotting TRansformers (TESTR), a generic end-to-end text spotting framework using Transformers for text detection and recognition in the wild.
TESTR is particularly effective when dealing with curved text-boxes where special cares are needed for the adaptation of the traditional bounding-box representations.
The text spotting problem typically consists of two sub-tasks:
The main difficulty in text spotting is contributed by multiple factors including large variations in font, size, style, color, shape, occlusion, distortion, and layout for natural scene images.
Human Annotations
SynthMap+
SynthText Training Data
results