1. A Human-Annotated Dataset of Scanned Images and OCR Texts from Medieval Documents Creator: Novotný, Vít, Seidlová, Kristýna, Vrabcová, Tereza, and Horák, Aleš Publisher: Masaryk University, Brno Type: image and corpus Subject: ocr, optical character recognition, language identification, image super-resolution, sr, and Medieval Language: German, Czech, Latin, and English Description: This is an open dataset of scanned images and OCR texts from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains human annotations for layout analysis, OCR evaluation, and language identification. Rights: Public Domain Dedication (CC Zero), http://creativecommons.org/publicdomain/zero/1.0/, and PUB