копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Handwritten and Printed Text Identification in Historical Archival Documents

M. Vafaie, O. Bruns, N. Pilz, J. Waitelonis, и H. Sack. 19, стр. 15--20. (2022)
DOI: https://doi.org/10.2352/issn.2168-3204.2022.19.1.04

Аннотация

Historical archival records present many challenges for OCR systems to correctly encode their content, due to visual complexity, e.g. mixed printed text and handwritten annotations, paper degradation and faded ink. This paper addresses the problem of automatic identification and separation of handwritten and printed text in historical archival documents, including the creation of an artificial pixel-level annotated dataset and the presentation of a new FCN-based model trained on historical data. Initial test results indicate 18% IoU performance improvement on recognition of printed pixels and 10% IoU performance improvement on recognition of handwritten pixels in synthesised data when compared to the state-of-the-art trained on modern documents. Furthermore, an extrinsic OCR-based evaluation on the printed layer extracted from real historical documents shows 26% performance increase.

Линки и ресурсы

ключ BibTeX: vafaie2022handwritten
тип записи: inproceedings
год: 2022
журнал: Archiving Conference
страницы: 15--20
том: 19
language: English
DOI: https://doi.org/10.2352/issn.2168-3204.2022.19.1.04
url: https://library.imaging.org/admin/apis/public/api/ist/website/downloadArticle/archiving/19/1/4

тэги

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение 2 лет назад
Создан 2 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!