copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Improving OCR Accuracy on Early Printed Books by utilizing Cross Fold Training and Voting

C. Reul, U. Springmann, C. Wick, and F. Puppe. (2017)cite arxiv:1711.09670.

Abstract

In this paper we introduce a method that significantly reduces the character error rates for OCR text obtained from OCRopus models trained on early printed books. The method uses a combination of cross fold training and confidence based voting. After allocating the available ground truth in different subsets several training processes are performed, each resulting in a specific OCR model. The OCR text generated by these models then gets voted to determine the final output by taking the recognized characters, their alternatives, and the confidence values assigned to each character into consideration. Experiments on seven early printed books show that the proposed method outperforms the standard approach considerably by reducing the amount of errors by up to 50% and more.

Description

Improving OCR Accuracy on Early Printed Books by utilizing Cross Fold Training and Voting

Links and resources

BibTeX key: reul2017ocrvoting
entry type: misc
year: 2017
url: http://arxiv.org/abs/1711.09670
note: cite arxiv:1711.09670

@chwick's tags highlighted

Cite this publication

search on

Meta data

Last update 6 years ago
Created 6 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Improving OCR Accuracy on Early Printed Books by utilizing Cross Fold Training and Voting

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Improving OCR Accuracy on Early Printed Books by utilizing Cross Fold Training and Voting

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Improving OCR Accuracy on Early Printed Books by utilizing Cross Fold Training and Voting

Comments and Reviews
(0)