Improving OCR Accuracy on Early Printed Books using Deep Convolutional Networks

02/27/2018
by   Christoph Wick, et al.
0

This paper proposes a combination of a convolutional and a LSTM network to improve the accuracy of OCR on early printed books. While the standard model of line based OCR uses a single LSTM layer, we utilize a CNN- and Pooling-Layer combination in advance of an LSTM layer. Due to the higher amount of trainable parameters the performance of the network relies on a high amount of training examples to unleash its power. Hereby, the error is reduced by a factor of up to 44 voting mechanism to achieve character error rates (CER) below 0.5 runtime of the deep model for training and prediction of a book behaves very similar to a shallow network.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset