"a decade ago was that there was a serious lack of uniformity." ... "I had hoped that the situation in the C++ world would have improved since then, unfortunately it appears that I may be sadly dissapointed."
document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities. The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods. OCRopus is development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts.