Draft report: Document Analysis with Transducers

Abstract:
We present here a general architecture for building Automatic Document Analysis Systems. This architecture is composed of a succession of modules transforming graphs describing lower-level hypotheses on the documents into graphs describing higher level hypotheses. This architecture generalizes techniques used in Neural Networks, Optical Character Recognition, Natural Language Processing and Speech Recognition.