Document image matching is the key technique for document image registration and retrieval. In this paper, a new matching method based on document component block list (CBL) is proposed. A document image is firstly parsed into a number of component blocks that are defined as non-adherent rectangular areas of substantial document contents. Then these blocks are organized as a list, on which several matching operations are defined. The template image that is most similar to the querying document image is selected as the matching result. Our method can effectively make use of the local information of each page component block and the global information of document page layout. We investigate the method with large-scale document template image database. Our method manifests good matching accuracy and good robustness to image distortion, filled-in text, and noises.

WEB OF SCIENCETM Citations

Page view(s)

Google ScholarTM

Altmetric

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

The Library actively supports the
University’s mission by providing integrated and timely access to high
quality scholarly resources, an inspiring environment for intellectual
growth and discovery, with responsive and outreaching services...
[read more ]