Xerox: Copying Pages From Bound Books Without Damaging The Spine

Monday, December 06, 2004

Press release from the issuing company

ROCHESTER, N.Y., Dec. 02, 2004 -- If you've ever copied pages from a book, you're familiar with the problem - dark, distorted words where the page is bound into the book. To correct it, most of us use the "brute force" method for getting readable copies - pushing the book to flatten it against the glass scanning surface, called the platen.
Xerox Corporation researchers think there is a better way. They have proposed a simple and elegant solution - a mathematical formula that could be incorporated in the software of common scanners to eliminate the problem. It could not only save the spines of books but also produce copies that are easier to read.
The innovative research was presented by Beilei Xu, a scientist at the company's Webster, N.Y., laboratory, at the 5th International Conference on Imaging Science and Hard Copy in Xi'an, China.
According to Xu and Robert P. Loce, another Xerox researcher who worked on the project, when a book page is not in uniform, intimate contact with the scanning surface there are actually two distinct problems. The variation in illumination causes some portions of the copy to be darker than others, and the variation in distance from the scanning surface causes letters or objects farther from the surface to look warped.
There have been some specialized and costly attempts to correct for this. At one time, for instance, Xerox sold a copier with an angled edge and articulated cover so people could copy pages without cracking books all the way open. And another solution is dedicated book scanners with height sensors, so the book lies face up, and scanning takes place from above it.
Instead of redesigning the machine to accommodate books, however, Xu and Loce decided to build the capability into the software. Scanners capture images by shining light onto the image; that light is reflected off the image, then measured by sensors, which "read" the lightness or darkness of the image, pixel by pixel.
"Normally the light only provides information on the reflectance of the original document," Xu said. "In our application, we use the sensed light to also determine the distance of the book from the platen for each pixel on the page."
Because parts of the page farther from the platen appear darker, they can determine the variation in distance and mathematically compensate for it, eliminating from the image the darker portion where the page is bound into the book. However, the image will still have distorted words at the center. The second step in their process is to apply a de-warping factor, again based on how far the page is from the platen, to compensate for the geometric distortion of the image.
The patented process could readily enable books to be scanned with high-quality results on almost any inexpensive, conventional scanner, according to Loce. The software is designed to work with the methodology that is commonly used in the consumer market scanners and copiers, an area where Xerox does not currently compete. However, the technology is available for license, and the theory behind it could be adapted in the future for the high-speed scanners used in Xerox products.