OCR - optical character recognition

OCR is a method to recognize text in images. We work with the ISO 8859-1 (8-bit) standard to ensure that all Danish characters are supported without problems.

​

This helps to reduce the workload and the handling of cases as quickly and efficiently refine its search through the digital archive.

​

OCR method that is used by Dansk Scanning A/S, is a special type that has made us certified as one of four approved OCR suppliers in the EU. One of the requirements of being selected as a supplier for digitization within the EU is that our OCR system:

​

​

RECOGNIZES A MINIMUM OF 997 OUT OF 1000 CAPITALS - 99,7%

​

It is important to point out that when the OCR is carried out by a computer, the search is based on many different fonts. This helps to recognize both new printed documents and old fonts from typewriters. Handwritten documents and special text in italics may be hard to identify and therefore can result with OCR on handwritten text can be challenging.