DjVu

Share Your DjVuPresentation at the Inter-Agency Meeting on Knowledge-Sharing and Information Management, 26-28 June 2002, Paris

Installation instructions

Click on the link below to download the Windows 95/98/2000/ME/NT/XP DjVu plug-in for Netscape and MS Internet Explorer. You will receive an executable file "DjVuWebBrowserPlugIn_xx.exe". Save this file, quit your browser, then run this executable file to install the plug-in. Downloaded executable file can be deleted after installation. You can then restart your browser and proceed to test its proper installation. The DjVu browser plug-in is supported on for Netscape 4.7 and IE 5.0 or greater.

DjVu (pronounced "déjà vu") is a new image compression technology developed since 1996 at AT&T Labs to solve the problem of getting scanned documents onto a Web site. DjVu allows the distribution on the Internet of very high resolution images of scanned documents, digital documents, and photographs. DjVu allows content developers to scan high-resolution color pages of books, magazines, catalogs, manuals, newspapers, historical or ancient documents, and make them available on the Web.

DjVu typically achieves compression ratios about 5 to 10 times better than existing methods such as JPEG and GIF for color documents, and 3 to 8 times than TIFF for black and white documents. Scanned pages at 300 DPI in full color can be compressed down to 30 to 100KB files from 25MB. Black-and-white pages at 300 DPI typically occupy 5 to 30KB when compressed. This puts the size of high-quality scanned pages within the realm of an average HTML page (which is typically around 50KB). For color document images that contain both text and pictures, DjVu files are typically 5 to 10 times smaller than JPEG at similar quality. For black-and-white pages, DjVu files are typically 10 to 20 times smaller than JPEG and five times smaller than GIF. DjVu files are also about 3 to 8 times smaller than black and white PDF files produced from scanned documents (scanned documents in color are impractical in PDF). In addition to scanned documents, DjVu can also be applied to documents produced electronically in formats such as Adobe's PostScript or PDF. In that case, the file sizes are between 15 to 20KB per page at 300 DPI.

The DjVu plug-in is available for standard Web browsers on various platforms. The DjVu plug-in allows for easy panning and zooming of document images. A unique on the fly decompression technology allows images that normally require 25MB of RAM to be decompressed to require only 2MB of RAM.

Conventional image viewing software decompresses images in their entirety before displaying them. This is impractical for high-resolution document images since they typically go beyond the memory capacity of many PCs, causing excessive disk swapping. DjVu, on the other hand, never decompresses the entire image, but instead keeps the image in memory in a compact form, and decompresses the piece displayed on the screen in real time as the user views the image. Images as large as 2,500 pixels by 3,300 pixels (a standard page image at 300 DPI) can be downloaded and displayed on very low-end PCs.

The DjVu format is progressive. Users get an initial version of the page very quickly, and the visual quality of the page progressively improves as more bits arrive. For example, the text of a typical magazine page would appear in just three seconds over a 56Kbps modem connection. In another second or two, the first versions of the pictures and backgrounds will appear. Then, after a few more seconds, the final full-quality version of the page is completed. One of the main technologies behind DjVu is the ability to separate an image into a background layer (i.e., paper texture and pictures) and foreground layer (text and line drawings). Traditional image compression techniques are fine for simple photographs, but they drastically degrade sharp color transitions between adjacent highly contrasted areas - which is why they render type so poorly. By separating the text from the backgrounds, DjVu can keep the text at high resolution (thereby preserving the sharp edges and maximizing legibility), while at the same time compressing the backgrounds and pictures at lower resolution with a wavelet-based compression technique.

DjVu is used by many commercial and non-commercial web sites on the Web today.

For more details on the technical aspects of DjVu, check out the DjVu Zone.

Useful keyboard shortcuts while viewing a DjVu document

<space>, <backspace>: next page, previous page.

<page-down>: for continuous reading (move down on document by one screenfull, flipping page when at the bottom).