The pagenumber element describes the page that the record is associated with. Most image files have a single page but formats such as TIFF support multiple pages.

The image element contains information about the image.

The imagedata element contains the image data, base-64 encoded. The format attribute is deprecated and HPE recommends using the format element instead.

The width and height elements provide the size of the image.

The pixelAspectRatio element describes the shape of the pixels that make up the image, for example 1:1 pixels are square.

The format element describes the format of the image data contained in the imagedata element. For images in the Image_1 track, this value is always PNG.

The compressionQuality element describes the amount of compression that is applied to the data in the imagedata element. For images in the Image_1 track, this value is always 100 (indicating maximum quality and no compression).

If you ingest a document such as a PDF file, the output might also include the text extracted from text elements:

The pagetext element contains information about associated text elements. If the ingested media was a PDF file, each record represents a page. If the ingested media was another type of document the record represents an embedded image and the text that follows it, up to the next embedded image.

Each element element describes a text element and contains the following data:

The text element contains the text from the text element.

The region element provides the position of the text element on the page.

NOTE:

The region information is accurate only if the ingested document was an Adobe PDF file.

The angle element provides the orientation of the text.

Information about text elements is used by the OCR analysis engine, which automatically combines the text elements with the text extracted from images, to produce a complete transcript of the text that appears on the page.

Source Information

The image ingest engine produces a proxy track, named taskName.proxy, where taskName is the name of your ingest task. The purpose of the proxy track is to contain information about the ingested source. The engine produces one record in this track for each page in the ingested image or document.

The metadata element contains any metadata that Media Server was able to extract from the source. The information present in this element varies based on the format of the source file and the information present in the source.