Classification and Redaction

The Data Loss Prevention API lets you understand and manage sensitive data. With the
DLP API, you can easily classify and redact sensitive data. The
DLP API can classify textual and image-based information, redact
sensitive data from text files, and classify any data you have stored in
Google Cloud Storage or
Google Cloud Datastore.

Image Classification

We use Optical Character Recognition (OCR) technology to decipher text prior to
classification. Similar to our text classification, we return findings and
include the addition of a bounding box where the text was found.

Note: Automatic image rotation is not currently supported. Images must be
right-side up to be classified.

Storage Classification

Storage classification scans textual data stored in Google Cloud Storage or
Google Cloud Datastore. Instead of streaming the textual data into the API, you
specify in your API call the storage location for your Google Cloud Storage
Bucket or Datastore Kind. The results of the scan are placed in temporary
internal storage for 30 days and are linked to the API calling project. The
results can be read by all authorized and authenticated API callers on the same
project that executed the scan.

Note: Storage Classification is not currently file-type aware. Files and
documents will be treated as text or raw bytes.