Deduplication Database Simulator

Deduplication Database (DDB) Simulator tool is used to estimate the performance of the disk where you plan to host the DDB. You can use this tool to determine whether the disk, where you are planning to host the DDB, is performing optimally. This tool simulates the deduplication operation by querying and inserting a large number of records into the DDB.

Use this tool to get the following estimates about the performance of the disk:

The Query & Insert average time for a given application data size.

The maximum size of application data that can be backed up using the DDB for a given time limit.

Procedure

On the MediaAgent, from the Start menu, right-click the Command Prompt and then click Run as administrator.

On the User Account Control dialog box, click Yes or Continue depending upon the operating system version.

From the command prompt, go to the software_installation_path/Base and execute the following command:

sidb_sim.jar

On the SIDB Simulator dialog box, select the appropriate options:

Under Choose Type of Simulation and continue, choose one of the following options:

To find the maximum size of the application data that can be backed up using the DDB for a given access time limit for each record, select Determine max data handled by the dedup database for given threshold query response time.

To find the average query response time for each record for a given application data size, select Determine the average query response time for given application data size.

In the Directory to use for Simulation box, enter the path where the DDB will be hosted.

Select the Use existing DB check box.

Click Continue.

Under Parameters:

Specify appropriate simulation parameters:

Parameter

Description

Time limit

The access time limit for each record in microsecond.

Default: 1000 microseconds.

Data Size

The application data size in GB.

Default: 100 GB.

Dedup ratio

The expected deduplication ratio.

Default: 5.

Block Size

The deduplication data block size in KB.

Default: 128 KB.

Iterations for Sampling Query time

Number of iterations performed to determine the query and insert time spent at a particular application data size.

Default: 10000 iterations.

Number of connections

Number of jobs (such as regular backup job, auxiliary copy jobs, recovery jobs, pruning and so on) that are accessing the DDB.

Default: 10.

When specified, the simulation will run till all connections reaches the time limit threshold of 1000 microseconds.

To clean the simulation data after the simulation process is complete, select the Clean db files after simulation check box.

Click Next.

Click Start to initiate the simulation process.

Results

The Simulation Report tab displays the progress of the simulation process.

A graph of Query Time against Data Size is displayed. Click Maximize to see the details in the graph.

Each data point in the graph is calculated from the average of the last n iterations; where n is the number of Iterations for Sampling Query.

The Results tab displays the following results based on the options selected for the simulation process.

The maximum size of the application data that can be backed up using the DDB for the given access time limit.

The estimated average transaction time for an insert or query in the DDB based on the size of the application data that is backed up.