In Advanced eDiscovery, set the Analyze options prior to running Analyze.

Set Analyze options

Open Prepare > Analyze > Setup. The following window is displayed.

Near-duplicates and email threads Check this box if you want to run the analysis. It is selected by default.

Document similarity Enter the Near-duplicates threshold value or accept the default of 65%.

Themes Check this box to process all files and assign themes to them. By default, this check box is not selected. Enter the following options if you want to perform Themes processing.

Max number of themes Enter or select a value for the number of themes to create. The default is 200.

Note: Increasing the number of themes affects performance, as well as the ability of a theme to generalize. The higher the number of themes, the more granular they are. For example, if a set of 50 themes include a theme such as “Basketball, Spurs, Clippers, Lakers”; 300 themes may include separate themes: “Spurs”, “Clippers”, “Lakers”. If you had no awareness of the theme “Basketball” and use this feature for ECA, seeing the theme “Basketball” could be useful. But, if the processing had too many themes, you may never see the word “Basketball” and may not know that Spurs and Clippers are good Basketball themes to review, rather than items that go on boots and used for hair.

Suggested themes You can suggest theme words to control Themes processing. Advanced eDiscovery will focus on these suggested words and try to create one or more relevant themes, based on the “Max number of themes” settings.

For example, if the suggested word is “computer”, and you specified “2” as the “Max number of Themes”, Advanced eDiscovery will try to generate two themes that relate to the word “computer”. The two themes might be “computer software” and “computer hardware”, for example.

To view, add, or edit suggested themes, click Modify.

In the Suggested themes panel, click the Add
icon to add a theme. In the Add suggested theme panel, add the words, separated by commas.

In Number of themes, select a value to determine the number of themes Advanced eDiscovery will try to generate for these words (default is 1 theme).

Click Save and then close the dialogue.

Note: The total number of themes includes Suggested Themes. The total Suggested Themes cannot exceed the total themes. If there are many Suggested Themes relative to the total themes, only a few ”novel” themes will be detected by the system because most of the themes will be dedicated to Suggested Themes.

Mode From the drop-down list, select a Themes option:

Create and apply model: Calculates themes by models from a segment of the files and then distributes files among them.

Create model: Calculates a themes model from a segment of the files. The Apply process of dividing files is done separately at another time.

Apply model: This option is only shown if a model was created previously and not yet applied. This will divide the files based on the themes.