Running Job Basics

Run Job

When you are ready to test your recipe against the entire dataset, click Run Job in the Transformer page. In the Run Job page, you specify the output formats and any compression to apply. For this basic walkthrough, compression is unneeded unless you are working with a large dataset.

Tip: Optionally, you can disable generating a visual profile of your results. While the visual profile is very useful for examining issues in your recipe and iterating, it is a resource-intensive process. If you are working with large datasets that do not require additional debugging, you can consider disabling the profiling of your results. For more information, see Overview of Visual Profiling.

The job is queued up for processing. After the job has completed, you can view the results of successful executions through the Dataset Details page. Click View Results to open the job in the Job Results page.

Tip: You can also download the results directly and view the recipe used to execute the job. See Dataset Details Page.

Iterate

In the Job Results page, you can review the effects of the transformation recipe across the entire dataset. Statistics and data histograms provide overall visibility into the quality of your transformation recipe.

Figure: Job Results page

You can continue working on your sample and the related recipe, generating jobs when you think you are done, until you have generated the appropriate dataset.