QUESTION 31Note: This question is part of a series of questions that present the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.Start of repeated scenarioYou are migrating an existing on-premises data warehouse named LocalDW to Microsoft Azure. You will use an Azure SQL data warehouse named AzureDW for data storage and an Azure Data Factory named AzureDF for extract, transformation, and load (ETL) functions.For each table in LocalDW, you create a table in AzureDW.On the on-premises network, you have a Data Management Gateway.Some source data is stored in Azure Blob storage. Some source data is stored on an on- premises Microsoft SQL Server instance. The instance has a table named Table1.After data is processed by using AzureDF, the data must be archived and accessible forever. The archived data must meet a Service Level Agreement (SLA) for availability of 99 percent. If an Azure region fails, the archived data must be available for reading always.End of repeated scenario.You need to connect AzureDF to the storage account.What should you create?

QUESTION 32Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.You are monitoring user queries to a Microsoft Azure SQL data warehouse that has six compute nodes.You discover that compute node utilization is uneven. The rows_processed column from sys.dm_pdw_workers shows a significant variation in the number of rows being moved among the distributions for the same table for the same query.You need to ensure that the load is distributed evenly across the compute nodes.Solution: You add a clustered columnstore index.Does this meet the goal?

A. YesB. No

Answer: B

QUESTION 33You have a Microsoft Azure subscription that contains an Azure Data Factory pipeline.You have an RSS feed that is published on a public website.You need to configure the RSS feed as a data source for the pipeline.Which type of linked service should you use?

A. webB. ODataC. Azure SearchD. Azure Data Lake Store

Answer: A

QUESTION 34You have sensor devices that report data to Microsoft Azure Stream Analytics. Each sensor reports data several times per second.You need to create a live dashboard in Microsoft Power BI that shows the performance of the sensor devices. The solution must minimize lag when visualizing the data.Which function should you use for the time-series data element?

A. LAGB. SlidingWindowC. System.TimeStampD. TumblingWindow

Answer: D

QUESTION 35You have a Microsoft Azure SQL data warehouse that has 10 compute nodes.You need to export 10 TB of data from a data warehouse table to several new flat files in Azure Blob storage. The solution must maximize the use of the available compute nodes.What should you do?

QUESTION 36You plan to use Microsoft Azure Event Hubs to ingest sensor data.You plan to use Azure Stream Analytics to analyze the data in real time and to send the output directly to Azure Data Lake Store.You need to write events to the Data Lake Store in batches.What should you use?

QUESTION 37Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.You have a table named Table1 that contains 3 billion rows. Table1 contains data from the last 36 months.At the end of every month, the oldest month of data is removed based on a column named DateTime.You need to minimize how long it takes to remove the oldest month of data.Solution: You implement a columnstore index on the DateTime column.Does this meet the goal?

A. YesB. No

Answer: A

QUESTION 38Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.You have a table named Table1 that contains 3 billion rows. Table1 contains data from the last 36 months.At the end of every month, the oldest month of data is removed based on a column named DateTime.You need to minimize how long it takes to remove the oldest month of data.Solution: You implement round robin for table distribution.Does this meet the goal?

A. YesB. No

Answer: B

QUESTION 39You are using a Microsoft Azure Stream Analytics query language.You are outputting data from an input click stream.You need to ensure that when you consecutively receive two rows from the same IP address within one minute, only the first row is outputted.Which functions should you use in the WHERE statement?

A. Last and HoppingWindowB. Last and SlidingWindowC. LAG and HoppingWindowD. LAG and Duration

Answer: B

QUESTION 40You are designing a solution that will use Apache HBase on Microsoft Azure HDInsight.You need to design the row keys for the database to ensure that client traffic is directed over all of the nodes in the cluster.What are two possible techniques that you can use? Each correct answer presents a complete solution. NOTE: Each correct selection is worth one point.

A. paddingB. trimmingC. hashingD. salting

Answer: AC

QUESTION 41You have a Microsoft Azure Data Factory pipeline.You discover that the pipeline fails to execute because data is missing.You need to rerun the failure in the pipeline.Which cmdlet should you use?