NEW QUESTION 1You have an Azure Machine Learning environment. You are evaluating whether to use R code or Python. Which three actions can you perform by using both R code and Python in the Machine Learning environment? (Each correct answer presents a complete solution. Choose three.)

A. Preprocess, cleanse, and group data.B. Score a training model.C. Create visualizations.D. Create an untrained model that can be used with the Train Model module.E. Implement feature ranking.

Answer: ABC

NEW QUESTION 2Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.You plan to create a predictive analytics solution for credit risk assessment and fraud prediction in Azure Machine Learning. The Machine Learning workspace for the solution will be shared with other users in your organization. You will add assets to projects and conduct experiments in the workspace. The experiments will be used for training models that will be published to provide scoring from web services. The experiment for fraud prediction will use Machine Learning modules and APIs to train the models and will predict probabilities in an Apache Hadoop ecosystem. You need to alter the list of columns that will be used for predicting fraud for an input web service endpoint. The columns from the original data source must be retained while running the Machine Learning experiment. Which module should you add after the web service input module and before the prediction module?

A. Edit MetadataB. Import DataC. SMOTED. Select Columns in Dataset

Answer: D

NEW QUESTION 3Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.You need to remove rows that have an empty value in a specific column. The solution must use a native module. Which module should you use?

NEW QUESTION 5Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.You have a non-tabular file that is saved in Azure Blob storage. You need to download the file locally, access the data in the file, and then format the data as a dataset. Which module should you use?

NEW QUESTION 6You are performing exploratory analysis of files that are encoded in a complex proprietary format. The format requires disk intensive access to several dependent files in HDFS. You need to build an Azure Machine Learning model by using a canopy clustering algorithm. You must ensure that changes to proprietary file formats can be maintained by using the least amount of effort. Which Machine Learning library should you use?

A. MicrosoftMLB. Scikit-learnC. SparkRD. Mahout

Answer: D

NEW QUESTION 7You plan to use the Data Science Virtual Machine for development, but you are unfamiliar with R scripts. You need to generate R code for an experiment. Which IDE should you use?

NEW QUESTION 8You are building an Azure Machine Learning workflow by using Azure Machine Learning Studio. You create an Azure notebook that supports the Microsoft Cognitive Toolkit. You need to ensure that the stochastic gradient descent (SGD) configuration maximizes the samples per second and supports parallel modeling that is managed by a parameter server. Which SGD algorithm should you use?

NEW QUESTION 9You are building an Azure Machine Learning experiment. You need to transform a string column that has 47 distinct values into a binary indicator column. The solution must use the One-vs-All Multiclass model. Which module should you use?

NEW QUESTION 10You are analyzing taxi trips in New York City. You leverage the Azure Data Factory to create data pipelines and to orchestrate data movement. You plan to develop a predictive model for 170 million rows (37 GB) of raw data in Apache Hive by using Microsoft R Server to identify which factors contribute to the passenger tipping behavior. All of the platforms that are used for the analysis are the same. Each worker node has eight processor cores and 26 GB of memory. Which type of Azure HDInsight cluster should you use to produce results as quickly as possible?