QUESTION 23Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.You build a model that uses xyz regression.You need to estimate a model that predicts a binary variable.Which function should you use?

QUESTION 24You have one-class support vector machines (SVMs).You have a large dataset, but you do not have enough training time to fully test the model.What is an alternative method to validate the model?

QUESTION 26You are running a large logistic regression for 1,000 feature variables by using the LoisticRegression() function in the MicrosoftML package. All of the predictor variables are numeric.Currently, you specify the input variables separately by using the following formula.Outcome ~ Feature000 + Feature001 + Feature002 + … + Feature999You discover that it takes 20 minutes to estimate each model.You need to reduce the amount of time required to estimate each model without losing any information in the predictors.What should you do?

A. Use stepControl() to perform stepwise regression to limit the number of variables that contribute to the model.B. Use selectFeatures() to select the features that provide the most information about the outcome variable.C. Use princomp() on the correlation matrix of Features, and then use only the first 100 principle components to reduce the number of input variables.D. Use concat() to create a single array variable named Features, and then specify a new formula named Outcome ~ Features.

Answer: D

QUESTION 27You have a Microsoft SQL Server instance that has R Services (In-Database) installed. The server has a comma-separated values (CSV) file stored in the local file system.For analytic purposes, you need to read the CSV file into a database table in the SQL Server instance.You connect to the SQL Server instance by using SQL Server Management Studio.What should you use from sp_execute_external_script?

A. RxSqlServerData and specify the CSV file path in the connection stringB. rxDataStep and specify the CSV file path as the inFile argumentC. rxImportToXdf and specify specify the CSV file as the inputD. read.csv and specify the CSV file path as the parameter

Answer: D

QUESTION 28Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.After you answer a question in this sections, you will NOT be able to return to it. As a result, these questions will not appear in the review screen. You use dplyrXdf, and you discover that after you exit the session, the output files that were created were deleted.You need to prevent the files from being deleted.Solution: You use dplyrXdf with the outFile parameter and specify a path other than the working directory for dplyrXdf.Does this meet the goal?

QUESTION 29Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.After you answer a question in this sections, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.You have a Microsoft SQL Server instance that has R Services (In-Database) installed.You need to monitor the R jobs that are sent to SQL Server.Solution: You call a function from the RevoPemaR package.Does this meet the goal?

QUESTION 30Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.You need to generate a residual based on two columns. The solution must build a trend indicator.Which function should you use?