QUESTION 19You have a dataset that has multiple blocks and only numeric variables.You are computing in a local compute context.You plan to lag a variable named x to create a new variable named x_lagged by using a transform function. You will create a new element in the output of the function.You need to minimize the number of missing values.Which three actions should you perform? Each correct answer presents part of the solution.NOTE: Each correct selection is worth one point.

A. Assign a value to the first value of x_lagged in the current block.B. Use rxSet to store the last value of x_lagged in the current block.C. Use rxSet to store the last value of x in the current block.D. Use rxGet to retrieve the first value of x in the next block to be processed.E. Use rxGet to retrieve a value stored in processing of the prior block.

Answer: ACD

QUESTION 20You have an Apache Hadoop Hive data warehouse. RevoScalerR is not installed.You need to sort the data according to the variables in the dataset.What should you do?

A. Connect to the database by using an ODBC connection, and then use the rxSort function.B. Create a table in the ORC file format.C. Connect to the database by using an ODBC connection, and then use the rxDataStep function.D. Execute a Hive query that sorts the data, and then reads the results.

Answer: D

QUESTION 21Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.You need to get all of the deciles for a variable in a data frame.What should you use?

A. the Describe packageB. the rxHistogram functionC. the rxSummary functionD. the rxQuantile functionE. the rxCube functionF. the summary functionG. the rxCrossTabs functionH. the ggplot2 package

Answer: D

QUESTION 22You need to run a large data tree model by using rxDForest. The model must use cross validation.Which rxDForest option should you use?

QUESTION 23Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.You build a model that uses xyz regression.You need to estimate a model that predicts a binary variable.Which function should you use?

QUESTION 24You have one-class support vector machines (SVMs).You have a large dataset, but you do not have enough training time to fully test the model.What is an alternative method to validate the model?