70-773 Premium Bundle

70-773 Premium Bundle

Analyzing Big Data with Microsoft R (beta) Certification Exam

4.5 
(45135 ratings)
0 QuestionsPractice Tests
0 PDFPrint version
November 21, 2024Last update

Microsoft 70-773 Free Practice Questions

Master the 70-773 Dumps Questions content and be ready for exam day success quickly with this 70-773 Braindumps. We guarantee it!We make it a reality and give you real 70-773 Dumps Questions in our Microsoft 70-773 braindumps. Latest 100% VALID 70-773 Exam Questions and Answers at below page. You can use our Microsoft 70-773 braindumps and pass your exam.

Check 70-773 free dumps before getting the full version:

NEW QUESTION 1
You have a dataset that has a character variable. You need to create a bag of counts of n-grams. Which function should you use?

  • A. featurizeText0
  • B. categoricalHash0
  • C. concat0
  • D. selcctFeatures0
  • E. categorical0

Answer: A

Explanation: featurizeText: Produces a bag of counts of sequences of consecutive words, called n-grams, from a given
corpus of text. It offers language detection, tokenization, stopwords removing, text normalization and
feature generation.

NEW QUESTION 2
Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.
Start of repeated scenario
You are developing a Microsoft R Open solution that will leverage the computing power of the database server for some of your datasets.
You are performing feature engineering and data preparation for the datasets. The following is a sample of the dataset.
70-773 dumps exhibit
End of repeated scenario
You need to analyze the dataset without the missing values. The solution must not remove the missing values from the dataset.
Which R code segment should you use?
70-773 dumps exhibit

  • A. Option A
  • B. Option B
  • C. Option C
  • D. Option D

Answer: A

NEW QUESTION 3
You are running a large logistic regression for 1,000 feature variables by using the logisticRegression0 function in the MicrosoftML package. All of the predictor variables are numeric.
Currently, you specify the input variables separately by using the following formula.
70-773 dumps exhibit
You discover that it takes 20 minutes to estimate each model.
You need to reduce the amount of time required to estimate each model without losing any information in the predictors.
What should you do?

  • A. Use stepControl0 to perform stepwise regression to limit the number of variables that contribute to the model.
  • B. Use selectFeatures0 to select the features that provide the most information about the outcome variable.
  • C. Use princomp0 on the correlation matrix of Features, and then use only the first 100 principle components to reduce the number of input variables.
  • D. Use concat0 to create a single array variable named Features, and then specify a newformula named Outcome - Features.

Answer: B

NEW QUESTION 4
Note: This question is part of a series of Questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, whale others might not have a correct solution-After you answer a question in this section, you will NOT be able to return to it- As a result, these questions will not appear in the review screen.
You use dplyrXdf and you discover that after you exit the session, the output files that were created were deleted. You need to prevent the files from being deleted.
Solution: You use dplyrXdf with the outFile parameter and specify a path other than the working directory for dplyrXdf.
Does this meet the goal?

  • A. Yes
  • B. No

Answer: A

NEW QUESTION 5
You are planning the compute contexts for your environment. You need to execute rx-function calls in parallel.
What are three possible compute contexts that you can use to achieve this goal? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.

  • A. local parallel
  • B. Spark
  • C. local sequential
  • D. Map Reduce
  • E. SQL

Answer: ABC

Explanation: https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-r-server-compute-contexts

NEW QUESTION 6
HOTSPOT
Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.
Start of repeated scenario
You are developing a Microsoft R Open solution that will leverage the computing power of the database server for some of your datasets.
You are performing feature engineering and data preparation for the datasets. The following is a sample of the dataset.
70-773 dumps exhibit
End of repeated scenario
You need to sort the data from the dataset sample and to remove duplicates by using wkswork1.
Which R code segment should you use? to answer, select the appropriate options in the
answer area.
Note: Each correct selection is worth one point.
70-773 dumps exhibit

    Answer:

    Explanation: 70-773 dumps exhibit

    NEW QUESTION 7
    You have a slow Map Reduce job.
    You need to optimize the job to control the number of mapper and runner tasks. Which function should you use?

    • A. RxComputeContext
    • B. RxHadoopMR
    • C. rxExec
    • D. RxLocalParallel

    Answer: B

    NEW QUESTION 8
    You need to build a model that looks at the probability of an outcome. You must regulate between L1 and L2.
    Which classification method should you use?

    • A. Two-Class Neural Network
    • B. Two-Class Support Vector Machine
    • C. Two-Class Decision Forest
    • D. Two-Class Logistic Regression

    Answer: A

    NEW QUESTION 9
    DRAG DROP
    You are using rxPredict for a logistic regression model.
    You need to obtain prediction standard errors and confidence intervals. Which R code segment should you use?
    To answer, drag the .impropriate values to the correct targets. Each value may be used once, more than once, or not .You may need to drag the split bar between panes or scroll to view content.
    NOTE: Each correct selection is worth one point.
    70-773 dumps exhibit

      Answer:

      Explanation: 70-773 dumps exhibit

      NEW QUESTION 10
      Note: This question Is part of a series of questions that use the same or similar answer choice. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series.
      Information and details provided In a question apply only to that question. You build a model that uses xyz regression.
      You need to estimate a model that predicts a binary variable.
      Which function should you use?

      • A. rxPredict
      • B. rxLogit
      • C. Summary
      • D. rxLinMod
      • E. rxTweedie
      • F. stepAic
      • G. rxTransform
      • H. rxDataStep

      Answer: B

      Explanation: https://docs.microsoft.com/en-us/r-server/r/how-to-revoscaler-logistic- regression

      NEW QUESTION 11
      You need to use the ScaleR distributed processing in an Apache Hadoop environment. Which data source should you use?

      • A. Microsoft SQL Server database
      • B. XDF data files
      • C. ODBC data
      • D. Teradata database

      Answer: B

      NEW QUESTION 12
      You have a dataset that has multiple blocks and only numeric variables. You are computing in a local compute context.
      You plan to lag a variable named x to create a new variable named x_lagged by using a transform function. You will create a new element in the output of the function.
      You need to minimize the number of missing values.
      Which three actions should you perform? Each correct answer presents part of the solution.
      NOTE: Each correct selection is worth one point.

      • A. Assign a value to the first value of x_lagged in the current block.
      • B. Use rxSet to store the last value of x_lagged in the current block.
      • C. Use rxSet to store the last value of x in the current block.
      • D. Use rxGet to retrieve the first value of x in the next block to be processed.
      • E. Use rxGet to retrieve a value stored in processing of the prior block.

      Answer: ACD

      NEW QUESTION 13
      Note: This Question is part of a series of Questions that use the same or similar answer choices. An answer choice may be correct than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
      You have a data source that is larger than memory.
      You need to visualize the distribution of the values for a variable in the data source. What should you use?

      • A. the Describe package
      • B. the rxHistogram function
      • C. the rxSummary function
      • D. the rxQuantile function
      • E. the rxCube function
      • F. the summary function
      • G. the rxCrossTabs function
      • H. the ggplot2 package

      Answer: B

      NEW QUESTION 14
      HOTSPOT
      Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.
      Start of repeated scenario
      You are developing a Microsoft R Open solution that will leverage the computing power of the database server for some of your datasets.
      You are performing feature engineering and data preparation for the datasets. The following is a sample of the dataset.
      70-773 dumps exhibit
      End of repeated scenario
      You plan to score some data to create data features to address empty rows. You have the following R code.
      70-773 dumps exhibit
      You need to transform the data and overwrite the current dataset.
      Which R code segment should you use? To answer, select the appropriate options in the
      answer area.
      NOTE: Each correct selection is worth one point.
      70-773 dumps exhibit

        Answer:

        Explanation: 70-773 dumps exhibit

        NEW QUESTION 15
        Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
        You need to calculate a measure of central tendency and variability for the variables in a dataset that is grouped by using another categorical variable.
        What should you use?

        • A. the Describe package
        • B. the rxHistogram function
        • C. the rxSummary function
        • D. the rxQuantile function
        • E. the rxCube function
        • F. the summary function
        • G. the rxCrossTabs function
        • H. the ggplot2 package

        Answer: C

        NEW QUESTION 16
        You plan to read data from an Oracle database table and to store the data in the file system for later processing by dplyrXdf, The size of the data is larger than the memory on the server to used for modelling.
        You need to ensure that the data can be processed by dplyrXdf in the least amount of time possible.
        How should you transfer the data from the Oracle database?

        • A. Use the RODBC library, connect to the Oracle database server by using odbcConnec
        • B. and then use rxDataStep to export the data to a comma-separated values (CSV) file.
        • C. Define a data source to the Oracle database server by using RxOdbcData, and then use rxlmport to save the data to an XDF file.
        • D. Use the RODBC library, connect to the Oracle database server by using odbcConnec
        • E. and then use rxSplit to save the data to multiple comma-separated values (CSV) files.

        Answer: C

        NEW QUESTION 17
        You have following regression forest.
        70-773 dumps exhibit
        Which variable contributes the most to the dependent variable?

        • A. stack.loss
        • B. Water.Temp
        • C. Air.Flow
        • D. Acid.Conc

        Answer: A

        NEW QUESTION 18
        You are running a parallel function that uses the following R code segment. (Line numbers are included for reference only.)
        70-773 dumps exhibit
        You need to complete the R code. The solution must support chunking. Which function should insert at line 02?

        • A. rxBTrees
        • B. rxExec
        • C. rxDForest
        • D. rxDTree

        Answer: C

        Recommend!! Get the Full 70-773 dumps in VCE and PDF From Surepassexam, Welcome to Download: https://www.surepassexam.com/70-773-exam-dumps.html (New 39 Q&As Version)


        START 70-773 EXAM