True or false: the CRISP-DM Process Methodology is a linear process.
A. True
B. False
What will occur if the deployment data has a very different range than the data that was used in modeling?
A. The model will underpredict.
B. The model will overpredict.
C. The model may overpredict or underpredict.
D. The model nugget will automatically nullify those cases that are out of range .075.
Examine the Data Audit output shown below.
What action will occur in a generated Outlier and Extreme Supernode for HOLCOST?
A. Nine records will be set to the system missing (null) value for HOLCOST.
B. Ten records will be deleted.
C. Ten records will be set to the system missing (null) value for HOLCOST.
D. Nine records will be deleted.
Which fields are created by this Derive dialog?
A. Three fields representing the difference between Travel-1 and each of the other Travel fields
B. No operation will be performed because the expression is invalid.
C. Four fields representing the difference in weeks between AcctEst and each of the Travel fields
D. A field representing the difference between AcctEst and the global @FIELD value
True or false: running Auto Dataprep results in data transformation nodes placed in a Supernode on the canvas.
A. True
B. False
Which node can be used to impute (estimate) missing values?
A. Data Audit node
B. Balance node
C. Filler node D. Reclassify node
True or false: auto checking for invalid values can be done on the Type tab in any Source node.
A. True
B. False
Which of the following types of nodes will have data flowing both in and out, when used in a stream?
A. Record Ops
B. Graphs
C. Export
D. Sources
True or false: given the information in the Data Audit Quality table below,
the generated Filter node will exclude only the fields logwire, logequi, logtoll, and logcard.
A. True
B. False
An online retailer wants to identify groups of customers based on components of their buying behavior, such as types of products purchased, volume of purchases, and frequency of purchases. What type of model would be used?
A. Association model
B. Segmentation model
C. Classification model
D. Sequence model
True or false: only Terminal nodes (Graphs, Modeling, Output, Export) have a Run button as displayed in the graphic.
A. True
B. False
Which method would be used on the Merge node in order to combine a file containing 100 products and a file containing 50 suppliers and retain only the matching records?
A. Inner join
B. Anti-join
C. Full outer join
D. Partial outer join
True or false: association models require all fields used to be defined with the role of both.
A. True
B. False
True or false: the Auto Classifier node estimates and compares predictive models for continuous target fields.
A. True
B. False
A prison system has historical data on prison inmates and wants to find what factors are related to recidivism (return to prison). What type of model would be used?
A. Segmentation model
B. Classification model
C. Association model
D. Anomaly model