Certbus > EMC > EMC Specialist > E20-026 > E20-026 Online Practice Questions and Answers

E20-026 Online Practice Questions and Answers

Questions 4

In data visualization, what is used to focus the audience on a key part of a chart?

A. Emphasis colors

B. Detailed text

C. Pastel colors

D. A data table

Browse 163 Q&As
Questions 5

Which word or phrase completes the statement? Data-ink ratio is to data visualization as __________ .

A. Confusion matrix is to classifier

B. Data scientist is to big data

C. Seasonality is to ARIMA

D. K-means is to Naive Bayes

Browse 163 Q&As
Questions 6

Consider a database with 4 transactions:

Transaction 1: {cheese, bread, milk} Transaction 2: {soda, bread, milk} Transaction 3: {cheese, bread} Transaction 4: {cheese, soda, juice}

You decide to run the association rules algorithm where minimum support is 50%. Which rule has a confidence equal to 25%?

A. {cheese} => {bread}

B. {juice} => {cheese}

C. {milk} => {soda}

D. {soda} => {milk}

Browse 163 Q&As
Questions 7

Under which circumstance do you need to implement N-fold cross-validation after creating a regression model?

A. There is not enough data to create a test set.

B. The data is unformatted.

C. There are missing values in the data.

D. There are categorical variables in the model.

Browse 163 Q&As
Questions 8

A disk drive manufacturer has a defect rate of less than 1.0% with 98% confidence. A quality assurance team samples 1000 disk drives and finds 14 defective units. Which action should the team recommend?

A. The manufacturing process should be inspected for problems.

B. A larger sample size should be taken to determine if the plant is functioning properly

C. A smaller sample size should be taken to determine if the plant is functioning properly

D. The manufacturing process is functioning properly and no further action is required.

Browse 163 Q&As
Questions 9

What is the primary bottleneck in text classification?

A. The availablilty of tagged training data.

B. The ability to parse unstructured text data.

C. The high dimensionality of text data.

D. The fact that text corpora are dynamic.

Browse 163 Q&As
Questions 10

Your customer provided you with 2, 000 unlabeled records and asked you to separate them into three groups. What is the correct analytical method to use?

A. K-means clustering

B. Linear regression

C. Naive Bayesian classification

D. Logistic regression

Browse 163 Q&As
Questions 11

You are performing a market basket analysis using the Apriori algorithm. Which measure is a ratio describing the how many more times two items are present together than would be expected if those two items are statistically independent?

A. Lift

B. Leverage

C. Support

D. Confidence

Browse 163 Q&As
Questions 12

Which word or phrase completes the statement? Structured data is to OLAP data as quasi-structured data is to____

A. Clickstream data

B. XML data

C. Text documents

D. Image files

Browse 163 Q&As
Questions 13

Imagine you are trying to hire a Data Scientist for your team. In addition to technical ability and quantitative background, which additional essential trait would you look for in people applying for this position?

A. Communication skill

B. Scientific background

C. Domain expertise

D. Well Organized

Browse 163 Q&As
Questions 14

You have run the association rules algorithm on your data set, and the two rules {banana, apple} => {grape} and {apple, orange}=> {grape} have been found to be relevant. What else must be true?

A. {grape,apple,orange} must be a frequent itemset.

B. {banana,apple,grape,orange} must be a frequent itemset.

C. {grape} => {banana,apple} must be a relevant rule.

D. {banana,apple} => {orange} must be a relevant rule.

Browse 163 Q&As
Questions 15

What is a property of window functions in SQL commands?

A. They can be used to calculate moving averages over various intervals.

B. They group rows into a single output row.

C. They can be used between the keywords FROM and WHERE in a SELECT command.

D. They don't require ordering of data within a window.

Browse 163 Q&As
Questions 16

Refer to the Exhibit.

In the Exhibit. For effective visualization, what is the chart's primary flaw?

A. The use of 3 dimensions.

B. The slanting of axis labels.

C. The location of the legend.

D. The order of the columns.

Browse 163 Q&As
Questions 17

Refer to the exhibit.

You ran a linear regression, and the final output is seen in the exhibit. Based only on the information in the

exhibit and an acceptable confidence level of 95%, how would you interpret the interaction of variable D

with the dependent variable?

A. In this model,Variable D is not significantly interacting with the dependent variable

B. For every 1 unit increase in variable D,holding all other variables constant,we can expect the dependent variable to increase by 10.23 units

C. For every 1 unit increase in variable D,holding all other variables constant,we can expect the dependent variable to be multiplied by 10.23 units

D. Variable D is more significant than variables A,B,and C.

Browse 163 Q&As
Questions 18

Refer to the exhibit Consider the training data set shown in the exhibit. What are the classification (Y = 0 or 1) and the probability of the classification for the tuple X(1, 0, 0) using Naive Bayesian classifier?

A. Classification Y = 0,Probability = 4/54

B. Classification Y = 1,Probability = 4/54

C. Classification Y = 0,Probability = 1/54

D. Classification Y = 1,Probability = 1/54

Browse 163 Q&As
Exam Code: E20-026
Exam Name: Enterprise Storage Networking Specialist Exam
Last Update: Mar 21, 2024
Questions: 163 Q&As

PDF

$45.99

VCE

$49.99

PDF + VCE

$59.99