Machine Learning MCQ Multiple Choice Questions Answers

Q: 2. Selecting data so as to assure that each class is properly represented in both the training and test set.

Correct Answer is: stratification

Q: 4. Supervised learning and unsupervised clustering both require at least one

Correct Answer is: hidden attribute.

Q: 5. Supervised learning differs from unsupervised clustering in that supervised learning requires

Correct Answer is: input attributes to be categorical.

Q: 6. Suppose your model is overfitting. Which of the following is NOT a valid way to try and reduce the overfitting?

Correct Answer is: Improve the optimisation algorithm being used for error minimisation.

Q: 7. The adjusted multiple coefficient of determination accounts for

Correct Answer is: none of the above

Q: 8. The average positive difference between computed and desired outcome values.

Correct Answer is: mean positive error

Q: 9. The average squared difference between classifier predicted output and actual output.

Correct Answer is: mean squared error

Q: 10. The correlation between the number of years an employee has worked for a company and the salary of the employee is 0.75. What can be said about employee salary and years worked?

Correct Answer is: Individuals that have worked for the company the longest have higher salaries.

Q: 11. The correlation coefficient for two real-valued attributes is 0.85. What does this value tell you?

Correct Answer is: As the value of one attribute decreases the value of the second attribute increases.

Q: 12. The leaf nodes of a model tree are

Correct Answer is: linear regression equations.

1. Regression trees are often used to model ........... data.

linear
nonlinear
categorical
symmetrical

2. Selecting data so as to assure that each class is properly represented in both the training and test set.

cross validation
stratification
verification
bootstrapping

3. Simple regression assumes a ........... relationship between the input attribute and output attribute.

linear
quadratic
reciprocal
inverse

4. Supervised learning and unsupervised clustering both require at least one

hidden attribute.
output attribute.
input attribute.
categorical attribute.

5. Supervised learning differs from unsupervised clustering in that supervised learning requires

at least one input attribute.
input attributes to be categorical.
at least one output attribute.
ouput attriubutes to be categorical.

6. Suppose your model is overfitting. Which of the following is NOT a valid way to try and reduce the overfitting?

Increase the amount of training data.
Improve the optimisation algorithm being used for error minimisation.
Decrease the model complexity.
Reduce the noise in the training data.

7. The adjusted multiple coefficient of determination accounts for

the number of dependent variables in the model
the number of independent variables in the model
unusually large predictors
none of the above

8. The average positive difference between computed and desired outcome values.

root mean squared error
mean squared error
mean absolute error
mean positive error

9. The average squared difference between classifier predicted output and actual output.

mean squared error
root mean squared error
mean absolute error
mean relative error

10. The correlation between the number of years an employee has worked for a company and the salary of the employee is 0.75. What can be said about employee salary and years worked?

There is no relationship between salary and years worked.
Individuals that have worked for the company the longest have higher salaries.
Individuals that have worked for the company the longest have lower salaries.
The majority of employees have been with the company a long time.

11. The correlation coefficient for two real-valued attributes is 0.85. What does this value tell you?

The attributes are not linearly related.
As the value of one attribute increases the value of the second attribute also increases.
As the value of one attribute decreases the value of the second attribute increases.
The attributes show a curvilinear relationship.

12. The leaf nodes of a model tree are

averages of numeric output attribute values.
nonlinear regression equations.
linear regression equations.
sums of numeric output attribute values.

13. The multiple coefficient of determination is computed by

dividing SSR by SST
dividing SST by SSR
dividing SST by SSE
none of the above

14. The process of forming general concept definitions from examples of concepts to be learned.

Deduction
abduction
induction
conjunction

15. The standard error is defined as the square root of this computation.

The sample variance divided by the total number of sample instances.
The population variance divided by the total number of sample instances.
The sample variance divided by the sample mean.
The population variance divided by the sample mean.

16. This clustering algorithm initially assumes that each data instance represents a single cluster.

agglomerative clustering
conceptual clustering
K-Means clustering
expectation maximization

17. This clustering algorithm merges and splits nodes to help modify nonoptimal partitions.

agglomerative clustering
expectation maximization
conceptual clustering
K-Means clustering

18. This supervised learning technique can process both numeric and categorical input attributes.

linear regression
Bayes classifier
logistic regression
backpropagation learning

19. This technique associates a conditional probability value with each data instance.

linear regression
logistic regression
simple regression
multiple linear regression

20. This unsupervised clustering algorithm terminates when mean values computed for the current iteration of the algorithm are identical to the computed mean values for the previous iteration.

agglomerative clustering
conceptual clustering
K-Means clustering
expectation maximization

21. When doing least-squares regression with regularisation (assuming that the optimisation can be done exactly), increasing the value of the regularisation parameter (Lambda)

will never decrease the training error.
will never increase the training error.
will never decrease the testing error.
will never increase the testing error.

22. Which is not true about Gradient of a continuous and differentiable function

is zero at a minimum
is non-zero at a maximum
is zero at a saddle point
decreases as you get closer to the minimum

23. Which of the following is a common use of unsupervised clustering?

detect outliers
determine a best set of input attributes for supervised learning
evaluate the likely performance of a supervised learner model
determine if meaningful relationships can be found in a dataset

24. Which of the following is not an advantage of Grid search

It can be applied to non-differentiable functions.
It can be applied to non-continuous functions.
It is easy to implement.
It runs reasonably fast for multiple linear regression.

25. Which of the following points would Bayesians and frequentists disagree on?

The use of a non-Gaussian noise model in probabilistic regression.
The use of probabilistic modelling for regression.
The use of prior distributions on the parameters in a probabilistic model.
The use of class priors in Gaussian Discriminant Analysis

26. Which of the following sentence is FALSE regarding regression?

It relates inputs to outputs.
It is used for prediction.
It may be used for interpretation.
It discovers causal relationships.

27. Which statement about outliers is true?

Outliers should be identified and removed from a dataset.
Outliers should be part of the training dataset but should not be present in the test data.
Outliers should be part of the test dataset but should not be present in the training data.
The nature of the problem determines how outliers are used.

28. Which statement is true about neural network and linear regression models?

Both models require input attributes to be numeric.
Both models require numeric attributes to range between 0 and 1.
The output of both models is a categorical attribute value.
Both techniques build models whose output is determined by a linear sum of weighted input attribute values.

29. Which statement is true about prediction problems?

The output attribute must be categorical.
The output attribute must be numeric.
The resultant model is designed to determine future outcomes.
The resultant model is designed to classify current behavior.

30. With Bayes classifier, missing data items are

treated as equal compares.
treated as unequal compares.
replaced with a default value.
ignored.

Machine Learning MCQ Multiple Choice Questions Answers for Practice

1. Regression trees are often used to model ........... data.

2. Selecting data so as to assure that each class is properly represented in both the training and test set.

3. Simple regression assumes a ........... relationship between the input attribute and output attribute.

4. Supervised learning and unsupervised clustering both require at least one

5. Supervised learning differs from unsupervised clustering in that supervised learning requires

6. Suppose your model is overfitting. Which of the following is NOT a valid way to try and reduce the overfitting?

7. The adjusted multiple coefficient of determination accounts for

8. The average positive difference between computed and desired outcome values.

9. The average squared difference between classifier predicted output and actual output.

10. The correlation between the number of years an employee has worked for a company and the salary of the employee is 0.75. What can be said about employee salary and years worked?

11. The correlation coefficient for two real-valued attributes is 0.85. What does this value tell you?

12. The leaf nodes of a model tree are

13. The multiple coefficient of determination is computed by

14. The process of forming general concept definitions from examples of concepts to be learned.

15. The standard error is defined as the square root of this computation.

16. This clustering algorithm initially assumes that each data instance represents a single cluster.

17. This clustering algorithm merges and splits nodes to help modify nonoptimal partitions.

18. This supervised learning technique can process both numeric and categorical input attributes.

19. This technique associates a conditional probability value with each data instance.

20. This unsupervised clustering algorithm terminates when mean values computed for the current iteration of the algorithm are identical to the computed mean values for the previous iteration.

21. When doing least-squares regression with regularisation (assuming that the optimisation can be done exactly), increasing the value of the regularisation parameter (Lambda)

22. Which is not true about Gradient of a continuous and differentiable function

23. Which of the following is a common use of unsupervised clustering?

24. Which of the following is not an advantage of Grid search

25. Which of the following points would Bayesians and frequentists disagree on?

26. Which of the following sentence is FALSE regarding regression?

27. Which statement about outliers is true?

28. Which statement is true about neural network and linear regression models?

29. Which statement is true about prediction problems?

30. With Bayes classifier, missing data items are

Looking to post on our portal?

Machine Learning MCQ Multiple Choice Questions Answers for Practice

1. Regression trees are often used to model ........... data.

2. Selecting data so as to assure that each class is properly represented in both the training and test set.

3. Simple regression assumes a ........... relationship between the input attribute and output attribute.

4. Supervised learning and unsupervised clustering both require at least one

5. Supervised learning differs from unsupervised clustering in that supervised learning requires

6. Suppose your model is overfitting. Which of the following is NOT a valid way to try and reduce the overfitting?

7. The adjusted multiple coefficient of determination accounts for

8. The average positive difference between computed and desired outcome values.

9. The average squared difference between classifier predicted output and actual output.

10. The correlation between the number of years an employee has worked for a company and the salary of the employee is 0.75. What can be said about employee salary and years worked?

11. The correlation coefficient for two real-valued attributes is 0.85. What does this value tell you?

12. The leaf nodes of a model tree are

13. The multiple coefficient of determination is computed by

14. The process of forming general concept definitions from examples of concepts to be learned.

15. The standard error is defined as the square root of this computation.

16. This clustering algorithm initially assumes that each data instance represents a single cluster.

17. This clustering algorithm merges and splits nodes to help modify nonoptimal partitions.

18. This supervised learning technique can process both numeric and categorical input attributes.

19. This technique associates a conditional probability value with each data instance.

20. This unsupervised clustering algorithm terminates when mean values computed for the current iteration of the algorithm are identical to the computed mean values for the previous iteration.

21. When doing least-squares regression with regularisation (assuming that the optimisation can be done exactly), increasing the value of the regularisation parameter (Lambda)

22. Which is not true about Gradient of a continuous and differentiable function

23. Which of the following is a common use of unsupervised clustering?

24. Which of the following is not an advantage of Grid search

25. Which of the following points would Bayesians and frequentists disagree on?

26. Which of the following sentence is FALSE regarding regression?

27. Which statement about outliers is true?

28. Which statement is true about neural network and linear regression models?

29. Which statement is true about prediction problems?

30. With Bayes classifier, missing data items are

Login at Quizforum