## Machine Learning Quiz Question with Answer

31. You observe the following while fitting a linear regression to the data: As you increase the amount of training data, the test error decreases and the training error increases. The train error is quite low (almost what you expect it to), while the test error is much higher than the train error. What do you think is the main reason behind this behavior. Choose the most probable option.

1. High variance
2. High model bias
3. High estimation bias
4. None of the above

32. A measure of goodness of fit for the estimated regression equation is the

1. multiple coefficient of determination
2. mean square due to error
3. mean square due to regression
4. none of the above

33. A nearest neighbor approach is best used

1. with large-sized datasets.
2. when irrelevant attributes have been removed from the data.
3. when a generalized model of the data is desireable.
4. when an explanation of what has been found is of primary importance.

34. A regression model in which more than one independent variable is used to predict the dependent variable is called

1. a simple linear regression model
2. a multiple regression models
3. an independent model
4. none of the above

35. A term used to describe the case when the independent variables in a multiple regression model are correlated is

1. regression
2. correlation
3. multicollinearity
4. none of the above

36. Adding more basis functions in a linear model... (pick the most probably option)

1. Decreases model bias
2. Decreases estimation bias
3. Decreases variance
4. Doesnt affect bias and variance

37. Another name for an output attribute.

1. predictive variable
2. independent variable
3. estimated variable
4. dependent variable

38. Bootstrapping allows us to

1. choose the same training instance several times.
2. choose the same test set instance several times.
3. build models with alternative subsets of the training data several times.
4. test a model with alternative subsets of the test data several times.

39. Choose the options that are correct regarding machine learning (ML) and artificial intelligence (AI),(A) ML is an alternate way of programming intelligent machines.(B) ML and AI have very different goals.(C) ML is a set of techniques that turns a dataset into a software.(D) AI is a software that can emulate the human mind.

1. (A), (B), (D)
2. (A), (C), (D)
3. (B), (C), (D)
4. All are correct

40. Classification problems are distinguished from estimation problems in that

1. classification problems require the output attribute to be numeric.
2. classification problems require the output attribute to be categorical.
3. classification problems do not allow an output attribute.
4. classification problems are designed to predict future outcome.