Question 63:
You work as a machine learning specialist for a clothing manufacturer. You have built a linear regression model using SageMaker’s built-in linear learner algorithm to predict sales for a given year. Your training dataset observations are based on several features such as marketing dollars spent, number of active stores, traffic per store, online traffic to the company website, overall market indicators, etc. You have decided to use the k-fold method of cross-validation to assess how the results of your model will generalize beyond your training data. Which of these will indicate that you don’t have biased training data?
Answer options:
A.The variance of the estimate increases as you increase k. B.You shouldn’t have to worry about bias because your error function removes bias in the data. C.Every k-fold cross-validation round increases the training error rate. D.Every k-fold cross-validation round has a very similar error rate to the rate of all the other rounds. E.You would not normally use k-fold with linear regression models.