ExamQuestions.com

Register
Login
AWS Certified Machine Learning Specialty Exam Questions

Amazon

AWS Certified Machine Learning Specialty

236 / 258

Question 236:

You are a machine learning specialist at an online car retailer. Your machine learning team has been tasked with building models to predict car sales and customer conversion rates. The dataset you are using has a large number of features, over 1,000. Your team plans to use linear models, such as linear regression and logistic regression, in a SageMaker Studio environment. When your team performs exploratory data analysis in their SageMaker Studio jupyter notebooks, they notice that many features are highly correlated with each other. Your tech lead has indicated that this may make your models unstable.
Which option would help you reduce the impact of having such a large number of features?

Answer options:

A.Use dot product on the highly correlated features.
B.Use Principal Component Analysis (PCA) to create a new feature space
C.One-hot-encode the highly correlated features.
D.Use TF-IDF encoding to reduce the impact of the highly correlated features.