Question 135:
You work for the city planning department of a major metropolitan city in the United States. You are on the city’s machine learning team where you are responsible for creating a model that assists in the resource planning for police officers in the city. Each day the city has to assign police officers to each precinct according to varying parameters. You have data from the past several years for your city and other US cities of similar makeup. You are in the process of deciding which algorithm to use for your police officer allocation model. Your goal is to predict the police officer allocation size for a given shift based on your dataset features. Your city dataset has the following features: Infrastructure average age Square feet Citizens Precincts Residences Population density Police officers Before you select an algorithm, you need to perform feature selection and dimensionality reduction of your features. You only want to select features that are relevant to your training dataset, i.e., dimensionality reduction. This process will help you prevent overfitting and increase computation efficiency through simplification of the feature set. You have chosen to use visualization techniques to decide which of your 7 features are the most important or most relevant, in other words, which of your 7 features are needed to train your model properly. Which visualization techniques are the best to use for this purpose? (Choose TWO)
Answer options:
A.Cat plot B.Swarm plot C.Pairs plot D.Covariance matrix E.Entropy matrix