Question 55:
You are building a machine learning model to use your web server logs to predict which users are most likely to buy a given product. Using your company’s unstructured web server log data stored in S3, you want to get your data into CSV format and load it into another S3 bucket so that you can use it for your machine learning algorithm. Which of the following architectures will be the most efficient way to achieve this?
Answer options:
A.Load the log data into a Redshift cluster; use the UNLOAD Redshift command with a select statement to unload the data in CSV format to S3; SageMaker model uses the data to produce product purchase predictions. B.Use a built-in classifier in an AWS Glue crawler that crawls the web server logs and outputs the log data to CSV format on your ML S3 bucket; SageMaker model uses the data to produce product purchase predictions. C.Use AWS Schema Conversion tool to convert your web log data to CSV format and output it to your ML S3 bucket; run your SageMaker model on the new data to produce product purchase predictions. D.Use AWS Snowball Edge and its lambda function capability to convert and then move the web log to S3 in CSV format; run your SageMaker model on the new data to produce product purchase predictions.