Question 12:
You work for a startup e-commerce site that sells various consumer products. Your company has just launched its e-commerce website. The site provides the capability for your users to rate their purchases and the products they have purchased from your e-commerce site. You would like to use the review data to build a recommender machine learning model. Since your e-commerce site is very new, you don’t yet have a very large review dataset to use for your recommendation model. You have decided to use the Amazon Customer Reviews dataset from the AWS website as a first data source for your machine learning model. Since your website sells similar products to the products sold on Amazon, you will use the Amazon Customer Reviews dataset as the basis for your initial training runs of your model. Once you have enough data from your own e-commerce site, you’ll use that data. Your goal is to perform sentiment analysis on the review dataset to create your own dataset that will be the source used for your recommender machine learning model. Which set of AWS services would you use to build your data pipeline to produce your sentiment dataset for use by your SageMaker model?
Answer options:
A.S3 -> AWS Glue ETL -> Comprehend -> S3 -> SageMaker B.S3 -> AWS Glue ETL -> Comprehend -> S3 -> Athena -> QuickSite -> SageMaker C.S3 -> Kinesis Data Firehose -> Comprehend -> S3 -> SageMaker D.S3 -> Kinesis Data Firehose -> Lambda -> S3 -> SageMaker