Question 161:
You work for a global consulting company as a machine learning specialist. You work with a team of data scientists that continually create datasets for your consultancy’s analysis and trend prediction work using machine learning. You have been assigned the job of creating a data repository to store the large amount of training data generated by your data scientists for use in your machine learning models. You do not know how many new datasets your data scientists will create each day, so your solution must scale automatically, and your management team wants the storage solution to be cost-effective. Also, the data scientists and machine learning specialists must be able to query the data using SQL. Which option is the best solution to meet your requirements?
Answer options:
A.Have your data scientists store their new datasets in DynamoDB using global tables. B.Have your data scientists store their new datasets as tables in a Redshift cluster using RA3 nodes with managed storage and Redshift Spectrum. C.Have your data scientists store their new datasets as files in an EFS attached to EC2 instances instance. D.Have your data scientists store their new datasets as files in S3.