Question 67:
You work as a machine learning specialist for a polling research company. You have national polling data for the last 10 presidential elections that you have engineered, randomized, partitioned into various training and test datasets, and stored on S3. You have selected a SageMaker built-in algorithm to use for your model. Your training datasets are very large. As you repeatedly run your training job with different large datasets, you find your training takes a very long time. How can you improve the performance of your training runs? (Select TWO)
Answer options:
A.Use the protobuf recordIO format. B.Convert your data to XML and use file mode to load your data to the EBS training instance volumes. C.Use pipe mode to stream the training data directly to your EBS training instance volumes. D.Convert your data to CSV and use file mode to load your data to the EBS training instance volumes. E.Change your Elastic Inference accelerator type to a larger instance type.