Correct Answers: D and E
Option A is incorrect. The most commonly used data source for SageMaker is an S3 bucket. However, you can also use Athena, EMR, and Redshift as data sources for SageMaker.
Option B is incorrect. The most commonly used data source for SageMaker is an S3 bucket. However, you can also use Athena, EMR, and Redshift as data sources for SageMaker.
Option C is incorrect. The most commonly used data source for SageMaker is an S3 bucket. However, you can also use Athena, EMR, and Redshift as data sources for SageMaker.
Option D is correct. DynamoDB is not a viable data source for ingesting data into your machine learning jupyter notebook environment.
Option E is correct. RDS is not a viable data source for ingesting data into your machine learning jupyter notebook environment.
References:
Please see the Amazon SageMaker Examples Read the Docs Data Ingestion guide titled Get started with data ingestion (https://sagemaker-examples.readthedocs.io/en/latest/ingest_data/index.html),
The Amazon SageMaker Examples Read the Docs Data Ingestion guide titled Ingest data with Athena (https://sagemaker-examples.readthedocs.io/en/latest/ingest_data/02_Ingest_data_with_Athena_v1.html),
The Amazon SageMaker Examples Read the Docs Data Ingestion guide titled Ingest Data with EMR (https://sagemaker-examples.readthedocs.io/en/latest/ingest_data/04_Ingest_data_with_EMR.html),
The Amazon SageMaker Examples Read the Docs Data Ingestion guide titled Ingest data with Redshift (https://sagemaker-examples.readthedocs.io/en/latest/ingest_data/03_Ingest_data_with_Redshift_v3.html)