Question 17:
You work for an energy company that buys and sells energy to customers. To get the best prices for their energy customers, your company trades financial energy derivative futures contracts. The trading of these future contracts requires accurate forecasting of energy prices. You need to build a model that compares spot prices (current commodity price) to future commodity prices (the price that a commodity can be bought or sold in the future). Your model needs to assist your company’s futures traders in hedging against future energy price changes based on current price predictions. To source the model with appropriate data, you need to gather and process the energy price data automatically. The data pipeline requires two sources of data: Historic energy spot prices Energy consumption and production rates Based on the company analysts’ requirements, you have decided you need multiple years of historical data. You also realize you’ll need to update the data feed daily as the market prices change. You can gather the required data through APIs from data provider vendor systems. Your company’s traders require a forecast from your model multiple times per day to help them form their trading strategy. So your pipeline needs to call the data provider APIs multiple times per day. Your data-ingestion pipeline needs to take the data from the API calls, perform preprocessing, and then store the data in an S3 data lake from which your forecasting model will access the data. Your data-ingestion pipeline has three main steps: Data ingestion Data storage Inference generation Assuming you have written a lambda function that interacts with the data provider APIs and stores the data in CSV format, which of the following python libraries are the best option to perform the data preprocessing to transform the data by changing raw feature vectors into a format best suited for a SageMaker batch transform job to generate your forecast?
Answer options:
A.matplotlib and plotly B.boto3 and moto C.pandas and scikit-learn D.NLTK and scrapy