Question 106:
MindPyramid Limited is a multinational information technology and outsourcing company headquartered in Vizag, India and New Jersey, USA. Founded in 2003, the company employs approximately 2000 employees. The company offers consulting services in cloud computing, big data and analytics. They offer services to major cloud providers including AWS. The team is working with one of the major clients having their infrastructure build on AWS. Currently the client is having lot of performance issues and wants to understand the design best practices from MindPyramid team.Please suggest the best practices in terms of loading data into tables. Select 4 options.
Answer options:
A.Use copy command here to load multiple files from Amazon S3, Amazon EMR, Amazon DynamoDB, or multiple data sources on remote hosts B.for optimal parallelism, split your load data files so that the files are about equal size, between 1 MB and 1 GB after compression C.The number of files should be a multiple of the (number of slices+ X) in your cluster. X indicates the number of leader nodes D.Manage data consistency using a manifest file to load data and address eventual consistency issues E.In order to reduce the need for VACCUM, load data in the sort key order of the table F.Load the data in sequential blocks according to sort order to eliminate the need to vacuum. G.Staging tables benefits inserts and updates but create performance issues when upserts are performed