1) In your project, Your team has successfully migrated the company data warehouse to AWS Redshift. After signing some new business deals with a 3rd party, they will be securely sending files directly to S3. The data in these files need to be ingested into Redshift. What is the most efficient and best-automated way to introduce this change into the ETL pipeline? (Choose 2)
1.Use Lambda (AWS Redshift Database Loader).
2.Procure a new 3rd party tool that integrates with S3 and Redshift that provides powerful scheduling capabilities.
3.Use Data Pipeline.
4.Run a cron job on a t2.micro instance that will execute Linux shell scripts.
Ans: Correct Answer: Use Lambda (AWS Redshift Database Loader)..
Use Data Pipeline..
2) Your company is launching an IoT device that will send data to AWS. All the data generated by the millions of devices your company is going to sell will be stored in DynamoDB for use by the Engineering team. Each customer’s data, however, will only be stored in DynamoDB for 30 days. A mobile application will be used to control the IoT device, and easy user sign-up and sign-in to the mobile application are requirements. Which option would be the best choice for their mobile application?
Use an Amazon Cognito identity pool.
Since everyone uses Facebook, Amazon, and Google, keep it simple and use all three.
Use a SAML identity provider.
Correct Answer: Use an Amazon Cognito identity pool.
3) If Kinesis Firehose experiences data delivery issues to S3, it will retry delivery to S3 for a period of __.
7 hours
7 days
24 hours
Correct Answer: 24 hours
4) Which of the following AWS IoT components transforms messages and routes them to different AWS services?
Device Gateway
Rules Engine
Device Shadow
Correct Answer: Rules Engine
5) True or False: Data Pipeline does not integrate with on-premise servers.
True
False
Correct Answer: False
6) Which service does Kinesis Firehose not load streaming data into?
DynamoDB
Redshift
Elasticsearch
Correct Answer: DynamoDB
7) For which of the following AWS services can you not create a rule action in AWS IoT? (Choose 2)
Kinesis Firehose
Redshift
CloudWatch
Aurora
Correct Answer: Redshift.Aurora.
8) For an unknown reason, data delivery from Kinesis Firehose to your Redshift cluster has failed. Kinesis Firehose retries the data delivery every 5 minutes for a maximum period for of 60 minutes; however, none of the retries deliver the data to Redshift. Kinesis Firehose skips the files and move onto the next batch of files in S3. How can you ensure that the undelivered data is eventually loaded into Redshift?
Check the STL_LOAD_ERRORS table in Redshift, find the files that failed to load and manually, and load the data in those files using the COPY command.
You create a Lambda function to automatically load these files into Redshift by reading the manifest after the retries have been completed and the COPY command has been run.
Skipped files are delivered to your S3 bucket as a manifest file in an errors folder. Run the COPY command manually to load the skipped files after you have determined why they failed to load.
Correct Answer: Skipped files are delivered to your S3 bucket as a manifest file in an errors folder. Run the COPY command manually to load the skipped files after you have determined why they failed to load.
9) Regarding SQS, which of the following are true? (Choose 3)
A queue can only be created in limited regions, and you should check the SQS website to see which are supported.
Messages can be sent and read simultaneously.
A queue can be created in any region.
Messages can be retained in queues for up to 7 days.
Messages can be retained in queues for up to 14 days.
Correct Answer: Messages can be sent and read simultaneously..A queue can be created in any region..Messages can be retained in queues for up to 14 days..
10) What are the main uses of Kinesis Data Streams? (Choose 2)
They can undertake the loading of streamed data directly into data stores
They can provide long term storage of data
They can carry out real-time reporting and analysis of streamed data
They can accept data as soon as it has been produced, without the need for batching
Correct Answer: They can accept data as soon as it has been produced, without the need for batching