AWS Interview Question-10

How will do upload a file greater than 200 megabytes in Amazon S3?
  • As we know that Amazon S3 supports of storing objects or files up to 5 terabytes.
  • But if you want to upload an file greater than 100 megabytes then we have to use of Multipart upload functionality from AWS.
  • we can upload an large file in multiple parts if we use Multipart upload.
  • In this case , each part will be uploaded independently and uploading time decreases.
What are the two types of Load Balancer?

Ans: Classic LB and Application LB. ALB is the Content Based Routing.

Different types of Cloud Computing as per services?

Ans: PAAS (Platform As A Service), IAAS (Infrastructure As A Service), SAAS (Software As A Service)

Can a AMI be shared?

Ans: Yes. A developer can create an AMI and share it with other developers for their use. A shared AMI is packed with the components you need and you can customize the same as per your needs. As you are not an owner of a shared AMI there is a risk always involved.

Which component of a Red shift cluster, if down, it renders the Red shift cluster as unavailable?

Answer : Leader Node.The Leader Node in an Amazon Redshift Cluster manages all external and internal communication. It is responsible for preparing query execution plans whenever a query is submitted to the cluster.The Leader Node distributes data to the slices, and allocates parts of a user query or other database operation to the slices. Slices work in parallel to perform the operations.

What is a Hypervisor?

Ans: A Hypervisor is a kind of software that enables Virtualization. It combines physical hardware resources into a platform which is delivered virtually to one or more users. XEN is the Hypervisor for EC2.

How do Encryption is done in S3?

 In Transit: SSL/TLS

 At Rest

 Server-Side in Encryption

 S3 Managed Keys – SSE-S3

 AWS Key Management Service, Managed of Keys – SSE-KMS

 Server-Side Encryption with Customer Provided Keys – SSE-C  Client-Side Encryptions

What is Auto Scaling?

Ans: Creating duplicate instances during heavy business hours. Scale-IN and Scale-OUT are two different statues of Scaling. Scale-IN: Reducing the instances. Scale-OUT: Increasing the instances by duplicating.

Which SQL function statements can be used in Red shift to specify a result when there are multiple conditions?
Answer : Case expression
Key Pair and its uses?

Ans: We can use Key Pair to login to EC2 Instance in a secured way. When instances are spread across regions we need to create key pair in each region.

What is AMI?

Ans: AMI is defined as Amazon Machine Image. It’s a template comprising software configuration part. For example, Operating System, DB Server, Application Server, etc.,

You have to create an Amazon Machine Learning model to predict how many inches of snow will fall in an area based on the historical snowfall data. What type of modeling will you use?

Answer : Regression

Difference between Stopping and Terminating the Instances?
  • When you STOP an instance it is a normal shutdown. The corresponding EBS volume attached to that instance remains attached and you can restart the instance later.
  • When you TERMINATE an instance it gets deleted and you cannot restart that instance again later. And any EBS volume attached with that instance also deleted.
What is Shard in AWS Kinesis?

It is a group of data records in a stream.

When you launch a standby Relational Database Service instance will it be available in the same Available Zone?

Ans: Not advisable. Because the purpose of having standby RDS instance is to avoid an infrastructure failure. So you have to keep your standby RDS service in a different Availability Zone, which may have different infrastructure.

How will you load streaming data and establish scalable private connections to on-premise data centers ?Which service will you used for that?

Answer : Direct Connect and Kinesis Fire hose

  • Establish a dedicated network connection from your premises to AWS.
  • AWS Direct Connect makes it easy to establish a dedicated network connection from your premises to AWS.
  • Using AWS Direct Connect, you can establish private connectivity between AWS and your datacentre.
  • Amazon Kinesis Firehose is the easiest way to load streaming data into AWS. It can capture and automatically load streaming data into Amazon S3 and Amazon Redshift
Difference between Amazon RDS, DynamoDB and Redshift?

Ans: RDS is meant for structured data only. DynamoDB is meant for unstructured data which is a NoSQL service. Redshift is a data warehouse product used for data analysis.

Which service is used by the Spark Streaming tool to consume data from Amazon Kinesis?

Answer : Amazon Kinesis Producer Library

There is a requirement to perform SQL querying along with complex queries on different backend data that include Red shift, My SQL Hive on EMR. H3, and PostgreSQL. How can we use Presto S in this case?

Answer : Presto is a high performance, distributed SQL query engine for big data. Its architecture allows users to query a variety of data sources such as Hadoop, AWS S3, Alluxio, MySQL, Cassandra, Kafka, MongoDB and Teradata. 

What are Lifecycle Hooks?

Ans: Lifecycle Hooks are used in Auto Scaling. Lifecycle hooks enable you to perform custom actions by pausing instances as an Auto Scaling group launches or terminates them. Each Auto Scaling group can have multiple lifecycle hooks.

We need to perform ad-hoc SQL queries on structured data in Project. As Data comes in constantly at a high velocity so what services should we use?

Answer : EMR + Red shift

What is S3?

Ans: S3 stands for Simple Storage Service, with a simple web service interface to store and retrieve any amount of data from anywhere on the web.

Consider you have to load a lot of data once a week from your on-premise datacenter to AWS Redshift. Which AWS-Managed Cloud Data Migration Tools can be used for this data transfer in simple, fast, and secure way.

Answer : Direct Connect

What is AWS Lambada?

Ans: Lambda is an event-driven platform. It is a compute service that runs code in response to events and automatically manages the compute resources required by that code.

In S3 how many buckets can be created?

Ans: By default 100 buckets can be created in a region.

Which service is used by the AWS Athena in partitioning data?
Answer : Hive
What is CloudFront?

Ans: Amazon CloudFront is a service that speeds up transfer of your static and dynamic web content such as HTML files, IMAGE files., etc., CloudFront delivers your particulars thru worldwide data centers named Edge Locations.

Brief about S3 service in AWS?

Ans: S3, a Simple Storage Service from Amazon. You can move your files TO and FROM S3. Its like a FTP storage. You can keep your SNAPSHOTS in S3. You can also ENCRYPT your sensitive data in S3.

Explain Regions and Available Zones in EC2?

Ans: Amazon has hosted EC2 in various locations around the world. These locations are called REGIONS. For example in Asia, Mumbai is one region and Singapore is another region. Each region is composed of isolated locations which are known as AVAILABLE ZONES. Region is independent. But the Available Zones are linked thru low-latency links.