AWS Data Engineering Training

Live Online & Classroom Enterprise Training

This course equips learners with the skills to design, build, and manage scalable data pipelines and analytics solutions using Amazon Web Services. It focuses on data ingestion, transformation, storage, orchestration, and performance optimization using modern AWS data services.

Looking for a private batch ?

REQUEST A CALLBACK

Enterprise Reporting
Lifetime Access
CloudLabs
24x7 Support
Real-time code analysis and feedback

What is AWS Data Engineering Course about?

AWS Data Engineering training provides hands-on knowledge to work with cloud-based data platforms. Learners gain practical experience in building end-to-end data pipelines, managing big data workloads, and implementing secure, cost-efficient architectures. The course covers key services such as Amazon S3, Glue, Redshift, Athena, Lambda, and more to support real-world data engineering use cases.

What are the objectives of AWS Data Engineering Course ?

Understand core data engineering concepts on AWS
Build scalable data pipelines using AWS services
Perform data transformation and orchestration
Implement data security and governance practices
Optimize performance and cost of data solutions

Who is AWS Data Engineering Course for?

Aspiring data engineers
Data analysts transitioning to engineering roles
Software developers working with data systems
Cloud professionals working on AWS
BI professionals handling large datasets

What are the prerequisites for AWS Data Engineering Course?

Prerequisites:

Basic understanding of databases and SQL
Fundamental knowledge of cloud computing
Familiarity with Python or scripting basics
Understanding of data formats (CSV, JSON, Parquet)
Basic knowledge of Linux or command line

Learning Path:

Fundamentals of data engineering
AWS core services for data workloads
Building ETL pipelines using AWS Glue
Data warehousing and analytics on AWS
Monitoring, optimization, and best practices

Related Courses:

AWS Cloud Practitioner Essentials
Python for Data Analysis
Big Data Fundamentals
Data Warehousing Concepts

Available Training Modes

Live Online Training

3 Days

Course Outline Expand All

Expand All

Module 1: AWS in Big Data introduction

Introduction to Cloud Computing

Cloud Computing Deployments Models

Amazon Web Services Cloud Platform

The Cloud Computing Difference

AWS Cloud Economics

AWS Virtuous Cycle

AWS Cloud Architecture Design Principles

Why AWS for Big Data - Reasons

Why AWS for Big Data - Challenges

Databases in AWS

Relational vs Non-Relational Databases

Data Warehousing in AWS

Services for Collecting, Processing, Storing, and Analyzing Big Data

Amazon Redshift

Amazon Kinesis

Amazon EMR

Amazon DynamoDB

Amazon Machine Learning

AWS Lambda

Amazon Elasticsearch Service

Amazon EC2 (big data analytics software on EC2 instances)

Amazon Redshift

Amazon Kinesis

Amazon EMR

Amazon DynamoDB

Amazon Machine Learning

AWS Lambda

Amazon Elasticsearch Service

Amazon EC2 (big data analytics software on EC2 instances)

Key Takeaway

Knowledge Checks

Lesson End Project

Module 2: Collection

Objectives

Amazon Kinesis Fundamentals

Loading Data into Kinesis Stream

Kinesis Data Stream High-Level Architecture

Kinesis Stream Core Concepts

Kinesis Stream Emitting Data to AWS Services

Kinesis Connector Library

Kinesis Firehose

Transferring Data Using Lambda

Amazon SQS

IoT and Big Data

IoT Framework

AWS Data Pipeline

AWS Data Pipeline Components

Key Takeaway

Knowledge Checks

Lesson End Project

Module 3: Storage

Objectives

Introduction to AWS Big Data Storage Services

Amazon Glacier

Glacier and Big Data

DynamoDB Introduction

The Architecture of the DynamoDB Table

DynamoDB in AWS Ecosystem

DynamoDB Partitions

Data Distribution

Local Secondary Index (LSI) **

Global Secondary Index (GSI) **

DynamoDB GSI vs LSI

DynamoDB Stream

Cross-Region Replication in DynamoDB

Partition Key Selection

Snowball & AWS Big Data

AWS DMS

AWS Aurora in Big Data

Key Takeaway

Knowledge Checks

Lesson End Project

Module 4: Processing I

Objectives

Introduction to AWS Big Data Processing Services

Amazon Elastic MapReduce (EMR)

Apache Hadoop

EMR Architecture

Storage Options

EMR File Storage and Compression

Supported File Format and File Size

Single-AZ Concept

EMR Operations

EMR Releases

AWS Cluster

Launching a Cluster

Advanced EMR Setting Option

Choosing Instance Type

Number of Instances

Monitoring EMR

Resizing of Cluster

Using Hue with EMR

Setup Hue for LDAP

Hive on EMR

Hive Use Cases

Key Takeaway

Knowledge Checks

Lesson End Project

Module 5: Processing II

HBase with EMR

HBase Use Cases

Comparison of HBase with Redshift and DynamoDB

HBase Architecture HBase on S3

HBase and EMRFS

HBase Integration

HCatalog

Presto with EMR

Advantages of Presto

Presto Architecture

Spark with EMR

Spark Use Cases

Spark Components

Spark Integration With EMR

AWS Lambda in AWS Big Data Ecosystem

Limitations of Lambda

Lambda and Kinesis Stream

Lambda and Redshift

Key Takeaway

Knowledge Checks

Lesson End Project

Module 6: Analysis I

Objectives

Introduction to AWS Big Data Analysis Services

RedShift

RedShift Architecture

RedShift in the AWS Ecosystem

Columnar Databases

RedShift Table Design

RedShift Workload Management

RedShift Loading Data

RedShift Maintenance and Operations

Key Takeaway

Knowledge Checks

Lesson End Project

Module 7: Analysis II

Machine Learning

Machine Learning - Use Cases

Algorithms

Amazon SageMaker

Elasticsearch

Amazon Elasticsearch Service

Loading of Data into Elasticsearch

Logstash

Kibana

RStudio

Characteristics

Athena

Presto and Hive

Integration with AWS Glue

Comparison of Athena with Other AWS Services

Lab Run Query on S3 Using Serverless Athena

Key Takeaway

Knowledge Checks

Lesson End Project

Module 8: Visualisation

Objectives

Introduction to AWS Big Data Visualization Services

Amazon QuickSight

Amazon QuickSight - Use Cases

LAB Create an Analysis with a Single Visual Using Sample Data

Working with Data

Assisted Practice: TBD

QuickSight Visualization

Big Data Visualization

Apache Zeppelin

Jupyter Notebook

Comparison Between Notebooks

D3.js (Data-Driven Documents)

MicroStrategy

Key Takeaway

Knowledge Checks

Lesson End Project

Module 9: Security

Objectives

Introduction to AWS Big Data Security Services

EMR Security

Roles

Private Subnet

Encryption At Rest and In Transit

RedShift Security

KMS Overview

SloudHSM

Limit Data Access

STS and Cross Account Access

Cloud Trail

Key Takeaway

Knowledge Checks

Lesson End Project

Who is the instructor for this training?

The trainer for this AWS Data Engineering Training has extensive experience in this domain, including years of experience training & mentoring professionals.

Reviews

My outlook on training changed completely after attending SpringPeople BPC training. The content, the trainer and infrastructure at SpringPeople were top notch and perfectly in tune with the industry requirements. Regardless to say, training is now something that I look forward to to. Kudos to everyone at SpringPeople!

Shweta Priya

Sony

I attended the 3-day AngularJs training at SpringPeople. The trainer was an industry veteran with vast experience in the subject. Notably, the hands-on training, and the Q&A session stood out. Overall, I found SpringPeople a great place to learn with excellent facilities and great trainers. Would recommend SpringPeople to my colleagues and friends.

Swati Singh

I attended the training on API Design for Mulesoft. The sessions were well planned and value-laden. I benefited immensely from the hands-on experience enabled through virtual labs. I would like to specifically commend the efficiency of the support team who were always available to resolve my concerns.

Nikhil Kohli

Stryker

I attended the jQuery training batch, conducted by Mr. Vijay, an SME who did a thorough coverage of all the essentials. He took us through concepts such as jQuery animations, event handlers, plugins, and jQuery-UI by small programs, very easily. The sessions were useful and well structured. By the end of the training, I was well equipped to develop a SPA on Product Management System. Overall, the learning experience at SpringPeople was great!

Heena Rajan

Mindtree