Data Warehousing on AWS

Online

3 days

German

Download PDF

€ 1.990,-

plus VAT.

€ 2.368,10

incl. VAT.

Booking number

36407

Venue

Online

€ 1.990,-

plus VAT.

€ 2.368,10

incl. VAT.

Booking number

36407

Venue

Online

Become a certified
Machine Learning Engineer

This course is part of the certified Master Class "Machine Learning Engineer". If you book the entire Master Class, you save over 15 percent compared to booking this individual module.

To the Master Class

In-house training

In-house training just for your employees - exclusive and effective.

Inquiries

In cooperation with

In this course you will learn how to develop a cloud-based data warehousing solution with Amazon Redshift.

Contents

This course introduces you to the concepts, strategies, and best practices for designing a cloud-based data warehousing solution using Amazon Redshift, the petabyte-scale data warehouse in AWS. We will demonstrate how to collect, store, and prepare data for the data warehouse using other AWS services such as Amazon DynamoDB, Amazon EMR, Amazon Kinesis Firehose, and Amazon Simple Storage Service (Amazon S3). We will also explain how you can use business intelligence (BI) tools to analyze your data.

This course is additionally enriched with content from the Building Data Analytics Solutions using Amazon Redshift course

Module 1: Introduction to Data Warehousing

Relational databases
Data warehousing concepts
The overlap between data warehousing and big data
Overview of data management in AWS
Practical Lab 1: Introduction to Amazon Redshift

Module 2: Introduction to Amazon Redshift

Conceptual overview
Use cases from practice
Interactive demo 1: Tour of the Amazon Redshift console
Practical Lab 2: Starting an Amazon Redshift Cluster
RA3 Nodes and AQUA architecture
Amazon Redshift ML

Module 3: Starting clusters

Structure of the cluster
Connecting to the cluster
Access control
Database security
Load data
Practical Lab 3: Optimizing database schemas
Optional Lab: Starting an Amazon Redshift Cluster

Module 4: Designing the database schema

Schemas and data types
Column-by-column compression
Styles of data distribution
Methods for data sorting
Practical Lab 3: Optimizing database schemas

Module 5: Identification of data sources

Overview of data sources
Amazon S3
Amazon DynamoDB
Amazon EMR
Amazon Kinesis data fire pants
AWS Lambda database loader for Amazon Redshift
Redshift Data API
SUPER Data Type
Interactive Demo 2: Connecting your Amazon Redshift cluster to a Jupyter notebook with Data API
Interactive demo 3: Analyzing semi-structured data with the SUPER data type
Practical Lab 4: Loading real-time data into an Amazon Redshift database

Module 6: Loading data

Preparing data
Data warehousing on AWS
Loading data with COPY
Maintaining tables
Simultaneous write operations
Troubleshooting for charging problems
Practical exercise 5: Loading data with the COPY command

Module 7: Writing queries and performance tuning

Amazon Redshift SQL
User-defined functions (UDFs)
Factors that influence query performance
The EXPLAIN command and query plans
Workload management (WLM)
Interactive Demo 4: Applying mixed workload management on Amazon Redshift
Practical Lab 6: Configuring workload management

Module 8: Amazon Redshift Spectrum

Amazon Redshift Spectrum
Configuring data for Amazon Redshift Spectrum
Amazon Redshift Spectrum queries
Data transformation
Data sharing
Exercise Lab 2: Data analysis with Amazon Redshift Spectrum
Exercise Lab 3: Data transformation and retrieval in Amazon Redshift
Practical Lab 7: Using Amazon Redshift Spectrum

Module 9: Maintenance of clusters

Audit logging
Performance monitoring
Events and notifications
Practical exercises 8: Auditing and monitoring clusters
Resizing clusters
Backing up and restoring clusters
Resource labeling and limits and restrictions
Practical Lab 9: Saving, restoring and resizing clusters
Optional: Data analysis and visualization

Module 10: Analyzing and visualizing data

Performance of visualizations
Creating dashboards
Amazon QuickSight editions and functions

Your benefit

Evaluation of the relationship between Amazon Redshift and other big data systems
Evaluate case studies of data warehouse workloads and cover case studies that demonstrate the implementation of AWS data and analytics services as part of the data warehousing solution
Select an Amazon Redshift node type in the appropriate size for your data needs
Understand security features suitable for Amazon Redshift, such as encryption, IAM permissions and database permissions
Commission an Amazon Redshift cluster and use components and functions to implement a data warehouse in the cloud
Use other AWS and analytics services such as Amazon DynamoDB, Amazon EMR, Amazon Kinesis Firehose and Amazon S3 to contribute to the data warehousing solution
Evaluate approaches and methods for designing data warehouses
Determine data sources and evaluate requirements that affect the data warehouse concept
Design data warehouse with regard to the effective use of compression, data distribution and sorting methods
Load and unload data and perform data maintenance tasks
Compose queries and evaluate query plans to optimize query performance
Configure the resource allocation database, such as query queue storage, and define the criteria to assign specific query types to your configured query queues for better processing
Check, monitor and receive event notifications about activities in the data warehouse using functions and services such as Amazon Redshift Database Audit Logging, Amazon CloudTrail, Amazon CloudWatch and Amazon Simple Notification Service (Amazon SNS)
Prepare operational tasks, such as resizing the Amazon Redshift cluster and using snapshots to backup and restore clusters
Use a BI application to perform data analysis and visualization tasks on your data

trainer

Milo Fels

Methods

Instructor-led training, practical exercises, demos and group exercises

Final examination

Recommended for

This course is aimed at the following job roles:

Data analytics

We recommend that attendees on this course have the following prerequisites:

Familiarity with relational databases and database design concepts

Data Warehousing on AWS

Do you have questions about training?