pds-it
['Product detail page','no']
Amazon Web Services / AWS Data Analytics
The illustrations were created in cooperation between humans and artificial intelligence. They show a future in which technology is omnipresent, but people remain at the center.
AI-generated illustration

Data Warehousing on AWS

Online
3 days
German
Download PDF
€ 1.990,-
plus VAT.
€ 2.368,10
incl. VAT.
Booking number
36407
Venue
Online
1 appointment
€ 1.990,-
plus VAT.
€ 2.368,10
incl. VAT.
Booking number
36407
Venue
Online
1 appointment
Become a certified
Machine Learning Engineer
This course is part of the certified Master Class "Machine Learning Engineer". If you book the entire Master Class, you save over 15 percent compared to booking this individual module.
To the Master Class
In-house training
In-house training for your Employees only - exclusive and effective.
Inquiries
In cooperation with
In this course you will learn how to develop a cloud-based data warehousing solution with Amazon Redshift.
Contents

This course introduces you to the concepts, strategies, and best practices for designing a cloud-based data warehousing solution using Amazon Redshift, the petabyte-scale data warehouse in AWS. We will demonstrate how to collect, store, and prepare data for the data warehouse using other AWS services such as Amazon DynamoDB, Amazon EMR, Amazon Kinesis Firehose, and Amazon Simple Storage Service (Amazon S3). We will also explain how you can use business intelligence (BI) tools to analyze your data.

This course is additionally enriched with content from the Building Data Analytics Solutions using Amazon Redshift course

 

Module 1: Introduction to Data Warehousing

  • Relational databases
  • Data warehousing concepts
  • The overlap between data warehousing and big data
  • Overview of data management in AWS
  • Practical Lab 1: Introduction to Amazon Redshift

 

Module 2: Introduction to Amazon Redshift

  • Conceptual overview
  • Use cases from practice
  • Interactive demo 1: Tour of the Amazon Redshift console
  • Practical Lab 2: Starting an Amazon Redshift Cluster
  • RA3 Nodes and AQUA architecture
  • Amazon Redshift ML

 

Module 3: Starting clusters

  • Structure of the cluster
  • Connecting to the cluster
  • Access control
  • Database security
  • Load data
  • Practical Lab 3: Optimizing database schemas
  • Optional Lab: Starting an Amazon Redshift Cluster

 

Module 4: Designing the database schema

  • Schemas and data types
  • Column-by-column compression
  • Styles of data distribution
  • Methods for data sorting
  • Practical Lab 3: Optimizing database schemas

 

Module 5: Identification of data sources

  • Overview of data sources
  • Amazon S3
  • Amazon DynamoDB
  • Amazon EMR
  • Amazon Kinesis data fire pants
  • AWS Lambda database loader for Amazon Redshift
  • Redshift Data API
  • SUPER Data Type
  • Interactive Demo 2: Connecting your Amazon Redshift cluster to a Jupyter notebook with Data API
  • Interactive demo 3: Analyzing semi-structured data with the SUPER data type
  • Practical Lab 4: Loading real-time data into an Amazon Redshift database

 

Module 6: Loading data

  • Preparing data
  • Data warehousing on AWS
  • Loading data with COPY
  • Maintaining tables
  • Simultaneous write operations
  • Troubleshooting for charging problems
  • Practical exercise 5: Loading data with the COPY command

 

Module 7: Writing queries and performance tuning

  • Amazon Redshift SQL
  • User-defined functions (UDFs)
  • Factors that influence query performance
  • The EXPLAIN command and query plans
  • Workload management (WLM)
  • Interactive Demo 4: Applying mixed workload management on Amazon Redshift
  • Practical Lab 6: Configuring workload management

 

Module 8: Amazon Redshift Spectrum

  • Amazon Redshift Spectrum
  • Configuring data for Amazon Redshift Spectrum
  • Amazon Redshift Spectrum queries
  • Data transformation
  • Data sharing
  • Exercise Lab 2: Data analysis with Amazon Redshift Spectrum
  • Exercise Lab 3: Data transformation and retrieval in Amazon Redshift
  • Practical Lab 7: Using Amazon Redshift Spectrum

 

Module 9: Maintenance of clusters

  • Audit logging
  • Performance monitoring
  • Events and notifications
  • Practical exercises 8: Auditing and monitoring clusters
  • Resizing clusters
  • Backing up and restoring clusters
  • Resource labeling and limits and restrictions
  • Practical Lab 9: Saving, restoring and resizing clusters
  • Optional: Data analysis and visualization

 

Module 10: Analyzing and visualizing data

  • Performance of visualizations
  • Creating dashboards
  • Amazon QuickSight editions and functions
Your benefit
  • Evaluation of the relationship between Amazon Redshift and other big data systems
  • Evaluate case studies of data warehouse workloads and cover case studies that demonstrate the implementation of AWS data and analytics services as part of the data warehousing solution
  • Select an Amazon Redshift node type in the appropriate size for your data needs
  • Understand security features suitable for Amazon Redshift, such as encryption, IAM permissions and database permissions
  • Commission an Amazon Redshift cluster and use components and functions to implement a data warehouse in the cloud
  • Use other AWS and analytics services such as Amazon DynamoDB, Amazon EMR, Amazon Kinesis Firehose and Amazon S3 to contribute to the data warehousing solution
  • Evaluate approaches and methods for designing data warehouses
  • Determine data sources and evaluate requirements that affect the data warehouse concept
  • Design data warehouse with regard to the effective use of compression, data distribution and sorting methods
  • Load and unload data and perform data maintenance tasks
  • Compose queries and evaluate query plans to optimize query performance
  • Configure the resource allocation database, such as query queue storage, and define the criteria to assign specific query types to your configured query queues for better processing
  • Check, monitor and receive event notifications about activities in the data warehouse using functions and services such as Amazon Redshift Database Audit Logging, Amazon CloudTrail, Amazon CloudWatch and Amazon Simple Notification Service (Amazon SNS)
  • Prepare operational tasks, such as resizing the Amazon Redshift cluster and using snapshots to backup and restore clusters
  • Use a BI application to perform data analysis and visualization tasks on your data
trainer
Milo Fels
Methods

Instructor-led training, practical exercises, demos and group exercises

Final examination
Recommended for

This course is aimed at the following job roles:

  • Data analytics

 

We recommend that participants in this course have the following prerequisites:

Familiarity with relational databases and database design concepts

Start dates and details

Form of learning

Learning form

20.10.2025
Online
Places free
Implementation secured
Online
Places free
Implementation secured

The training is carried out in cooperation with an authorized training partner.

The latter collects and processes data under its own responsibility. Please take note of the corresponding privacy policy.

Do you have questions about training?
Call us on +49 761 595 33900 or write to us at service@haufe-akademie.de or use the contact form.