Further training

Data Scientist with Python

Data pipelines with machine learning algorithms and Python - the online training with certificate of completion

This continuing education is held in German.
Being able to process and analyze data automatically and in real time and derive insights from it is one of the key requirements of companies. Building the data pipelines for this is the task of data scientists - a professional field that is currently in high demand and offers great opportunities. This certified online training course enables you to set up data mining processes, apply machine learning algorithms, create predictive models and implement them productively in automated workflows. The course uses the Python programming language with its leading machine learning libraries. This online course is designed so that you can learn flexibly and at your own pace. You can expect videos, interactive graphics, texts and lots of practical exercises with extensive data sets and coding tasks. Experienced data analysts are on hand as mentors to answer your questions.

The online training course has been tested and approved by the State Central Agency for Distance Learning (ZFU) in Cologne under the number 73597.

Contents

Further training in accordance with AI Regulation Art. 4 for the obligation to provide evidence of AI competence

1. basics of data analytics with Python

  • Working with the Data Lab.
  • Basics and concepts in Python.
  • Presentation of the tools pandas, matplotlib and seaborn.
  • Database queries with SQL Alchemy.

2. linear algebra

  • Mathematical background.
  • Basic concepts of linear algebra.
  • Calculation with vectors and matrices.
  • Use of the Python library numpy.

3. probability distribution

  • Statistics in data science algorithms.
  • Discrete and continuous distributions.
  • Versioning of code in Git.

4. supervised learning (regression)

  • Use linear regression.
  • Use of the Python package sklearn.
  • Understanding regression models.
  • Evaluation of the forecasts.
  • Bias variance trade-off and regularization.
  • Measurement of model quality.

5. supervised learning (classification)

  • Concepts of supervised learning.
  • Introduction to classification algorithms.
  • The k-Nearest Neighbors algorithm.
  • Assessment of the classification performance.
  • Optimization of the parameters.
  • Division of the data into training and evaluation sets.

6. unsupervised learning (clustering)

  • Concepts of Unsupervised Learning.
  • The k-Means algorithm.
  • Evaluation of performance metrics.
  • Alternatives to k-means clustering.

7. unsupervised learning (dimension reduction)

  • dimensions in the data analysis.
  • Principal Component Analysis (PCA).
  • Generate uncorrelated features from original data.
  • Introduction to feature engineering.

8. identify and exclude outliers

  • Methods for detecting outliers.
  • Criteria of unusual data points.
  • Robust measurements and reduction of influences due to outliers.

9. collect and merge data

  • Read data from web pages and PDF documents.
  • Use of regular expressions.
  • Structure text data before processing.

10. logistic regression

  • Concepts of logistic regression.
  • Performance metrics for evaluation.
  • Use non-numerical data in models.

11. decision trees and random forests

  • The concept of Decision Trees.
  • Combine several styles to create ensembles.
  • Methods for improving the prediction quality.

12. support vector machines

  • Use of Support Vector Machines (SVM).
  • Introduction to Natural Language Processing (NLP).
  • Text classification with bag-of-words models.

13. neural networks

  • Basics of artificial neural networks (ANN).
  • Basics of deep learning.
  • Deeper understanding of the layers in KNN.

14. visualization and model interpretation

  • Derive and illustrate how models work.
  • Methods for interpretation and visualization.
  • Apply model-agnostic methods.

15. use distributed databases

  • Use the Python package PySpark.
  • Read data from distributed databases.
  • Basics of big data analysis.
  • Using machine learning algorithms in distributed systems.

16. exercise project

  • Work independently on a comprehensive exercise project.
  • Solve prediction problem using a larger data set.
  • Preparation for the final project.

17. final project

  • Independent analysis of the data project.
  • Presentation of results and 1:1 feedback meeting with mentor team.
  • Receipt of the certificate for Data Scientist with Python.

How do you learn in the course?

This online course offers you a particularly practice-oriented learning concept with comprehensive self-study units and a team of mentors who are available to you at all times. A new chapter is unlocked for you every week. With a time budget of around 6 hours per week, you are sure to reach your goal in 17 weeks. This is how you learn in the course:

Placement test: In an onboarding meeting at the start of the course, you and the mentor team will determine what knowledge you already have and which parts of the course you should pay particular attention to. This will prepare you optimally for learning in the self-study units.

Data Lab: In the course's learning environment, you can expect videos, interactive graphics, text and, above all, lots of practical exercises with comprehensive datasets and coding tasks. You carry these out directly in the browser - without any installation or configuration effort and with direct success control.

Mentor team: Your learning coaches are available to answer any questions you may have. They are experienced data analysts who will be happy to help you - via chat, audio or video call.

Webinars: Once a week, you have the opportunity to take part in webinars and immerse yourself in selected specialist data analysis topics.

Career coaching: What professional goals are you pursuing with your further education and how can you achieve them? A team of mentors is ready to help you achieve your career goals.

Final project: In your own data project, you will work independently through the entire data pipeline and answer typical questions. At the end, you will present your project in a 1-to-1 feedback session with your mentor team.

Certificate: After the final project, you will receive your official certificate as a Data Scientist with Python.

This online training is provided by our partner StackFuel GmbH. StackFuel specializes in training courses on data literacy, data science and AI.

Your benefit

In this practice-oriented training course, you will learn how to carry out data analyses with large data sets independently .

You will learn how to use Python competently, how to use the programming language for data evaluation and how to create effective visualizations.

You will learn how to connect different data sources, filter data in them and merge them.

You will learn about machine learning methods, algorithms and technologies and how to use them with Python packages.

You will learn everything you need to know about the use of deep learning and create an artificial neural network with multiple layers.

After the training, you will be able to examine company data, visualize it in a meaningful way and make it interactively accessible in dynamic dashboards.

The technical entry hurdles are minimized by the use of Jupyter notebooks , with which you can carry out the programming exercises directly in the browser.

Recommended for

Anyone looking for comprehensive training on machine learning and data pipelines. Basic knowledge of Python is required. The training is also suitable for career changers.

Final examination

In your own data project, you will work independently through the entire data pipeline and answer typical questions. At the end, you will present your project in a 1-to-1 feedback session with your mentor team.

Open Badges - Show what you can do digitally too.

Open Badges are recognized, digital certificates of participation. These verifiable credentials are the current standard for integration in career networks such as LinkedIn.

With them, you digitally demonstrate the competences you possess. After successful completion, you will receive an Open Badge from us.

Read more

Further recommendations for "Data Scientist with Python"

View into the product

Here you can get impressions of the training as well as information about the training topic.

Articles, interviews or whitepapers on the topic

Data Scientist: Salary, tasks and skills

Structuring large amounts of data and transforming it into useful information - that is the main task of a data scientist. As a specialist for data and data references, he or she creates meaningful forecasts for the future from mere figures and provides the company with recommendations for action. Data Scientist: Key Facts Education Master's degree in Data Science, Computer Science or Mathematics Professional experience an advantage [...]

Learn more here

Insight into the Datalab

You can view 3 pictures of the event.

Articles, interviews or whitepapers on the topic

Data Scientist: Salary, tasks and skills

Structuring large amounts of data and transforming it into useful information - that is the main task of a data scientist. As a specialist for data and data references, he or she creates meaningful forecasts for the future from mere figures and provides the company with recommendations for action. Data Scientist: Key Facts Education Master's degree in Data Science, Computer Science or Mathematics Professional experience an advantage [...]

Learn more here

Insight into the Datalab

You can view 3 pictures of the event.

Digital learning for individuals
Booking number
30354
€ 4.500,- plus VAT
18 weeks (6 ...
Online
4 Events
German
Start dates

Can also be booked as English-language training:
Data Scientist with Python

To this product

Future Jobs Classes

Get ready for the jobs of the future and develop into a data analyst.

In cooperation with

Start dates and details

  Select time period
0 events
28.07.2025
Booking number: 30354
€ 4.500,- plus VAT.
€ 5.355,- incl. VAT.
Details
18 weeks (6 hours/week)
08.09.2025
Booking number: 30354
€ 4.500,- plus VAT.
€ 5.355,- incl. VAT.
Details
18 weeks (6 hours/week)
20.10.2025
Booking number: 30354
€ 4.500,- plus VAT.
€ 5.355,- incl. VAT.
Details
18 weeks (6 hours/week)
01.12.2025
Booking number: 30354
€ 4.500,- plus VAT.
€ 5.355,- incl. VAT.
Details
18 weeks (6 hours/week)
Sufficient places are still free.
Don't wait too long to book.
Fully booked.
Training is guaranteed to take place
Booking number: 30354
€ 4.500,- plus VAT.
€ 5.355,- incl. VAT.
Details
18 weeks (6 hours/week)
Booking number: 30354
€ 4.500,- plus VAT.
€ 5.355,- incl. VAT.
Details
18 weeks (6 hours/week)
Please note: We use third-party tools for selected events. Personal data of the participant will be passed on to them for the implementation of the training offer. You can find more information in our privacy policy.

About us - The Haufe Akademie

Your optimizer, innovator and companion since 1978 -
Your professional partner for professional development and seminars, training courses and topical conferences.

Whether on site, live online or in-house - our customised solutions, our claim to the highest level of consulting expertise and training tailored to your needs simplify the acquisition of skills for the working world of the future and sustainably facilitate professional development.

A wide range of seminars, individual coaching and our flexible formats support HR managers and decision-makers in shaping the future and developing employees, in-house teams and companies.

Experience the benefits of online training from the comfort of your own home. Our online formats meet the highest quality standards and are in no way inferior to face-to-face events in terms of practical relevance. Learn together live online in interactive groups or digitally at a time of your choice.

2,500+ further training
600,000+ apprentices per year
Over 95% positive reviews
2,500 trainers and coaches
17,500+ training courses held per year
Call us or send an email

Do you have any questions?

We are there for you Monday to Friday 8:00 a.m. - 5:00 p.m.

Stephanie Göpfert

Head of Customer Service

*Mandatory fields
FAQs

Questions & Answers

In our Questions & Answers (FAQ) section, you will find all the answers and the most frequently asked questions about your selected topic.