Building Data Pipelines with Apache Kafka Training Course
Apache Kafka serves as a distributed streaming platform and has become the de facto standard for constructing data pipelines. It addresses a wide array of data processing scenarios, functioning effectively as a message queue, a distributed log, a stream processor, and more.
The course begins by exploring the underlying theory of data pipelines, followed by an in-depth look at the core principles of Kafka. We will also examine key components such as Kafka Streams and Kafka Connect.
This course is available as onsite live training in Slovakia or online live training.Course Outline
- Data pipelines 101: ingestion, storage, processing
- Kafka fundamentals: topics, partitions, brokers, replication, etc.
- Producer and Consumer APIs
- Kafka Streams as a processing layer
- Kafka Connect for integrating with external systems
- Kafka best practices and tuning
Requirements
Basic proficiency in Java 8 or Scala is recommended. If you plan to execute the examples locally, please ensure that Docker and Docker Compose are installed.
Open Training Courses require 5+ participants.
Building Data Pipelines with Apache Kafka Training Course - Booking
Building Data Pipelines with Apache Kafka Training Course - Enquiry
Building Data Pipelines with Apache Kafka - Consultancy Enquiry
Testimonials (2)
Possibility to perform independent exercises in the training environment.
Tomasz - PKO Zycie Towarzystwo Ubezpieczen S.A.
Course - Kafka for Administrators
The trainer tried to make the most complicated topics , explain it in simpler way
Calvin Raj Antony - SICPA SA
Course - Administration of Kafka Message Queue
Upcoming Courses
Related Courses
Administration of Confluent Apache Kafka
21 HoursConfluent Apache Kafka is a distributed event streaming platform engineered for high-throughput, fault-tolerant data pipelines and real-time analytics.
This instructor-led, live training (available online or onsite) targets intermediate-level system administrators and DevOps professionals looking to install, configure, monitor, and troubleshoot Confluent Apache Kafka clusters.
Upon completion of this training, participants will be able to:
- Grasp the components and architecture of Confluent Kafka.
- Deploy and manage Kafka brokers, Zookeeper quorums, and essential services.
- Configure advanced features such as security, replication, and performance tuning.
- Utilize management tools to monitor and maintain Kafka clusters.
Format of the Course
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation within a live-lab environment.
Course Customization Options
- To request customized training for this course, please contact us to arrange.
Apache Kafka Connect
7 HoursThis instructor-led, live training in Slovakia (online or onsite) is tailored for developers who wish to integrate Apache Kafka with existing databases and applications for processing, analysis, and other purposes.
By the end of this training, participants will be able to:
- Use Kafka Connect to ingest large amounts of data from a database into Kafka topics.
- Ingest log data generated by application servers into Kafka topics.
- Make any collected data available for stream processing.
- Export data from Kafka topics into secondary systems for storage and analysis.
Big Data Streaming for Developers
14 HoursGain the skills to implement complete big data streaming use cases. Master real-time data preparation and maintenance using Informatica, Edge, Kafka, and Spark. This course is applicable to software versions 10.2.1 and later.
Confluent Apache Kafka: Cluster Operations and Configuration
16 HoursConfluent Apache Kafka is an enterprise-grade distributed event streaming platform built on Apache Kafka. It supports high-throughput, fault-tolerant data pipelines and real-time streaming applications.
This instructor-led, live training (online or onsite) is aimed at intermediate-level engineers and administrators who wish to deploy, configure, and optimize Confluent Kafka clusters in production environments.
By the end of this training, participants will be able to:
- Install, configure, and operate Confluent Kafka clusters with multiple brokers.
- Design high-availability setups using Zookeeper and replication techniques.
- Tune performance, monitor metrics, and apply recovery strategies.
- Secure, scale, and integrate Kafka with enterprise environments.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building Kafka Solutions with Confluent
14 HoursThis instructor-led live training, available online or on-site, is designed for engineers who want to utilize Confluent (a distribution of Kafka) to build and manage a real-time data processing platform for their applications.
Upon completion of this training, participants will be able to:
- Install and configure the Confluent Platform.
- Leverage Confluent’s management tools and services to simplify Kafka operations.
- Store and process incoming stream data.
- Optimize and manage Kafka clusters.
- Secure data streams.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practical work.
- Hands-on implementation within a live lab environment.
Course Customization Options
- This course is based on the open-source version of Confluent: Confluent Open Source.
- To request customized training for this course, please contact us to arrange.
A Practical Introduction to Stream Processing
21 HoursIn this instructor-led, live training at Slovakia (onsite or remote), participants will learn how to set up and integrate various Stream Processing frameworks with existing big data storage systems, as well as related software applications and microservices.
By the end of this training, participants will be able to:
- Install and configure different Stream Processing frameworks, such as Spark Streaming and Kafka Streaming.
- Understand and select the most appropriate framework for the job.
- Process data continuously, concurrently, and in a record-by-record fashion.
- Integrate Stream Processing solutions with existing databases, data warehouses, data lakes, etc.
- Integrate the most appropriate stream processing library with enterprise applications and microservices.
Distributed Messaging with Apache Kafka
14 HoursThis course is designed for enterprise architects, developers, system administrators, and anyone seeking to understand and utilize a high-throughput distributed messaging system. If you have more specific needs (e.g., focusing solely on system administration), this course can be customized to better fit your requirements.
Kafka for Administrators
21 HoursThis instructor-led live training in Slovakia (online or onsite) is aimed at beginner-level, intermediate-level, or advanced-level system administrators and operations engineers who wish to use Apache Kafka to deploy, secure, monitor, and troubleshoot Kafka clusters.
By the end of this training, participants will be able to: explain Kafka architecture and KRaft mode, operate and secure Kafka clusters, monitor performance and reliability, and resolve common production issues.
Apache Kafka for Developers
21 HoursThis instructor-led, live training in Slovakia (online or onsite) is aimed at intermediate-level developers who wish to develop big data applications with Apache Kafka.
By the end of this training, participants will be able to:
- Develop Kafka producers and consumers to send and read data from Kafka.
- Integrate Kafka with external systems using Kafka Connect.
- Write streaming applications with Kafka Streams & ksqlDB.
- Integrate a Kafka client application with Confluent Cloud for cloud-based Kafka deployments.
- Gain practical experience through hands-on exercises and real-world use cases.
Apache Kafka for Python Programmers
7 HoursThis instructor-led live training in Slovakia (online or onsite) is designed for data engineers, data scientists, and developers who want to leverage Apache Kafka capabilities for data streaming using Python.
Upon completion of this training, participants will be able to use Apache Kafka to monitor and manage conditions within continuous data streams via Python programming.
Kafka Fundamentals for Java Developers
14 HoursThis instructor-led, live training in Slovakia (online or onsite) is aimed at intermediate-level Java developers who wish to integrate Apache Kafka into their applications for reliable, scalable, and high-throughput messaging.
By the end of this training, participants will be able to:
- Understand the architecture and core components of Kafka.
- Set up and configure a Kafka cluster.
- Produce and consume messages using Java.
- Implement Kafka Streams for real-time data processing.
- Ensure fault tolerance and scalability in Kafka applications.
Administration of Kafka Message Queue
14 HoursThis instructor-led, live training in Slovakia (online or in-person) is tailored for intermediate-level system administrators seeking to effectively leverage Kafka's message queuing features.
By the conclusion of this training, participants will be able to:
- Understand Kafka's message queuing capabilities and architecture.
- Configure Kafka topics for message queuing scenarios.
- Produce and consume messages using Kafka.
- Monitor and manage Kafka as a message queue.
Security for Apache Kafka
7 HoursThis instructor-led, live training in Slovakia (online or onsite) is aimed at software testers who wish to implement network security measures into an Apache Kafka application.
By the end of this training, participants will be able to:
- Deploy Apache Kafka onto a cloud based server.
- Implement SSL encryption to prevent attacks.
- Add ACL authentication to track and control user access.
- Ensure credible clients have access to Kafka clusters with SSL and SASL authentication.
Apache Kafka and Spring Boot
7 HoursThis instructor-led, live training in Slovakia (online or onsite) is designed for intermediate-level developers who want to learn the fundamentals of Kafka and integrate it with Spring Boot.
By the end of this training, participants will be able to:
- Understand Kafka and its architecture.
- Learn how to install, configure, and set up a basic Kafka environment.
- Integrate Kafka with Spring Boot.
Stream Processing with Kafka Streams
7 HoursKafka Streams is a client-side library designed for building applications and microservices that exchange data with a Kafka messaging system. Traditionally, processing data between message producers and consumers has relied on Apache Spark or Apache Storm. By invoking the Kafka Streams API directly within an application, data can be processed natively within Kafka, eliminating the need to route data to a separate cluster for processing.
In this instructor-led live training, participants will learn how to integrate Kafka Streams into sample Java applications that pass data to and from Apache Kafka for stream processing.
By the end of this training, participants will be able to:
- Understand the features and advantages of Kafka Streams compared to other stream processing frameworks
- Process stream data directly within a Kafka cluster
- Develop a Java or Scala application or microservice that integrates with Kafka and Kafka Streams
- Write concise code to transform input Kafka topics into output Kafka topics
- Build, package, and deploy the application
Audience
- Developers
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice
Notes
- To request a customized training for this course, please contact us to arrange