Data Engineering Zoomcamp π
   
Master the fundamentals of data engineering by building an end-to-end data pipeline from scratch.
A free 9-week course on building production-ready data pipelines.
π **Table of Contents**
π― **Course Overview**
Data Engineering Zoomcamp is a comprehensive, hands-on course designed to help you master modern data engineering tools and practices. You'll learn by doing - building a complete data pipeline from ingestion to visualization using industry-standard technologies.
Key Features:
π **2026 Cohort**
| Item | Details | |------|---------| | Start Date | January 12, 2026 | | Duration | 9 weeks | | Format | Cohort-based with self-paced option | | Registration | Sign up here |
Self-Paced Option: All materials are available year-round for independent learners!
π **Syllabus**
**Module 1: Containerization & Infrastructure as Code**
**Module 2: Workflow Orchestration**
**Workshop 1: Data Ingestion**
**Module 3: Data Warehousing**
**Module 4: Analytics Engineering**
**Module 5: Batch Processing**
**Module 6: Streaming**
**Final Project**
ποΈ **Prerequisites**
To get the most out of this course, you should have:
π **Getting Started**
**For Cohort Participants:**
1. Register for the 2026 cohort
2. Join the Slack community
3. Check the #course-data-engineering channel
4. Set up your development environment (instructions in Week 1)
**For Self-Paced Learners:**
1. Clone this repository:
git clone https://github.com/DataTalksClub/data-engineering-zoomcamp.git
Data Engineering Zoomcamp is a free, community-driven, 9-week course designed to teach the fundamentals of building production-ready data pipelines. It's structured around hands-on projects, allowing participants to master key technologies like Docker, Terraform, BigQuery, Apache Spark, and Kafka. The course is led by experienced instructors and supported by an active global community on Slack for collaboration and troubleshooting