Articles by Tag #dezoomcamp

Browse our collection of articles on various topics related to IT technologies. Dive in and explore something new!

Data Engineering Zoomcamp 2025 Cohort: Introduction - Self-Study Notes

Overview Course Duration and Structure Duration: 6 modules plus 2...

Learn More 6 0Jan 25

Study Notes: DE Zoomcamp 1.2.1 - Introduction to Docker

Overview Topic: Introduction to Docker and its importance for data engineers. Purpose:...

Learn More 5 0Jan 26

Notes on Data Engineering Zoomcamp 2025 - Launch Stream

Overview: Course Edition: Fourth edition of the Data Engineering Zoomcamp. Purpose of...

Learn More 3 0Jan 24

Study Notes 2.2.3: Setting Up an ETL Pipeline with Kestra and Postgres

Introduction This study note covers the key steps to set up an ETL (Extract, Transform,...

Learn More 2 0Feb 4

Study Note 2.2.5: Orchestrate dbt Models with Postgres in Kestra

Overview This introduces how to use dbt (data build tool) with Kestra to perform data...

Learn More 2 0Feb 4

Study Notes 5.1.1-2 Introduction to Batch Processing & spark

1. Introduction to Batch Processing What is Batch Processing? Batch processing...

Learn More 1 0Mar 4

Study Notes 6.5-6: Kafka Producer, Consumer & Configuration

1. Overview of Kafka Producer & Consumer Objective: Learn how to produce and consume...

Learn More 1 0Mar 18

Study Notes 5.3.3-4 Data Processing & SQL with Spark

Study Notes 5.3.3: Preparing Yellow and Green Taxi Data 1. Overview and...

Learn More 1 0Mar 4

ETL with DLTHUB

Just wrapped up the Data Engineering Zoomcamp workshop on dlt! Learned how to build data pipelines...

Learn More 1 0Feb 16

Study Notes 6.13-14: Kafka Streaming with Python & PySpark Structured Streaming with Kafka

1. Overview of Kafka Streaming with Python Purpose & Context: This session...

Learn More 1 0Mar 18

Study Notes 1.2.3: Connecting pgAdmin and PostgreSQL

Overview from lecture 1.2.3. Purpose: To connect pgAdmin, a web-based GUI tool, to a...

Learn More 1 0Jan 28

InsightFlow Part 1: Building an Integrated Retail & Economic Data Pipeline - Project Introduction

Introduction I'm thrilled to begin documenting my journey building "InsightFlow" - an...

Learn More 1 0Apr 9

Study Notes 5.3.1-2 First Look at Spark/PySpark & Spark Dataframes

Study Notes 5.3.1 - on Spark/PySpark These notes cover the basics and some intermediate...

Learn More 0 0Mar 4

Study Notes 3.1.2: Partitioning and Clustering in BigQuery

1. Introduction Partitioning and clustering are key optimization techniques in Google...

Learn More 0 0Feb 11

Study Notes 1.2.5: Running PostgreSQL and pgAdmin with Docker-Compose

1. Introduction This video continues the Data Engineering Zoomcamp series, focusing on...

Learn More 0 0Feb 4

Peer Review 1: Poland's Real Estate Market Dashboards and Insights with Streamlit (Part 2)

Introduction Welcome to the second part of Peer Review 1, where we continue exploring the...

Learn More 0 0Apr 30

Study Notes 1.2.6: SQL Refresher

1. Introduction This session covers SQL basics and is part of a series on Docker and...

Learn More 0 0Feb 4

Study Note 3.3.2: BigQuery Machine Learning Model Deployment using Docker

This lecture outlines the steps to export a BigQuery Machine Learning model, deploy it in a Docker...

Learn More 0 0Feb 11

Study Notes 4.3.2 - Testing and Documenting the Project

1. Introduction to Data Testing Objective: Ensure data delivered to end-users is...

Learn More 0 0Feb 25

Study Notes 3.3.1: BigQuery Machine Learning (BQML)

Overview & Key Benefits Target Audience: Data analysts and managers No Python/Java...

Learn More 0 0Feb 11

Study Notes 6.3-4: What is Kafka & Confluent Cloud

1. Introduction to Kafka in Stream Processing Context of Stream Processing: Stream...

Learn More 0 0Mar 18

InsightFlow Part 4: Data Exploration & Understanding the Datasets

InsightFlow GitHub Repo Before diving into building any data pipeline, a crucial first step is Data...

Learn More 0 0Apr 29

Data Engg Bootcamp module 1

🚀 Just completed Module 1 of #DEZoomcamp! Built a data pipeline analyzing NYC taxi data using Docker,...

Learn More 0 0Jan 27

Study Notes: DE Zoomcamp 1.2.2 - Ingesting NY Taxi Data to Postgres

Introduction and Context The video builds upon the previous lesson where Docker was...

Learn More 0 0Jan 28

Study Notes 5.4.1-3 Anatomy of a Spark Cluster GroupBy & Joins in Spark

Study Notes 5.4.1 - Anatomy of a Spark Cluster 1. Introduction In this lesson,...

Learn More 0 0Mar 4

Study Notes dlt Fundamentals Course: Lesson 7 Inspecting & Adjusting Schema

1. Introduction In many data processing frameworks (like Apache Spark or even when working...

Learn More 0 0Feb 17

Study Notes: Data Query on S3 Bucket Using Athena

Introduction to Athena Athena is a serverless interactive query service that allows users...

Learn More 0 0Feb 11

Study Notes 2.2.2: Learning Kestra

Introduction to Kestra Key Resources Getting Started with Kestra...

Learn More 0 0Feb 4

Study Note 3.2.1: BigQuery Best Practices

Cost Reduction Strategies Column Selection Avoid using SELECT * Always specify required...

Learn More 0 0Feb 11

Study Notes 1.3.1: Terraform Primer

Introduction to Terraform Definition: Terraform is an Infrastructure-as-Code (IaC) tool...

Learn More 0 0Feb 4