Browse our collection of articles on various topics related to IT technologies. Dive in and explore something new!
Overview Course Duration and Structure Duration: 6 modules plus 2...
Overview Topic: Introduction to Docker and its importance for data engineers. Purpose:...
Overview: Course Edition: Fourth edition of the Data Engineering Zoomcamp. Purpose of...
Introduction This study note covers the key steps to set up an ETL (Extract, Transform,...
Overview This introduces how to use dbt (data build tool) with Kestra to perform data...
1. Introduction to Batch Processing What is Batch Processing? Batch processing...
1. Overview of Kafka Producer & Consumer Objective: Learn how to produce and consume...
Study Notes 5.3.3: Preparing Yellow and Green Taxi Data 1. Overview and...
Just wrapped up the Data Engineering Zoomcamp workshop on dlt! Learned how to build data pipelines...
1. Overview of Kafka Streaming with Python Purpose & Context: This session...
Overview from lecture 1.2.3. Purpose: To connect pgAdmin, a web-based GUI tool, to a...
Introduction I'm thrilled to begin documenting my journey building "InsightFlow" - an...
Study Notes 5.3.1 - on Spark/PySpark These notes cover the basics and some intermediate...
1. Introduction Partitioning and clustering are key optimization techniques in Google...
1. Introduction This video continues the Data Engineering Zoomcamp series, focusing on...
Introduction Welcome to the second part of Peer Review 1, where we continue exploring the...
1. Introduction This session covers SQL basics and is part of a series on Docker and...
This lecture outlines the steps to export a BigQuery Machine Learning model, deploy it in a Docker...
1. Introduction to Data Testing Objective: Ensure data delivered to end-users is...
Overview & Key Benefits Target Audience: Data analysts and managers No Python/Java...
1. Introduction to Kafka in Stream Processing Context of Stream Processing: Stream...
InsightFlow GitHub Repo Before diving into building any data pipeline, a crucial first step is Data...
🚀 Just completed Module 1 of #DEZoomcamp! Built a data pipeline analyzing NYC taxi data using Docker,...
Introduction and Context The video builds upon the previous lesson where Docker was...
Study Notes 5.4.1 - Anatomy of a Spark Cluster 1. Introduction In this lesson,...
1. Introduction In many data processing frameworks (like Apache Spark or even when working...
Introduction to Athena Athena is a serverless interactive query service that allows users...
Introduction to Kestra Key Resources Getting Started with Kestra...
Cost Reduction Strategies Column Selection Avoid using SELECT * Always specify required...
Introduction to Terraform Definition: Terraform is an Infrastructure-as-Code (IaC) tool...