Denzel Kanyeki

Denzel Kanyeki @dkkinyua

About: I'm a young budding data engineer, backend developer, and civil engineering student. I'm up for any Python, Data Engineering discussions, programmer memes and how did I miss Manchester United?

Location:
Nairobi, Kenya
Joined:
Sep 27, 2023

Denzel Kanyeki
articles - 18 total

Building a Fraud Detection Pipeline using Python, PostgreSQL, Apache Kafka, PySpark, Grafana and Scikit-learn

Introduction Fraud doesn’t happen once in a while, it happens every second. Every card...

Learn More 11 0Aug 18

Building a Real-Time Data Pipeline using Binance Websocket API, PySpark, Kafka and Grafana

Introduction Ever wondered how data is consumed in real-time via websockets? In this...

Learn More 3 1Aug 4

Using Data Engineering to Track Food Prices and Inflation in Kenya from 2006 to 2025

Rising food prices continue to be a critical issue affecting many households in Kenya. As a data...

Learn More 8 0Jul 23

A Real-Time Earthquake Monitoring Pipeline with Kafka, MySQL, PostgreSQL, and Grafana

In this project, I designed and built an end-to-end real-time data pipeline that monitors earthquakes...

Learn More 3 0Jul 14

Building a Real-Time Crypto Pipeline with Binance APIs, PostgreSQL, Debezium, Kafka, Spark & Cassandra

Introduction In this blog post, I’ll walk you through a real-time crypto data pipeline I...

Learn More 2 0Jul 4

Tracking Kenya’s External Debt Using Python, PostgreSQL, and Grafana

We always hear about Kenya’s external rising debt in headlines, but how fast is it growing? And what...

Learn More 3 0Jun 20

Extracting Data from the Premier League YouTube Channel Using `googleapiclient`, PySpark, Airflow, PostgreSQL and Grafana

Introduction In this post, I’ll walk you through how I built a scalable ETL data pipeline...

Learn More 1 1Jun 12

Testing Airflow DAGs 101: A Practical Guide for Modern Data Teams.

Apache Airflow is an essential tool for data engineers. Data engineers use Airflow to automate,...

Learn More 0 0Jun 6

Building a Stock Data Pipeline with requests, Apache Airflow and PostgreSQL

In this tutorial, I will walk you through how I built a fully functional ETL pipeline using Apache...

Learn More 1 0May 26

dbt for Normies - A Complete Guide.

Whether you are new to data engineering or you are quite conversant with the dataverse, you should...

Learn More 1 1May 11

Building and Deploying My First Python ETL Package to PyPI

Introduction Creating your own Python package is one of the most satisfying steps as a...

Learn More 1 1Apr 30

Learn Python Basics for Data Engineering (with Mini Project)

In today’s data-driven world, Data Engineers are the architects of pipelines that move and transform...

Learn More 0 2Apr 14

SQL for Beginners - A Complete Guide.

Introduction Structured Query Language (SQL) is a language that is used to interact with...

Learn More 1 0Apr 14

🚀 Building an ETL Pipeline with Python to Scrape Internship Jobs and Load into Excel

Have you ever needed up-to-date job listings but struggled to find one clean source? In this project,...

Learn More 1 2Apr 6

The Complete Guide to Time Series Models.

In this post, I would like to cover in detail what is time series analysis and time series models....

Learn More 2 0Nov 7 '23

Data Modeling and its Importance in Data Science.

What is data modeling? Data modeling is the process used in database management to create a...

Learn More 1 0Oct 22 '23

Exploratory Data Analysis(EDA) using Data Visualization Techniques.

Introduction to Exploratory Data Analysis: Exploratory Data Analysis is essential in the Data Science...

Learn More 0 0Oct 11 '23

Data Science Roadmap 2023-2024 for beginners.

Before we get to the roadmap, let's answer the question, What's data science? Data science is the...

Learn More 14 1Sep 30 '23