Why use data vis When you need to work with a new data source, with a huge amount of data,...
In this cheatsheet I summed up the most used commands. In doubt you can always consult kubectl...
With Kubernetes large and diverse workloads can be handled. To keep track of all these processes,...
To expose applications from our Kubernetes cluster we need different service types. Service...
In this post you are going to learn about the basics of the Kubernetes security. You will see how the...
Volumes Volumes are needed to store data within a container or share data among other...
Deployment A K8s Deployment is a declarative configuration in a .yaml or .json file to...
In this blogpost, you will get a crisp guide through the design concepts of Kubernetes. Let's go: ...
This post sums up the steps to build a Kubernetes application. CRI - Container Runtime...
TL;DR - Delta Live Tables aka DLT DLT is a framework on top of a Delta Lake, and does...
Kubernetes is an open-source system for automating deployment, scaling and management of...
In the following lines I am doing a write-up about everything I learned about data pipelines at the...
WHAT - the famous analogy A data warehouse is like a producer of water where you are...
When we work with Spark we usually want to first prototype to see if everything works as expected,...
Why do we need distributed data technologies If the dataset is larger than our memory, the...
Technology The Amazon Redshift technology is a column-oriented storage in the cloud. It is...
What Cloud computing is the use of a network of remote servers hosted on the internet to...
WHAT An OLAP cube is an aggregation of a fact metric on a number of dimensions. CUBE...
Kimball's Bus Architecture Results in a common dimension data model, that is shared by...
WHY Different business processes need different data structures due to different needs....
What NoSQL databases are distributed databases. They are needed to have high availability,...
What The relational data model was invented in 1969 at the IBM research department, with...
Recently I started the online nanodegree program Data Engineering. In order to learn the new...
SQL is an abbreviation for Structured Query Language. It was born in 1974 to handle data that is...
When you are working as a backend developer, data engineer, scientist or analyst you normally don't...
When we want to do simple changes in a file, we don't want to open an integrated development...
Unix was published in 1973. Since them there where many different operating systems build using the...
When creating a web application in western countries, we want to read it from the left-to-right and...
As you don't switch computers everyday, this blogpost will provide a guide how to set up your new M1...