Articles by Tag #parquet

Browse our collection of articles on various topics related to IT technologies. Dive in and explore something new!

Compression algorithms in Parquet Java

Apache Parquet is a columnar storage format optimized for analytical workloads, though it can also be...

Learn More 3 2Jan 20

The two versions of Parquet

A few days ago, the creators of DuckDB wrote the article: Query Engines: Gatekeepers of the Parquet...

Learn More 2 0Feb 20

The Carpet feature that nobody will use

This week I released a new version of Carpet, the Java library for working with Parquet files. In...

Learn More 0 0May 15

Turning Parquet File into a Queryable RESTful with DuckDB, Quarkus & Kotlin

Parquet files are a powerhouse for storing large, columnar datasets in big data workflows....

Learn More 0 0Feb 23

Crawling web sites using “Data Prep Kit”

A hands-on exercise using “data-prep-kit” and storing the result as parquet files. ...

Learn More 0 0Apr 4

Working with Parquet files

Parquet files offer significant advantages over traditional formats like CSV or JSON. This is more...

Learn More 0 0Apr 5