Building My First End-to-End ETL Pipeline with Airflow, BigQuery, and Docker
Recently, I completed my first full Data Engineering project: building an end-to-end ETL pipeline using real-world Australian weather data spanning 10…
Latest DevOps news from Tech News
Recently, I completed my first full Data Engineering project: building an end-to-end ETL pipeline using real-world Australian weather data spanning 10…
Over the years, I've seen many data platforms start with good intentions. A few scripts are created to move data from one system to another, and every…
The core paradox of modern BitTorrent: how do you search without a central index? Searching needs metadata, an address that you can query. BitTorrent …
Introduction In the evolving landscape of data engineering, DuckLake is emerging as a powerful solution for building data lakes with ACID transactions…
The best way to actually understand data engineering is to build something that breaks, fix it, and watch it successfully run. In this article, we bui…
Introduction ClickHouse is a columnar OLAP database. It runs aggregate queries across billions of rows in seconds. MySQL is what most apps run on for …
Introduction Apache Iceberg is the table format that turns a pile of Parquet files in object storage into something that behaves like a warehouse tabl…
В мае 2024 года Broadcom заархивировал публичный репозиторий Greenplum: последний коммит остался на месте, дальнейшая разработка ушла в закрытый репоз…