IngestThis Logo
BLOG
COMMUNITY
PODCAST

Featured Post

2025 Year in Review Apache Iceberg, Polaris, Parquet, and Arrow
2025-12-29 β€’ Alex Merced

2025 Year in Review Apache Iceberg, Polaris, Parquet, and Arrow

A look back at key developments in Apache Iceberg, Polaris, Parquet, and Arrow in 2025....

2025-12-05 β€’ Alex Merced

dremioframe & iceberg - Pythonic interfaces for Dremio and Apache Iceberg

Discover DremioFrame and IceFrame, two new Python libraries that simplify working with Dremio and Apache Iceberg. Learn ...

2025-11-29 β€’ Alex Merced

Introducing dremioframe - A Pythonic DataFrame Interface for Dremio

Discover dremioframe, a new Python library that offers a DataFrame-like experience for interacting with Dremio's data la...

2025-11-12 β€’ Alex Merced

Comprehensive Hands-on Walk Through of Dremio Cloud Next Gen (Hands-on with Free Trial)

Walkthrough with the new trial of the Dremio Cloud Platform...

2025-10-23 β€’ Alex Merced

2025-2026 Guide to Learning about Apache Iceberg, Data Lakehouse & Agentic AI

A curated guide to mastering Apache Iceberg, data lakehouse architectures, and the emerging field of Agentic AI for data...

2025-10-21 β€’ Alex Merced

An Exploration of the Commercial Iceberg Catalog Ecosystem

Dive into the world of commercial Iceberg catalogs and discover how they enhance data lakehouse architectures for modern...

2025-10-17 β€’ Alex Merced

Building a Universal Lakehouse Catalog - Beyond Iceberg Tables

Exploring paths to a universal lakehouse catalog that supports multiple data formats and engines, building on Apache Ice...

2025-10-16 β€’ Alex Merced

Intro to Apache Iceberg with Apache Polaris and Apache Spark

Learn how to leverage Apache Iceberg with Apache Polaris and Apache Spark to build scalable and efficient data lakehouse...

2025-10-14 β€’ Alex Merced

The State of Apache Iceberg v4 - October 2025 Edition

What's Coming in Apache Iceberg v4: A Deep Dive into the Future of Open Table Formats...

2025-09-24 β€’ Alex Merced

The Ultimate Guide to Open Table Formats - Iceberg, Delta Lake, Hudi, Paimon, and DuckLake

Understanding Iceberg, Delta Lake, Hudi, Paimon, and DuckLake...

2025-09-23 β€’ Alex Merced

The 2025 & 2026 Ultimate Guide to the Data Lakehouse and the Data Lakehouse Ecosystem

What is the Data Lakehouse and the Data Lakehouse Ecosystem? This comprehensive guide covers everything you need to know...

2025-09-16 β€’ Alex Merced

The Endgame β€” Building an Autonomous Optimization Pipeline for Apache Iceberg

Learn how to automate compaction, snapshot expiration, and layout optimization in Apache Iceberg using metadata-driven t...

2025-09-09 β€’ Alex Merced

Managing Large-Scale Optimizations β€” Parallelism, Checkpointing, and Fail Recovery

Learn how to scale Apache Iceberg table optimizations across large datasets using parallelism, checkpointing, and fail r...

2025-09-05 β€’ Alex Merced

Unlocking the Power of Agentic AI with Apache Iceberg and Dremio

Unlocking the Power of Agentic AI with Apache Iceberg and Dremio...

2025-09-02 β€’ Alex Merced

Hidden Pitfalls β€” Compaction and Partition Evolution in Apache Iceberg

Partition evolution in Apache Iceberg is a powerful feature, but if not managed carefully, it can introduce fragmentatio...

2025-08-26 β€’ Alex Merced

Using Iceberg Metadata Tables to Determine When Compaction Is Needed

Discover how to use Apache Iceberg's metadata tables to proactively detect small files, bloated manifests, and table fra...

2025-08-19 β€’ Alex Merced

Designing the Ideal Cadence for Compaction and Snapshot Expiration

Learn how to design an effective schedule for compaction and snapshot expiration in Apache Iceberg to balance cost, perf...

2025-08-12 β€’ Alex Merced

Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests

Learn how to prevent and clean up metadata bloat in Apache Iceberg by expiring snapshots and rewriting manifests for bet...

2025-08-05 β€’ Alex Merced

Smarter Data Layout β€” Sorting and Clustering Iceberg Tables

Improve query performance in Apache Iceberg by organizing your data layout with sorting and Z-order clustering. Learn ho...

2025-07-29 β€’ Alex Merced

Optimizing Compaction for Streaming Workloads in Apache Iceberg

Learn how to design fast, incremental compaction strategies in Apache Iceberg to support high-throughput streaming pipel...

Categories

data engineering
oltp
database
data
frontend
data lakehouse
Data Engineering
Data Lakehouse
Javascript
Data Architecture
Data Analytics
Devops
Data Modeling
DevOps
python
sql
rust
AI
Apache Iceberg
copyright 2022 by Alex Merced of alexmercedcoder.dev