IngestThis
BLOG
COMMUNITY
PODCAST

Featured Post

Use Hermes Agent for Free With DeepSeek V4 and Slack
2026-05-25 • Alex Merced

Use Hermes Agent for Free With DeepSeek V4 and Slack

Hermes Agent is a free, open-source AI agent from Nous Research. Connect it to DeepSeek V4 for zero-cost inference and S...

2026-05-24 • Alex Merced

Automating Table Maintenance Before Small Files Accumulate

Learn how Databricks Predictive Optimization, AWS S3 Tables, and Iceberg native actions automate compaction and snapshot...

2026-05-24 • Alex Merced

Choosing the Right Iceberg Control Plane: Polaris vs. Unity Catalog vs. Cloud REST

Choosing an Apache Iceberg catalog? Compare open-source Apache Polaris, open Unity Catalog, and managed cloud REST contr...

2026-05-24 • Alex Merced

Clean Rooms for Privacy-Preserving Analytics

Data clean rooms enable secure multi-party analytics without sharing raw data. Learn how Databricks Clean Rooms, AWS Cle...

2026-05-24 • Alex Merced

Building Composable Query Engines with Rust Runtimes

Apache DataFusion, Velox, and Substrait form the foundation of modern composable query engine stacks. Learn how these co...

2026-05-24 • Alex Merced

Data Mesh After the Hype: What Actually Works

Three years after Zhamak Dehghani's original papers, data mesh has proven valuable in specific organizational contexts a...

2026-05-24 • Alex Merced

How dbt Fusion Reshapes Analytics Engineering

dbt Fusion entered public beta in May 2025 with a Rust-powered runtime that changes how analytics engineers develop, val...

2026-05-24 • Alex Merced

Using DuckDB and Polars to Query Iceberg Tables

DuckDB 1.4 LTS and Polars streaming engine now both support reading and writing Apache Iceberg tables. Learn how to use ...

2026-05-24 • Alex Merced

FinOps for Data Warehouses with Open Billing Data

The FOCUS 1.3 specification and native warehouse cost views make real-time cost attribution practical. Learn how to buil...

2026-05-24 • Alex Merced

Designing Governed RAG on Data Products

Enterprise RAG architecture that trusts its own data requires governance at the retrieval layer. Learn how to build gove...

2026-05-24 • Alex Merced

What Iceberg V3 Advances Mean for CDC Pipelines

Apache Iceberg V3 brings deletion vectors and row lineage that reshape CDC pipeline design. Learn what these features me...

2026-05-24 • Alex Merced

Kafka 4.0 Changes Streaming Platform Operations

Kafka 4.0 removes ZooKeeper and ships KRaft and KIP-848 by default. Learn what those changes mean for platform operation...

2026-05-24 • Alex Merced

Lance and Iceberg for Multimodal AI Data

LanceDB and Apache Iceberg serve complementary roles in a multimodal AI lakehouse. Learn when to use Lance for embedding...

2026-05-24 • Alex Merced

Bringing MLflow and Data Pipelines Closer Together

MLflow 3 extends observability from classic ML experiments to GenAI tracing and data pipeline lineage. Learn how to conn...

2026-05-24 • Alex Merced

Modern Feature Stores Beyond Batch Pipelines

Feature stores like Feast now support streaming feature views from Kafka and Kinesis alongside batch pipelines. Learn ho...

2026-05-24 • Alex Merced

OpenLineage as the Spine of Data Observability

OpenLineage provides a standard API for collecting pipeline lineage across Airflow, Spark, Flink, and dbt. Learn how it ...

2026-05-24 • Alex Merced

When Paimon Beats Iceberg for Mutable Streams

Apache Paimon uses LSM-Tree storage for native CDC upserts without restart. Learn when Paimon outperforms Iceberg for hi...

2026-05-24 • Alex Merced

Policy as Code for Lakehouse Governance

OPA, ABAC, row filters, and column masks make lakehouse governance programmable and scalable. Learn how Databricks, Snow...

2026-05-24 • Alex Merced

Real-Time Lakehouse Patterns with Apache Flink and Iceberg

Learn how to build a real-time lakehouse with Apache Flink 2.1 and the Dynamic Iceberg Sink, covering schema evolution, ...

2026-05-24 • Alex Merced

Why Semantic Layers Make Enterprise Text-to-SQL Safer

Text-to-SQL accuracy jumps from 40% to 85-95% when grounded in a semantic layer. Learn how Dremio, Snowflake Cortex Anal...

Categories

data engineering
oltp
database
data
frontend
data lakehouse
Data Engineering
Data Lakehouse
Javascript
Data Architecture
Data Analytics
Devops
Data Modeling
DevOps
python
sql
rust
AI
Apache Iceberg
Software Development
Semantic Layer
AI Tools & Software Development
TopicsData EngineeringApache IcebergData LakehouseAI & Machine Learning
SiteAll ArticlesRSS FeedSitemap
AuthorAlex MercedLinkedInTwitter / X

© 2026 Alex Merced — alexmercedcoder.dev