IngestThis!
Data Engineering, Science, and Architecture Content
Home for data professionals. Articles, tutorials, and resources for Data Engineers, Scientists, Analysts, and Architects.
Guest submissions are welcome. Pitch your idea by emailing alex@ingestthis.com or join the devNursery Slack community.
Read the Blog →Data Lakehouses & Agentic Analytics
Authoritative guides to the modern data ecosystem — curated from Dremio's engineering blog.
The Semantic Layer: Definitive Guide
A comprehensive guide to the Semantic Layer — how it creates a single source of truth for metrics, powers headless BI, and makes AI agents answer business questions accurately.
Read on Dremio.com →Apache PolarisApache Polaris: The Catalog Standard for Lakehouses and AI
How Apache Polaris is emerging as the universal Iceberg catalog standard, enabling multi-engine interoperability and governed AI access across the lakehouse ecosystem.
Read on Dremio.com →Table FormatsWhat Are Table Formats and Why Were They Needed?
The origin story of open table formats — the problems with Hive, why Apache Iceberg, Delta Lake, and Hudi were created, and what they unlock for modern data platforms.
Read on Dremio.com →DremioWhat Is Dremio?
A clear-eyed breakdown of what Dremio is, how its semantic layer, query federation, Reflections, and Apache Arrow Flight power the Intelligent Lakehouse Platform.
Read on Dremio.com →Apache IcebergWhat Apache Iceberg Native Actually Means
Not all 'Iceberg support' is equal. This piece breaks down what it means to be genuinely Apache Iceberg native versus bolt-on, and why it matters for your lakehouse.
Read on Dremio.com →Open SourceOpen Source and the Data Lakehouse
How the Apache Software Foundation's open-source projects — Iceberg, Arrow, Parquet, Polaris — form the modular foundation of the modern open data lakehouse.
Read on Dremio.com →Agentic AIWhat Is Agentic Analytics?
Agentic AI is reshaping how organizations interact with data. This guide explains agentic analytics, the role of the semantic layer, and why query performance matters for AI agents.
Read on Dremio.com →Data LakehouseDefinitive Guide to the Data Lakehouse
The complete, authoritative guide to the Data Lakehouse architecture — what it is, why it supersedes the data warehouse + data lake combination, and how to build one.
Read on Dremio.com →AI & PerformanceHow Dremio Keeps Agentic Analytics Fast Without Manual Tuning
How Dremio's layered autonomous performance architecture — Reflections, caching, vectorized execution — handles unpredictable AI agent query patterns at interactive speed.
Read on Dremio.com →