Active Monitoring: How Agentic AI Auto-Heals and Protects Enterprise Data Pipelines
Static alerts miss cascading pipeline failures. Learn how agentic AI monitors, traces root causes, and automatically rol...
Home for data professionals. Articles, tutorials, and resources for Data Engineers, Scientists, Analysts, and Architects.
Guest submissions are welcome. Pitch your idea by emailing alex@ingestthis.com or join the The Data Lakehouse Hub Slack community.
Read the Blog →Authoritative guides to the modern data ecosystem — curated from Dremio's engineering blog.
A comprehensive guide to the Semantic Layer — how it creates a single source of truth for metrics, powers headless BI, and makes AI agents answer business questions accurately.
Read on Dremio.com →Apache PolarisHow Apache Polaris is emerging as the universal Iceberg catalog standard, enabling multi-engine interoperability and governed AI access across the lakehouse ecosystem.
Read on Dremio.com →Table FormatsThe origin story of open table formats — the problems with Hive, why Apache Iceberg, Delta Lake, and Hudi were created, and what they unlock for modern data platforms.
Read on Dremio.com →DremioA clear-eyed breakdown of what Dremio is, how its semantic layer, query federation, Reflections, and Apache Arrow Flight power the Intelligent Lakehouse Platform.
Read on Dremio.com →Apache IcebergNot all 'Iceberg support' is equal. This piece breaks down what it means to be genuinely Apache Iceberg native versus bolt-on, and why it matters for your lakehouse.
Read on Dremio.com →Open SourceHow the Apache Software Foundation's open-source projects — Iceberg, Arrow, Parquet, Polaris — form the modular foundation of the modern open data lakehouse.
Read on Dremio.com →Agentic AIAgentic AI is reshaping how organizations interact with data. This guide explains agentic analytics, the role of the semantic layer, and why query performance matters for AI agents.
Read on Dremio.com →Data LakehouseThe complete, authoritative guide to the Data Lakehouse architecture — what it is, why it supersedes the data warehouse + data lake combination, and how to build one.
Read on Dremio.com →AI & PerformanceHow Dremio's layered autonomous performance architecture — Reflections, caching, vectorized execution — handles unpredictable AI agent query patterns at interactive speed.
Read on Dremio.com →Data Engineering, Data Architecture, and AI insights fresh from our writers.
Static alerts miss cascading pipeline failures. Learn how agentic AI monitors, traces root causes, and automatically rol...
How does an agentic analytics system actually work? Inside the ReAct loop, tool calling, schema exploration, and self-co...
Apache Iceberg v3 adds deletion vectors, VARIANT type, row lineage, and table encryption. Here's what changed and how to...
Build a custom agentic analytics system using Python, LangChain, and Dremio. A developer tutorial covering SQL tool bind...
Data lakehouses become data swamps without active governance. Learn how schema enforcement, catalog stewardship, and dri...
Apache Iceberg gives regulated enterprises data sovereignty with hybrid-cloud deployments. Learn how open catalogs and I...