Skip to content
Modern Lakehouse Concepts & Interoperability Last updated: May 29, 2026

Apache DataFusion

A highly extensible Rust-native SQL query engine that supports native querying and management of Apache Iceberg tables.

datafusionapache datafusionrust query engineiceberg rust datafusion

Apache DataFusion

Apache DataFusion is a fast, extensible SQL query engine written in Rust. It is designed to act as a SQL parser and execution engine for custom databases, analytics platforms, and data pipelines. DataFusion uses Apache Arrow as its in-memory representation, enabling vectorized execution and gRPC-based data transfers.

Iceberg Integration Scope

DataFusion integrates with Apache Iceberg natively, primarily through the official iceberg-datafusion crate (part of the iceberg-rust project). This integration implements DataFusion’s core trait interfaces:

πŸ“š Go Deeper on Apache Iceberg

Alex Merced has authored three hands-on books covering Apache Iceberg, the Agentic Lakehouse, and modern data architecture. Pick up a copy to master the full ecosystem.

← Back to Iceberg Knowledge Base