Skip to content
Iceberg Specification, Schema & Internals Last updated: May 29, 2026

Iceberg Snapshot Summary

A map of key-value metadata properties embedded inside a snapshot that describes the write operation and counts of affected files and records.

snapshot summaryiceberg summary metadatasnapshot operations

Iceberg Snapshot Summary

An Iceberg Snapshot Summary is a key-value map metadata block stored within each snapshot definition inside the table’s metadata.json file. It records the characteristics of the commit that created the snapshot, including the type of write operation, the number of files added or deleted, and counts of modified records. This block provides audit tracking and performance metrics for table historical analyses.

Common Summary Properties

The snapshot summary contains a mix of standard and operation-specific metrics:

Example Metadata Representation

This excerpt from an Iceberg metadata.json file shows how the summary map is represented:

{
  "snapshot-id": 123456789012345,
  "timestamp-ms": 1716982400000,
  "summary": {
    "operation": "append",
    "added-data-files": "4",
    "added-records": "45000",
    "total-data-files": "28",
    "total-records": "315000",
    "spark.app.id": "app-20260529140000-0001"
  },
  "manifest-list": "s3://my-bucket/db/table/metadata/snap-123456789012345-list.avro"
}

Engines and diagnostic scripts parse these properties to track dataset growth, monitor data ingestion pipelines, and audit user modification activities.

πŸ“š Go Deeper on Apache Iceberg

Alex Merced has authored three hands-on books covering Apache Iceberg, the Agentic Lakehouse, and modern data architecture. Pick up a copy to master the full ecosystem.

← Back to Iceberg Knowledge Base