Table of Contents

SeaweedFS Iceberg Catalog

1. Start SeaweedFS
2. Create a Table Bucket
3. Connect Your Query Engine

Client Integrations
Metadata Storage

Namespaces
Tables

Authentication and Authorization

Authentication Methods
Authorization (IAM)
Anonymous Access (Development)

Configuration Reference
See Also

SeaweedFS Iceberg Catalog

SeaweedFS provides a built-in Iceberg REST Catalog that can be used with popular analytics engines.

Architecture

The SeaweedFS S3 Tables feature implements the Iceberg REST Catalog API. This allows clients to talk directly to SeaweedFS to manage Iceberg namespaces and tables, while the underlying data files (Parquet, Avro, Metadata JSON) are stored in SeaweedFS S3 buckets.

Iceberg REST Catalog: Available on a dedicated port (default 8181)
S3 Data Access: Available on the S3 port (default 8333)
Authentication: SigV4 (Spark, Trino, RisingWave), OAuth2 (DuckDB, Doris), or unsigned REST + S3 access keys (Dremio)

Catalog and Bucket Relationship

In SeaweedFS, an Iceberg Catalog corresponds 1:1 with a Table Bucket.

When you configure a client with a URI prefix like http://localhost:8181/v1/my-catalog/, SeaweedFS maps requests to the bucket named my-catalog.
If no catalog/prefix is provided in the URL (e.g., http://localhost:8181/v1/), it defaults to using a bucket named warehouse.

This architecture allows you to manage multiple independent Iceberg catalogs on the same SeaweedFS cluster simply by creating multiple buckets.

Quick Start

1. Start SeaweedFS

weed mini

This starts:

S3 API on port 8333
Iceberg REST Catalog on port 8181

2. Create a Table Bucket

weed shell
> s3tables.bucket -create -name my-catalog

3. Connect Your Query Engine

See the integration guide for your engine below.

Client Integrations

Engine	Auth Method	Guide
Apache Spark	SigV4	Spark Iceberg Integration
Trino	SigV4	Trino Iceberg Integration
Dremio	S3 access keys (REST source)	Dremio Iceberg Integration
DuckDB	OAuth2	DuckDB Iceberg Integration
Apache Doris	OAuth2	Doris Iceberg Integration
RisingWave	SigV4	RisingWave Iceberg Integration
Lakekeeper	STS + SigV4	Lakekeeper Iceberg Integration

Metadata Storage

SeaweedFS stores Iceberg metadata using a hybrid approach to maximize performance and compatibility:

Namespaces

Namespace metadata (creation time, properties) is stored as Extended Attributes (xattrs) on the directory corresponding to the namespace in the Filer.

This ensures lightweight namespace operations.
The directory structure in the Filer mirrors the namespace hierarchy.

Tables

Table metadata follows the standard Iceberg V2 specification:

Metadata Location: Stored in the metadata/ subdirectory of the table.
Data Location: Stored in the data/ subdirectory.
Format:
- vN.metadata.json: The table metadata file.
- snap-*.avro: Snapshot manifest lists.
- *.avro: Manifest files.
- *.parquet: Data files.

Authentication and Authorization

Authentication Methods

SeaweedFS supports two authentication methods for the Iceberg REST Catalog:

SigV4 (Spark, Trino, RisingWave) — Clients sign each request using AWS Signature Version 4. This is the standard method used by most Iceberg-compatible engines.

OAuth2 (DuckDB, Doris) — Clients exchange S3 credentials for a bearer token via POST /v1/oauth/tokens using the client_credentials grant type. The S3 access key is used as client_id and the secret key as client_secret.

Authorization (IAM)

Permissions are managed via S3 Bucket Policies applied to the Table Bucket.

You can define granular permissions for CreateNamespace, CreateTable, WriteTable, etc.

Example Policy to allow read-only access:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": [
        "s3tables:ListNamespaces",
        "s3tables:GetTable",
        "s3tables:ListTables"
      ],
      "Resource": "arn:aws:s3tables:region:account:bucket/my-catalog/*"
    }
  ]
}

Anonymous Access (Development)

If SeaweedFS is running without IAM configuration (e.g., weed mini with no -s3.config), the Iceberg Catalog allows anonymous access by default. This is useful for local development and testing. See each integration page for anonymous configuration details.

Configuration Reference

Parameter	CLI Flag	Default
Iceberg REST port	`-s3.port.iceberg` (mini) / `--port.iceberg` (standalone)	`8181`
S3 port	`-s3.port` (mini) / `--port` (standalone)	`8333`
Disable Iceberg	Set port to `0`	Enabled
IAM config	`-s3.config`	None (anonymous)

SeaweedFS Iceberg Catalog

Architecture

Catalog and Bucket Relationship

Quick Start

1. Start SeaweedFS

2. Create a Table Bucket

3. Connect Your Query Engine

Client Integrations

Metadata Storage

Namespaces

Tables

Authentication and Authorization

Authentication Methods

Authorization (IAM)

Anonymous Access (Development)

Configuration Reference

See Also

Introduction

API

Configuration

Filer

Filer Stores

Management

Advanced Filer Configurations

FUSE Mount

WebDAV

SFTP Server

Cloud Drive

AWS S3 API

S3 Table Bucket

Iceberg Integrations

S3 Authentication & IAM

Server-Side Encryption

S3 Client Tools

Machine Learning

HDFS

Replication and Backup

Metadata Change Events

Messaging

Use Cases

Operations

Rust Volume Server

Advanced

Security

Misc Use Case Examples