layout	default
title	Frequently Asked Questions
body_class	faq
toc	false

Frequently Asked Questions

Overview

Why should I use DuckLake?

DuckLake provides a lightweight one-stop solution if you need a lakehouse, i.e., a data lake with a catalog.

You can use DuckLake for a “multiplayer DuckDB” setup with multiple DuckDB instances reading and writing the same dataset – a concurrency model not supported by vanilla DuckDB.

If you only use DuckDB for both your DuckLake entry point and your catalog database, you can still benefit from DuckLake: you can run [time travel queries]({% link docs/stable/duckdb/usage/time_travel.md %}), exploit [data partitioning]({% link docs/stable/duckdb/advanced_features/partitioning.md %}), and can store your data in multiple files instead of using a single (potentially very large) database file.

Is DuckLake an open table format?

DuckLake is both a lakehouse format and an open table format. When comparing to other technologies, DuckLake is similar to Delta Lake with Unity Catalog and Iceberg with Lakekeeper or Polaris.

What is “DuckLake”?

“DuckLake” can refer to a number of things:

The DuckLake lakehouse format that uses a catalog database and a Parquet storage to store data.
A DuckLake instance storing a dataset with the DuckLake lakehouse format.
The [ducklake DuckDB extension]({% link docs/stable/duckdb/introduction.md %}), which supports reading/writing datasets using the DuckLake format.

Where can I download the DuckLake logo?

You can download the [logo package]({% link images/logo/DuckLake_Logo-package.zip %}). You can also download individual logos:

Dark mode, inline layout: [png]({% link images/logo/DuckLake-dark-inline.png %}), [svg]({% link images/logo/DuckLake-dark-inline.svg %})
Dark mode, stacked layout: [png]({% link images/logo/DuckLake-dark-stacked.png %}), [svg]({% link images/logo/DuckLake-dark-stacked.svg %})
Dark mode, logo only: [png]({% link images/logo/DuckLake-dark-icon.png %}), [svg]({% link images/logo/DuckLake-dark-icon.svg %})
Light mode, inline layout: [png]({% link images/logo/DuckLake-light-inline.png %}), [svg]({% link images/logo/DuckLake-light-inline.svg %})
Light mode, stacked layout: [png]({% link images/logo/DuckLake-light-stacked.png %}), [svg]({% link images/logo/DuckLake-light-stacked.svg %})
Light mode, logo only: [png]({% link images/logo/DuckLake-light-icon.png %}), [svg]({% link images/logo/DuckLake-light-icon.svg %})

Where can I find more resources on DuckLake?

We have several [talks and podcast episodes on DuckLake]({% link media/index.html %}). Additionally, consider visiting the awesome-ducklake repository maintained by community member Emil Sadek.

Architecture

What are the main components of DuckLake?

DuckLake needs a storage layer and a catalog database. Both components can be picked from a wide range of options. The storage system can be a blob storage (object storage), a block storage or a file storage. For the catalog database, any SQL-compatible database works that supports ACID operations and primary keys.

Does DuckLake work on AWS S3 (or a compatible storage)?

DuckLake can store the data files (Parquet files) on the AWS S3 blob storage or compatible solutions such as Azure Blob Storage, Google Cloud Storage or Cloudflare R2. You can run the catalog database anywhere, e.g., in an AWS Aurora database.

DuckLake in Operation

Is DuckLake production-ready?

Yes! We released [DuckLake v1.0]({% post_url 2026-04-13-ducklake-10 %}) in April 2026. This includes a production-ready specification and a production-ready [ducklake DuckDB extension]({% link docs/stable/duckdb/introduction.md %}), with guaranteed backward-compatibility.

How is authentication implemented in DuckLake?

DuckLake relies on the authentication of the metadata catalog database. For example, if your catalog database is PostgreSQL, you can use PostgreSQL's authentication and authorization methods to protect your DuckLake. This is particularly effective when enabling encryption of DuckLake files.

How does DuckLake deal with the “small files problem”?

The “small files problem” is a well-known problem in data lake formats and occurs e.g. when data is inserted in small batches, yielding many small files with each storing only a small amount of data. DuckLake significantly mitigates this problem by storing the metadata in a database system (catalog database) and making the compaction step simple. DuckLake also uses a technique called “data inlining”, i.e., it harnesses the catalog database to stage data before serializing it into Parquet files. For more details, see the [“Data Inlining in DuckLake” blog post]({% post_url 2026-04-02-data-inlining-in-ducklake %}).

Is there an example DuckLake?

Yes, we published a DuckLake that contains the Dutch Railway Dataset. This DuckLake uses DuckDB as its catalog database and is served from [object storage]({% link docs/stable/duckdb/guides/public_ducklake_on_object_storage.md %}). To attach to it from a DuckDB instance, run:

ATTACH 'https://blobs.duckdb.org/datalake/nl-railway.ducklake' AS nl_railway
    (TYPE ducklake);
USE nl_railway;
FROM services LIMIT 1;

What is a “Frozen DuckLake”?

A Frozen DuckLake is a read-only DuckLake, served through, for example, an HTTPS endpoint. There are multiple ways to implement a Frozen DuckLake, see the blog post [“Frozen DuckLakes for Multi-User, Serverless Data Access”]({% post_url 2025-10-24-frozen-ducklake %}) and the guide [“Public DuckLake on Object Storage”]({% link docs/stable/duckdb/guides/public_ducklake_on_object_storage.md %}). Despite being “frozen”, you can update a Frozen DuckLake provided that you replace the catalog database.

Features

Are constraints such as primary keys and foreign keys supported?

No. Similarly to other lakehouse technologies, DuckLake does not support constraints, keys, or indexes. For more information, see the [list of unsupported features]({% link docs/stable/duckdb/unsupported_features.md %}).

Can I export my DuckLake into other formats?

Yes, you can copy from [DuckLake to Iceberg]({% post_url 2025-09-17-ducklake-03 %}#interoperability-with-iceberg).

Are DuckDB database files supported as the data files for DuckLake?

The data files of DuckLake must be stored in Parquet. Using DuckDB files as storage is not supported at the moment.

Are there any practical limits to the size of data and the number of snapshots?

No. The only limitation is the catalog database's performance but even with a relatively slow catalog database, you can have terabytes of data and millions of snapshots.

Development

How is DuckLake tested?

DuckLake receives extensive testing, including running the applicable subset of DuckDB's thorough test suite. That said, if you encounter any problems using DuckLake, please submit an issue in the DuckLake issue tracker.

How can I contribute to DuckLake?

If you encounter any problems using DuckLake, please submit an issue in the DuckLake issue tracker. If you have any suggestions or feature requests, please open a ticket in DuckLake's discussion forum. You are also welcome to implement support in other systems for DuckLake following the [specification]({% link docs/stable/specification/introduction.md %}).

What is the license of DuckLake?

The [DuckLake specification]({% link docs/stable/specification/introduction.md %}) and the [ducklake DuckDB extension]({% link docs/stable/duckdb/introduction.md %}) are released under the MIT license.

Is the documentation available as a single file?

Yes, you can download the documentation as a single Markdown file and as a PDF.

When is the next version of the DuckLake standard released and what features will it include?

The DuckLake v1.1 standard is expected to be released in September 2026. See the [release calendar]({% link release_calendar.md %}) for the latest information and the [roadmap]({% link roadmap.md %}) for upcoming features.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Frequently Asked Questions

Overview

Why should I use DuckLake?

Is DuckLake an open table format?

What is “DuckLake”?

Where can I download the DuckLake logo?

Where can I find more resources on DuckLake?

Architecture

What are the main components of DuckLake?

Does DuckLake work on AWS S3 (or a compatible storage)?

DuckLake in Operation

Is DuckLake production-ready?

How is authentication implemented in DuckLake?

How does DuckLake deal with the “small files problem”?

Is there an example DuckLake?

What is a “Frozen DuckLake”?

Features

Are constraints such as primary keys and foreign keys supported?

Can I export my DuckLake into other formats?

Are DuckDB database files supported as the data files for DuckLake?

Are there any practical limits to the size of data and the number of snapshots?

Development

How is DuckLake tested?

How can I contribute to DuckLake?

What is the license of DuckLake?

Is the documentation available as a single file?

When is the next version of the DuckLake standard released and what features will it include?

FilesExpand file tree

faq.md

Latest commit

History

faq.md

File metadata and controls

Frequently Asked Questions

Overview

Why should I use DuckLake?

Is DuckLake an open table format?

What is “DuckLake”?

Where can I download the DuckLake logo?

Where can I find more resources on DuckLake?

Architecture

What are the main components of DuckLake?

Does DuckLake work on AWS S3 (or a compatible storage)?

DuckLake in Operation

Is DuckLake production-ready?

How is authentication implemented in DuckLake?

How does DuckLake deal with the “small files problem”?

Is there an example DuckLake?

What is a “Frozen DuckLake”?

Features

Are constraints such as primary keys and foreign keys supported?

Can I export my DuckLake into other formats?

Are DuckDB database files supported as the data files for DuckLake?

Are there any practical limits to the size of data and the number of snapshots?

Development

How is DuckLake tested?

How can I contribute to DuckLake?

What is the license of DuckLake?

Is the documentation available as a single file?

When is the next version of the DuckLake standard released and what features will it include?