[connector] pass Fluss schema to lake writer by xx789633 · Pull Request #1192 · apache/fluss

xx789633 · 2025-06-25T03:44:32Z

Purpose

Use the schema in Fluss as the single source of truth for the lake writers to avoid any inconsistency.

Moreover, we sometimes need the Fluss schema to pick the correct field writers when creating lake writer. For example, currently the LocalZonedTimestampType and TimestampType type in Fluss map to the same Arrow type:
https://github.com/alibaba/fluss/blob/main/fluss-common/src/main/java/com/alibaba/fluss/utils/ArrowUtils.java#L476
https://github.com/alibaba/fluss/blob/main/fluss-common/src/main/java/com/alibaba/fluss/utils/ArrowUtils.java#L489

When we tier the table to a data lake with Arrow schema, we are not able to choose the appropriate type of filed writers.

Brief change log

Add the schema in Fluss to WriterInitContext.

Tests

n/a

API and Format

n/a

Documentation

n/a

CLAassistant · 2025-06-25T03:44:38Z

All committers have signed the CLA.

luoyuxia

@cwang9208 Thanks for the pr. Left minor comments. Otherwise, LGTM!

luoyuxia · 2025-06-25T03:48:07Z

fluss-common/src/main/java/com/alibaba/fluss/lake/writer/WriterInitContext.java

    @Nullable
    String partition();
+
+    Schema schema();


nit:
add java doc for this method

Do we need to consider compatibility for example use default Optional<Schema> schema() {return Optional.empty()}

I prefer not to keep code clean since currently it still for inner use in some degree.

Sorry I forgot it. Fixed.

I don't think there will be any compatibility issues. The tiering reader will automatically fill the WriterInitContext struct with the Fluss schema and it depends on the connector to decide whether to use it or not. For now, Paimon doesn't even touch this field.

The compatibility issue actually exists, WriterInitContext is a public API with annotation WriterInitContext, add a method in this API is an compatibility-breaking change, imaging this case, user implement their own xxLakeWriter base on Fluss 0.7 API. And once they bump fluss version from 0.7 to 0.8, they need to adjust their xxLakeWriter implementation and build fluss-lake-xx.jar for 0.8 instead of using existing fluss-lake-xx.jar of 0.7.
But as there should not have this user case at this moment and 0.8 will be a apache version, I agree to keep code clean and introduce this method here.

Noted. Thanks for the clarification! @leonardBang

luoyuxia · 2025-06-25T03:49:04Z

...fluss-lake-paimon/src/test/java/com/alibaba/fluss/lake/paimon/tiering/PaimonTieringTest.java

+
+                    @Override
+                    public com.alibaba.fluss.metadata.Schema schema() {
+                        return null;


nit: throw unsupportException.

Thanks for suggestion. Fixed.

pass fluss schema to lake writer

83da50e

luoyuxia reviewed Jun 25, 2025

View reviewed changes

pass fluss schema to lake writer

6d4013b

luoyuxia merged commit 32c066f into apache:main Jun 25, 2025
4 checks passed

polyzos pushed a commit to polyzos/fluss that referenced this pull request Aug 30, 2025

[lake] Pass Fluss schema to lake writer (apache#1192)

513ce9a

polyzos pushed a commit to Alibaba-HZY/fluss that referenced this pull request Aug 31, 2025

[lake] Pass Fluss schema to lake writer (apache#1192)

e6add87

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[connector] pass Fluss schema to lake writer#1192

[connector] pass Fluss schema to lake writer#1192
luoyuxia merged 2 commits intoapache:mainfrom
xx789633:lake_schema

xx789633 commented Jun 25, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Jun 25, 2025 •

edited

Loading

Uh oh!

luoyuxia left a comment

Uh oh!

luoyuxia Jun 25, 2025

Uh oh!

leonardBang Jun 25, 2025

Uh oh!

luoyuxia Jun 25, 2025

Uh oh!

xx789633 Jun 25, 2025

Uh oh!

xx789633 Jun 25, 2025

Uh oh!

leonardBang Jun 25, 2025

Uh oh!

xx789633 Jun 25, 2025

Uh oh!

luoyuxia Jun 25, 2025

Uh oh!

xx789633 Jun 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

xx789633 commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

CLAassistant commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

luoyuxia left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

xx789633 commented Jun 25, 2025 •

edited

Loading

CLAassistant commented Jun 25, 2025 •

edited

Loading