Skip to content

fix: add Parquet UUID type support for read/write #65#71

Open
MisterRaindrop wants to merge 1 commit intoapache:mainfrom
MisterRaindrop:support_uuid
Open

fix: add Parquet UUID type support for read/write #65#71
MisterRaindrop wants to merge 1 commit intoapache:mainfrom
MisterRaindrop:support_uuid

Conversation

@MisterRaindrop
Copy link
Collaborator

@MisterRaindrop MisterRaindrop commented Feb 28, 2026

closes: #65


Change logs

Add UUID (OID 2950) handling to ParquetFileAccessor by mapping it to
BINARY with STRING logical type annotation, consistent with how ORC and
JDBC connectors handle UUID. UUIDARRAY is also supported automatically
through the existing array type framework.

Contributor's checklist

Here are some reminders before you submit your pull request:

  Add UUID (OID 2950) handling to ParquetFileAccessor by mapping it to
  BINARY with STRING logical type annotation, consistent with how ORC and
  JDBC connectors handle UUID. UUIDARRAY is also supported automatically
  through the existing array type framework.
case BPCHAR:
case TEXT:
primitiveTypeName = PrimitiveTypeName.BINARY;
logicalTypeAnnotation = LogicalTypeAnnotation.stringType();
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will test on weekdays that it works with parquet files from other databases. Probably they encode UUID as 16-byte FIXED_LEN_BYTE_ARRAY primitive type.

https://parquet.apache.org/docs/file-format/types/logicaltypes/

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you very much. This is indeed very detailed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Parquet UUID support

2 participants