Skip to content

parquet reading improvements#650

Merged
richox merged 1 commit intomasterfrom
dev-parquet-read
Nov 20, 2024
Merged

parquet reading improvements#650
richox merged 1 commit intomasterfrom
dev-parquet-read

Conversation

@richox
Copy link
Contributor

@richox richox commented Nov 19, 2024

supports parquet parallel reading.

use bufreader to read page header, avoid reading in fragments.

@richox richox force-pushed the dev-parquet-read branch 3 times, most recently from be6c9bf to 5f8000c Compare November 19, 2024 14:50
use bufreader to read page header, avoid reading in fragments.

add hadoop shim.
@richox richox merged commit 22040b0 into master Nov 20, 2024
richox added a commit that referenced this pull request Nov 25, 2024
use bufreader to read page header, avoid reading in fragments.

add hadoop shim.

Co-authored-by: zhangli20 <zhangli20@kuaishou.com>
richox added a commit that referenced this pull request Nov 29, 2024
use bufreader to read page header, avoid reading in fragments.

add hadoop shim.

Co-authored-by: zhangli20 <zhangli20@kuaishou.com>
@richox richox mentioned this pull request Dec 6, 2024
@cxzl25 cxzl25 deleted the dev-parquet-read branch September 5, 2025 08:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants