HADOOP-19859. Speed up GHA jobs by image cache by pan3793 · Pull Request #8451 · apache/hadoop

pan3793 · 2026-04-22T05:13:54Z

Description of PR

This PR adds image cache for the GHA workflow, when cache-hit, the build image step likely decreases from ~15min to ~1min.

Image cache is created on pushing a commit to apache/hadoop repo, when dev-support/docker/** changes or manually trigger, and since this is a public repo, it can be leveraged by forked repos to read to speed up their image building workflows.

How was this patch tested?

Tested on my forked repo.

For code changes:

Does the title or this PR starts with the corresponding JIRA issue id (HADOOP-19859)?
[na] Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation?
[na] If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under ASF 2.0?
[na] If applicable, have you updated the LICENSE, LICENSE-binary, NOTICE-binary files?

AI Tooling

No AI usage.

pan3793 · 2026-04-22T05:25:21Z

note: cache only will be used after merging this. so don't expect this PR itself to benefit from cache.

pan3793 · 2026-04-22T05:35:25Z

+
+jobs:
+  main:
+    name: build-image-cache-${{ inputs.os }}-${{ github.ref_name }}


github.ref_name is the branch name for push event

pan3793 · 2026-04-22T16:51:15Z

+      - 'branch-*'
+    paths:
+      - 'dev-support/docker/**'
+  workflow_dispatch:


both push and workflow_dispatch indicate the person has write permission on the repo, so it's safe.

Agreed. It is a "trusted" action. (We still are careful to use best practices below, and our CodeQL scanning helps enforce that in the future.)

pan3793 · 2026-04-23T02:26:15Z

@ajfabbri I follow your practice to add "Security ..." comments too, could you take a look?

hadoop-yetus · 2026-04-23T03:27:02Z

🎊 +1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 32s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 1s		No case conflicting files found.
+0 🆗	codespell	0m 1s		codespell was not available.
+0 🆗	detsecrets	0m 1s		detect-secrets was not available.
+0 🆗	yamllint	0m 1s		yamllint was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
			_ trunk Compile Tests _
+1 💚	shadedclient	31m 20s		branch has no errors when building and testing our client artifacts.
			_ Patch Compile Tests _
+1 💚	blanks	0m 0s		The patch has no blanks issues.
+1 💚	shadedclient	27m 2s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
+1 💚	asflicense	0m 37s		The patch does not generate ASF License warnings.
		61m 10s

Subsystem	Report/Notes
Docker	ClientAPI=1.54 ServerAPI=1.54 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-8451/5/artifact/out/Dockerfile
GITHUB PR	#8451
Optional Tests	dupname asflicense codespell detsecrets yamllint
uname	Linux 5914304c2c9d 5.15.0-164-generic #174-Ubuntu SMP Fri Nov 14 20:25:16 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `2b51282`
Max. process+thread count	635 (vs. ulimit of 10000)
modules	C: . U: .
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-8451/5/console
versions	git=2.43.0 maven=3.9.11
Powered by	Apache Yetus 0.14.1 https://yetus.apache.org

This message was automatically generated.

ajfabbri

LGTM

ajfabbri · 2026-04-23T16:48:43Z

+      - 'trunk'
+      - 'branch-*'
+    paths:
+      - 'dev-support/docker/**'


Cool. Makes sense to rebuild when anything in here changes. In the future we might just publish an image and use it directly instead of always doing a cached build? We can iterate on it though. 👍

I think the cache is better.

in most cases, no docker file change, cache hit and it works well

if trunk merges a PR that has docker files change, it will take ~15min to refresh the cache, a new push happens in the forked PR will miss cache and do a refresh build

if a PR itself contains docker files change, it must do a fresh build

ajfabbri · 2026-04-23T16:50:25Z

+      - 'branch-*'
+    paths:
+      - 'dev-support/docker/**'
+  workflow_dispatch:


Agreed. It is a "trusted" action. (We still are careful to use best practices below, and our CodeQL scanning helps enforce that in the future.)

ajfabbri · 2026-04-23T17:24:56Z

+          push: true
+          tags: ghcr.io/apache/hadoop/gha-build-${{ inputs.os }}-image-cache:${{ github.ref_name }}-static
+          cache-from: type=registry,ref=ghcr.io/apache/hadoop/gha-build-${{ inputs.os }}-image-cache:${{ github.ref_name }}
+          cache-to: type=registry,ref=ghcr.io/apache/hadoop/gha-build-${{ inputs.os }}-image-cache:${{ github.ref_name }},mode=max


Just getting familiar with this and reading docs. Is this based on the Spark CI workflows?

type=registry (docs)

registry: embeds the build cache into a separate image, and pushes to a dedicated location separate from the main output.

cache-to: exports the cache to a particular backend (registry) after a build. cache-from specifies how to import at start of a build. IIUC the local BuildKit cache is always enabled, but has no persistence between runs, so only helps with multiple builds within the same workflow.

The locations passed in (ref=) act as the key for the cache lookup, and we separate these by OS and branch name.

mode=max means to export all intermediate layers of the image build, whereas mode=min only exports those which end up in the image. This looks good to me. 👍

Is this based on the Spark CI workflows?

I basically replicate it from Spark.

https://github.com/apache/spark/blob/branch-4.1/.github/workflows/build_infra_images_cache.yml

pan3793 · 2026-04-24T02:34:23Z

thanks, merged to trunk, I will manually trigger the first image cache, subsequent cache refreshes will happen automatically once dev-support/docker/** changes.

the manually triggered image cache jobs are: https://github.com/apache/hadoop/actions/runs/24869306467

github-actions Bot added trunk Infra labels Apr 22, 2026

pan3793 mentioned this pull request Apr 22, 2026

HADOOP-19868: ci: add comments from security review of new actions #8450

Merged

1 task

pan3793 force-pushed the HADOOP-19859 branch from 256965a to 8406e54 Compare April 22, 2026 05:28

pan3793 commented Apr 22, 2026

View reviewed changes

pan3793 closed this Apr 22, 2026

pan3793 deleted the HADOOP-19859 branch April 22, 2026 06:00

pan3793 restored the HADOOP-19859 branch April 22, 2026 06:08

pan3793 reopened this Apr 22, 2026

apache deleted a comment from hadoop-yetus Apr 22, 2026

pan3793 requested review from ajfabbri April 22, 2026 16:48

pan3793 commented Apr 22, 2026

View reviewed changes

pan3793 added 2 commits April 23, 2026 10:22

HADOOP-19859. Speed up GHA jobs by image cache

2056737

Add Security comment

2b51282

pan3793 force-pushed the HADOOP-19859 branch from bc29921 to 2b51282 Compare April 23, 2026 02:24

pan3793 requested a review from steveloughran April 23, 2026 06:02

apache deleted a comment from hadoop-yetus Apr 23, 2026

ajfabbri approved these changes Apr 23, 2026

View reviewed changes

pan3793 merged commit 38f48fb into apache:trunk Apr 24, 2026
8 checks passed

Conversation

pan3793 commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of PR

How was this patch tested?

For code changes:

AI Tooling

Uh oh!

pan3793 commented Apr 22, 2026

Uh oh!

pan3793 Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

pan3793 Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

ajfabbri Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

pan3793 commented Apr 23, 2026

Uh oh!

hadoop-yetus commented Apr 23, 2026

Uh oh!

ajfabbri left a comment

Choose a reason for hiding this comment

Uh oh!

ajfabbri Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

pan3793 Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ajfabbri Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

ajfabbri Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

pan3793 Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pan3793 commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pan3793 commented Apr 22, 2026 •

edited

Loading

pan3793 Apr 24, 2026 •

edited

Loading

pan3793 commented Apr 24, 2026 •

edited

Loading