[WIP] compression: add specific prefix for zstd:chunked by giuseppe · Pull Request #2183 · containers/image

giuseppe · 2023-11-10T14:18:21Z

it allows to differentiate between zstd and zstd:chunked

Signed-off-by: Giuseppe Scrivano gscrivan@redhat.com

Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

Needs: containers/storage#1756 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

mtrmac

Implementation LGTM, but when do we need this?

The two zstd variants should be differentiated by annotations; and the layer can’t be pulled without the ManifestChecksumKey annotation present.[1] Are there any situations where need to treat such “chunked blob, but no chunked annotations” layers specially?

[1]… actually that sounds like another thing we don’t handle: when pushing from c/storage to a registry, TryReusingBlobWithOptions will find a record of a pre-existing zstd:chunked layer and return that it should be reused, but we don’t set the right annotations on that blob. (So we would need to record these annotations in BlobInfoCache, and take good care that we only record them when we created them ourselves, or after we have verified them when pulling.)

giuseppe · 2023-11-11T21:01:13Z

I am not even sure if we need any of these. The blob info cache probably doesn't make sense when using a layer that was partially pulled.

mtrmac · 2023-11-14T20:54:26Z

The compression data in the BIC exists for CandidateLocations2, i.e. for pushes only. We probably don’t need it for pulls, reuse (currently?) happens mostly (but not exclusively) relying on data stored in c/storage natively, not in the BIC.

But we do need it for pushes:

If the same image is pushed twice to the same destination, we should typically re-use blobs and not re-push.
In particular, if we (pull), build, + push the result, users expect us to optimize out re-pushing the base image layers if possible. Edit+build+push+run iterations should be fast.

Also, if users ask for zstd, I think it’s fine to reuse a zstd:chunked layer (possibly without adding the annotations, especially if we don’t trust them). If they ask for zstd:chunked, we need to be able to reuse only chunked layers and not the non-chunked ones. So eventually we should be able to reliably differentiate in the BIC between the two, and to carry the required annotations to allow reuse. Right now it seems to me that the difference is primarily trusted-annotation-driven, but there may well be something I’m missing.

giuseppe added 2 commits November 10, 2023 15:01

compression: use a function instead of a static prefix

4957ef9

Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

compression: add specific prefix for zstd:chunked

a2b0692

Needs: containers/storage#1756 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

giuseppe marked this pull request as draft November 10, 2023 14:18

giuseppe mentioned this pull request Nov 10, 2023

copy: do not fail if digest mismatches #1980

Merged

mtrmac reviewed Nov 10, 2023

View reviewed changes

mtrmac mentioned this pull request Aug 27, 2025

Zstd(:chunked) work tracking checklist podman-container-tools/container-libs#205

Open

38 tasks

giuseppe closed this Nov 15, 2023

mtrmac mentioned this pull request Aug 27, 2025

Zstd(:chunked) work tracking checklist podman-container-tools/container-libs#210

Closed

37 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] compression: add specific prefix for zstd:chunked#2183

[WIP] compression: add specific prefix for zstd:chunked#2183
giuseppe wants to merge 2 commits into
containers:mainfrom
giuseppe:zstd-detection

giuseppe commented Nov 10, 2023

Uh oh!

mtrmac left a comment

Uh oh!

giuseppe commented Nov 11, 2023

Uh oh!

mtrmac commented Nov 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

giuseppe commented Nov 10, 2023

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

giuseppe commented Nov 11, 2023

Uh oh!

mtrmac commented Nov 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants