container: Add deployed commits into set of GC roots #404

cgwalters · 2022-11-10T19:47:27Z

Prep for handling image pruning better. The way things are kind of expected to work today is that for a deployed ostree commit, we have two refs which point to it - one like e.g. fedora:fedora/x86_64/coreos/stable, as well as the "deployment ref" like "ostree/0/1/1" which is a synthetic ref generated by the sysroot core.

We want to be able to remove the container image refs - but doing so today subjects the layer branches to garbage collection.

Fix this by looking at the deployment refs as well as the set of images when computing the set of references for container images.

Prep for handling image pruning better. The way things are kind of expected to work today is that for a deployed ostree commit, we have *two* refs which point to it - one like e.g. `fedora:fedora/x86_64/coreos/stable`, as well as the "deployment ref" like "ostree/0/1/1" which is a synthetic ref generated by the sysroot core. We want to be able to remove the container image refs - but doing so today subjects the *layer* branches to garbage collection. Fix this by looking at the deployment refs as well as the set of images when computing the set of references for container images.

lucab · 2022-11-11T09:07:59Z

lib/src/container/store.rs

@@ -983,6 +983,40 @@ pub async fn copy(
    Ok(())
 }

+/// Iterate over deployment commits, returning the manifests from
+/// commits which point to a container image.
+fn list_container_deployment_manifests(


Code LGTM. However I have a general observation and alternative suggestion on GC logic.

I have a feeling that we should try to rework all the code behind gc_image_layers() so that it becomes infallible (or at least provide an infallible version to consumers). The idea is that a minor unexpected state or a single error in the GC logic can quickly spiral down into a instance with a full sysroot and hard to recover.
To that extent, it is probably more useful to keep going cleaning up whatever we can without failing, and log error messages if we encounter any failures.

Hmm. I see your point. Arguably we should do the same inside ostree_repo_prune() for the objects right?

I think the most cases of failures here are going to be filesystem corruption...in which case, trying to continue probably won't help much.

But in the case of a bug in the logic where we too-eagerly pruned a layer in earlier code; yeah I'd agree continuing makes sense because the user can always re-pull that layer. This really gets into the need for a similar ostree fsck --repair type logic.

Will look at this as a followup

OK one thing I did verify is that the key bit of repo.set_ref_immediate(None, layer_ref.as_str(), None, cancellable)?; is already idempotent. So I think that addresses a major source of possible logic errors already.

This needs some thought; filed #407

Yes I guess the main part of this would be on objects pruning instead, I agree.

Overall on GC there are many little things that could be just barely misaligned enough to cascade into larger issues. So it seems a good idea to at least trying to keep going whenever possible and clean at least some of the things.

cgwalters force-pushed the image-layer-gc-roots branch from c8d4c24 to 7e071e9 Compare November 10, 2022 19:56

cgwalters force-pushed the image-layer-gc-roots branch from 7e071e9 to aec69c8 Compare November 10, 2022 20:34

cgwalters mentioned this pull request Nov 10, 2022

rebasing does not prune previous container image coreos/rpm-ostree#4136

Closed

lucab approved these changes Nov 11, 2022

View reviewed changes

cgwalters merged commit 1ff6bdd into ostreedev:main Nov 11, 2022

cgwalters mentioned this pull request Nov 11, 2022

best effort GC #407

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

container: Add deployed commits into set of GC roots #404

container: Add deployed commits into set of GC roots #404

cgwalters commented Nov 10, 2022

lucab Nov 11, 2022

cgwalters Nov 11, 2022

cgwalters Nov 11, 2022

cgwalters Nov 11, 2022

lucab Nov 11, 2022

container: Add deployed commits into set of GC roots #404

container: Add deployed commits into set of GC roots #404

Conversation

cgwalters commented Nov 10, 2022

lucab Nov 11, 2022

Choose a reason for hiding this comment

cgwalters Nov 11, 2022

Choose a reason for hiding this comment

cgwalters Nov 11, 2022

Choose a reason for hiding this comment

cgwalters Nov 11, 2022

Choose a reason for hiding this comment

lucab Nov 11, 2022

Choose a reason for hiding this comment