Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide richer diff of values changes #1121

Open
jcogilvie opened this issue Apr 18, 2023 · 35 comments
Open

Provide richer diff of values changes #1121

jcogilvie opened this issue Apr 18, 2023 · 35 comments
Labels

Comments

@jcogilvie
Copy link

jcogilvie commented Apr 18, 2023

Description

When I have an existing helm_release with a set of values[], and the values change, often the values are only changing by a few lines. I would like to see a more narrow diff of the selected part of the values that have changed from my last apply, rather than the current view of "old values file replaced by new values file."

I'm not sure if this is feasible. But, if I have a long values file, the diff becomes meaningless if it just prints the 100 lines of old file as being replaced by 101 lines of new file, without any indicator of which line has changed.

Take this example diff given a small values file change. Can you immediately tell which line has changed?

Terraform will perform the following actions:

  # helm_release.datadog[0] will be updated in-place
!   resource "helm_release" "datadog" {
        id                         = "datadog"
        name                       = "datadog"
!       values                     = [
-           <<-EOT
                  dogstatsd:
                    originDetection: true
                    useHostPort: true
                    useHostPID: true
                    useSocketVolume: true
                    nonLocalTraffic: true
                    # unix domain socket
                    # keep this value in sync with helm-charts DD_DOGSTATSD_SOCKET (default is in flux)
                    socketPath: /var/run/datadog/statsd.sock
            EOT,
+           <<-EOT
                  dogstatsd:
                    originDetection: true
                    useHostPort: true
                    useHostPID: true
                    useSocketVolume: true
                    nonLocalTraffic: true
                    # unix domain socket
                    # keep this value in sync with helm-charts DD_DOGSTATSD_SOCKET (default is in flux)
                    socketPath: /var/run/datadog/statsd.sock
                    someAddedLine: foo  # for example
            EOT,
        ]
        # (25 unchanged attributes hidden)

        # (3 unchanged blocks hidden)
    }

Versus a more narrowly scoped change:

Terraform will perform the following actions:

  # helm_release.datadog[0] will be updated in-place
!   resource "helm_release" "datadog" {
        id                         = "datadog"
        name                       = "datadog"
!       values                     = [
!                 dogstatsd:
                    # unix domain socket
                    # keep this value in sync with helm-charts DD_DOGSTATSD_SOCKET (default is in flux)
                    socketPath: /var/run/datadog/statsd.sock
+                   someAddedLine: foo  # for example
        ]
        # (25 unchanged attributes hidden)

        # (3 unchanged blocks hidden)
    }

I know that 'smarter' diffs are possible in terraform, because I am able to see line by line diffs in helm manifests with helm_template if I set them as outputs. Maybe this could be achieved by changing values to a map internally with its map key as the index or something, to allow diffs between objects that may not be possible in the current list form.

Potential Terraform Configuration

Same as today.

Community Note

  • Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment
@sheneska
Copy link
Contributor

Hi @jcogilvie, can you explain why you defined values with a block of yaml instead of HCL?

@jcogilvie
Copy link
Author

jcogilvie commented Apr 26, 2023

Sure I can.

First, the interface for values takes a list of strings. And using a file ref to load that string is the first thing shown in the documentation for the resource. So, that's how we started.

We did try HCL in a local in some cases. But helm charts come bundled with values as yaml, showing us a template of how to use the chart. That meant it was a context switch and a mental translation any time we wanted to include new values. We couldn't just copy and paste, and it was problematic having people context switch between yaml and HCL (and yaml is a little bit lighter). And then we'd just have to yamlencode the HCL before passing it into values anyway. So that's reason number 2.

We also separately tried to use a bunch of set blocks but that's a lot less legible, and meant changes could live in two places (in the values file or in the helm_release). This became more problematic when we started using a sister helm_template that's a duplicate of the helm_release (in order to get plan-time manifests as an output that can be diffed), as it meant we'd have to perform the set on two objects.

I hope this helps.

@sheneska
Copy link
Contributor

Hi @jcogilvie, Thanks for sharing your use case! However the block of yaml is treated as plain text by terraform so there really isn't any way to break it down further.

@jcogilvie
Copy link
Author

I could be off base here but I think I've seen this happen with lists of strings.

I think if there were an option for a singular values file, that might cause terraform to diff the value instead of treating it like a replacement of one string with another.

@jcogilvie
Copy link
Author

I can confirm that a more robust diff behavior is possible if the provider were to expose a single-string variant of values that can accept a merged yaml instead of a list.

I know this from usage of the argocd_application in the argocd provider, which does exactly that: it accepts values as a singular string on the interface, and therefore does a nice clean diff of that string.

@Kenterfie
Copy link

As a intermediate solution i have created a small python script which converts the plan output to make it easier to read

Without script
image

With script
image

Download: https://gist.github.com/Kenterfie/a7ec9e50f17a749b8bb6469f21a6be4f

Maybe it will help others as well, as long as no solution exists

@rd-michel
Copy link

i think a proper diff view is really neccessary here.

e.g. kube-prometheus-stack helm chart contains a approx 3000 line values file. changing one line of code outputs 6000 lines of "terraform plan" which is nearly impossible to understand/diff.

@towolf
Copy link

towolf commented Sep 13, 2023

In addition to the missing diff for Helm values, the provider also now started printing all metadata as invalidated now, since 2.10 or so. This makes reading the plan output for a large Helm chart very painful.

I mean the stuff that starts with this:

image

... and thousands of lines later ends with this:

image

@rayjanoka
Copy link

rayjanoka commented Sep 14, 2023

The diff has been an issue for this module for a while. Sometimes it shows the difference and sometimes it just prints the entire yaml twice like @rd-michel is saying.

I wrote a little thing like @Kenterfie to diff the helm_release resource using the tfplan file.

➜ terraform-diff helm_release.onepassword
Generating a diff of 'helm_release.onepassword' in '/var/folders/r1/xx/T/tmp.REIC8M5s'
INFO[0003] Executing hook: create_plugin_directory  prefix=[/Users/xxx/asdf/1password]
@@ -23,6 +23,13 @@
     - "hosts":
       - "1password.xxx.io"
       "secretName": "tls-io-xxx-1password"
+  "redis":
+    "master":
+      "resources":
+        "limits":
+          "cpu": 512
+        "requests":
+          "cpu": "250m"
   "service":
     "type": "ClusterIP"
function terraform-diff () {
    TEMP=$(mktemp -d)

    echo "Generating a diff of '$1' in '$TEMP'"

    terraform plan -out=$TEMP/tfplan >/dev/null
    terraform show -json $TEMP/tfplan | jq -r '.resource_changes[] | select(.address=="'"$1"'") | .change.before.values | add' > $TEMP/before.txt
    terraform show -json $TEMP/tfplan | jq -r '.resource_changes[] | select(.address=="'"$1"'") | .change.after.values | add' > $TEMP/after.txt

    diff -u --color=always $TEMP/before.txt $TEMP/after.txt | sed -e '1,2d'
}

@alyssahardy
Copy link

The metadata diffs is present in versions >= 2.10.0. I run a workaround by wrapping the values in sensitive so it doesn't display in the helm resource, then adding a null resource which prints the diffs more nicely. One can use a templatefile for this, or just jsonencode values directly.

terraform {
  required_providers {
    helm = {
      source  = "hashicorp/helm"
      version = "< 2.10.0"
    }
  }
}

resource "helm_release" "chart" {
  name       = var.chart_name
  namespace  = var.chart_namespace
  chart      = var.helm_chart
  repository = var.chart_repo
  version    = var.chart_version
  values     = [sensitive(local.chart_values)]
}

resource "null_resource" "helm_values" {
  triggers = {
    contents = local.helm_values
  }
}

locals {
  chart_values = templatefile(format("%s/values.yaml.tpl", path.module), {
    value1 = var.value1,
    value2 = "some-value",
  })
}

@Punkoivan
Copy link

Hey there. Was looking for that feature and looks like there is no reason to use terraform for helm at all.
I'd switch to helmfile or whatever, because terraform's way to manage helm is not so good for complicated manifests.

@alyssahardy
Copy link

@Punkoivan I would usually prefer that if I didn't need to create additional resources such as buckets or keys, and I didn't need values that I obtain from Terraform.

@Punkoivan
Copy link

@alyssahardy you can try to create just a raw configmap within terraform and then use helm's "lookup" function to get the values.

@joaocc
Copy link

joaocc commented Sep 27, 2023

Hi. Coming from a helm world, where helm diff works very well, and even comparing with the normal plans available on terraform, having 10s and 100s of lines marked as diff (and in our case, not even being able to see the planned changes) does indeed making working with the kubernetes_helm provider very challenging... Workaround cost time and introduce variability in the processes, which is why it would be preferable that the diff produced more useful results.

@duxing
Copy link

duxing commented Jan 6, 2024

I agree with the benefits of using collections of yamls over hcl since it's more helm-native. been suffering from this diff issue for a while and would love to see it being properly addressed!

thx in advance!

@eytanhanig
Copy link

One alternative would be to provide a method to convert a data structure (in our case obtained from yamldecode) into keys/values that could be put in a dynamic set { block. This would also save people who don't use values files from having to deal with the headache of rewriting keys to use the \\. syntax.

@towolf
Copy link

towolf commented Apr 15, 2024

BTW, this seems to be better in TF 1.8.0.

@rd-michel
Copy link

nothing changed for me with tf 1.8.0

example:

  • deploy kube-prometheus-stack helm chart
  • afterwards add a single new line to the helm values file
  • rollout again

result:
❯ terraform plan |wc -l
8372

8372 lines of diff ... minus some terraform description output

@towolf
Copy link

towolf commented Apr 16, 2024

@rd-michel which helm Provider version? Please try pinning down to Version 2.9.0

@tanadeau
Copy link

I'm using Terraform 1.8.0 and version 2.13.0 of the provider, and there's definitely been an improvement. It's definitely still a lot of output though. I see a "minus" diff for the entire old set of values and then the entire new set of values with the actual proper diffs within it. It would be great if it just showed the proper diff with some context.

@towolf
Copy link

towolf commented Apr 16, 2024

With new versions of the provider you also get this problem:

#1315

I mentioned that earlier in this issue:

#1121 (comment)

@duxing
Copy link

duxing commented Apr 16, 2024

you can try enabling manifest from the helm provider. note it's experimental.

when it works, it's a big improvement. however, it has provided me with some weird errors that can only be bypassed by turning this off.

@rd-michel
Copy link

@towolf we always use the latest version of the helm provider (2.13.1 atm). why is it necessary to downgrade to 2.9.0?

@towolf
Copy link

towolf commented Apr 17, 2024

@rd-michel see my previous comment. I refer to the metadata change introduced here: Always recompute metadata when a release is updated.

But I don't know exactly what you are complaining about. Too much clutter from metadata or too much context in the actual values diff?

@rd-michel
Copy link

Thanks @towolf

Ive pretty much the same problem as described by @tanadeau

The displayed helm values context for large helm values files can be overwhelming during a terraform plan/apply

Perfect would be:

  • show me the actual diff + some context

@alexgottscha
Copy link

alexgottscha commented May 23, 2024

Ideally, I'd like similar output to helm diff -C3 upgrade when running an terraform plan. Unlike the solution from @rayjanoka, helm diff shows the difference in the actual cluster resources, rather than the values yaml.

@raffraffraff
Copy link

I have a monitoring terraform module that deploys Loki, Mimir and Grafana. It has to feed in values for the AWS resources it creates (buckets, roles, namespace, secrets). Every time I touch this deployment, no matter how trivial the change, I get a pain in my face as I scroll through a hundred lines of -<<EOT gone bad and a hundred more +<<EOT all new! and somehow figure out wtf has changed. Oh and the metadata:

 ~ metadata                   = [
          - {
                 ~  hundreds of lines of stuff, apparently all bad now! Bye bye!
          - }
    ] -> (known after apply)

Known after apply. In other words ¯\_(ツ)_/¯

@towolf
Copy link

towolf commented Sep 3, 2024

@raffraffraff please try terraform > 1.8.0 and helm provider version 2.9.0.

@meysam81
Copy link

meysam81 commented Sep 4, 2024

September 2024, this is still an issue ❌

OpenTofu v1.8.1
on linux_amd64
+ provider registry.opentofu.org/hashicorp/helm v2.15.0

Here's a minimal working example for your reference 👇

resource "helm_release" "this" {
  name       = "external-secrets"
  repository = "https://charts.external-secrets.io"
  chart      = "external-secrets"
  version    = "0.10.x"

  namespace        = "external-secrets"
  create_namespace = true

  set {
    name  = "crds.createClusterSecretStore"
    value = "true"
  }

}

Apply this stack two times and you will see metadata change. It's exhausting. 😮‍💨

@GreasyAvocado
Copy link

I feel like I'm missing something... How are everyone using the Provider, if it lacks this functionality?
How do you manage without it?
You just manually compare the before and after every time?

@meysam81
Copy link

meysam81 commented Oct 7, 2024

@GreasyAvocado
I do not use this provider for precisely the same reason.
I have been using helmwave and I am a happy user.

@pandvan
Copy link

pandvan commented Oct 14, 2024

I have been using helmwave and I am a happy user.

Do you use helmwave in Terraform runs? If so, how did you integrated it in Terraform?
Thanks.

@meysam81
Copy link

@pandvan

No, I use helmwave CLI, outside TF tooling.

Additionally, in the v1.16, the helm provider has fixed the metadata issue. Read the following comment:

#1344 (comment)

@jrluis
Copy link

jrluis commented Jan 8, 2025

The lack of better diff is the main reason why we are avoiding usage of helm, it just feels must more controllable to use the kubernetes provider with manifests.

@rd-michel
Copy link

@jrluis
render (only) the yaml files to roll them out with kubernetes provider is still a helm option.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests