How Terraform Implicit Dependencies Work: A Toy Example

2022-09-09

Have you ever wondered how Terraform Implicit Dependencies actually work in practice?

In today's blog, I'm going to craft a toy example that demonstrates how I understand that they work using local_file resources and thus not requiring any cloud connection or even internet connectivity.

What do you mean by Implicit Dependencies?

You might be wondering, "Implicit Dependencies... are those like the stuff artists do to avoid getting a Parental Advisory: Explicit Content sticker on their albums?" Which, yeah, I guess it could be but I'm talking about Terraform here. In Terraform, every time that you run a terraform plan or terraform apply, the tool is internally building a directed acyclic graph that models the dependencies between resources in your configuration that you specify using HCL. The edges in this graph are the dependency relationships between your resources and they are typically inferred by Terraform without you explicitly specifying them. It is possible to specify these dependencies explicitly using the depends_on meta-argument but, as I will discuss in this post, that won't necessarily do what you might expect.

If you aren't required to specify anything about the resource dependencies explicitly, then how does Terraform know what the implicit dependencies are? Like any good private investigator, Terraform knows who you're talking to. It uses the references that you are already making every time that you reference the output values of a resource, data source, or module somewhere else in your configuration and uses those to piece together how all the pieces fit together.

Connecting All the Dots

The example provided by Hashicorp for dependencies is a good guide, but it has a dependency of its own that makes it less accessible for some of us. Specifically, it uses the AWS provider which is fine if you're an AWS user but what if you're not a DevOps Wizard flying on Cloud to Nirvana? What if you don't even have internet access? How are you supposed to play around with these TF language constructs then? That's why I decided to play around with local_file resources on my laptop instead.

To get started, I pulled down Terraform 1.2.9 using asdf.

 asdf install terraform 1.2.9
Downloading terraform version 1.2.9 from https://releases.hashicorp.com/terraform/1.2.9/terraform_1.2.9_linux_amd64.zip
...
 asdf global terraform 1.2.9

 terraform version
Terraform v1.2.9
on linux_amd64

Next, I wrote up these files:

main.tf (GitHub)

locals {
  teams = {
    a = {
      summary = "hey, I'm a. aaaaaay!"
    },
    b = {
      summary = "great team. we change names a lot. bi-weekly, at least."
    },
    c = {
      summary = "int main () { return 0; }"
    }
  }
}

module "teams" {
  source = "./team"

  for_each = local.teams

  team = each.key
  summary = each.value.summary
}

resource "local_file" "team_guide" {
  content = join("\n", [for k, v in module.teams : join(": ", [k, v.content])])
  filename = "team_guide.txt"
}

team/main.tf (GitHub)

variable "team" {
  type = string
}

variable "summary" {
  type = string
}

resource "local_file" "team_data" {
  content = var.summary
  filename = "${var.team}.data"
}

output "content" {
  value = local_file.team_data.content
}

During the initial terraform apply to create the initial state, we can see that it knows that it must create each of the module.teams before the team_guide. Output is abbreviated.

 terraform apply
...
  # local_file.team_guide will be created
  + resource "local_file" "team_guide" {
      + content              = <<-EOT
            a: hey, I'm a. aaaaaay!
            b: great team. we change names a lot. bi-weekly, at least.
            c: int main () { return 0; }
        EOT
      + directory_permission = "0777"
      + file_permission      = "0777"
      + filename             = "team_guide.txt"
      + id                   = (known after apply)
    }

  # module.teams["a"].local_file.team_data will be created
  + resource "local_file" "team_data" {
      + content              = "hey, I'm a. aaaaaay!"
      + directory_permission = "0777"
      + file_permission      = "0777"
      + filename             = "a.data"
      + id                   = (known after apply)
    }

  # module.teams["b"].local_file.team_data will be created
  + resource "local_file" "team_data" {
      + content              = "great team. we change names a lot. bi-weekly, at least."
      + directory_permission = "0777"
      + file_permission      = "0777"
      + filename             = "b.data"
      + id                   = (known after apply)
    }

  # module.teams["c"].local_file.team_data will be created
  + resource "local_file" "team_data" {
      + content              = "int main () { return 0; }"
      + directory_permission = "0777"
      + file_permission      = "0777"
      + filename             = "c.data"
      + id                   = (known after apply)
    }
...
module.teams["b"].local_file.team_data: Creating...
module.teams["a"].local_file.team_data: Creating...
module.teams["c"].local_file.team_data: Creating...
module.teams["a"].local_file.team_data: Creation complete after 0s [id=fc2e8a361bfa03467011fc4eaffb66e25edf10fa]
module.teams["c"].local_file.team_data: Creation complete after 0s [id=aa42d468cc318c333239fdd23bea3c498d755a56]
module.teams["b"].local_file.team_data: Creation complete after 0s [id=4e6f79afa05a3112ec94f284923bed560435504c]
local_file.team_guide: Creating...
local_file.team_guide: Creation complete after 0s [id=400f628fc381aad58ca6cf47b75468c9535f2f87]

Apply complete! Resources: 4 added, 0 changed, 0 destroyed.

I can then make changes to the locals and when I do terraform apply, it automatically knows that it needs to destroy the team_guide resource before it updates the teams modules.

 terraform apply
...
  # local_file.team_guide must be replaced
-/+ resource "local_file" "team_guide" {
      ~ content              = <<-EOT # forces replacement
            a: hey, I'm a. aaaaaay!
          - b: great team. we change names a lot. bi-weekly, at least.
          + b: great team. we change names a lot. weekly, at least.
            c: int main () { return 0; }
        EOT
      ~ id                   = "400f628fc381aad58ca6cf47b75468c9535f2f87" -> (known after apply)
        # (3 unchanged attributes hidden)
    }

  # module.teams["b"].local_file.team_data must be replaced
-/+ resource "local_file" "team_data" {
      ~ content              = "great team. we change names a lot. bi-weekly, at least." -> "great team. we change names a lot. weekly, at least." # forces replacement
      ~ id                   = "4e6f79afa05a3112ec94f284923bed560435504c" -> (known after apply)
        # (3 unchanged attributes hidden)
    }
...
local_file.team_guide: Destroying... [id=400f628fc381aad58ca6cf47b75468c9535f2f87]
local_file.team_guide: Destruction complete after 0s
module.teams["b"].local_file.team_data: Destroying... [id=4e6f79afa05a3112ec94f284923bed560435504c]
module.teams["b"].local_file.team_data: Destruction complete after 0s
module.teams["b"].local_file.team_data: Creating...
module.teams["b"].local_file.team_data: Creation complete after 0s [id=2c2d10292d625d948193136bfa981ffa84c1a6dd]
local_file.team_guide: Creating...
local_file.team_guide: Creation complete after 0s [id=f5533e02419d3af8acb8736662de1eb54e755544]

Apply complete! Resources: 2 added, 0 changed, 2 destroyed.

So, how does this work? The answer is here in the content attribute of team_guide.

resource "local_file" "team_guide" {
  content = join("\n", [for k, v in --> *module.teams* <-- : join(": ", [k, v.content])])
  filename = "team_guide.txt"
}

Because this for expression is referencing module.teams, Terraform creates an implicit dependency between these resources. That's it! You could technically add an explicit depends_on to the resource, but it wouldn't change anything about Terraform's resource graph.

One tricky thing that I learned about the dependency graph when using depends_on is that it doesn't work the way that I would have expected it to in a system like Puppet, which refreshes its "state" every time it runs (because, for the most part, it doesn't have any state). To demonstrate this, let's modify the configuration in main.tf so that the file content is gathered using the file() function instead.

resource "local_file" "team_guide" {
  content = join("\n", [for k, v in local.teams : "${k}: ${file(k)}"])
  filename = "team_guide.txt"
}

Now if we modify one of the team again and terraform apply, we see that the only change that is picked up is on the change to the team itself and not the team_guide, which is reading in the file.

  # module.teams["b"].local_file.team_data must be replaced
-/+ resource "local_file" "team_data" {
      ~ content              = "great team. we change names a lot. weekly, at least." -> "great team. we don't change our name often." # forces replacement
      ~ id                   = "2c2d10292d625d948193136bfa981ffa84c1a6dd" -> (known after apply)
        # (3 unchanged attributes hidden)
    }
...
module.teams["b"].local_file.team_data: Destroying... [id=2c2d10292d625d948193136bfa981ffa84c1a6dd]
module.teams["b"].local_file.team_data: Destruction complete after 0s
module.teams["b"].local_file.team_data: Creating...
module.teams["b"].local_file.team_data: Creation complete after 0s [id=f18ea0e62d069cf8b1383dd84f46ffd3d5d3555d]

Apply complete! Resources: 1 added, 0 changed, 1 destroyed.

If we run terraform apply again then we will see that the Terraform plan that is implicitly run as part of the apply action has picked up the changes to the files on disk, and the diff has been handed that off to apply so now it will converge as expected. For more details on why this is, read about the resource lifecycle

  # local_file.team_guide must be replaced
-/+ resource "local_file" "team_guide" {
      ~ content              = <<-EOT # forces replacement
            a: hey, I'm a. aaaaaay!
          - b: great team. we change names a lot. weekly, at least.
          + b: great team. we don't change our name often.
            c: int main () { return 0; }
        EOT
      ~ id                   = "f5533e02419d3af8acb8736662de1eb54e755544" -> (known after apply)
        # (3 unchanged attributes hidden)
    }

Plan: 1 to add, 0 to change, 1 to destroy.

If your brain is used to the patterns used by Puppet, you would expect that adding a depends_on to the resource would resolve this issue. It does not. The reason for this is that adding this dependency does not change the diff that will be generated during the plan stage. The file being read by file() simply will not have changed at that point. There is no concept of a "refresh" between resources in Terraform that would force an update of resources in the dependency graph based on another resource change. Instead, all of the resources that are known are queried during the plan stage, and a diff is generated using those values against the current state file.

I found this counter-intuitive when I first started learning Terraform due to the familiar patterns that I knew from Puppet. Still, once I learned and updated my understanding of how Terraform works, I found that I could reason about how my configurations would be applied much better. I hope that this post helped you have some similar epiphanies of your own about Terraform. Thanks for reading!

If you found this guide helpful, consider reading Matthew Rose's blog post "Talking to the Locals" about the terraform console to get some pointers on how to better explore your Terraform code and state with a handy REPL!