6 Front door

Oct 12. Slides. After class, read Hernán and Robins 2020 Technical Point 7.4. Optionally, see Glynn and Kashin 2018

This lecture is about how to engage with new methods for causal identification beyond backdoor adjustment. The learning goals are general

engage with a new causal identification approach
translate that method to code
critique the identification assumptions

Front door methods for causal identification are one case that we use to show how the building blocks you already know have prepared you to learn new approaches to causal identification.

Identification

We focus on the simplest case for front door identification, depicted in the DAG below where the variables \(A\), \(M\), and \(Y\) are binary.

In this setting, the slides show the following identification result.

\[P(Y^a)=\sum_m P(M = m\mid A = a) \sum_{a'}P(A = a')P(Y\mid M = m, A = a')\]

Code example

The lecture slides translate this method into code in one simulated example. We are providing the code below to make it easy to copy and follow along.

library(tidyverse)

sim_data <- function(n = 100) {
  data.frame(U = runif(n)) %>%
    # Generate a binary treatment
    mutate(A = rbinom(n(), 
                      prob = U, 
                      size = 1)) %>%
    # Generate a binary mediator
    mutate(M = rbinom(n(), 
                      prob = .1 + .8*A, 
                      size = 1)) %>%
    # Generate a binary outcome
    mutate(Y = rbinom(n(), 
                      prob = plogis(U + .5*M), 
                      size = 1))
}
data <- sim_data(n = 10e3)

Examine the descriptive relationship between \(A\) and \(Y\).

data %>%
  group_by(A) %>%
  summarize(Y = mean(Y))

## # A tibble: 2 × 2
##       A     Y
##   <int> <dbl>
## 1     0 0.601
## 2     1 0.751

Estimate the probability of each \(M\) given \(A\). Under the causal assumptions, this corresponds to the expected value of \(M\) under assignment to each value of \(A\) since \(M\rightarrow A\) is unconfounded.

p_M_given_A <- data %>%
  # Count size of each group
  group_by(A, M) %>%
  count() %>%
  # Convert to probability within A
  group_by(A) %>%
  mutate(p_M_under_A = n / sum(n)) %>%
  select(A,M,p_M_under_A) %>%
  print()

## # A tibble: 4 × 3
## # Groups:   A [2]
##       A     M p_M_under_A
##   <int> <int>       <dbl>
## 1     0     0      0.891 
## 2     0     1      0.109 
## 3     1     0      0.0908
## 4     1     1      0.909

Within the front-door identification formula, you need the marginal probability of each treatment value.

# Probability of each A
p_A <- data %>%
  # Count size of each group
  group_by(A) %>%
  count() %>%
  # Convert to probability
  ungroup() %>%
  mutate(p_A = n / sum(n)) %>%
  select(A,p_A) %>%
  print()

## # A tibble: 2 × 2
##       A   p_A
##   <int> <dbl>
## 1     0 0.494
## 2     1 0.506

You also need the outcome distribution given \(M\) and \(A\).

# Probability of Y = 1 given M and A
p_Y_given_M_A <- data %>%
  group_by(A,M) %>%
  summarize(P_Y_given_A_M = mean(Y),
            .groups = "drop") %>%
  print()

## # A tibble: 4 × 3
##       A     M P_Y_given_A_M
##   <int> <int>         <dbl>
## 1     0     0         0.589
## 2     0     1         0.693
## 3     1     0         0.65 
## 4     1     1         0.761

Given the above, you can use backdoor adjustment to identify the outcome under intervention on \(M\) by backdoor adjustment for \(A\).

# Probability of Y = 1 under intervention on M
p_Y_under_M <- p_Y_given_M_A %>%
  left_join(p_A, by = "A") %>%
  group_by(M) %>%
  summarize(p_Y_under_M = sum(P_Y_given_A_M  * p_A)) %>%
  print()

## # A tibble: 2 × 2
##       M p_Y_under_M
##   <int>       <dbl>
## 1     0       0.620
## 2     1       0.727

Bringing the above together, we have front-door identification.

# Probability of Y = 1 under intervention on A
p_Y_under_A <- p_M_given_A %>%
  left_join(p_Y_under_M,
            by = "M") %>%
  group_by(A) %>%
  summarize(estimate = sum(p_M_under_A * p_Y_under_M)) %>%
  print()

## # A tibble: 2 × 2
##       A estimate
##   <int>    <dbl>
## 1     0    0.632
## 2     1    0.717

5 Statistical modeling

7 Instrumental variables