Penalized Estimation of Cumulative EffectsAndreas Bender, F Scheipl, W Hartl, A G Day, H KüchenhoffDepartement of Statistics, LMU Munich2017/12/171 / 30

Outline

Motivation
Exposure-Lag-Response Associations
Application

2 / 30

Motivation

Multi-center study of critical care patients from 457 ICUs ( $\approx 10 k$ patients)
maximum follow up of 60 days (we only consider short term survival $t \leq 30$ )
Various confounders:
- Age, Gender, BMI
- Diagnosis, Admission Category
- year of ICU admission
- Apache II Score
- ICU random effect
11-day nutrition protocol
- prescribed calories (determined at baseline $t = 0$ )
- daily caloric intake
- daily caloric adequacy (CA) = caloric intake/prescribed calories

3 / 30

Caloric Intake

4 / 30

Motivation

We are interested in how artificial nutrition (exposure) affects short term survival (outcome)
Difficulty:
- effect of nutrition might have a temporal delay (e.g. nutrition today affects survival 4 days later)
- effect of nutrition might "wear off" after some time (e.g. nutrition on day 1 likely won't affect the hazard on day 30)
- the (delayed) effect of nutrition also depends on the amount of nutrition (caloric adequacy) provided, possibly non-linearly
- the same amount of exposure might have a different effect depending on the follow up and exposure time
- the effect may be cumulative (i.e., 5 days of malnutrition in a row may be worse than only 2 in a row or 5 days malnutrition scattered throughout the follow up while on the other days "correct" amount was provided)

5 / 30

Terminology

We use the following terminology and notation:

Time-to-event $t$ : Time at which event times are observed
Time of exposure $t_{e}$ : Time at which values of the exposure are observed (must not necessarily overlap temporally with $t$ , measured in the same units or be in the same domain as $t$ , e.g. calendar days ( $t_{e}$ ) vs. 24h periods (days) since admission to ICU $t$ )
time-varying effects (TVE): Effects of time-constant covariates (covariates observed at the beginning of the follow-up) that can vary over time $t$
time-dependent covariates (TDC): Covariates whose values change over time. Value changes are recorded at exposure time $t_{e}$ (here synonymous to exposure)
Exposure value $z (t_{e})$ : The value of the TDC observed at exposure time $t_{e}$
Exposure history $z$ : The complete history of observed values of the exposure/TDC $z = (z (t_{e, 1}), z (t_{e, 2}), . . ., z (t_{e, Q}))$

6 / 30

Terminology

A general cumulative effect/Exposure-Lag-Response Association (ELRA) can be defined as

$g (z, t) = \int_{t_{e} : t_{e} \leq t} h (t, t_{e}, z (t_{e})) d t_{e}$

Partial effects $h (t, t_{e}, z (t_{e}))$ : The effect of the TDC recorded at exposure time $t_{e}$ with value $z (t_{e})$ on the hazard at follow up time $t$ (the tri-variate function $h$ is potentially non-linear in all three dimensions)
Cumulative effect $g (z, t)$ : The total (cumulated) effect of the partial effects on the log-hazard at time $t$ given exposure history $z$

7 / 30

Lag-Lead-Window

The integration borders can be defined more general, such that $g (z, t) = \int_{t - t_{lag} - t_{lead}}^{t - t_{lag}} h (t, t_{e}, z (t_{e})) d t_{e}$

Lag time $t_{lag}$ : The length of the delay until the TDC recorded at exposure time $t_{e}$ starts to affect the hazard (often $t_{lag} = 0$ )
Lead time $t_{lead}$ : The duration of the effect of the TDC observed at exposure time $t_{e}$
$t_{lag}$ and $t_{lead}$ define the set of exposures that contribute to the cumulative effect at time $t$ as ${z (t_{e}) : t_{e} \in [t - t_{lag} - t_{lead}, t - t_{lag}]}$
Minimal requirement: $\int_{t_{e} : t_{e} \leq t}$
Special case $\int_{0}^{t}$ follows with $t_{lag} = 0$ and $t_{lead} = t$
Example ( $t_{lag} = 4$ , $t_{lead} = 3$ ):
- The last nutrition that will enter the cumulative effect at time $t = 10$ is nutrition at $t_{e} \leq t - t_{lag} = 10 - 4 = 6$ , i.e. $z (t_{e} = 6)$
- The earliest nutrition that will contribute to the cumulative effect at time $t = 10$ is nutrition at $t_{e} \geq t - t_{lag} - t_{lead} = 10 - 4 - 3 = 3$

8 / 30

Lag-Lead-Window

The integration borders can be defined even more general, such that

$g (z, t) = \int_{t - t_{lag} (t_{e}) - t_{lead} (t_{e})}^{t - t_{lag} (t_{e})} h (t, t_{e}, z (t_{e})) d t_{e} = \int_{T_{e} (t)} h (t, t_{e}, z (t_{e})) d t_{e}$

$t_{lag}$ and $t_{lead}$ times can themselves depend on (exposure) time
$T_{e} (t)$ is the set of exposure times $t_{e}$ relevant to the cumulative effect at time $t$
We call $T_{e} (t)$ the Lag-Lead-Window or Window of effectiveness

9 / 30

Lag-Lead Window (Example)

10 / 30

ELRAs in the literature

Some models known from the literature follow as special cases of the general specification $g (z, t) = \int_{T_{e} (t)} h (t, t_{e}, z (t_{e}))$ when we assume that partial effects $h$ only depend on latency $t - t_{e}$ instead of concrete combination of $t$ and $t_{e}$ , i.e., $h (t = 30, t_{e} = 3, z (t_{e})) \overset{!}{=} h (t = 40, t_{e} = 13, z (t_{e})) \overset{!}{=} \tilde{h} (t - t_{e} = 27, z (t_{e}))$

DLNM: Distributed Lag Non-linear Models (Gasparrini et al, 2014, 2017): $g (z, t) = \int_{T_{e} (t)} h (t - t_{e}, z (t_{e}))$

WCE: Weighted Cumulative Exposure (Sylvestre and Abrahamowicz, 2009): $g (z, t) = \int_{T_{e} (t)} h (t - t_{e}) z (t_{e})$
Also possible within general framework:
- more flexible WCE: $g (z, t) = \int_{T_{e} (t)} h (t, t_{e}) z (t_{e})$
- time-varying DLNM (TV DLNM): $g (z, t) = \int_{T_{e} (t)} h (t, t - t_{e}, z (t_{e}))$

11 / 30

Exposure-Lag-Response Association

$g (z, t)$ represents the cumulative, time-varying effect of exposure history $z$ on the log-hazard at time $t$
we define its contribution to the model's additive predictor as

$\begin{aligned} g (z_{i}, t) = \int_{T_{e} (t)} h ({\tilde{t}}_{j}, t_{e}, z_{i} (t_{e})) d t_{e} \approx \sum_{q : t_{e, q} \in T_{e} (t)} Δ_{q} h ({\tilde{t}}_{j}, t_{e, q}, z_{i} (t_{e, q})) \forall t \in (κ_{j - 1}, κ_{j}], \end{aligned}$

with

${\tilde{t}}_{j} := (κ_{j} - κ_{j - 1}) / 2, j = 1, \dots, J$
partial effects $h ({\tilde{t}}_{j}, t_{e}, z_{i} (t_{e}))$
quadrature weights $Δ_{q} = t_{e, q} - t_{e, q - 1}$ for numerical integration are given by the time between two consecutive exposure measurements

12 / 30

Tensor product smooths

Low rank representation of the tri-variate smooth function $h (t, t_{e}, z (t_{e})) = \sum_{ℓ = 1}^{L} \sum_{r = 1}^{R} \sum_{m = 1}^{M} γ_{ℓ r m} B_{m} (z (t_{e})) B_{r} (t_{e}) B_{ℓ} (t)$

with

model matrix $X = X_{t} ⊙ X_{t_{e}} ⊙ X_{z (t_{e})}$ and
penalty $S = ν_{z (t_{e})} I_{d_{R}} \otimes I_{d_{L}} \otimes S_{z (t_{e})} + ν_{t_{e}} I_{d_{L}} \otimes S_{t_{e}} \otimes I_{d_{M}} + ν_{t} S_{t} \otimes I_{d_{R}} \otimes I_{d_{M}}$

$\to$ Estimate parameters $γ$ by optimizing $D (γ) + \sum_{k} ν_{k} γ^{'} S_{k} γ$ (Wood, 2011), where

$D (γ)$ is the model deviance (of the Poisson GAMM)
$γ$ contains all Spline basis coefficients and random effects
$ν_{k}$ and $S_{k}, k = 1, \dots, K$ are the smoothing parameters and penalty matrices for the $k$ -th smooth term, respectively

13 / 30

Exposure-Lag Response AssociationIf we restrict the ELRA to be linear in the exposure, i.e.,
h(zi(te),te,t)=~h(te,t)⋅zi(te)h(zi(te),te,t)=h~(te,t)⋅zi(te) we can simplify to
g(zi,t)≈Q∑q=1~Δi,q~h(te,q,t)g(zi,t)≈∑q=1QΔ~i,qh~(te,q,t)
with
~Δi,q={zi(te,q)Δq if te,q∈Te(t)0 elseΔ~i,q={zi(te,q)Δq if te,q∈Te(t)0 else
14 / 30

Exposure-Lag Response Association

Spline bases for the bivariate functions $\tilde{h} (t_{e}, t)$ are set up via tensor product B-spline basis with marginal bases $B_{m} (t_{e}), m = 1, \dots, M$ and $B_{k} (t), k = 1, \dots, K$ defined over the exposure and hazard time domains, respectively
$M$ and $K$ delimit the maximal complexity of the ELRA
$\tilde{h} (t_{e}, t) = \sum_{m = 1}^{M} \sum_{k = 1}^{K} γ_{m, k} B_{m} (t_{e}) B_{k} (t)$
Combining above equations yields: $\begin{aligned} g (z_{i}, t) \approx \sum_{m = 1}^{M} \sum_{k = 1}^{K} γ_{m, k} {\tilde{B}}_{i, m} (t_{e}, t) B_{k} (t), \end{aligned}$ where ${\tilde{B}}_{i, m} (t_{e}, t) = \sum_{q = 1}^{Q} {\tilde{Δ}}_{i, q} B_{m} (t_{e})$ .

15 / 30

Simulation - DLNM

$λ (t | z) = λ_{0} (t) \exp (\int h (t - t_{e}, z (t_{e})) d t_{e})$
$t \in (0, 40]$ , $t_{e} \in [- 40, 40]$ , $z (t_{e}) \in [0, 10]$

16 / 30

Simulation (2) - TV DLNM

$λ (t | z) = λ_{0} (t) \exp (\int \tilde{h} (t, t - t_{e}, z (t_{e})) d t_{e}) = λ_{0} (t) \exp (\int f (t) \cdot h (t - t_{e}, z (t_{e})) d t_{e})$ and $f (t) = - \cos (π t / t_{max})$

17 / 30

Application

In the application example (categorical nutrition), we estimate $\begin{aligned} \log (λ_{i} (t | x_{i}, z_{i}, ℓ_{i})) & = f_{0} (t) + \sum_{p = 1}^{P} f_{p} (x_{i, p}, t) + g (z_{i}, t) + b_{ℓ_{i}} \end{aligned}$ with

$f_{0} (t_{j}) = \sum_{m = 1}^{M} γ_{0 m} B_{m} (t_{j})$ represents the log baseline-hazard
$f (x_{i, p}, t_{j}) = \sum_{m = 1}^{M} \sum_{ℓ = 1}^{L} γ_{m ℓ} B_{m} (x_{i, p}) B_{ℓ} (t_{j})$ are potentially non-linear, potentially non-linearly time-varying effects of confounders $x_{i, p}$
$g (z_{i}, t) = g_{C 2} (z_{i}^{C 2}, t) + g_{C 3} (z_{i}^{C 3}, t)$
- $z_{i}^{C 2}$ and $z_{i}^{C 3}$ dummy variables that indicate whether subject $i$ received category $C 2$ and $C 3$ nutrition on day $t_{e, q}, q = 1, \dots, 11$ , respectively
- $g_{C 2} (z_{i}, t) \approx \sum_{q = 1}^{Q} {\tilde{Δ}}_{i, q}^{C 2} {\tilde{h}}_{C 2} (t_{e, q}, t)$
$b_{ℓ_{i}}$ is the random effect associated with ICU (cluster) $ℓ_{i}$ at which subject $i$ is treated

$\to C 1$ reference category

18 / 30

PAMM

We know how the estimate such models in the framework of
Generalized Additive Mixed Models (GAMMs)

Fortunately, we can fit survival models via Poisson GLMs/GAMMs by representing them as a Piece-wise exponential Additive Mixed Model (PAMMs)
to do so requires to
- divide the follow up $(0, t_{m a x}]$ into $J$ intervals with $J + 1$ cut-points $0 = κ_{0} < \dots < κ_{J} = t_{max}$
- transform the data into appropriate format (pseudo observations in each interval):
  - interval specific event-indicators $δ_{i j}$ , where $δ_{i j} = 1$ if subject $i$ experienced an event in interval $j$ (i.e. $t_{i} \in (κ_{j - 1}, κ_{j}]$ and $T_{i} < C_{i}$ ) and $δ_{i j} = 0$ else
  - offsets $o_{i j} = \log (t_{i j})$ , where $t_{i j} = m i n (t_{i} - κ_{j - 1}, κ_{j} - κ_{j - 1})$ is the time subject $i$ spent in interval $j$
- in the $j^{t h}$ interval $(κ_{j - 1}, κ_{j}]$ estimate a piece-wise constant hazard rate $λ (t) = λ_{j} \forall t \in (κ_{j - 1}, κ_{j}]$ (more intervals lead to better approximation)
See Holford 1980, Laird 1981, Friedman 1982, Whitehead 1982

19 / 30

Application (Results)

20 / 30

Application (Results)

Example:

$z = (5 \times C 2, 6 \times C 3)$
$z^{C 2} = (1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0)$ , $z^{C 3} = (0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1)$
$g (z, {\tilde{t}}_{j} = 18.5) = g (z^{C 2}, 18.5) + g (z^{C 3}, 18.5) \approx - 0.57$

$\to$ Risk reduction of $\exp (- 0.57) = 0.57$ compared to subject with $11 \times C 1$ nutrition (c.p)

21 / 30

Application (Results)

These bivariate surfaces are difficult to interpret as

they must be interpreted with respect to a subject who received $C 1$ nutrition on all 11 days of nutrition protocol
partial effects $h_{C 2} (t, t_{e})$ and $h_{C 3} (t, t_{e})$ can both contribute to the cumulative effect, depending on the specific nutrition profile
for these reasons, we prefer to analyze and interpret estimated hazard ratios between hypothetical patients with different clinically relevant exposure histories ( $z_{1}$ and $z_{2}$ )

$e_{j} = \frac{λ ({\tilde{t}}_{j} | z_{2})}{λ ({\tilde{t}}_{j} | z_{1})}$

22 / 30

Application (Results)

We compare the following nutrition profiles:

23 / 30

Application (Results)

$\to$ Complete, mildly hypocaloric nutrition reduces risk of mortality compared to a complete, severely hypocaloric nutrition (Comparison B)

$\to$ No further risk reduction when moving from mildly hypocaloric to partial or complete near target nutrition (Comparisons E, F)

$\to$ Sensitivity analyses (Imputation of missing protocols, lag/lead specification, penalty structure, ...) show no substantive deviation from main results

24 / 30

Limitations (and outlook)

currently, $t_{lag}$ and $t_{lead}$ must be specified a priori $\to$ would be nice if the lag-lead window could be selected data-driven (e.g. Obermeier et al., 2015)
we assume that patients released from hospital survived until the end of the follow-up ( $t = 30$ ). Sensitivity analysis with hospital discharge as censoring event do not change the results $\to$ Competing risks model for outcomes hospital discharge and death would be preferable
Modeling and interpretation of TDCs always difficult, especially if exogeneity is unclear, e.g.
- although nutrition is provided by hospital staff, amount provided might depend on patients' health status
- more recent values provide better confounder adjustment but may also be fully indicative of the outcome (indication bias)
Model choice becomes difficult when all effects are potentially non-linear and/or non-linearly time-varying (boosting ad double-penalty procedures promising)

25 / 30

Links and Acknowledgments

Talk is based on two publications
- Andreas Bender, Andreas Groll, and Fabian Scheipl. 2018. “A Generalized Additive Model Approach to Time-to-Event Analysis.” Statistical Modelling. https://doi.org/10.1177/1471082X17748083.
- Andreas Bender, Fabian Scheipl, Wolfgang Hartl, Andrew G Day, Helmut Küchenhoff; "Penalized estimation of complex, non-linear exposure-lag-response associations", Biostatistics, , kxy003, https://doi.org/10.1093/biostatistics/kxy003
pammtools: Package for Piece-wise exponential Additive Mixed Models (in development)
Slides created via Yihui Xie's R package xaringan with (modified) Metropolis theme
All graphics have been created using Hadley Whickham's ggplot2
Models are estimated using Simon Wood's mgcv
Web: adibender.netlify.com
Social:

26 / 30

References

Friedman, Michael. “Piecewise Exponential Models for Survival Data with Covariates.” The Annals of Statistics 10, no. 1 (1982): 101–113.
Gasparrini, Antonio. “Modeling Exposure–lag–response Associations with Distributed Lag Non-Linear Models.” Statistics in Medicine 33, no. 5 (February 28, 2014): 881–99. https://doi.org/10.1002/sim.5963.
Gasparrini, Antonio, Fabian Scheipl, Ben Armstrong, and Michael G. Kenward. “A Penalized Framework for Distributed Lag Non-Linear Models.” Biometrics, January 1, 2017. https://doi.org/10.1111/biom.12645.
Holford, Theodore R. “The Analysis of Rates and of Survivorship Using Log-Linear Models.” Biometrics 36, no. 2 (1980): 299–305. https://doi.org/10.2307/2529982.
Laird, Nan, and Donald Olivier. “Covariance Analysis of Censored Survival Data Using Log-Linear Analysis Techniques.” Journal of the American Statistical Association 76, no. 374 (1981): 231–240. https://doi.org/10.2307/2287816.
Marra, Giampiero, and Simon N. Wood. “Coverage Properties of Confidence Intervals for Generalized Additive Model Components.” Scandinavian Journal of Statistics 39, no. 1 (March 1, 2012): 53–74. https://doi.org/10.1111/j.1467-9469.2011.00760.x.
Sylvestre, Marie-Pierre, and Michal Abrahamowicz. “Flexible Modeling of the Cumulative Effects of Time-Dependent Exposures on the Hazard.” Statistics in Medicine 28, no. 27 (2009): 3437–3453. https://doi.org/10.1002/sim.3701.

27 / 30

References

Whitehead, John. “Fitting Cox’s Regression Model to Survival Data Using GLIM.” Journal of the Royal Statistical Society. Series C (Applied Statistics) 29, no. 3 (1980): 268–75. https://doi.org/10.2307/2346901.
Wood, Simon N. Generalized Additive Models: An Introduction with R. Boca Raton and FL: Chapman & Hall/CRC, 2006.
Wood, Simon N. “Low-Rank Scale-Invariant Tensor Product Smooths for Generalized Additive Mixed Models.” Biometrics 62, no. 4 (December 1, 2006): 1025–36. https://doi.org/10.1111/j.1541-0420.2006.00574.x.
Wood, Simon N. “Fast Stable Restricted Maximum Likelihood and Marginal Likelihood Estimation of Semiparametric Generalized Linear Models.” Journal of the Royal Statistical Society: Series B (Statistical Methodology) 73, no. 1 (2011): 3–36. https://doi.org/10.1111/j.1467-9868.2010.00749.x.
Wood, Simon N. “On P-Values for Smooth Components of an Extended Generalized Additive Model.” Biometrika 100, no. 1 (March 1, 2013): 221–28. https://doi.org/10.1093/biomet/ass048.
Wood, Simon N., Fabian Scheipl, and Julian J. Faraway. “Straightforward Intermediate Rank Tensor Product Smoothing in Mixed Models.” Statistics and Computing, 2012. https://doi.org/10.1007/s11222-012-9314-z.

28 / 30

References

Wickham, Hadley. Ggplot2: Elegant Graphics for Data Analysis. 2nd ed. 2016. New York, NY: Springer, 2016.
Yihui Xie (2017). xaringan: Presentation Ninja. R package version 0.4.4. https://github.com/yihui/xaringan
Hadley Wickham, Romain Francois, Lionel Henry and Kirill Müller (2017). dplyr: A Grammar of Data Manipulation. R package version 0.7.4. https://CRAN.R-project.org/package=dplyr

29 / 30

Caloric Adequacy

caloric intake = calories from EN + PN + PF
caloric adequacy (CA):
$C A (%) = caloric intake / prescribed calories \cdot 100$
discretized caloric adequacy (in 3 categories):
- $C 1$ : $0 % \leq C A < 30 %$ and no OI
- $C 2$ :
  - $30 % \leq C A < 70 %$ and no OI or
  - $0 % \leq C A < 30 %$ and additional OI
- $C 3$ :
  - $C A \geq 70 %$ or
  - $30 % \leq C A < 70 %$ and additional OI

↑, ←, Pg Up, k	Go to previous slide
↓, →, Pg Dn, Space, j	Go to next slide
Home	Go to first slide
End	Go to last slide
Number + Return	Go to specific slide
b / m / f	Toggle blackout / mirrored / fullscreen mode
c	Clone slideshow
p	Toggle presenter mode
t	Restart the presentation timer
?, h	Toggle this help

Penalized Estimation of Cumulative Effects

Andreas Bender, F Scheipl, W Hartl, A G Day, H Küchenhoff

Departement of Statistics, LMU Munich

2017/12/17

Outline

Motivation

Caloric Intake

Motivation

Terminology

Terminology

Lag-Lead-Window

Lag-Lead-Window

Lag-Lead Window (Example)

ELRAs in the literature

Exposure-Lag-Response Association

Tensor product smooths

Exposure-Lag Response Association

Exposure-Lag Response Association

Simulation - DLNM

Simulation (2) - TV DLNM

Application

PAMM

Application (Results)

Application (Results)

Application (Results)

Application (Results)

Application (Results)

Limitations (and outlook)

Links and Acknowledgments

References

References

References

Caloric Adequacy

Outline

Help