Motivation
Exposure-Lag-Response Associations
Application
Multi-center study of critical care patients from 457 ICUs
( ≈10k
patients)
maximum follow up of 60 days
(we only consider short term survival t≤30
)
Various confounders:
11-day nutrition protocol
t=0
)We are interested in how artificial nutrition (exposure) affects short term survival (outcome)
Difficulty:
effect of nutrition might have a temporal delay (e.g. nutrition today affects survival 4 days later)
effect of nutrition might "wear off" after some time (e.g. nutrition on day 1 likely won't affect the hazard on day 30)
the (delayed) effect of nutrition also depends on the amount of nutrition (caloric adequacy) provided, possibly non-linearly
the same amount of exposure might have a different effect depending on the follow up and exposure time
the effect may be cumulative (i.e., 5 days of malnutrition in a row may be worse than only 2 in a row or 5 days malnutrition scattered throughout the follow up while on the other days "correct" amount was provided)
We use the following terminology and notation:
Time-to-event t
: Time at which event times are observed
Time of exposure te
: Time at which values of the exposure are observed
(must not necessarily overlap temporally with t
, measured in the same units
or be in the same domain as t
, e.g. calendar days ( te
) vs. 24h periods
(days) since admission to ICU t
)
time-varying effects (TVE): Effects of time-constant covariates (covariates
observed at the beginning of the follow-up) that can vary over time t
time-dependent covariates (TDC): Covariates whose values change over time.
Value changes are recorded at exposure time te
(here synonymous to exposure)
Exposure value z(te)
: The value of the TDC observed at exposure time
te
Exposure history z
: The complete history of observed
values of the exposure/TDC z=(z(te,1),z(te,2),...,z(te,Q))
A general cumulative effect/Exposure-Lag-Response Association (ELRA) can be defined as
g(z,t)=∫te:te≤th(t,te,z(te))dte
Partial effects h(t,te,z(te))
: The effect of the TDC recorded at
exposure time te
with value z(te)
on the hazard at follow up time t
(the tri-variate function h
is potentially non-linear in all three dimensions)
Cumulative effect g(z,t)
: The total (cumulated) effect of the
partial effects on the log-hazard at time t
given exposure history z
The integration borders can be defined more general, such that
g(z,t)=∫t−tlagt−tlag−tleadh(t,te,z(te))dte
tlag
: The length of the delay until the TDC recorded at
exposure time te
starts to affect the hazard (often tlag=0
)tlead
: The duration of the effect of the TDC observed at exposure time te
tlag
and tlead
define the set of exposures that contribute to the
cumulative effect at time t
as {z(te):te∈[t−tlag−tlead,t−tlag]}
∫te:te≤t
∫t0
follows with tlag=0
and tlead=t
tlag=4
, tlead=3
):t=10
is nutrition at te≤t−tlag=10−4=6
, i.e. z(te=6)
t=10
is nutrition at te≥t−tlag−tlead=10−4−3=3
The integration borders can be defined even more general, such that
g(z,t)=∫t−tlag(te)t−tlag(te)−tlead(te)h(t,te,z(te))dte=∫Te(t)h(t,te,z(te))dte
tlag
and tlead
times can themselves depend on (exposure) time
Te(t)
is the set of exposure times te
relevant to the cumulative effect
at time t
We call Te(t)
the Lag-Lead-Window or Window of effectiveness
Some models known from the literature follow as special cases of the general
specification
g(z,t)=∫Te(t)h(t,te,z(te))
when we assume that partial effects h
only depend on latency t−te
instead of concrete combination of t
and te
, i.e.,
h(t=30,te=3,z(te))!=h(t=40,te=13,z(te))!=~h(t−te=27,z(te))
WCE: Weighted Cumulative Exposure (Sylvestre and Abrahamowicz, 2009):
g(z,t)=∫Te(t)h(t−te)z(te)
Also possible within general framework:
g(z,t)=∫Te(t)h(t,te)z(te)
g(z,t)=∫Te(t)h(t,t−te,z(te))
g(z,t)
represents the cumulative, time-varying effect of
exposure history z
on the log-hazard at time t
we define its contribution to the model's additive predictor as
g(zi,t)=∫Te(t)h(~tj,te,zi(te))dte≈∑q:te,q∈Te(t)Δqh(~tj,te,q,zi(te,q))∀t∈(κj−1,κj],
with
~tj:=(κj−κj−1)/2,j=1,…,J
h(~tj,te,zi(te))
Δq=te,q−te,q−1
for numerical integration are given by the time between two consecutive exposure measurementsLow rank representation of the tri-variate smooth function
h(t,te,z(te))=L∑ℓ=1R∑r=1M∑m=1γℓrmBm(z(te))Br(te)Bℓ(t)
with
model matrix X=Xt⊙Xte⊙Xz(te)
and
penalty S=νz(te)IdR⊗IdL⊗Sz(te)+νteIdL⊗Ste⊗IdM+νtSt⊗IdR⊗IdM
→
Estimate parameters γ
by optimizing
D(γ)+∑kνkγ′Skγ
(Wood, 2011), where
D(γ)
is the model deviance (of the Poisson GAMM)γ
contains all Spline basis coefficients and random effectsνk
and Sk,k=1,…,K
are the smoothing parameters and penalty
matrices for the k
-th smooth term, respectivelyh(zi(te),te,t)=~h(te,t)⋅zi(te)
we can simplify to
g(zi,t)≈Q∑q=1~Δi,q~h(te,q,t)
with
~Δi,q={zi(te,q)Δq if te,q∈Te(t)0 else
Spline bases for the bivariate functions ~h(te,t)
are set up
via tensor product B-spline basis with marginal bases
Bm(te),m=1,…,M
and Bk(t),k=1,…,K
defined over the exposure and
hazard time domains, respectively
M
and K
delimit the maximal complexity of the ELRA
~h(te,t)=∑Mm=1∑Kk=1γm,kBm(te)Bk(t)
Combining above equations yields:
g(zi,t)≈M∑m=1K∑k=1γm,k~Bi,m(te,t)Bk(t),
where ~Bi,m(te,t)=∑Qq=1~Δi,qBm(te)
.
λ(t|z)=λ0(t)exp(∫h(t−te,z(te))dte)
t∈(0,40]
, te∈[−40,40]
, z(te)∈[0,10]
In the application example (categorical nutrition), we estimate
log(λi(t|xi,zi,ℓi))=f0(t)+P∑p=1fp(xi,p,t)+g(zi,t)+bℓi
with
f0(tj)=∑Mm=1γ0mBm(tj)
represents the log baseline-hazardf(xi,p,tj)=∑Mm=1∑Lℓ=1γmℓBm(xi,p)Bℓ(tj)
are potentially non-linear, potentially non-linearly time-varying effects
of confounders xi,p
g(zi,t)=gC2(zC2i,t)+gC3(zC3i,t)
zC2i
and zC3i
dummy variables that indicate whether
subject i
received category C2
and C3
nutrition on day
te,q,q=1,…,11
, respectivelygC2(zi,t)≈∑Qq=1~ΔC2i,q~hC2(te,q,t)
bℓi
is the random effect associated with ICU (cluster) ℓi
at which subject i
is treated→C1
reference category
Fortunately, we can fit survival models via Poisson GLMs/GAMMs by representing them as a Piece-wise exponential Additive Mixed Model (PAMMs)
to do so requires to
(0,tmax]
into J
intervals
with J+1
cut-points 0=κ0<…<κJ=tmax
δij
, where δij=1
if subject i
experienced an event in interval j
(i.e.
ti∈(κj−1,κj]
and Ti<Ci
) and δij=0
elseoij=log(tij)
, where
tij=min(ti−κj−1,κj−κj−1)
is the time
subject i
spent in interval j
jth
interval (κj−1,κj]
estimate a
piece-wise constant hazard rate
λ(t)=λj ∀ t∈(κj−1,κj]
(more intervals lead to better approximation)These bivariate surfaces are difficult to interpret as
they must be interpreted with respect to a subject who received
C1
nutrition on all 11 days of nutrition protocol
partial effects hC2(t,te)
and hC3(t,te)
can both contribute
to the cumulative effect, depending on the specific nutrition profile
for these reasons, we prefer to analyze and interpret estimated
hazard ratios between hypothetical patients with different clinically
relevant exposure histories ( z1
and z2
)
ej=λ(~tj|z2)λ(~tj|z1)
We compare the following nutrition profiles:
→
Complete, mildly hypocaloric nutrition reduces risk of mortality
compared to a complete, severely hypocaloric nutrition (Comparison B)
→
No further risk reduction when moving from mildly hypocaloric to
partial or complete near target nutrition (Comparisons E, F)
→
Sensitivity analyses (Imputation of missing protocols,
lag/lead specification, penalty structure, ...) show no substantive deviation
from main results
currently, tlag
and tlead
must be specified a priori
→
would be nice if the lag-lead window could be selected data-driven
(e.g. Obermeier et al., 2015)
we assume that patients released from hospital survived until the end
of the follow-up ( t=30
). Sensitivity analysis with hospital discharge
as censoring event do not change the results
→
Competing risks model for outcomes hospital discharge and
death would be preferable
Modeling and interpretation of TDCs always difficult, especially if exogeneity is unclear, e.g.
Model choice becomes difficult when all effects are potentially non-linear and/or non-linearly time-varying (boosting ad double-penalty procedures promising)
pammtools: Package for
Piece-wise exponential Additive Mixed Models (in development)
Slides created via Yihui Xie's R package xaringan with (modified) Metropolis theme
All graphics have been created using Hadley Whickham's ggplot2
Models are estimated using Simon Wood's mgcv
caloric intake = calories from EN + PN + PF
caloric adequacy (CA):
CA(%)=caloric intake/prescribed calories⋅100
discretized caloric adequacy (in 3 categories):
C1
: 0%≤CA<30%
and no OIC2
:30%≤CA<70%
and no OI or0%≤CA<30%
and additional OIC3
:CA≥70%
or30%≤CA<70%
and additional OI
Motivation
Exposure-Lag-Response Associations
Application
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |