Skip to contents

Introduction

This vignette develops the mathematical framework for biomarker-dependent treatment effects used in the ForestSearch simulation machinery. The central idea is a piecewise-linear spline model embedded in a Weibull regression that is simultaneously an accelerated failure time (AFT) model and a Cox proportional hazards model. This dual representation allows treatment effects to be specified as causal log-hazard-ratios on the Cox scale while survival times are generated on the AFT scale.

Three treatment effect estimands are defined for summarizing biomarker-modulated heterogeneity:

  • the Average Hazard Ratio (AHR) — a geometric-mean summary on the log-hazard scale,
  • the Controlled Direct Effect (CDE) — a ratio of average hazards on the natural scale (Aalen, Cook & Roysland, 2015),
  • the Marginal (causal) HR — a population-level Cox estimate from stacked potential outcomes.

The first two are deterministic functions of the data-generating mechanism (DGM); the third serves as a stochastic validation target. Their theoretical relationships, conditions for agreement and divergence, and causal standing are developed below.

The Weibull AFT/Cox Model

Weibull Distribution Essentials

For the Weibull distribution with shape parameter ν>0\nu > 0 and scale parameter θ>0\theta > 0, the density, survival, hazard, and cumulative hazard functions are: f(t)=νθνtν1exp((t/θ)ν),S(t)=exp((t/θ)ν),λ(t)=νθνtν1,Λ(t)=(t/θ)ν. \begin{aligned} f(t) &= \nu\,\theta^{-\nu}\,t^{\nu-1}\, \exp\!\bigl(-(t/\theta)^\nu\bigr), \\[4pt] S(t) &= \exp\!\bigl(-(t/\theta)^\nu\bigr), \\[4pt] \lambda(t) &= \nu\,\theta^{-\nu}\,t^{\nu-1}, \\[4pt] \Lambda(t) &= (t/\theta)^\nu. \end{aligned}

For ν(0,1)\nu \in (0,1) the hazard is strictly decreasing; for ν>1\nu > 1 it is strictly increasing. A useful distributional identity is that the cumulative hazard evaluated at the random variable TT is unit-exponential:

Probability-Integral Property. Λ(T)=(T/θ)νExp(1). \Lambda(T) = (T/\theta)^\nu \;\sim\; \mathrm{Exp}(1). \tag{1} This follows because Pr(Λ(T)t)=Pr(TΛ1(t))=F(Λ1(t))=1et\Pr(\Lambda(T) \leq t) = \Pr(T \leq \Lambda^{-1}(t)) = F(\Lambda^{-1}(t)) = 1 - e^{-t}.

The Dual AFT / Cox Representation

Writing Q=Λ(T)Exp(1)Q = \Lambda(T) \sim \mathrm{Exp}(1) gives logQ=ν(logTlogθ)\log Q = \nu(\log T - \log\theta), which rearranges to the AFT form:

log(T)=log(θ)+τε, \log(T) = \log(\theta) + \tau\,\varepsilon, \tag{2}

where τ=1/ν\tau = 1/\nu is the AFT scale parameter and ε=logQ\varepsilon = \log Q has the standard extreme-value distribution with density fε(x)=exp(xex)f_\varepsilon(x) = \exp(x - e^x).

Incorporating a covariate vector LL, the Cox (hazard) parameterization is: λ(t;L)=λ0(t)exp(Lβ), \lambda(t;\,L) = \lambda_0(t)\,\exp(L'\beta), and the corresponding AFT parameterization is: log(T)=log(θ)+Lγ+τε, \log(T) = \log(\theta) + L'\gamma + \tau\,\varepsilon, \tag{3} where the two coefficient vectors are linked by the AFT-to-hazard transformation:

AFT / Hazard-Scale Transformation. β=γ/τ. \beta = -\,\gamma\,/\,\tau. \tag{4} R’s survreg estimates (logθ̂,γ̂,τ̂)(\log\hat\theta,\;\hat\gamma,\;\hat\tau) on the AFT scale; hazard-ratio coefficients β̂\hat\beta are obtained via (4).

This transformation is the linchpin of the framework: potential outcomes are simulated on the AFT scale (3), while treatment effects are interpreted on the hazard-ratio scale via β\beta.

Biomarker-Dependent Treatment Effects

The Two-Phase Spline Model

To induce treatment effects that vary with a continuous biomarker ZZ, the Cox linear predictor is specified as a piecewise-linear spline with a single interior knot at Z=kZ = k:

Lβ:=β1X+β2Z+β3ZX+β4(Zk)𝟏(Z>k)+β5(Zk)𝟏(Z>k)X, L'\beta \;:=\; \beta_1\,X + \beta_2\,Z + \beta_3\,Z\!X + \beta_4\,(Z - k)\,\mathbf{1}(Z > k) + \beta_5\,(Z - k)\,\mathbf{1}(Z > k)\,X, \tag{5}

where X{0,1}X \in \{0,1\} denotes the treatment indicator. The five terms have the following roles:

Parameter Term Role
β1\beta_1 XX Main treatment effect (intercept of log-HR)
β2\beta_2 ZZ Prognostic biomarker effect (both arms)
β3\beta_3 ZXZX Biomarker ×\times treatment interaction (slope of log-HR for ZkZ \leq k)
β4\beta_4 (Zk)𝟏(Z>k)(Z-k)\mathbf{1}(Z>k) Spline term: change in prognostic slope above kk
β5\beta_5 (Zk)𝟏(Z>k)X(Z-k)\mathbf{1}(Z>k)\,X Spline term: change in treatment-effect slope above kk

The Causal Log-Hazard-Ratio Function

Under the potential outcomes framework, let λx,z(t)\lambda_{x,z}(t) denote the hazard function for a subject with biomarker level Z=zZ = z had they followed treatment X=xX = x. The causal log-hazard-ratio at biomarker level zz is:

ψ0(z):=log(λ1,z(t)/λ0,z(t))=β10+β30z+β50(zk)𝟏(z>k). \psi^0(z) \;:=\; \log\!\bigl(\lambda_{1,z}(t)\,/\,\lambda_{0,z}(t)\bigr) \;=\; \beta_1^0 + \beta_3^0\,z + \beta_5^0\,(z - k)\,\mathbf{1}(z > k). \tag{6}

Because the baseline hazard λ0(t)\lambda_0(t) cancels in the ratio, ψ0(z)\psi^0(z) is free of tt — a direct consequence of the proportional hazards structure conditional on ZZ. Expanding (6) piecewise:

ψ0(z)={β10+β30z,zk,β10+β30z+β50(zk),z>k. \psi^0(z) = \begin{cases} \beta_1^0 + \beta_3^0\,z, & z \leq k, \\[6pt] \beta_1^0 + \beta_3^0\,z + \beta_5^0\,(z - k), & z > k. \end{cases}

Anchor-Point Parameterization

Rather than specifying (β10,β30,β50)(\beta_1^0, \beta_3^0, \beta_5^0) directly, the framework specifies the log-HR at three interpretable anchor pointsψ0(0)\psi^0(0), ψ0(k)\psi^0(k), and ψ0(ζ)\psi^0(\zeta) for some ζ>k\zeta > k — and solves:

β10=ψ0(0),β30=ψ0(k)ψ0(0)k,β50=ψ0(ζ)ψ0(0)β30ζζk. \beta_1^0 = \psi^0(0), \qquad \beta_3^0 = \frac{\psi^0(k) - \psi^0(0)}{k}, \qquad \beta_5^0 = \frac{\psi^0(\zeta) - \psi^0(0) - \beta_3^0\,\zeta}{\zeta - k}. \tag{7}

Verification. These formulas are verified by substitution. For instance, evaluating ψ0(ζ)\psi^0(\zeta) from (6) gives ψ0(ζ)=β10+β30ζ+β50(ζk)\psi^0(\zeta) = \beta_1^0 + \beta_3^0\,\zeta + \beta_5^0(\zeta - k); solving for β50\beta_5^0 recovers the third expression in (7) exactly. The same check holds at z=0z = 0 and z=kz = k, confirming internal consistency.

Connection to Potential-Outcome Means

There is an important link between the log-HR ψ0(z)\psi^0(z) and the conditional means of the potential log-survival times. Define m(x,z):=E[logTTreat=x,Z=z]m(x,z) := E[\log T \mid \text{Treat} = x,\, Z = z] under the AFT model (3). Then:

ψ0(z)=m(0,z)m(1,z)τ. \psi^0(z) = \frac{m(0,z) - m(1,z)}{\tau}.

Derivation. On the AFT scale the treatment-related terms in log(T)\log(T) contribute Lγ=τ(Lβ)L'\gamma = -\tau(L'\beta) to m(x,z)m(x,z). Hence the difference in conditional means between control (x=0x=0) and treatment (x=1x=1) at biomarker level zz is:

m(1,z)m(0,z)=γ1+γ3z+γ5(zk)𝟏(z>k)=τψ0(z), m(1,z) - m(0,z) = \gamma_1 + \gamma_3\,z + \gamma_5\,(z - k)\,\mathbf{1}(z > k) = -\tau\,\psi^0(z),

which gives ψ0(z)={m(0,z)m(1,z)}/τ\psi^0(z) = \{m(0,z) - m(1,z)\}/\tau as required. This confirms that ψ0(z)\psi^0(z) measures the causal difference in expected log-survival between the two treatment arms, scaled by τ\tau.

Sign Convention. Throughout this vignette, ψ0(z)<0\psi^0(z) < 0 means treatment is beneficial (HR <1< 1), while ψ0(z)>0\psi^0(z) > 0 means treatment is detrimental (HR >1> 1). This follows the standard Cox model convention where negative log-HR favors the experimental arm.

Extension to Prognostic Factors

When an additional baseline prognostic factor WW is included, the linear predictor becomes:

lx,z,w=β10x+β20z+β30zx+β40(zk)𝟏(z>k)+β50(zk)𝟏(z>k)x+βw0w. l_{x,z,w} = \beta_1^0\,x + \beta_2^0\,z + \beta_3^0\,zx + \beta_4^0\,(z-k)\,\mathbf{1}(z>k) + \beta_5^0\,(z-k)\,\mathbf{1}(z>k)\,x + \beta_w^0\,w.

The prognostic term βw0w\beta_w^0\,w does not interact with treatment, so the causal log-HR remains: ψ0(z,w)=m(0,z,w)m(1,z,w)τ=ψ0(z), \psi^0(z,w) = \frac{m(0,z,w) - m(1,z,w)}{\tau} = \psi^0(z), i.e., it depends on zz but not on ww. When prognostic factors do interact with treatment, ψ0\psi^0 would depend on (z,w)(z,w) jointly, and the AHR/CDE definitions below would average over both.

Working Example

A representative configuration for an oncology biomarker setting:

Anchor zz ψ0(z)\psi^0(z) HR Interpretation
Low 00 log(3)1.10\log(3) \approx 1.10 3.003.00 Strongly detrimental
Knot 55 log(1.25)0.22\log(1.25) \approx 0.22 1.251.25 Modestly detrimental
High 1010 log(0.5)0.69\log(0.5) \approx -0.69 0.500.50 Strongly beneficial

This profile produces a biomarker-modulated treatment effect that transitions from harm at low biomarker levels to substantial benefit at high levels, crossing the null (HR =1= 1) between z=5z = 5 and z=10z = 10. The overall average hazard ratio across the full biomarker distribution is approximately 0.740.74, indicating net benefit in the population.

Treatment Effect Estimands

Three estimands are used to summarize treatment effects across biomarker-defined sub-populations. Each captures a different aspect of treatment effect heterogeneity.

Average Hazard Ratio (AHR)

The biomarker AHR is the exponentiated mean of ψ0(Z)\psi^0(Z) across a biomarker-threshold sub-population:

AHR(z+):=exp{EZzψ0(Z)},AHR(z):=exp{EZzψ0(Z)}. \text{AHR}(z^+) := \exp\!\bigl\{E_{Z \geq z}\;\psi^0(Z)\bigr\}, \qquad \text{AHR}(z^-) := \exp\!\bigl\{E_{Z \leq z}\;\psi^0(Z)\bigr\}. \tag{8}

The AHR is the geometric mean of the individual causal hazard ratios exp(ψ0(Zi))\exp(\psi^0(Z_i)) across the sub-population. In ForestSearch’s super-population data, this is computed from the loghr_po column as AHR(𝒮)=exp(loghr_po¯𝒮)\text{AHR}(\mathcal{S}) = \exp\!\bigl(\overline{\text{loghr\_po}}_{\mathcal{S}}\bigr).

Key properties.

  • Deterministic: depends only on the model coefficients and covariate distribution, not on the random error εi\varepsilon_i.
  • Reproducible: identical across simulation replications for a fixed super-population.
  • Dual causal status: as a geometric-mean ratio of potential-outcome hazards, the AHR achieves both individual-level and population-level causal interpretability under the Fay & Li (2024) taxonomy. This is because log\log linearizes the ratio, so E[log(HRi)]=E[logλ1,Z]E[logλ0,Z]E[\log(\text{HR}_i)] = E[\log\lambda_{1,Z}] - E[\log\lambda_{0,Z}], and the order of comparison and summarization is irrelevant.

Controlled Direct Effect (CDE)

The CDE is the ratio of average exponentiated log-hazards under treatment versus control (Aalen, Cook & Roysland, 2015):

CDE(z+):=θ1(z+)θ0(z+),CDE(z):=θ1(z)θ0(z), \text{CDE}(z^+) := \frac{\bar\theta^1(z+)}{\bar\theta^0(z+)}, \qquad \text{CDE}(z^-) := \frac{\bar\theta^1(z-)}{\bar\theta^0(z-)}, \tag{9}

where θx(z+)=EZzexp(lx,Z)\bar\theta^x(z+) = E_{Z \geq z}\,\exp(l_{x,Z}) and lx,zl_{x,z} is the Cox linear predictor at treatment xx and biomarker zz. In ForestSearch, this is computed from the theta_1 and theta_0 columns: CDE(𝒮)=exp(θi(1))¯𝒮/exp(θi(0))¯𝒮\text{CDE}(\mathcal{S}) = \overline{\exp(\theta_i(1))}_\mathcal{S} \;/\; \overline{\exp(\theta_i(0))}_\mathcal{S}.

Key properties.

  • Deterministic (like the AHR): no dependence on εi\varepsilon_i.
  • Natural-scale averaging: by averaging on the hazard (exponential) scale rather than the log-hazard scale, the CDE gives more weight to subjects with larger absolute hazard contributions.
  • Differs from AHR under heterogeneity: Jensen’s inequality implies CDE(𝒮)AHR(𝒮)\text{CDE}(\mathcal{S}) \neq \text{AHR}(\mathcal{S}) in general, with the discrepancy growing as within-subgroup treatment-effect variability increases.

Marginal (Causal) Hazard Ratio

The marginal HR for a subgroup 𝒮\mathcal{S} is obtained by fitting a Cox model to stacked potential outcomes — a dataset where each subject contributes two rows (one under treatment, one under control), both with event indicator 11:

HRmarg(𝒮)=exp(β̂Cox), \text{HR}_{\text{marg}}(\mathcal{S}) = \exp(\hat\beta_\text{Cox}),

where β̂Cox\hat\beta_\text{Cox} is the coefficient from coxph(Surv(time, event) ~ treat) fit to the stacked data within 𝒮\mathcal{S}.

Key properties.

  • Stochastic: depends on the realized error terms εi\varepsilon_i, introducing Monte Carlo variability across simulation replications.
  • Population-level: the Cox partial likelihood implicitly weights subjects by their risk-set contributions, targeting a population-averaged effect.
  • Validation role: serves as the DGM-level ground truth against which per-replicate estimator bias is assessed.

When the Three Estimands Agree — and When They Diverge

Under a constant treatment effect (no heterogeneity), the individual log-HR is identical for all subjects, and all three measures coincide: HRmarg=AHR=CDE=exp(βtreat). \text{HR}_\text{marg} = \text{AHR} = \text{CDE} = \exp(\beta_\text{treat}).

Under heterogeneous treatment effects, three sources of divergence arise:

  1. Jensen’s inequality separates AHR from CDE: exp(logHRi¯)exp(θi(1))¯/exp(θi(0))¯\exp\!\bigl(\overline{\log\text{HR}_i}\bigr) \neq \overline{\exp(\theta_i(1))}\;/\;\overline{\exp(\theta_i(0))} unless all logHRi\log\text{HR}_i are identical.

  2. Cox partial-likelihood weighting separates the marginal HR from the AHR, because risk-set membership creates implicit non-uniform weights.

  3. Monte Carlo variability affects the marginal HR (which depends on εi\varepsilon_i) but not the AHR or CDE.

The AHR is the recommended primary estimand for simulation studies because it is deterministic, directly interpretable as a geometric-mean HR, and achieves dual causal status. The marginal HR serves as a validation target, and the CDE provides a complementary natural-scale perspective.

Causal Foundations

Why the AFT Model Serves as the Causal DGM

Both Aalen, Cook & Roysland (2015) and Fay & Li (2024) identify the AFT model as a natural causal foundation for survival analysis. The Weibull AFT scale-change parameter exp(βtreat)\exp(-\beta_\text{treat}) satisfies: exp(βtreat)=γ1/γ0, \exp(-\beta_\text{treat}) = \gamma_1 / \gamma_0, where γx\gamma_x is the geometric mean of the potential survival time Ti(x)T_i(x). Crucially, this geometric-mean ratio is simultaneously:

  • Individual-level causal: it equals exp(E[logTi(1)logTi(0)])\exp\!\bigl(E[\log T_i(1) - \log T_i(0)]\bigr), with comparison formed within individuals before averaging.
  • Population-level causal: it is identifiable from marginal distributions in a randomized trial.

This dual status — unique among common survival ratio estimands — is why the ForestSearch DGM is built on the Weibull AFT. The biomarker log-HR ψ0(z)={m(0,z)m(1,z)}/τ\psi^0(z) = \{m(0,z) - m(1,z)\}/\tau inherits this property at each biomarker level.

The AHR as a Causally Valid Functional

The ForestSearch AHR is defined from individual potential-outcome log-HR differences: AHR(𝒮)=exp(1|𝒮|i𝒮[θi(1)θi(0)]), \text{AHR}(\mathcal{S}) = \exp\!\left(\frac{1}{|\mathcal{S}|} \sum_{i \in \mathcal{S}} [\theta_i(1) - \theta_i(0)]\right), where θi(x)=𝐗i(x)𝛃0\theta_i(x) = \mathbf{X}_i(x)'\boldsymbol{\beta}_0. Because this is a geometric mean of individual causal log-hazard differences, it falls within the class of estimands that achieves both individual-level and population-level causal status under the Fay & Li (2024) taxonomy.

This property is not shared by the instantaneous Cox HR eβ̂e^{\hat\beta}, which suffers from conditioning-set distortion at each event time (the “collider” argument of Hernan 2010; Aalen et al. 2015; Martinussen 2022): among survivors at time t>0t > 0, the treatment and frailty are no longer independent even in a randomized trial.

Proportional Hazards and Marginal Non-PH

The Weibull DGM generates data satisfying proportional hazards conditional on the biomarker. Under conditional PH, the Cox HR is a valid population-level causal estimand (Fay & Li 2024; Beyersmann, Schmoor & Schumacher 2025). However, marginally — averaging over the biomarker distribution — PH need not hold when treatment effects are heterogeneous. This is precisely why the framework reports AHR and CDE (marginal summaries) rather than a single Cox HR as the primary characterization of biomarker-modulated treatment effects.

The Abrahamowicz et al. (2025) simulations further support this design: frailty-induced bias in Cox HR estimates is substantial only under very strong unmeasured susceptibility. Since the ForestSearch DGM has no unmodeled frailty, the Cox HR within simulations targets the correct conditional parameter, while the AHR provides the appropriate marginal summary for real-data applications where unmeasured heterogeneity may exist.

Cox HR as an Operational Tool for Subgroup Selection

In the ForestSearch workflow, per-replicate Cox HR estimates and thresholds (e.g., β̂log(0.90)\hat\beta \geq \log(0.90)) serve as operational selection criteria for identifying candidate subgroups. This is distinct from their role as causal estimands. The recent literature (Edelmann 2025; Martinussen 2022; Fay & Li 2024) notes that the instantaneous Cox HR at time tt is a causal effect for the baseline population but not necessarily for the population at risk at tt.

Within the ForestSearch framework, this distinction is addressed as follows:

  • The true treatment effects are defined through ψ0(z)\psi^0(z), a baseline causal quantity.
  • Subgroup selection uses the Cox HR as an efficient screening tool.
  • Subgroup evaluation uses the AHR and CDE, which avoid the conditioning-set problem.
  • Bootstrap bias correction (hr.H.bc) and the decomposition of error into subgroup misidentification versus finite-sample noise provide further safeguards.

Computation Pipeline

The complete pipeline from model specification to estimand computation is summarized below.

Step Operation Formula
1 Fit Weibull AFT logTi=μ+𝐗i𝛄+σεi\log T_i = \mu + \mathbf{X}_i'\boldsymbol{\gamma} + \sigma\varepsilon_i
2 Hazard-scale transform 𝛃0=𝛄/σ\boldsymbol{\beta}_0 = -\boldsymbol{\gamma}/\sigma
3 Individual log-hazards θi(x)=𝐗i(x)𝛃0\theta_i(x) = \mathbf{X}_i(x)'\boldsymbol{\beta}_0
4 Individual causal log-HR loghr_poi=θi(1)θi(0){\text{loghr\_po}}_i = \theta_i(1) - \theta_i(0)
5a AHR (subgroup 𝒮\mathcal{S}) AHR(𝒮)=exp(loghr_po¯𝒮)\text{AHR}(\mathcal{S}) = \exp\bigl(\overline{\text{loghr\_po}}_{\mathcal{S}}\bigr)
5b CDE (subgroup 𝒮\mathcal{S}) CDE(𝒮)=exp(θ1)¯𝒮/exp(θ0)¯𝒮\text{CDE}(\mathcal{S}) = \overline{\exp(\theta_1)}_{\mathcal{S}}\;/\;\overline{\exp(\theta_0)}_{\mathcal{S}}
5c Marginal HR (subgroup 𝒮\mathcal{S}) exp(β̂Cox)\exp(\hat\beta_{\text{Cox}}) from stacked potential outcomes

In ForestSearch, Steps 1–4 are performed by generate_aft_dgm_flex(), which stores 𝛃0\boldsymbol{\beta}_0 in dgm$model_params$b0 and the individual-level quantities (loghr_po, lin_pred_0, lin_pred_1) in dgm$df_super. Steps 5a–5b are computed by cox_ahr_cde_analysis(); Step 5c is computed within calculate_hazard_ratios().

Consistency Across the ForestSearch Vignettes. The mathematical pipeline above — AFT model \rightarrow hazard-scale transformation (4) \rightarrow individual log-HR \rightarrow AHR/CDE aggregation — is consistent across the MRCT analysis document, the treatment_effect_definitions vignette, and the causal_effects_brief_review vignette. All three use the same sign conventions (ψ0(z)<0\psi^0(z) < 0 \Leftrightarrow beneficial), the same transformation β=γ/τ\beta = -\gamma/\tau, and the same averaging operations for AHR and CDE.

The correspondences are:

This vignette treatment_effect_definitions causal_effects_brief_review
ψ0(z)\psi^0(z) loghr_po =θi(1)θi(0)= \theta_i(1) - \theta_i(0) Individual-level AFT causal quantity
AHR(𝒮)\text{AHR}(\mathcal{S}) exp(loghr_po¯𝒮)\exp(\overline{\text{loghr\_po}}_{\mathcal{S}}) Geometric-mean HR with dual causal status
CDE(𝒮)\text{CDE}(\mathcal{S}) exp(θ1)¯𝒮/exp(θ0)¯𝒮\overline{\exp(\theta_1)}_{\mathcal{S}}/\overline{\exp(\theta_0)}_{\mathcal{S}} Natural-scale complement to AHR
β=γ/τ\beta = -\gamma/\tau 𝛃0=𝛄/σ\boldsymbol{\beta}_0 = -\boldsymbol{\gamma}/\sigma AFT-to-hazard transformation

References

  • Aalen, O.O., Cook, R.J. & Roysland, K. (2015). Does Cox analysis of a randomized survival study yield a causal treatment effect? Lifetime Data Analysis, 21(4), 579–593.

  • Abrahamowicz, M., Beauchamp, M.-E., Roberts, E.K. & Taylor, J.M.G. (2025). Revisiting the hazards of hazard ratios through simulations and case studies. European Journal of Epidemiology, 40, 611–629.

  • Beyersmann, J., Schmoor, C. & Schumacher, M. (2025). Hazards constitute key quantities for analyzing, interpreting and understanding time-to-event data. Biometrical Journal, 67, e70057.

  • Edelmann, D. (2025). Revisiting hazard ratios: Can we define causal estimands for time-dependent treatment effects? Biometrical Journal, 67, e70100.

  • Fay, M.P. & Li, F. (2024). Causal interpretation of the hazard ratio in randomized clinical trials. Clinical Trials, 21(5), 623–635.

  • Hernan, M.A. (2010). The hazards of hazard ratios. Epidemiology, 21(1), 13–15.

  • Leon, L.F., Jemielita, T., Guo, Z., Marceau West, R. & Anderson, K.M. (2024). Exploratory subgroup identification in the heterogeneous Cox model. Statistics in Medicine. doi:10.1002/sim.10163.

  • Martinussen, T. (2022). Causality and the Cox regression model. Annual Review of Statistics and Its Application, 9, 249–259.

  • Prentice, R.L. & Aragaki, A.K. (2022). Intention-to-treat comparisons in randomized trials. Statistical Science, 37(3), 380–393.