weightedsurv versus simtrial: Wald vs Log-Rank Simulation Analysis

Overview

This vignette evaluates the operating characteristics of weighted log-rank (WLR) tests and their corresponding Wald (weighted Cox model) counterparts under various non-proportional hazards (NPH) scenarios. We compare test statistics computed by weightedsurv with those from simtrial to validate alignment, and examine the correspondence between log-rank and Wald-based inference across weighting schemes including Fleming-Harrington $\text{FH}(\rho,\gamma)$ , Magirr-Burman, and a zero-one step-function weight.

The simulation framework uses simtrial for data generation under piecewise-exponential models and weightedsurv for weighted Cox regression, producing z-statistics, hazard ratio estimates, standard errors, and confidence intervals. We focus on a zero-one weighting scheme where weights are zero for the first 6 months and one thereafter — a design motivated by settings where treatment is “not active” by design during an initial period, as in vaccine trials.

Six scenarios are evaluated: proportional hazards (PH), 3-month delayed effect, 6-month delayed effect, crossing hazards, weak null, and strong null.

Simulation Helper Functions

library(weightedsurv)
library(survival)
library(dplyr)
library(tibble)
library(data.table)

The following helper functions provide the simulation infrastructure: general utilities (sim_utils), scenario definition (sim_scenarios), and the parallel execution driver (sim_driver).

Note: These functions will be included in a future CRAN release of weightedsurv (in R/sim_utils.R, R/sim_scenarios.R, and R/sim_driver.R). Once available via library(weightedsurv), the three chunks below can be removed and the remainder of this vignette will work unchanged.

Scenario	Name	Period	duration	Survival	Month	rate	hr	fail_rate	dropout_rate	stratum
1	PH	1	24	0.3500	24	0.0578	0.7573	0.0578	0.001	All
1	PH	2	12	0.2956	36	0.0186	0.7573	0.0186	0.001	All
2	3-month delay	1	3	0.8409	3	0.0578	1.0000	0.0578	0.001	All
2	3-month delay	2	21	0.3500	24	0.0578	0.7226	0.0578	0.001	All
2	3-month delay	3	12	0.2956	36	0.0186	0.7573	0.0186	0.001	All
3	6-month delay	1	6	0.7071	6	0.0578	1.0000	0.0578	0.001	All
3	6-month delay	2	18	0.3500	24	0.0578	0.6764	0.0578	0.001	All
3	6-month delay	3	12	0.2956	36	0.0186	0.7573	0.0186	0.001	All
4	Crossing	1	3	0.7983	3	0.0578	1.3000	0.0578	0.001	All
4	Crossing	2	21	0.3500	24	0.0578	0.6798	0.0578	0.001	All
4	Crossing	3	12	0.2956	36	0.0186	0.7573	0.0186	0.001	All
5	Weak null	1	24	0.2500	24	0.0578	1.0000	0.0578	0.001	All

Variable	Weighting Scheme	Package	Function Call	Notes
Z-statistic definitions in `sim_fn_analysis()`
Paired variables (e.g. `mb12z` / `mb12z_mine`) enable cross-validation between simtrial and weightedsurv.
Log-rank
logrankz	FH(0, 0)	simtrial	maxcombo(rho=0, gamma=0)	Sign reversed (−z)
fh00z_mine	FH(0, 0)	weightedsurv	cox_rhogamma(scheme='fh', rho=0, gamma=0)	Wald z from weighted Cox
fh00z_debiased	FH(0, 0)	weightedsurv	cox_rhogamma(..., draws=M)	Martingale de-biased
FH(0, 0.5)
fh05z	FH(0, 0.5)	simtrial	maxcombo(rho=0, gamma=0.5)	Sign reversed (−z)
fh05z_mine	FH(0, 0.5)	weightedsurv	cox_rhogamma(scheme='fh', rho=0, gamma=0.5)	Wald z from weighted Cox
fh05z_debiased	FH(0, 0.5)	weightedsurv	cox_rhogamma(..., draws=M)	Martingale de-biased
FH(0, 1)
fh01z	FH(0, 1)	simtrial	wlr(weight=fh(rho=0, gamma=1))
fh01z_mine	FH(0, 1)	weightedsurv	cox_rhogamma(scheme='fh', rho=0, gamma=1)	Wald z from weighted Cox
fh01z_debiased	FH(0, 1)	weightedsurv	cox_rhogamma(..., draws=M)	Martingale de-biased
MB(6)
mb6z	Magirr–Burman (τ=6)	simtrial	wlr(weight=mb(delay=6))	t* = 6 months
mb6z_mine	Magirr–Burman (τ=6)	weightedsurv	cox_rhogamma(scheme='MB', mb_tstar=6)	z only (no draws)
MB(12)
mb12z	Magirr–Burman (τ=12)	simtrial	wlr(weight=mb(delay=12))	t* = 12 months
mb12z_mine	Magirr–Burman (τ=12)	weightedsurv	cox_rhogamma(scheme='MB', mb_tstar=12)	Wald z from weighted Cox
mb12z_debiased	Magirr–Burman (τ=12)	weightedsurv	cox_rhogamma(..., draws=M)	Martingale de-biased
MB(16)
mb16z	Magirr–Burman (τ=16)	simtrial	wlr(weight=mb(delay=16))	t* = 16 months
mb16z_mine	Magirr–Burman (τ=16)	weightedsurv	cox_rhogamma(scheme='MB', mb_tstar=16)	z only (no draws)
FH-exp₁
fhe1z	FH exponential variant 1	weightedsurv	cox_rhogamma(scheme='fh_exp1')	Wald z from weighted Cox
fhe1z_debiased	FH exponential variant 1	weightedsurv	cox_rhogamma(..., draws=M)	Martingale de-biased
FH-exp₂
fhe2z	FH exponential variant 2	weightedsurv	cox_rhogamma(scheme='fh_exp2')	Wald z from weighted Cox
fhe2z_debiased	FH exponential variant 2	weightedsurv	cox_rhogamma(..., draws=M)	Martingale de-biased
Zero-one(6)
t601z	Zero–one step (τ=6)	weightedsurv	cox_rhogamma(scheme='custom_time', t.tau=6, w0=0, w1=1)	Wald z from weighted Cox
t601z_debiased	Zero–one step (τ=6)	weightedsurv	cox_rhogamma(..., draws=M)	Martingale de-biased
MaxCombo
maxcombop	MaxCombo	simtrial	maxcombo(rho=c(0,0), gamma=c(0,0.5))	p-value (not a z-statistic)
M = `mart_draws` (default 300). De-biased variants use the martingale-residual bootstrap of Xu & O’Quigley.
Sign convention: `maxcombo()` returns z < 0 for treatment benefit; reversed here (−z) so large positive z = superiority. `wlr()` already follows this convention.

Suffix	Quantity	Description
Companion estimates per weighting scheme
Naming convention: `{prefix}{suffix}`, e.g. `fh00_bhat`, `t601_waldstar`.
_bhat	$\hat{\beta}$	Weighted Cox log-HR estimate
_bhatdebiased	$\hat{\beta}^*$	De-biased log-HR via martingale resampling
_wald	$\exp(\hat{\beta} + z_{0.975} \hat{\sigma})$	Upper 95% CI bound (asymptotic)
_waldstar	$\exp(\hat{\beta}^* + z_{0.975} \hat{\sigma}^*)$	Upper 95% CI bound (de-biased)
_sigbhat	$\hat{\sigma}_{\text{asy}}$	Asymptotic SE of $\hat{\beta}$
_sigbhatstar	$\hat{\sigma}^*$	De-biased SE via martingale resampling
_cover	$I(\theta_0 \in \text{CI}_{\text{asy}})$	Coverage indicator (asymptotic CI)
_coverstar	$I(\theta_0 \in \text{CI}^*)$	Coverage indicator (de-biased CI)
Prefixes with companion estimates: `fh00` (log-rank), `fh05` (FH(0,0.5)), `fh01` (FH(0,1)), `mb12` (MB(12)), `fhe1` (FH-exp₁), `fhe2` (FH-exp₂), `t601` (zero–one(6)). MB(6) and MB(16) produce z-statistics only.

Scenario	Standard Cox		FH(0,1)		zero/one(6)
Scenario	Wald	logrank	Wald	logrank	Wald	logrank
PH	0.865	0.866	0.764	0.764	0.694	0.696
3m delay	0.783	0.784	0.834	0.836	0.804	0.805
6m delay	0.705	0.706	0.861	0.863	0.915	0.915
Crossing	0.672	0.672	0.900	0.902	0.911	0.913
Weak null	0.025	0.025	0.025	0.026	0.025	0.025
Strong null	0.016	0.016	0.047	0.049	0.025	0.025

Scenario	Standard Cox Estimates					zero/one(6) Estimators
Scenario	HR	db(HR)	SE	est(SE)	est*(SE)	HR	db(HR)	SE	est(SE)	est*(SE)
PH	0.762	0.755	0.091	0.090	0.085	0.762	0.755	0.112	0.112	0.110
3m delay	0.782	0.776	0.091	0.090	0.085	0.730	0.723	0.114	0.114	0.112
6m delay	0.801	0.794	0.091	0.090	0.085	0.686	0.680	0.117	0.117	0.115
Crossing	0.808	0.801	0.091	0.090	0.085	0.689	0.683	0.116	0.117	0.115
Weak null	1.005	0.997	0.087	0.086	0.082	1.006	0.997	0.109	0.110	0.110
Strong null	1.022	1.013	0.087	0.086	0.082	1.006	0.997	0.109	0.110	0.110

Overview

Simulation Helper Functions

Simulation utilities

Scenario Setup

Study-Specific Analysis Function

Simulation Output Dictionary

Z-Statistic Definitions

Companion Estimates per Weighting Scheme

Simulation Execution

Results

Main 3-Test Comparison Under NPH Scenarios

Focus on zero-one weighting

Wald vs Log-Rank Correspondence

Cox Hazard Ratio Estimates and SE Estimation

All FH Weighting Variants

Checking Alignment with simtrial