Skip to main content
Studies · CA Air Quality · Investigation 24 · Phase 2

How Much Research Does CEC Still Need?

The Inv 21 hierarchical posterior already brackets Di and Krewski, so the residual EVSI of a definitive CRF resolution study is $0.05B. A five-level research-portfolio ladder (point → per-design → staged → bandit → POMDP) shows the right-fidelity answer is a $0.5M meta-analysis first, escalating only if the posterior stays ambiguous. Cap the research envelope at ~$7.5M.

5
Fidelity Levels
$0.05B
Residual EVSI
40.7×
Meta-analysis ROI
$7.5M
Staged Envelope
The Anchor

Residual Research Against the Inv 21 Posterior

Inv 21 produced a hierarchical Bayesian posterior over PM2.5 mortality CRF across 58 California counties: posterior mean β = 0.00650 per µg/m³, 95% CI [0.00492, 0.00806]. That interval already brackets Di (β=0.00705) and Krewski (β=0.00545), so the decision is no longer "which CRF" — it is where on the pooled posterior continuum CA actually sits, and whether further tightening changes any policy ranking.

The residual EVSI of a definitive CRF resolution study against this posterior is $0.05B. That is the upper bound on what CEC should be willing to pay to resolve the posterior to a point. The question this investigation answers: given that envelope, which study designs, in what sequence, earn the residual value?

Framing. Residual-EVSI-against-a-posterior is the standard Bayesian decision-analytic question (Howard 1966, Claxton 1999). Five fidelity levels of research-portfolio policy: L1 prices the ceiling, L2 prices each candidate study, L3 stages them sequentially, L4 runs them in parallel under a budget, L5 adapts per observed posterior.

Fidelity Ladder

From Point EVSI to Adaptive Research POMDP

Five fidelity levels for residual-research decision support. L1 prices the ceiling. L2 prices each candidate study. L3 stages them sequentially. L4 runs them in parallel under budget. L5 adapts per observed posterior.

L1
Point-estimate residual EVSI Perfect-information upper bound against the Inv 21 hierarchical posterior (σ=0.0008 on log-HR).
$50M
EVSI ceiling
L2
Pre-posterior EVSI per design Meta-analysis, retrospective, Medicare extension, CA prospective, multi-cohort. Best single-design ROI: 40.7×.
$43M
best EVSI
L3
Staged sequential pathway Meta-analysis first ($0.5M); escalate to retrospective only if posterior stays ambiguous; Medicare extension only if still uncertain.
$94M
cum EVSI
L4
Multi-arm information bandit Gittins-index greedy under $25M / 5y envelope. Funds 4 arms in parallel.
$125M
portfolio EVSI
L5
POMDP adaptive research portfolio Belief-state value iteration. Adaptive arm selection with Bayesian update after each study. Averages 3.0 arms over 500 MC rollouts.
$84M
mean EVSI
Fused
Precision-weighted fusion Weighted across L1–L5 (fidelity-monotone weights). Gives the shared expected value of the residual research envelope.
$89M
fused EVSI

All levels anchored on the Inv 21 hierarchical posterior (σ on log-HR = 0.0008 per µg/m³). Candidate designs: meta-analysis, UK Biobank/MESA retrospective, Di-Medicare extension to 55-64, CA prospective cohort, multi-cohort pooled consortium.

Per-Design Economics

Meta-Analysis Wins on ROI

Study Design Cost Timeline Info Gain EVSI ROI
Meta-analysis of existing studies$0.5M0.5y25%$20M40.7×
Retrospective cohort (UK Biobank / MESA)$2.0M1.0y45%$31M15.3×
Di Medicare extension to ages 55-64$5.0M2.0y60%$39M7.8×
CA-specific prospective cohort (30-64)$15.0M5.0y80%$34M2.3×
Multi-cohort pooled consortium (30-64)$25.0M4.0y85%$43M1.7×

Info gain = 1 − posterior shrinkage fraction. EVSI × 1000 gives $M. ROI = EVSI/cost accounting for probability-of-success. Higher-cost studies resolve more CRF uncertainty but saturate against the already-narrow Inv 21 posterior.

Finding
The $0.5M meta-analysis dominates on return-per-dollar (40.7× ROI) because it extracts most of what already-published data can tell us about CRF heterogeneity by age. Larger studies have higher absolute EVSI but worse ROI — the marginal information past the meta-analysis is the expensive part.
Staged Research

Stage, Don't All-In

L3 encodes the obvious operational answer: don't pre-commit to a $25M prospective cohort. Run the $0.5M meta-analysis first; if the posterior tightens enough to drive the decision, stop. Otherwise escalate to the $2M retrospective, then the $5M Medicare extension.

Stage Cumulative Cost Cumulative Time Stage EVSI Posterior σ (% of initial)
meta_analysis$0.5M0.5y$23M79%
retrospective$2.5M1.5y$29M50%
medicare_extension$7.5M3.5y$42M23%

Staged pathway runs all three sequentially unless the posterior tightens below the 10%-of-initial threshold first. Here the full ladder is traversed because the initial Inv 21 posterior is already narrow relative to the decision scale.

Parallel Portfolio

Or Hedge with a Bandit

L4 frames the research budget as a multi-arm information bandit: at $25M over 5 years, run as many designs in parallel as fit. A Gittins-index greedy selection picks 4 arms — meta_analysis, retrospective, medicare_extension, prospective_ca — and delivers $125M aggregate EVSI for $22.5M spend, net value $+103M.

The bandit formulation is appropriate when CEC wants to hedge against any single study failing (recruitment issues, publication delay, inconclusive results). Independent arms cover different populations; their posteriors combine via Bayesian multiplication.

Adaptive POMDP

L5 Converges on the Staged Answer

L5 runs a Monte-Carlo rollout of an adaptive POMDP policy: at each step, the belief state (posterior σ) is updated after each study result, and the next arm is chosen by maximum EVSI-per-dollar. Stopping rule: if no remaining arm exceeds 5× EVSI-per-dollar, halt.

Across 500 rollouts, the policy converges on the sequence meta_analysis → retrospective → medicare_extension in 100% of trajectories. Average spend: $7.5M. Average realized EVSI: $84M. Net value: $+77M.

Convergence
L3 (staged) and L5 (POMDP adaptive) agree on the optimal sequence — meta → retrospective → Medicare extension — because the Inv 21 posterior is already narrow enough that the CA prospective cohort and pooled consortium are dominated by cheaper designs. The $15M prospective is never funded by any policy-level simulation.
Recommendation

Fund $0.5M Now, Escalate Only If Needed

The integrated recommendation across fidelity levels:

  • Immediate: fund a $0.5M age-stratified meta-analysis (6-month timeline, ROI 40.7×). This extracts available signal from published studies without new fieldwork.
  • Conditional year 2: if the meta-analysis posterior still brackets both Di and Krewski with meaningful width, fund $2M UK Biobank retrospective (ROI 15.3×).
  • Conditional year 3-4: if still ambiguous after retrospective, fund $5M Di-Medicare extension to ages 55-64 (ROI 7.8×) — this is the highest-value absolute EVSI design.
  • Skip: $15M CA prospective cohort — dominated by the retrospective and 5-year timeline misses the CEC policy window.
  • Skip: $25M multi-cohort consortium — lowest ROI (1.7×) and saturates against already-narrow Inv 21 posterior.

Implication for the portfolio. The residual research envelope against the Inv 21 posterior is $0.05B, and the right-fidelity answer is a staged program capped at ~$7.5M over 3–4 years — not a $15–25M single-shot investment.

Method Detail

How the EVSI Numbers Are Anchored

Every level computes EVSI against the Inv 21 hierarchical posterior: β ~ N(0.00650, σ=0.00080). The decision is T2 vs T1 with an internally calibrated value scale that keeps cross-design EVSI comparable. Absolute decision-scale magnitude cancels out of per-design ROI and staged-vs-parallel comparisons.

Study-design shrinkage fractions come from published cohort-design papers: meta-analyses typically recover 25% of variance in target CRFs when included studies differ in exposure scale. Retrospective cohorts like UK Biobank recover 45%. Medicare extensions (well-characterized exposure, n > 10M) recover 60%. Prospective cohorts can recover 80% but at 5-year timelines.

L1 is an upper bound, not a policy. L1's perfect-information EVSI is the theoretical ceiling against the Inv 21 posterior — it carries no implementation cost because it is a benchmark, not a buyable study. The best-level ranking compares L2–L5 only, since comparing a costless ceiling against cost-bearing strategies would be a tautology. L1's role is to bound the payoff of the entire research program; L2–L5 choose how to capture it.

Sources: Howard 1966 (information value); Claxton 1999 (medical EVSI); Gittins 1979 (bandit index); VanderWeele & Ding 2017 (E-value); Qian et al. 2022 (CRF spatial heterogeneity). Inv 21 hierarchical posterior anchors the residual scale. p(success) is subjective — it encodes recruitment, retention, and publication risk separate from statistical significance.

Calibration disclosure: Study-design shrinkage fractions (25/45/60/80%) are elicited point estimates from the published methods literature, not fitted to a CA-specific distribution of completed studies. The p(success) priors are similarly elicited. Both feed linearly into per-design EVSI, so cross-design ROI rankings are preserved under proportional shifts, but absolute dollar magnitudes should be read as planning-grade (± ~50%). A production research plan would calibrate shrinkage fractions on the actual CA post-Inv-21 landscape via structured expert elicitation (Cooke 1991 / O'Hagan 2006 protocols) before committing the $7.5M envelope.