Abstract

The paper challenges the assumption that more data plus a strong predictor suffices for world modeling. Across hundreds of structural causal models, predictors and Bayesian baselines recover the diagonal (posterior over observed worlds) but collapse to a point on the off-diagonal — the coupling between counterfactual worlds — on 28% of models to values no valid model can produce, while the true answer is an interval that more data never narrows. WorldKernel casts a world model as a single positive semidefinite coupling kernel K(T,T′) over admissible worlds: the diagonal is the ordinary posterior, the off-diagonal is the cross-world coupling that fixes counterfactuals, and PSD-enforcement yields tractable bounds on quantities that exact response-type programs cannot compute.

Key Contributions

Method Details

Conceptual architecture:

  1. Admissible worlds — the set of all structural causal models (SCMs) consistent with the observed data
  2. Coupling kernel K(T,T′) — a positive semidefinite matrix indexed by pairs of admissible worlds; entries encode joint plausibility of two worlds under the same observations
  3. Diagonal = posterior — K(T,T) is exactly the Bayesian posterior over worlds given the data (what predictors recover)
  4. Off-diagonal = counterfactual coupling — K(T,T′) for T≠T′ fixes the answer to "what would have happened in T′ if the world were T," a quantity no point predictor can express
  5. PSD-enforcement as partial identification — positive semidefiniteness is a constraint the marginal posteriors lack; it bounds counterfactuals in polynomial time
  6. Ontology axioms — logical structure over admissible worlds tightens the PSD bound by propagating constraints to couplings that were not directly constrained (up to 1/3 tighter)
  7. Targeted scars — additional constraints learned from observed infeasibilities during data collection, prioritized to close the kernel gap faster than uniform sampling
  8. Reconstruction complexity — exact K is equivalent to #P-hard counting of admissible worlds; tractable below the Sly–Sun threshold, inapproximable above

Key Results

Limitations and Future Work

Relevance to Patrick's Research

WorldKernel is the rare world-model paper that attacks the epistemic foundations rather than the architecture. For Patrick's tracking, it formalizes a sharp distinction between diagonal (what any predictor learns) and off-diagonal (what counterfactual reasoning needs) — a distinction that doesn't show up in JEPA, Genie, or Sora-style models but matters for any agent that asks "what would have happened if I had done X." The PSD-coupling formulation is also conceptually aligned with the kernel-method tradition Yann LeCun's JEPA lineage draws on. The 28% failure rate is a concrete number to cite when arguing that generative world models alone are insufficient for counterfactual planning.