📊

Causal Inference Mastery

Distinguish true cause from mere correlation

This pack curates foundational papers and frameworks from statistics, econometrics, and philosophy of science on identifying genuine causal effects. It covers randomized trials, natural experiments, graphical models, and counterfactual reasoning with practical applications. Designed for analysts, product leaders, and researchers who must make high-stakes decisions from observational data.

10 documents · sourced from Olival Freire · Donald B. Rubin · Perplexity web research on randomized controlled trials as benchmark for causal identification · Marlies Hafer · Perplexity web research on natural experiments · Sparse Causal Discovery in Multivariate Time Series · Ladder operators and a second--order difference equation for general discrete Sobolev orthogonal polynomials · Cattaneo · Perplexity web research on DAGs for causal inference · Pearl’s do-calculus formalization via Perplexity web research on structural causal models and do-calculus rules

Install this pack — try MIND free →Open in MIND

What’s inside

Correlation Versus Causation: Philosophical Foundations

Olival Freire, arXiv physics/0508180v2; Eddy Keming Chen, arXiv 2110.12102v1; P. J. E. Peebles, arXiv 2401.16506v2

The philosophical tension between observed correlations and underlying causation has shaped foundational debates in physics, particularly as quantum mechanics moved from abstract speculation to empirical inquiry. Bell's theorem, published in 1965, formalized how certain correlations predicted by quantum theory cannot arise from local hidden variables, forcing a reevaluation of classical causal assumptions. Between 1965 and 1982, experiments culminating in Aspect's results transformed what many physicists had dismissed as philosophical speculation into mainstream laboratory science, altering community standards for what counts as legitimate physics. Healey's pragmatist interpretation, reviewed in recent scholarship, treats quantum theory not as a descriptive account of reality but as prescriptive guidance for agents, explicitly engaging causation alongside probability, explanation, and objectivity while rejecting both realist hidden-variable approaches and purely epistemic views. Peebles describes an implicit community philosophy among physicists that privileges operational success over ontological claims, contrasting with external analyses from historians and sociologists. These developments illustrate how definitions and preconceptions, whether in quantum foundations or other domains, inevitably carry philosophical commitments that influence what correlations are accepted as evidence of causal structure. Critical examination of such commitments reveals that experimental closure on causation often requires prior shifts in what the physics community regards as admissible questions.

The Potential Outcomes Framework

Donald B. Rubin, arXiv math/0612789v1

The potential outcomes framework, also known as the Rubin causal model, centers on defining causal effects through two hypothetical outcomes for each unit. For any unit i and binary treatment, the potential outcome under treatment is denoted Y_i(1) while the outcome under control is Y_i(0). The individual treatment effect is then the difference tau_i equals Y_i(1) minus Y_i(0), which Rubin explicitly frames as the difference in the outcome y_i under the two conditions. The average treatment effect follows directly as the expectation of these unit-level differences, written E[Y_i(1) minus Y_i(0)], or equivalently as the finite-population average one over N times the sum from i equals one to N of {Y_i(1) minus Y_i(0)}. These quantities are identified from observed data only under additional assumptions such as randomization or ignorability that allow the missing potential outcome to be inferred. The framework therefore separates the definition of causal estimands from the statistical procedures needed to estimate them, providing a unified language for both randomized experiments and observational studies. All core definitions trace to the supplied description of Rubin’s model and the cited arXiv rejoinder that elaborates the same notation and interpretation.

Randomized Controlled Trials as the Gold Standard

Perplexity web research on randomized controlled trials as benchmark for causal identification

Randomized controlled trials achieve their status as the benchmark for causal identification through core design elements that directly enforce key assumptions rather than relying on post hoc modeling. Random assignment of treatment ensures that assignment is statistically independent of potential outcomes, breaking links to both observed and unobserved confounders and allowing the average treatment effect to be identified via simple outcome comparisons without strong assumptions. This produces exchangeability on average, so that groups share the same distribution of baseline characteristics before treatment begins. Prospective data collection aligns eligibility, randomization, and follow-up before outcomes occur, establishing that treatment precedes the outcome and eliminating reverse causation along with biases such as immortal time bias. Trials specify clear treatment conditions, well-defined units, and pre-specified outcomes, supporting the stable unit treatment value assumption of no interference and no hidden treatment variants. The known randomization probabilities strictly between zero and one for eligible units further guarantee positivity and enable design-based inference for estimators and standard errors derived directly from the assignment mechanism. These features are illustrated in specialized extensions such as connectivity-informed cluster randomized trials that leverage contact networks to improve public health impact while preserving detection of effects, optimal designs for active controlled dose-finding studies with bivariate efficacy-toxicity outcomes, incentive-compatible mechanisms that balance exploration and participation, and targeted machine learning estimation with adaptive pre-specification that improves precision through data-adaptive covariate adjustment.

Selection Bias and Confounding in Observational Studies

Marlies Hafer, Alexander Marx / Regression-Based Estimation of Causal Effects in the Presence of Selection Bias and Confounding / arXiv 2503.20546v1

In observational studies estimating expected causal effects such as E[Y|do(X)] for continuous variables, regression succeeds only when E[Y|do(X)] equals E[Y|X], but this equality breaks under selection bias from systematic missingness or confounding from shared causes. Hafer and Marx (arXiv 2503.20546v1) derive identifiability conditions that recover the effect when external data and proxy variables unaffected by selection are available, extending Boeken et al. [2023] results on proxy correction for selection alone. Their two-step regression estimator exploits these proxies first to adjust the observed conditional expectation and then to isolate the interventional quantity despite confounding. Luo et al. (arXiv 2501.10124v2) show that selection-induced dependencies remain symmetric under gene perturbations while regulatory or confounding dependencies do not, enabling distinction in network inference. Confounding mixes true effects with backdoor associations through common ancestors, whereas selection bias opens noncausal paths by conditioning on colliders, both yielding spurious exposure-outcome links that the framework corrects without assuming no latent distortion.

Natural Experiments and Quasi-Experimental Designs

Perplexity web research on natural experiments

Natural experiments enable causal inference when historical or institutional events produce as-if random variation in treatment across comparable units, permitting researchers to compare exposed and unexposed groups as treatment and control. John Snow’s 1854 cholera investigation in London illustrates the approach, with households served by different water companies experiencing differential exposure to contaminated Thames water in a manner not controlled by the researcher. The Soviet Union’s collapse generated externally driven shifts in food and fuel availability across former Soviet economies that supported studies of resulting changes in body weight, diabetes, and obesity. China’s air-pollution reductions ahead of the 2008 Beijing Olympics supplied a short-run policy shock whose birth-outcome effects could be compared against adjacent years. British colonial rule in India created differences in land-revenue and landlord institutions through historical accidents, allowing comparisons across areas under distinct administrative regimes. French Revolutionary and Napoleonic institutional changes spread unevenly across conquered European territories and have been used to examine long-run economic and political consequences. Island colonization and ethnic-region partition supply quasi-experimental historical splits, while military service in the Vietnam War has been applied in epidemiological work to identify later mortality differences. Arbitrary policy timing or lotteries in public programs likewise yield credible comparison groups when they induce plausibly quasi-random variation. Such designs derive their value from the event or institution creating variation that is defensible as quasi-random for identification purposes, provided the requisite assumptions hold.

Instrumental Variables for Causal Identification

Sparse Causal Discovery in Multivariate Time Series — arXiv 0901.2234v1

Instrumental variables identify causal effects by leveraging an exogenous source of variation in treatment when direct randomization is unavailable. A valid instrument must affect the treatment through a nonzero first-stage relationship, influence the outcome exclusively via the treatment channel under the exclusion restriction, and remain independent of unmeasured confounders shared with the outcome. These conditions together support recovery of local average treatment effects when monotonicity also holds. In discrete instrumental variable models the sharp analytical bounds on the average treatment effect take the form of a maximum or minimum over an exponentially growing collection of linear terms in the number of outcome values, matching the exponential growth in the number of valid instrumental inequalities themselves. Bounds constructed from only polynomially many such terms therefore cannot attain sharpness. In empirical industrial organization the same logic appears in a Rivers-Vuong non-nested testing procedure that compares the fit of two first-stage price regressions to distinguish Bertrand-Nash competition from collusion, bypassing full demand and supply system estimation while remaining robust to functional-form choices for the instruments.

Difference-in-Differences Estimation

Ladder operators and a second--order difference equation for general discrete Sobolev orthogonal polynomials — arXiv 2006.14391v3

Difference-in-differences recovers the average treatment effect on the treated when the untreated group represents the counterfactual evolution of treated units. In the two-period two-group design the observed data consist of pre-treatment outcomes for everyone and post-treatment outcomes that equal the treated potential outcome only for the treated group. The resulting DiD contrast equals the ATT plus a bias term given by the difference in untreated outcome trends between the two groups. Parallel trends sets this bias to zero by requiring that the expected change in untreated potential outcomes is the same for treated and control units. Under that assumption together with no anticipation and consistency the within-treated time difference removes time-invariant group confounding while the control time difference subtracts the common trend that would have affected the treated group absent treatment, leaving only the causal ATT. The supplied web-research derivation shows this algebra directly from potential-outcome definitions without additional assumptions beyond the three listed. No supplied arXiv paper addresses causal identification or produces these results.

Regression Discontinuity Design

Cattaneo, Titiunik, Vazquez-Bare / The Regression Discontinuity Design / arXiv 1906.04242v2

Regression discontinuity designs identify causal effects from a discrete jump in treatment probability at a cutoff of the running variable, as formalized in the sharp design handbook chapter by Cattaneo, Titiunik, and Vazquez-Bare (arXiv 1906.04242v2). The central identifying assumption requires continuity of potential outcome functions E[Y(0)|X=x] and often E[Y(1)|X=x] at the cutoff, ensuring that absent treatment the outcome would evolve smoothly so any observed discontinuity is attributable to treatment; this also enforces balance of confounders in a neighborhood around the threshold (arXiv 1906.04242v2; Perplexity web research). Equivalently, units immediately above and below the cutoff behave as if randomly assigned, supporting local randomization inference. Precise manipulation of the running variable is ruled out by requiring its continuous density at the cutoff, which underpins diagnostic tests and renders the design manipulation-robust through low-level restrictions on sorting (Ishihara and Sawada, arXiv 2009.07551v7). Bayesian nonparametric discontinuity designs replace parametric regressions with Gaussian process priors to detect jumps of arbitrary order while avoiding overconfidence from implicit effect conditioning, demonstrated in simulations and applications to political longevity, phantom borders, and meditation effects (Hinne et al., arXiv 1911.06722v3). These elements together enable estimation, inference, and falsification without competing interventions or structural breaks at the cutoff.

Directed Acyclic Graphs and Causal Structure

Perplexity web research on DAGs for causal inference

Directed acyclic graphs encode causal relationships through nodes representing variables in a data-generating process and directed arrows denoting direct causal effects from cause to effect, while the absence of an arrow encodes the assumption of no direct causal link. Acyclicity enforces a strict causal order in which earlier causes point forward to later effects with no possibility of a variable causing itself through any chain, thereby excluding direct representation of feedback loops. These structures capture researchers' qualitative causal assumptions a priori rather than emerging from observed data, and they map directly onto observable associations by distinguishing open paths that transmit statistical dependence from closed paths that induce independence via d-separation. Causal paths follow the arrow directions, whereas non-causal paths can produce spurious associations unless blocked by appropriate conditioning. Graphical criteria such as the back-door criterion then identify which observed associations reflect genuine causal influence versus confounding, allowing all marginal and conditional dependences and independences implied by the graph to be read off once the qualitative assumptions are specified.

Counterfactual Reasoning and the Do-Calculus

Pearl’s do-calculus formalization via Perplexity web research on structural causal models and do-calculus rules

Pearl's framework defines structural causal models through endogenous variables, exogenous variables drawn from a joint distribution, and structural equations of the form X = f_X(Pa_X, U_X). An intervention do(X=x) replaces the equation for X with the constant X := x, which graphically removes all incoming arrows to X and produces the interventional distribution P(Y=y | do(X=x)). The do-calculus supplies three graphical rules, each justified by d-separation statements in suitably mutilated versions of the original DAG, that transform expressions containing the do operator into equivalent expressions using only observational probabilities whenever identification succeeds. The rules respectively authorize insertion or deletion of observations, exchange of an action for an observation, and insertion or deletion of an action. Counterfactual queries are evaluated by first using observed data to fix the exogenous variables and then simulating the same structural equations under the hypothetical intervention, thereby comparing the factual and counterfactual worlds within a single model. These formal elements underpin recent applications that generate stable local and regional counterfactual rules from random forests, produce minimal policy adjustments in reinforcement learning that reach target returns, adapt latent-shift generation to three-dimensional CT classifiers, and employ contextual counterfactuals to calibrate belief strengths across subjective and epistemic uncertainty.

Your AI shouldn’t start from zero.

Install this pack and your MIND begins smart — then every answer is grounded in your own knowledge graph.

Try MIND free →