Comprehensive Causal Inference Cheatsheet: Methods, Assumptions & Applications

Introduction: Understanding Causal Inference

Causal inference is the process of determining the actual effect of a particular phenomenon (the cause) on another phenomenon (the effect). Unlike traditional statistical methods that focus on correlation and association, causal inference seeks to establish whether and how a change in one variable directly influences another. This is essential for answering “what if” questions, making effective interventions, and developing policies based on true cause-effect relationships rather than mere associations. This cheatsheet provides a structured overview of key causal inference frameworks, methodologies, assumptions, and applications for researchers, data scientists, and analysts working with observational and experimental data.

Core Causal Frameworks and Concepts

Fundamental Causal Models

FrameworkKey ConceptsPrimary ContributorsBest For
Potential Outcomes (Rubin Causal Model)Counterfactuals, treatment effects, ignorabilityDonald Rubin, Jerzy Neyman, Paul HollandExperimental design, treatment effect estimation
Structural Causal Models (SCM)Directed graphs, structural equations, do-calculusJudea Pearl, Peter SpirtesComplex causal relationships, identification
Graphical Models (DAGs)Directed acyclic graphs, d-separation, Markov conditionJudea Pearl, James RobinsVisualization, bias identification, confounder selection
Granger CausalityTime series precedence, predictive causalityClive GrangerTime series data, economic forecasting
Instrumental VariablesExogenous variation, exclusion restrictionPhilip Wright, Joshua AngristNatural experiments, econometrics

Key Terminology in Causal Inference

TermDefinitionRelevance
CounterfactualWhat would have happened to a unit under alternative treatment conditionFoundation of potential outcomes framework
Causal EffectDifference between potential outcomes under different treatmentsPrimary quantity of interest in causal analysis
ConfoundingBias due to common causes of treatment and outcomeMajor threat to causal inference from observational data
Selection BiasSystematic differences between compared groupsCompromises validity of causal conclusions
MediatorVariable in causal path between cause and effectImportant for understanding mechanisms
ColliderCommon effect of two variablesCan create spurious associations when conditioned upon
Instrumental VariableAffects outcome only through treatmentEnables causal inference despite unmeasured confounding
do-operatorMathematical notation for interventionEssential for calculating intervention effects
Backdoor PathNon-causal path creating associationMust be blocked to estimate causal effects
IdentifiabilityWhether causal effect can be estimated from observed dataDetermines feasibility of causal inference

Treatment Effect Types

Effect TypeDefinitionFormula (Potential Outcomes)When to Use
Average Treatment Effect (ATE)Average effect across entire populationE[Y(1) – Y(0)]General policy evaluation
Average Treatment Effect on Treated (ATT)Average effect for those who received treatmentE[Y(1) – Y(0) | T=1]Program evaluation, selective treatments
Average Treatment Effect on Untreated (ATU)Average effect for those who didn’t receive treatmentE[Y(1) – Y(0) | T=0]Policy extension considerations
Conditional Average Treatment Effect (CATE)Average effect for subgroup defined by covariatesE[Y(1) – Y(0) | X=x]Heterogeneous treatment effects, personalization
Local Average Treatment Effect (LATE)Average effect for compliers (IV context)E[Y(1) – Y(0) | compliers]Instrumental variable analysis
Intention-to-Treat (ITT)Effect of being assigned to treatmentE[Y | assigned to T=1] – E[Y | assigned to T=0]Randomized trials with non-compliance

Causal Inference Methods and Approaches

Experimental Methods

MethodKey FeaturesStrengthsLimitationsExample Application
Randomized Controlled Trials (RCT)Random assignment to treatment/controlGold standard for causal inferenceCostly, ethical limitations, external validity concernsClinical drug trials, A/B testing
Factorial DesignsMultiple treatments tested simultaneouslyEfficiency, interaction effectsComplex analysis, larger sample requirementsAgricultural experiments, multi-factor interventions
Crossover DesignsSubjects receive multiple treatments in sequenceWithin-subject comparison, higher powerCarryover effects, limited to reversible treatmentsBioequivalence studies, short-term interventions
Cluster RandomizationGroups rather than individuals randomizedAccounts for intra-group dependenciesReduced statistical power, implementation challengesCommunity interventions, classroom-based studies
Stepped Wedge DesignSequential rollout of interventionPractical for universal implementation, temporal controlsComplex analysis, temporal confoundingHealth system changes, policy implementations

Quasi-Experimental Methods

MethodKey FeaturesAssumptionsLimitationsExample Application
Difference-in-Differences (DiD)Before-after comparison between treated and control groupsParallel trends, no anticipationSensitive to time-varying confoundersPolicy evaluation, natural experiments
Regression Discontinuity (RD)Treatment assigned based on threshold of running variableContinuity around thresholdLimited to threshold vicinity, requires threshold-based assignmentScholarship effects, age-based eligibility
Instrumental Variables (IV)External variable influences treatment but not outcome directlyRelevance, exclusion restriction, monotonicityFinding valid instruments, weak instrument biasMendelian randomization, lottery-based assignments
Synthetic ControlWeighted combination of control units to match treated unitConvex hull, no anticipationFew treated units, requires long pre-treatment periodState/country policy changes, single-unit interventions
Interrupted Time SeriesBefore-after comparison with multiple time pointsNo contemporaneous events, stabilityConfounding by time-varying factorsPolicy changes, interventions with defined start times
Propensity Score MethodsBalancing treatment groups based on probability of treatmentNo unmeasured confounders, positivitySensitivity to specification, cannot adjust for unobserved confoundersObservational healthcare data, program evaluations

Observational Methods

MethodKey ApproachCritical AssumptionsAdvantagesLimitations
MatchingPair treated units with similar controlsConditional exchangeability, common supportIntuitive, transparentDimensionality issues, exact matches often impossible
Covariate AdjustmentRegression controlling for confoundersNo unmeasured confounding, correct model specificationFamiliar, adjusts for multiple variablesModel dependence, extrapolation dangers
Inverse Probability Weighting (IPW)Reweighting to create balanced pseudo-populationPositivity, correct propensity score modelCan estimate various treatment effectsExtreme weights, model sensitivity
Doubly Robust MethodsCombine outcome modeling and weightingEither propensity or outcome model correctProtection against model misspecificationComputational complexity, implementation challenges
G-Methods (G-Computation)Standardization using predicted counterfactualsCorrect specification of outcome modelHandles time-varying confounding, treatmentsComputational intensity, “g-null paradox”
Targeted Maximum Likelihood (TMLE)Efficient estimation with targeted bias correctionInitial estimators correctly specifiedEfficiency, double robustnessImplementation complexity, computational demands

Causal Discovery Methods

MethodApproachAssumptionsStrengthsLimitations
PC AlgorithmConstraint-based learning of graph skeletonCausal sufficiency, faithfulness, no latent confoundersComputationally efficient, handles high dimensionsSensitive to statistical decisions, assumes no latent variables
FCI AlgorithmExtension of PC allowing latent confoundersAcyclicity, faithfulnessCan work with latent confoundersComputational complexity, often returns equivalence classes
GES (Greedy Equivalence Search)Score-based search over graph spaceAcyclicity, correct scoring functionOften more accurate than constraint-basedComputationally intensive for many variables
LiNGAMExploits non-Gaussian distributionsLinear relationships, non-Gaussian errorsCan identify unique DAG, not just equivalence classRequires non-Gaussian distributions, assumes linearity
Additive Noise ModelsIdentifies direction via independence of residualsAdditive noise, no latent confoundersCan identify direction between two variablesStrong functional form assumptions
Invariant Causal PredictionUses stability across environmentsCausal sufficiency, invariance principleRobust to certain forms of model misspecificationRequires multiple environments/datasets

Step-by-Step Causal Inference Process

1. Define Causal Question

  • Clearly specify treatment/exposure and outcome
  • Define target population
  • Formulate precise causal estimand (ATE, ATT, etc.)
  • Determine whether average effects or heterogeneous effects are of interest

2. Represent Causal Knowledge

  • Develop causal diagram (DAG) based on domain knowledge
  • Identify potential confounders, mediators, colliders
  • Determine minimum adjustment set(s) for identification
  • Check for possible violations of assumptions

3. Design Data Collection Strategy

  • Choose between experimental, quasi-experimental, or observational approaches
  • Determine sample size requirements for adequate power
  • Plan for collecting data on all necessary confounders
  • Consider instrumental variables or other identification strategies

4. Select Appropriate Method

  • Match method to data structure and causal question
  • Consider robustness vs. efficiency tradeoffs
  • Plan for sensitivity analyses to assess assumption violations
  • Determine whether bounds rather than point estimates are needed

5. Implement Analysis

  • Check balance between treatment groups
  • Apply selected causal inference method(s)
  • Estimate treatment effects and construct confidence intervals
  • Consider multiple methods for robustness

6. Conduct Sensitivity Analysis

  • Assess robustness to unmeasured confounding
  • Vary model specifications and assumptions
  • Apply bounds or partial identification approaches if needed
  • Determine thresholds at which conclusions would change

7. Interpret and Communicate Results

  • Distinguish between statistical and causal inference
  • Clarify assumptions underlying causal claims
  • Discuss limitations and generalizability
  • Translate findings into practical implications

Key Assumptions and Their Assessment

Fundamental Causal Assumptions

AssumptionDefinitionAssessment MethodsImplications if Violated
ConsistencyWell-defined treatments lead to same outcome regardless of assignment mechanismClarify treatment definition, assess variation in implementationUndefined treatment effects, misleading estimates
Positivity (Overlap)Positive probability of each treatment level for all covariate valuesCheck propensity score distributions, identify sparse regionsExtreme weights, unreliable estimates, extrapolation
Exchangeability (No Unmeasured Confounding)Treatment assignment independent of potential outcomes given observed covariatesSensitivity analysis, negative controls, bias formulasBiased treatment effect estimates
Stable Unit Treatment Value Assumption (SUTVA)No interference between units, no hidden treatment variationsAssess spillover potential, treatment implementation consistencyBiased effect estimates, unclear interpretation
Instrument RelevanceInstrumental variable meaningfully affects treatment assignmentF-statistic in first stage, partial R²Weak instrument bias, large standard errors
Exclusion RestrictionInstrument affects outcome only through treatmentDomain knowledge, falsification testsBiased IV estimates
MonotonicityInstrument affects treatment in same direction for all unitsTheoretical justification, empirical checks where possibleBiased LATE estimates
Parallel Trends (DiD)Treatment and control would follow same trends without interventionVisual inspection of pre-treatment trends, placebo testsBiased DiD estimates

Testing Assumptions with Data

  • Balance checks: Compare covariate distributions between treatment groups
  • Placebo tests: Analyze outcomes known to be unaffected by treatment
  • Sensitivity analysis: Quantify how strong unmeasured confounding would need to be to invalidate results
  • Negative controls: Test effects on outcomes that shouldn’t be affected or from exposures that shouldn’t have an effect
  • Over-identification tests: When multiple instruments are available, test consistency of estimates
  • Cross-validation: Compare estimates from different methods or data subsets
  • E-values: Minimal strength of unmeasured confounder needed to explain away an effect

Common Challenges and Solutions in Causal Inference

Challenge: Unmeasured Confounding

  • Potential Causes:
    • Unavailable data on important confounders
    • Unknown confounding mechanisms
    • Limitations in measurement
  • Solutions:
    • Instrumental variable approaches
    • Difference-in-differences if confounders are stable over time
    • Sensitivity analysis to quantify potential bias
    • Bounded estimates incorporating uncertainty
    • Proxies for unmeasured confounders
    • Negative control outcomes for bias detection

Challenge: Selection Bias and Missing Data

  • Potential Causes:
    • Non-random sampling
    • Differential attrition/dropout
    • Self-selection into treatment
    • Incomplete observation of outcomes
  • Solutions:
    • Inverse probability of selection weighting
    • Multiple imputation for missing data
    • Heckman selection models
    • Bounds analysis incorporating range of possible values
    • Pattern-mixture models for sensitivity analysis
    • Addressing selection as part of causal model

Challenge: Treatment Heterogeneity

  • Potential Causes:
    • Individual variation in treatment response
    • Differences in treatment implementation
    • Interaction with observed or unobserved characteristics
  • Solutions:
    • Subgroup analysis with pre-specified groups
    • Causal forests or Bayesian additive regression trees
    • Varying coefficient models
    • Meta-regression across studies or contexts
    • Hierarchical models incorporating group structure
    • Testing for qualitative interactions

Challenge: Interference Between Units

  • Potential Causes:
    • Network effects
    • Spillovers between treated and control units
    • General equilibrium effects
  • Solutions:
    • Cluster randomization
    • Network exposure models
    • Partial identification approaches
    • Spatial models incorporating distance
    • Two-stage randomization designs
    • Explicit modeling of interference patterns

Applied Causal Inference by Field

Economics and Policy Evaluation

Common MethodsKey ConsiderationsExample Applications
Instrumental VariablesFinding valid instruments, relevance/exclusionEstimating returns to education, policy impacts
Difference-in-DifferencesUnit and time fixed effects, staggered adoptionMinimum wage effects, healthcare policy changes
Regression DiscontinuityBandwidth selection, manipulation testsProgram eligibility effects, election outcomes
Synthetic ControlPre-treatment fit, placebo testsState-level policy evaluation, country-level interventions
Structural ModelsBehavioral assumptions, parameter stabilityLabor market interventions, tax policy changes

Healthcare and Epidemiology

Common MethodsKey ConsiderationsExample Applications
Propensity Score MethodsConfounder selection, balance assessmentTreatment effectiveness, medication side effects
Marginal Structural ModelsTime-varying confounding, treatment adherenceLong-term treatment effects, dynamic treatment regimes
Mendelian RandomizationGenetic confounding, pleiotropyCausal effects of biomarkers, exposures on disease
Target Trial EmulationProtocol specification, eligibility criteriaComparing treatment strategies, preventive interventions
Risk-Score MatchingPrognostic score development, overlapComparative effectiveness, rare disease studies

Computer Science and Machine Learning

Common MethodsKey ConsiderationsExample Applications
Causal ForestsFeature selection, honest estimationHeterogeneous treatment effects, personalization
Double/Debiased Machine LearningSample splitting, regularizationHigh-dimensional confounder adjustment
Invariant PredictionEnvironment definition, stability testingDomain adaptation, transferable predictions
Neural Networks for Causal EffectsRepresentation learning, balance metricsIndividual treatment effect estimation, image data
Causal Reinforcement LearningExploration strategy, off-policy evaluationPolicy learning, sequential decision making

Social Sciences

Common MethodsKey ConsiderationsExample Applications
Mediation AnalysisSequential ignorability, direct/indirect effectsUnderstanding mechanisms, intervention components
Fixed Effects ModelsTime-invariant confounding, clusteringPanel data studies, educational interventions
Matching MethodsDistance metrics, common supportProgram evaluation, observational studies
Interrupted Time SeriesSeasonality, autocorrelationPolicy implementation, social media interventions
Instrumental Variables via Natural ExperimentsExogeneity, local effectsVoting behavior, peer effects

Best Practices for Credible Causal Inference

Research Design Principles

  • Pre-specify analysis plan: Document hypotheses, methods, variables before analysis
  • Power analysis: Ensure adequate sample size for meaningful causal estimates
  • Diversify methods: Apply multiple approaches with different assumptions
  • Triangulate evidence: Combine results across methods, datasets, contexts
  • Transparency: Share data, code, and detailed methodological decisions
  • Consider mechanisms: Test mediating pathways, not just overall effects
  • Plan for heterogeneity: Design to detect variation in treatment effects
  • Specify counterfactuals: Clearly define the alternative treatment scenario

Analytical Best Practices

  • Balance checks: Verify similarity between treatment and comparison groups
  • Covariate selection: Use domain knowledge and causal graphs to select confounders
  • Robustness checks: Vary specifications to assess stability of findings
  • Sensitivity analysis: Quantify vulnerability to assumption violations
  • Avoid overfitting: Use cross-validation or sample splitting
  • Report uncertainty: Provide confidence intervals, not just point estimates
  • Check functional form: Test for nonlinearities in confounder relationships
  • Beware of colliders: Avoid conditioning on variables affected by treatment/outcome

Reporting and Interpretation Guidelines

  • Distinguish descriptive and causal claims: Clearly separate association from causation
  • State assumptions explicitly: Acknowledge required assumptions for causal interpretation
  • Contextualize effect sizes: Relate findings to meaningful benchmarks
  • Address generalizability: Discuss external validity and transportability
  • Present visual evidence: Use graphs to support causal reasoning
  • Acknowledge limitations: Discuss threats to validity transparently
  • Consider alternative explanations: Address competing interpretations
  • Connect to theory: Link empirical findings to theoretical mechanisms

Resources for Further Learning

Foundational Books

  • “Causal Inference in Statistics: A Primer” by Judea Pearl, Madelyn Glymour, and Nicholas Jewell
  • “Causal Inference for Statistics, Social, and Biomedical Sciences” by Guido Imbens and Donald Rubin
  • “Causality: Models, Reasoning, and Inference” by Judea Pearl
  • “Counterfactuals and Causal Inference” by Stephen Morgan and Christopher Winship
  • “Causal Inference: What If” by Miguel Hernán and James Robins (freely available online)

Specialized Resources

  • “Mostly Harmless Econometrics” by Joshua Angrist and Jörn-Steffen Pischke
  • “Elements of Causal Inference” by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf
  • “Statistical Methods in Epidemiology” by James Robins and Miguel Hernán
  • “Targeted Learning” by Mark van der Laan and Sherri Rose
  • “Explanation in Causal Inference” by Tyler VanderWeele

Software Packages

LanguagePackageSpecializationFeatures
RMatchItMatching methodsVarious matching algorithms, balance diagnostics
RgrfCausal forestsHeterogeneous treatment effects, honest estimation
RCausalImpactTime seriesBayesian structural time series for intervention analysis
PythonDoWhyCausal graph-based inferenceUnified API for multiple causal methods, graphical models
PythonEconMLMachine learning approachesDouble ML, metalearners, heterogeneous effects
PythonCausalMLUplift modelingTree-based and neural network methods
RmediationMediation analysisCausal mediation, sensitivity analysis
RdagittyGraphical modelsDAG creation, testable implications
RtwangPropensity methodsGBM for propensity score estimation
StatateffectsTreatment effects suiteComprehensive treatment effect models

Online Courses and Tutorials

  • “A Crash Course in Causality” on Coursera by Jason Roy
  • “Causal Diagrams” on edX by Miguel Hernán
  • “Causal Inference” at Stanford Online by Jas Sekhon
  • “Introduction to Causal Inference” from Brady Neal (freely available)
  • “Causal Inference with Python” tutorials from Microsoft Research

This cheatsheet provides a structured overview of causal inference concepts and methods, but the field continues to evolve rapidly. Always consult the latest research and domain-specific literature when applying these techniques. Causal claims require careful consideration of underlying assumptions and limitations, alongside substantive knowledge of the study context.

Scroll to Top