COREX — Documentation | Causal Origin Resolution

📖 Overview

"Causality is not assumed, inferred, or interpreted — it is survived or rejected under systematic perturbation tests."

COREX is a deterministic, graph-free, model-agnostic computational framework that treats causality as an empirically testable robustness property rather than an assumed structural characteristic. The framework implements a four-axis evaluation pipeline — Statistical Stability, Representation Invariance, Intervention Consistency, and Domain Robustness — and fuses their outputs through a weighted scoring function to produce a calibrated causal classification.

Contemporary machine learning systems routinely exploit shortcut correlations embedded in training distributions — associations that collapse the moment data distribution shifts, feature encodings change, or interventions are applied. COREX provides a principled audit pipeline to classify any observed X → Y relationship as CAUSAL, SPURIOUS, or REPRESENTATION ARTIFACT.

🏗️ 4-Module Architecture

Module 01 — Statistical Stability (S)

Tests whether P(Y|X) remains invariant across independently drawn subpopulations of the data. Performs random stratified partitioning into k folds, estimates conditional distributions using kernel density estimation, and computes pairwise KL divergence across fold estimates.

Formula

S = 1 - mean_{i≠j} KL[P(Y|X, D_i) ‖ P(Y|X, D_j)]

Module 02 — Representation Invariance (R)

Evaluates whether the observed relationship persists when the feature representation of X is subjected to a structured family of transformations: linear projections, nonlinear embeddings, Gaussian noise injection, feature dropout, and PCA compression.

Formula

R = 1 - (1/|Φ|) Σ_φ ||P(Y|X) - P(Y|φ(X))||₁

Module 03 — Intervention Consistency (I)

Simulates causal interventions by applying controlled perturbations to X and observing the consistency of the downstream effect on Y. Uses propensity-score matched observational comparisons and synthetic counterfactual generation.

Formula

I = Consistency(do(X=x₁)→Y₁, do(X=x₂)→Y₂)

Module 04 — Domain Robustness (D)

Evaluates whether the predictive relationship generalizes across environments with distinct data-generating distributions. Constructs pseudo-environments through clustering in covariate space and assesses stability via coefficient of variation.

Formula

D = 1 - CV(P(Y|X, e)) over e ∈ E

📐 Core Equations

Statistical Stability

S = 1 − mean KL[P(Y|X,D₁) ‖ P(Y|X,D₂)]

Cross-subpopulation conditional invariance

Representation Invariance

R = 1 − (1/|Φ|) Σ ‖P(Y|X) − P(Y|φ(X))‖₁

Stability under feature transformations

Intervention Consistency

I = Consistency(do(X=x₁)→Y₁, do(X=x₂)→Y₂)

Causal effect direction & magnitude stability

Domain Robustness

D = 1 − CV(P(Y|X, e)) over e ∈ E

Cross-environment generalization

📊 COREX Scoring Function

COREX Score Formula

COREX = w₁·S + w₂·R + w₃·I + w₄·D

Weight	Value	Module
w₁	0.25	Statistical Stability
w₂	0.25	Representation Invariance
w₃	0.30	Intervention Consistency (highest)
w₄	0.20	Domain Robustness

>

Decision Thresholds

Label	COREX Range	Interpretation
🟢 CAUSAL	≥ 0.80	All four modules stable; intervention consistent
🟡 SPURIOUS	0.50 – 0.79	Domain shift OR intervention instability
🔴 ARTIFACT	< 0.50	Representation invariance fails

📦 Installation

bash — pip install

pip install corex

# From source
git clone https://github.com/gitdeeper12/COREX.git
cd COREX
pip install -e .

Core Dependencies: numpy, scipy

🔧 API Reference

python — main interface

from corex import CausalEvaluator

# Initialize evaluator
evaluator = CausalEvaluator()

# Evaluate relationship between X and Y
result = evaluator.evaluate(X, y)

# Access results
print(result.label)         # "CAUSAL" | "SPURIOUS" | "REPRESENTATION_ARTIFACT"
print(result.corex_score)   # float in [0, 1]
print(result.breakdown)     # {"S": 0.91, "R": 0.88, "I": 0.85, "D": 0.90}

Parameters

Parameter	Description	Default
weights	Custom module weights	{'statistical':0.25, 'representation':0.25, 'intervention':0.30, 'domain':0.20}
meta_scorer	Optional learnable meta-layer	None

🧩 Core Modules

Module	Path	Description
statistical.py	corex/modules/statistical.py	Statistical Stability Module (S)
representation.py	corex/modules/representation.py	Representation Invariance Module (R)
domain.py	corex/modules/domain.py	Domain Robustness Module (D)
intervention.py	corex/modules/intervention.py	Intervention Consistency Engine (I)
score.py	corex/score.py	COREX scoring function and thresholds
pipeline.py	corex/pipeline.py	Main evaluation pipeline

📊 Validation Summary

Method	Accuracy	AUROC	FPR
COREX v1.0.0	91.4%	0.963	3.2%
IRM baseline	76.0%	0.871	23.0%
Conditional Independence	69.0%	0.741	31.0%

>

👤 Author

🔬

Samir Baladi

Principal Investigator — Causal Machine Learning & Biomedical AI

Samir Baladi is an independent interdisciplinary researcher affiliated with the Ronin Institute, developing the Rite of Renaissance research program. COREX is a framework for robust causal discrimination in data-driven AI systems.

📧 gitdeeper@gmail.com 🔗 ORCID: 0009-0003-8903-0029 🐙 GitHub 🦊 GitLab

📝 Citation

@software{baladi2026corex, author = {Baladi, Samir}, title = {COREX: Causal Origin Resolution and Empirical eXamination}, year = {2026}, version = {1.0.0}, doi = {10.5281/zenodo.20351233}, url = {https://github.com/gitdeeper12/COREX}, license = {MIT} }

"Causality is not assumed — it is survived."

COREX Documentation

📖 Overview

🏗️ 4-Module Architecture

Module 01 — Statistical Stability (S)

Module 02 — Representation Invariance (R)

Module 03 — Intervention Consistency (I)

Module 04 — Domain Robustness (D)

📐 Core Equations

📊 COREX Scoring Function

Decision Thresholds

📦 Installation

🔧 API Reference

Parameters

🧩 Core Modules

📊 Validation Summary

👤 Author

📝 Citation