Information-Theoretic Approximation to Causal Models

07/29/2020

∙

Inferring the causal direction and causal effect between two discrete random variables X and Y from a finite sample is often a crucial problem and a challenging task. However, if we have access to observational and interventional data, it is possible to solve that task. If X is causing Y, then it does not matter if we observe an effect in Y by observing changes in X or by intervening actively on X. This invariance principle creates a link between observational and interventional distributions in a higher dimensional probability space. We embed distributions that originate from samples of X and Y into that higher dimensional space such that the embedded distribution is closest to the distributions that follow the invariance principle, with respect to the relative entropy. This allows us to calculate the best information-theoretic approximation for a given empirical distribution, that follows an assumed underlying causal model. We show that this information-theoretic approximation to causal models (IACM) can be done by solving a linear optimization problem. In particular, by approximating the empirical distribution to a monotonic causal model, we can calculate probabilities of causation. It turns out that this approximation approach can be used to successfully solve causal discovery problems in the bivariate, discrete case. Experimental results on both labeled synthetic and real-world data demonstrate that our approach outperforms other state-of-the-art approaches in the discrete case with low cardinality.

READ FULL TEXT

Information-Theoretic Approximation to Causal Models

Telling Cause from Effect using MDL-based Local and Global Regression

A Primer on Causal Analysis

Causal Inference on Discrete Data using Additive Noise Models

Causal Inference on Multivariate and Mixed-Type Data

Information in probability: Another information-theoretic proof of a finite de Finetti theorem

On the Role of Entropy-based Loss for Learning Causal Structures with Continuous Optimization

Genome-Wide Association Studies: Information Theoretic Limits of Reliable Learning

Information-Theoretic Approximation to Causal Models

Related Research

Telling Cause from Effect using MDL-based Local and Global Regression

A Primer on Causal Analysis

Causal Inference on Discrete Data using Additive Noise Models

Causal Inference on Multivariate and Mixed-Type Data

Information in probability: Another information-theoretic proof of a finite de Finetti theorem

On the Role of Entropy-based Loss for Learning Causal Structures with Continuous Optimization

Genome-Wide Association Studies: Information Theoretic Limits of Reliable Learning