Docstrings · CausalityTools.jl

CausalityTools.CausalityTools — Module

CausalityTools

CausalityTools.jl is a package for quantifying associations, independence testing and causal inference.

All further information is provided in the documentation, which you can either find online or build locally by running the docs/make.jl file.

Key features

Association API: includes measures and their estimators for pairwise, conditional and other forms of association from conventional statistics, from dynamical systems theory, and from information theory: partial correlation, distance correlation, (conditional) mutual information, transfer entropy, convergent cross mapping and a lot more!
Independence testing API, which is automatically compatible with every association measure estimator implemented in the package.
Causal (network) inference API integrating the association measures and independence testing framework.

Addititional features

Extending on features from ComplexityMeasures.jl, we also offer

Discretization API for multiple (multivariate) input datasets.
Multivariate counting and probability estimation API.
Multivariate information measure API

Installation

To install the package, run import Pkg; Pkg.add("CausalityTools").

CausalityTools.Amplitude — Type

Amplitude <: InstantaneousSignalProperty

Indicates that the instantaneous amplitudes of a signal should be used.

CausalityTools.AssociationMeasure — Type

AssociationMeasure

The supertype of all association measures.

Abstract implementations

Currently, the association measures are classified by abstract classes listed below. These abstract classes offer common functionality among association measures that are conceptually similar. This makes maintenance and framework extension easier than if each measure was implemented "in isolation".

MultivariateInformationMeasure
CrossmapMeasure
ClosenessMeasure
CorrelationMeasure

Concrete implementations

Concrete subtypes are given as input to association. Many of these types require an AssociationMeasureEstimator to compute.

Type	`AssociationMeasure`	Pairwise	Conditional
Correlation	`PearsonCorrelation`	✓	✖
Correlation	`DistanceCorrelation`	✓	✓
Closeness	`SMeasure`	✓	✖
Closeness	`HMeasure`	✓	✖
Closeness	`MMeasure`	✓	✖
Closeness (ranks)	`LMeasure`	✓	✖
Closeness	`JointDistanceDistribution`	✓	✖
Cross-mapping	`PairwiseAsymmetricInference`	✓	✖
Cross-mapping	`ConvergentCrossMapping`	✓	✖
Conditional recurrence	`MCR`	✓	✖
Conditional recurrence	`RMCD`	✓	✓
Shared information	`MIShannon`	✓	✖
Shared information	`MIRenyiJizba`	✓	✖
Shared information	`MIRenyiSarbu`	✓	✖
Shared information	`MITsallisFuruichi`	✓	✖
Shared information	`PartialCorrelation`	✖	✓
Shared information	`CMIShannon`	✖	✓
Shared information	`CMIRenyiSarbu`	✖	✓
Shared information	`CMIRenyiJizba`	✖	✓
Shared information	`CMIRenyiPoczos`	✖	✓
Shared information	`CMITsallisPapapetrou`	✖	✓
Information transfer	`TEShannon`	✓	✓
Information transfer	`TERenyiJizba`	✓	✓
Partial mutual information	`PartialMutualInformation`	✖	✓
Information measure	`JointEntropyShannon`	✓	✖
Information measure	`JointEntropyRenyi`	✓	✖
Information measure	`JointEntropyTsallis`	✓	✖
Information measure	`ConditionalEntropyShannon`	✓	✖
Information measure	`ConditionalEntropyTsallisAbe`	✓	✖
Information measure	`ConditionalEntropyTsallisFuruichi`	✓	✖
Divergence	`HellingerDistance`	✓	✖
Divergence	`KLDivergence`	✓	✖
Divergence	`RenyiDivergence`	✓	✖
Divergence	`VariationDistance`	✓	✖

CausalityTools.AssociationMeasureEstimator — Type

AssociationMeasureEstimator

The supertype of all association measure estimators.

Concrete subtypes are given as input to association.

Abstract subtypes

MultivariateInformationMeasureEstimator
CrossmapEstimator

Concrete implementations

AssociationMeasure	Estimators
`PearsonCorrelation`	Not required
`DistanceCorrelation`	Not required
`PartialCorrelation`	Not required
`SMeasure`	Not required
`HMeasure`	Not required
`MMeasure`	Not required
`LMeasure`	Not required
`JointDistanceDistribution`	Not required
`PairwiseAsymmetricInference`	`RandomVectors`, `RandomSegment`
`ConvergentCrossMapping`	`RandomVectors`, `RandomSegment`
`MCR`	Not required
`RMCD`	Not required
`MIShannon`	`JointProbabilities`, `EntropyDecomposition`, `KraskovStögbauerGrassberger1`, `KraskovStögbauerGrassberger2`, `GaoOhViswanath`, `GaoKannanOhViswanath`, `GaussianMI`
`MIRenyiJizba`	`JointProbabilities`, `EntropyDecomposition`
`MIRenyiSarbu`	`JointProbabilities`
`MITsallisFuruichi`	`JointProbabilities`, `EntropyDecomposition`
`MITsallisMartin`	`JointProbabilities`, `EntropyDecomposition`
`CMIShannon`	`JointProbabilities`, `EntropyDecomposition`, `MIDecomposition`, `GaussianCMI`, `FPVP`, `MesnerShalizi`, `Rahimzamani`
`CMIRenyiSarbu`	`JointProbabilities`
`CMIRenyiJizba`	`JointProbabilities`, `EntropyDecomposition`
`CMIRenyiPoczos`	`PoczosSchneiderCMI`
`CMITsallisPapapetrou`	`JointProbabilities`
`TEShannon`	`JointProbabilities`, `EntropyDecomposition`, `Zhu1`, `Lindner`
`TERenyiJizba`	`JointProbabilities`
`PartialMutualInformation`	`JointProbabilities`
`JointEntropyShannon`	`JointProbabilities`
`JointEntropyRenyi`	`JointProbabilities`
`JointEntropyTsallis`	`JointProbabilities`
`ConditionalEntropyShannon`	`JointProbabilities`
`ConditionalEntropyTsallisAbe`	`JointProbabilities`
`ConditionalEntropyTsallisFuruichi`	`JointProbabilities`
`HellingerDistance`	`JointProbabilities`
`KLDivergence`	`JointProbabilities`
`RenyiDivergence`	`JointProbabilities`
`VariationDistance`	`JointProbabilities`

CausalityTools.BivariateInformationMeasure — Type

BivariateInformationMeasure <: MultivariateInformationMeasure

The supertype of all bivariate information measure definitions.

CausalityTools.CMIDecomposition — Type

CMIDecomposition(definition::MultivariateInformationMeasure, 
    est::ConditionalMutualInformationEstimator)

Estimate some multivariate information measure specified by definition, by decomposing it into a combination of conditional mutual information terms.

Usage

Use with association to compute a MultivariateInformationMeasure from input data: association(est::CMIDecomposition, x...).
Use with some IndependenceTest to test for independence between variables.

Description

Each of the conditional mutual information terms are estimated using est, which can be any ConditionalMutualInformationEstimator. Finally, these estimates are combined according to the relevant decomposition formula.

This estimator is similar to EntropyDecomposition, but definition is expressed as conditional mutual information terms instead of entropy terms.

Examples

Example 1: Estimating TEShannon by decomposing it into CMIShannon which is estimated using the FPVP estimator.

CausalityTools.CMIRenyiJizba — Type

CMIRenyiJizba <: ConditionalMutualInformation
CMIRenyiJizba(; base = 2, q = 1.5)

The Rényi conditional mutual information $I_q^{R_{J}}(X; Y | Z)$ defined in Jizba2012.

Usage

Use with association to compute the raw Rényi-Jizba conditional mutual information using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise conditional independence using the Rényi-Jizba conditional mutual information.

Compatible estimators

JointProbabilities
EntropyDecomposition

Definition

\[I_q^{R_{J}}(X; Y | Z) = I_q^{R_{J}}(X; Y, Z) - I_q^{R_{J}}(X; Z),\]

where $I_q^{R_{J}}(X; Z)$ is the MIRenyiJizba mutual information.

Estimation

Example 1: JointProbabilities with BubbleSortSwaps outcome space.
Example 2: EntropyDecomposition with OrdinalPatterns outcome space.
Example 3: EntropyDecomposition with differential entropy estimator LeonenkoProzantoSavani.

CausalityTools.CMIRenyiPoczos — Type

CMIRenyiPoczos <: ConditionalMutualInformation
CMIRenyiPoczos(; base = 2, q = 1.5)

The differential Rényi conditional mutual information $I_q^{R_{P}}(X; Y | Z)$ defined in Poczos2012.

Usage

Use with association to compute the raw Rényi-Poczos conditional mutual information using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise conditional independence using the Rényi-Poczos conditional mutual information.

Compatible estimators

PoczosSchneiderCMI

Definition

\[\begin{align*} I_q^{R_{P}}(X; Y | Z) &= \dfrac{1}{q-1} \int \int \int \dfrac{p_Z(z) p_{X, Y | Z}^q}{( p_{X|Z}(x|z) p_{Y|Z}(y|z) )^{q-1}} \\ &= \mathbb{E}_{X, Y, Z} \sim p_{X, Y, Z} \left[ \dfrac{p_{X, Z}^{1-q}(X, Z) p_{Y, Z}^{1-q}(Y, Z) }{p_{X, Y, Z}^{1-q}(X, Y, Z) p_Z^{1-q}(Z)} \right] \end{align*}\]

Estimation

Example 1: Dedicated PoczosSchneiderCMI estimator.

CausalityTools.CMIRenyiSarbu — Type

CMIRenyiSarbu <: ConditionalMutualInformation
CMIRenyiSarbu(; base = 2, q = 1.5)

The Rényi conditional mutual information from Sarbu2014.

Usage

Use with association to compute the raw Rényi-Sarbu conditional mutual information using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise conditional independence using the Rényi-Sarbu conditional mutual information.

Compatible estimators

JointProbabilities

Discrete description

Assume we observe three discrete random variables $X$, $Y$ and $Z$. Sarbu (2014) defines discrete conditional Rényi mutual information as the conditional Rényi $\alpha$-divergence between the conditional joint probability mass function $p(x, y | z)$ and the product of the conditional marginals, $p(x |z) \cdot p(y|z)$:

\[I(X, Y; Z)^R_q = \dfrac{1}{q-1} \sum_{z \in Z} p(Z = z) \log \left( \sum_{x \in X}\sum_{y \in Y} \dfrac{p(x, y|z)^q}{\left( p(x|z)\cdot p(y|z) \right)^{q-1}} \right)\]

CausalityTools.CMIShannon — Type

CMIShannon <: ConditionalMutualInformation
CMIShannon(; base = 2)

The Shannon conditional mutual information (CMI) $I^S(X; Y | Z)$.

Usage

Use with association to compute the raw Shannon conditional mutual information using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise conditional independence using the Shannon conditional mutual information.

Compatible estimators

JointProbabilities
EntropyDecomposition
MIDecomposition
FPVP
MesnerShalizi
Rahimzamani
PoczosSchneiderCMI
GaussianCMI

Supported definitions

Consider random variables $X \in \mathbb{R}^{d_X}$ and $Y \in \mathbb{R}^{d_Y}$, given $Z \in \mathbb{R}^{d_Z}$. The Shannon conditional mutual information is defined as

\[\begin{align*} I(X; Y | Z) &= H^S(X, Z) + H^S(Y, z) - H^S(X, Y, Z) - H^S(Z) \\ &= I^S(X; Y, Z) + I^S(X; Y) \end{align*},\]

where $I^S(\cdot; \cdot)$ is the Shannon mutual information MIShannon, and $H^S(\cdot)$ is the Shannon entropy.

Differential Shannon CMI is obtained by replacing the entropies by differential entropies.

Estimation

Example 1: EntropyDecomposition with Kraskov estimator.
Example 2: EntropyDecomposition with ValueBinning estimator.
Example 3: MIDecomposition with KraskovStögbauerGrassberger1 estimator.

CausalityTools.CMITsallisPapapetrou — Type

CMITsallisPapapetrou <: ConditionalMutualInformation
CMITsallisPapapetrou(; base = 2, q = 1.5)

The Tsallis-Papapetrou conditional mutual information Papapetrou2020.

Usage

Use with association to compute the raw Tsallis-Papapetrou conditional mutual information using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise conditional independence using the Tsallis-Papapetrou conditional mutual information.

Compatible estimators

JointProbabilities

Definition

Tsallis-Papapetrou conditional mutual information is defined as

\[I_T^q(X, Y \mid Z) = \frac{1}{1 - q} \left( 1 - \sum_{XYZ} \frac{p(x, y, z)^q}{p(x \mid z)^{q-1} p(y \mid z)^{q-1} p(z)^{q-1}} \right).\]

CausalityTools.ClosenessMeasure — Type

ClosenessMeasure <: AssociationMeasure

The supertype for all multivariate information-based measure definitions.

Implementations

JointDistanceDistribution
SMeasure
HMeasure
MMeasure
LMeasure

CausalityTools.CodifyPoints — Type

CodifyPoints{N}
CodifyPoints(encodings::NTuple{N, Encoding})

CodifyPoints points is a Discretization scheme that encodes input data points without applying any sequential transformation to the input (as opposed to CodifyVariables, which may apply some transformation before encoding).

Usage

Use with codify to encode/discretize input variable on a point-by-point basis.

Compatible encodings

Description

Given x::AbstractStateSpaceSet..., where the i-th dataset is assumed to represent a single series of measurements, CodifyPoints encodes each point pₖ ∈ x[i] using some Encoding(s), without applying any (sequential) transformation to the x[i] first. This behaviour is different to CodifyVariables, which does apply a transformation to x[i] before encoding.

If length(x) == N (i.e. there are N input dataset), then encodings must be a tuple of N Encoding. Alternatively, if encodings is a single Encoding, then that same encoding is applied to every x[i].

Examples

using CausalityTools

# The same encoding on two input datasets
x = StateSpaceSet(rand(100, 3))
y = StateSpaceSet(rand(100, 3))
encoding_ord = OrdinalPatternEncoding(3)
cx, cy = codify(CodifyPoints(encoding_ord), x, y)

# Different encodings on multiple datasets
z = StateSpaceSet(rand(100, 2))
encoding_bin = RectangularBinEncoding(RectangularBinning(3), z)
d = CodifyPoints(encoding_ord, encoding_ord, encoding_bin)
cx, cy, cz = codify(d, x, y, z)

CausalityTools.CodifyVariables — Type

CodifyVariables <: Discretization
CodifyVariables(outcome_space::OutcomeSpace)

The CodifyVariables discretization scheme quantises input data in a column-wise manner using the given outcome_space.

Compatible outcome spaces

UniqueElements (for when data are pre-discretized)
BubbleSortSwaps
CosineSimilarityBinning
OrdinalPatterns
Dispersion

Description

The main difference between CodifyVariables and [CodifyPoints] is that the former uses OutcomeSpaces for discretization. This usually means that some transformation is applied to the data before discretizing. For example, some outcome constructs a delay embedding from the input (and thus encodes sequential information) before encoding the data.

Specifically, given x::AbstractStateSpaceSet..., where the i-th dataset x[i] is assumed to represent a single series of measurements, CodifyVariables encodes x[i] by codify-ing into a series of integers using an appropriate OutcomeSpace. This is typically done by first sequentially transforming the data and then running sliding window (the width of the window is controlled by outcome_space) across the data, and then encoding the values within each window to an integer.

Examples

using CausalityTools
x, y = rand(100), rand(100)
d = CodifyVariables(OrdinalPatterns(m=2))
cx, cy = codify(d, x, y)

CausalityTools.ConditionalEntropy — Type

ConditionalEntropy <: MultivariateInformationMeasure

The supertype for all conditional entropy measures.

Concrete subtypes

ConditionalEntropyShannon
ConditionalEntropyTsallisAbe
ConditionalEntropyTsallisFuruichi

CausalityTools.ConditionalEntropyShannon — Type

ConditionalEntropyShannon <: ConditionalEntropy
ConditionalEntropyShannon(; base = 2)

The Shannon conditional entropy measure.

Usage

Use with association to compute the Shannon conditional entropy between two variables.

Compatible estimators

JointProbabilities

Discrete definition

Sum formulation

The conditional entropy between discrete random variables $X$ and $Y$ with finite ranges $\mathcal{X}$ and $\mathcal{Y}$ is defined as

\[H^{S}(X | Y) = -\sum_{x \in \mathcal{X}, y \in \mathcal{Y}} p(x, y) \log(p(x | y)).\]

This is the definition used when calling association with a JointProbabilities estimator.

Two-entropies formulation

Equivalently, the following differenConditionalEntropy of entropies hold

\[H^S(X | Y) = H^S(X, Y) - H^S(Y),\]

where $H^S(\cdot)$ and $H^S(\cdot | \cdot)$ are the Shannon entropy and Shannon joint entropy, respectively. This is the definition used when calling association with a ProbabilitiesEstimator.

Differential definition

The differential conditional Shannon entropy is analogously defined as

\[H^S(X | Y) = h^S(X, Y) - h^S(Y),\]

where $h^S(\cdot)$ and $h^S(\cdot | \cdot)$ are the Shannon differential entropy and Shannon joint differential entropy, respectively. This is the definition used when calling association with a DifferentialInfoEstimator.

Estimation

Example 1: Analytical example from Cover & Thomas's book.
Example 2: JointProbabilities estimator withCodifyVariables discretization and UniqueElements outcome space on categorical data.
Example 3: JointProbabilities estimator with CodifyPoints discretization and UniqueElementsEncoding encoding of points on numerical data.

CausalityTools.ConditionalEntropyTsallisAbe — Type

ConditionalEntropyTsallisAbe <: ConditionalEntropy
ConditionalEntropyTsallisAbe(; base = 2, q = 1.5)

Abe2001's discrete Tsallis conditional entropy measure.

Usage

Use with association to compute the Tsallis-Abe conditional entropy between two variables.

Compatible estimators

JointProbabilities

Definition

Abe & Rajagopal's Tsallis conditional entropy between discrete random variables $X$ and $Y$ with finite ranges $\mathcal{X}$ and $\mathcal{Y}$ is defined as

\[H_q^{T_A}(X | Y) = \dfrac{H_q^T(X, Y) - H_q^T(Y)}{1 + (1-q)H_q^T(Y)},\]

where $H_q^T(\cdot)$ and $H_q^T(\cdot, \cdot)$ is the Tsallis entropy and the joint Tsallis entropy.

Estimation

Example 1: JointProbabilities estimator withCodifyVariables discretization and UniqueElements outcome space on categorical data.
Example 2: JointProbabilities estimator with CodifyPoints discretization and UniqueElementsEncoding encoding of points on numerical data.

CausalityTools.ConditionalEntropyTsallisFuruichi — Type

ConditionalEntropyTsallisFuruichi <: ConditionalEntropy
ConditionalEntropyTsallisFuruichi(; base = 2, q = 1.5)

Furuichi (2006)'s discrete Tsallis conditional entropy definition.

Usage

Use with association to compute the Tsallis-Furuichi conditional entropy between two variables.

Compatible estimators

JointProbabilities

Definition

Furuichi's Tsallis conditional entropy between discrete random variables $X$ and $Y$ with finite ranges $\mathcal{X}$ and $\mathcal{Y}$ is defined as

\[H_q^T(X | Y) = -\sum_{x \in \mathcal{X}, y \in \mathcal{Y}} p(x, y)^q \log_q(p(x | y)),\]

$\ln_q(x) = \frac{x^{1-q} - 1}{1 - q}$ and $q \neq 1$. For $q = 1$, $H_q^T(X | Y)$ reduces to the Shannon conditional entropy:

\[H_{q=1}^T(X | Y) = -\sum_{x \in \mathcal{X}, y \in \mathcal{Y}} = p(x, y) \log(p(x | y))\]

If any of the entries of the marginal distribution for Y are zero, or the q-logarithm is undefined for a particular value, then the measure is undefined and NaN is returned.

Estimation

Example 1: JointProbabilities estimator withCodifyVariables discretization and UniqueElements outcome space on categorical data.
Example 2: JointProbabilities estimator with CodifyPoints discretization and UniqueElementsEncoding encoding of points on numerical data.

CausalityTools.ConditionalMutualInformation — Type

CondiitionalMutualInformation

Abstract type for all mutual information measures.

Concrete implementations

CMIShannon
CMITsallisPapapetrou
CMIRenyiJizba
CMIRenyiSarbu
CMIRenyiPoczos

CausalityTools.ConditionalMutualInformationEstimator — Type

ConditionalMutualInformationEstimator

The supertype for dedicated ConditionalMutualInformation estimators.

Concrete implementations

FPVP
GaussianCMI
MesnerShalizi
Rahimzamani
PoczosSchneiderCMI

CausalityTools.ConvergentCrossMapping — Type

ConvergentCrossMapping <: CrossmapMeasure
ConvergentCrossMapping(; d::Int = 2, τ::Int = -1, w::Int = 0,
    f = Statistics.cor, embed_warn = true)

The convergent cross mapping measure Sugihara2012.

Usage

Use with association together with a CrossmapEstimator to compute the cross-map correlation between input variables.

Compatible estimators

RandomSegment
RandomVectors
ExpandingSegment

Description

The Theiler window w controls how many temporal neighbors are excluded during neighbor searches (w = 0 means that only the point itself is excluded). f is a function that computes the agreement between observations and predictions (the default, f = Statistics.cor, gives the Pearson correlation coefficient).

Embedding

Let S(i) be the source time series variable and T(i) be the target time series variable. This version produces regular embeddings with fixed dimension d and embedding lag τ as follows:

\[( S(i), S(i+\tau), S(i+2\tau), \ldots, S(i+(d-1)\tau, T(i))_{i=1}^{N-(d-1)\tau}.\]

In this joint embedding, neighbor searches are performed in the subspace spanned by the first D-1 variables, while the last (D-th) variable is to be predicted.

With this convention, τ < 0 implies "past/present values of source used to predict target", and τ > 0 implies "future/present values of source used to predict target". The latter case may not be meaningful for many applications, so by default, a warning will be given if τ > 0 (embed_warn = false turns off warnings).

Estimation

Example 1. Estimation with RandomVectors estimator.
Example 2. Estimation with RandomSegment estimator.
Example 3: Reproducing figures from Sugihara2012.

CausalityTools.CorrTest — Type

CorrTest <: IndependenceTest
CorrTest()

An independence test based correlation (for two variables) and partial correlation (for three variables) Levy1978; as described in Schmidt2018.

Uses PearsonCorrelation and PartialCorrelation internally.

Assumes that the input data are (multivariate) normally distributed. Then ρ(X, Y) = 0 implies X ⫫ Y and ρ(X, Y | 𝐙) = 0 implies X ⫫ Y | 𝐙.

Description

The null hypothesis is H₀ := ρ(X, Y | 𝐙) = 0. We use the approach in Levy & Narula (1978)Levy1978 and compute the Z-transformation of the observed (partial) correlation coefficient $\hat{\rho}_{XY|\bf{Z}}$:

\[Z(\hat{\rho}_{XY|\bf{Z}}) = \log\dfrac{1 + \hat{\rho}_{XY|\bf{Z}}}{1 - \hat{\rho}_{XY|\bf{Z}}}.\]

To test the null hypothesis against the alternative hypothesis H₁ := ρ(X, Y | 𝐙) > 0, calculate

\[\hat{Z} = \dfrac{1}{2}\dfrac{Z(\hat{\rho}_{XY|\bf{Z}}) - Z(0)}{\sqrt{1/(n - d - 3)}},\]

and compute the two-sided p-value (Schmidt et al., 2018)

\[p(X, Y | \bf{Z}) = 2(1 - \phi(\sqrt{n - d - 3}Z(\hat{\rho}_{XY|\bf{Z}}))),\]

where $d$ is the dimension of $\bf{Z}$ and $n$ is the number of samples. For the pairwise case, the procedure is identical, but set $\bf{Z} = \emptyset$.

Examples

Example 1. Pairwise and conditional tests for independence on coupled noise processes.

CausalityTools.CorrTestResult — Type

CorrTestResult(pvalue, ρ, z)

A simple struct that holds the results of a CorrTest test: the (partial) correlation coefficient ρ, Fisher's z, and pvalue - the two-sided p-value for the test.

CausalityTools.CorrelationMeasure — Type

CorrelationMeasure <: AssociationMeasure end

The supertype for correlation measures.

Concrete implementations

PearsonCorrelation
PartialCorrelation
DistanceCorrelation

CausalityTools.CrossmapEstimator — Type

CrossmapEstimator{M<:CrossmapMeasure, LIBSIZES, RNG}

The abstract supertype for all cross-map estimators.

Concrete subtypes

RandomVectors
RandomSegment
ExpandingSegment

Description

Because the type of the library may differ between estimators, and because RNGs from different packages may be used, subtypes must implement the LIBSIZES and RNG type parameters.

For efficiency purposes, subtypes may contain mutable containers that can be re-used for ensemble analysis (see Ensemble).

Libraries

A cross-map estimator uses the concept of "libraries". A library is essentially just a reference to a set of points, and usually, a library refers to indices of points, not the actual points themselves.

For example, for timeseries, RandomVectors(libsizes = 50:25:100) produces three separate libraries, where the first contains 50 randomly selected time indices, the second contains 75 randomly selected time indices, and the third contains 100 randomly selected time indices. This of course assumes that all quantities involved can be indexed using the same time indices, meaning that the concept of "library" only makes sense after relevant quantities have been jointly embedded, so that they can be jointly indexed. For non-instantaneous prediction, the maximum possible library size shrinks with the magnitude of the index/time-offset for the prediction.

For spatial analyses (not yet implemented), indices could be more complex and involve multi-indices.

CausalityTools.CrossmapMeasure — Type

CrossmapMeasure <: AssociationMeasure

The supertype for all cross-map measures. Concrete subtypes are

ConvergentCrossMapping, or CCM for short.
PairwiseAsymmetricInference, or PAI for short.

Estimator	Principle
`UniqueElements`	Count of unique elements
`ValueBinning`	Binning (histogram)
`OrdinalPatterns`	Ordinal patterns
`Dispersion`	Dispersion patterns
`BubbleSortSwaps`	Sorting complexity
`CosineSimilarityBinning`	Cosine similarities histogram

See also: ClosenessMeasure.

CausalityTools.HellingerDistance — Type

HellingerDistance <: DivergenceOrDistance

The Hellinger distance.

Usage

Use with association to compute the compute the Hellinger distance between two pre-computed probability distributions, or from raw data using one of the estimators listed below.

Compatible estimators

JointProbabilities

Description

The Hellinger distance between two probability distributions $P_X = (p_x(\omega_1), \ldots, p_x(\omega_n))$ and $P_Y = (p_y(\omega_1), \ldots, p_y(\omega_m))$, both defined over the same OutcomeSpace $\Omega = \{\omega_1, \ldots, \omega_n \}$, is defined as

\[D_{H}(P_Y(\Omega) || P_Y(\Omega)) = \dfrac{1}{\sqrt{2}} \sum_{\omega \in \Omega} (\sqrt{p_x(\omega)} - \sqrt{p_y(\omega)})^2\]

Estimation

Example 1: From precomputed probabilities
Example 2: JointProbabilities with OrdinalPatterns outcome space

CausalityTools.Hilbert — Type

Hilbert(est;
    source::InstantaneousSignalProperty = Phase(),
    target::InstantaneousSignalProperty = Phase(),
    cond::InstantaneousSignalProperty = Phase())
) <: TransferDifferentialEntropyEstimator

Compute transfer entropy on instantaneous phases/amplitudes of relevant signals, which are obtained by first applying the Hilbert transform to each signal, then extracting the phases/amplitudes of the resulting complex numbers Palus2014. Original time series are thus transformed to instantaneous phase/amplitude time series. Transfer entropy is then estimated using the provided est on those phases/amplitudes (use e.g. ValueBinning, or OrdinalPatterns).

Info

Details on estimation of the transfer entropy (conditional mutual information) following the phase/amplitude extraction step is not given in Palus (2014). Here, after instantaneous phases/amplitudes have been obtained, these are treated as regular time series, from which transfer entropy is then computed as usual.

See also: ClosenessMeasure.

CausalityTools.Lindner — Type

Lindner <: TransferEntropyEstimator
Lindner(definition = Shannon(); k = 1, w = 0, base = 2)

The Lindner transfer entropy estimator Lindner2011, which is also used in the Trentool MATLAB toolbox, and is based on nearest neighbor searches.

Compatible definitions

TEShannon

Usage

Use with association to compute TEShannon from input data.
Use with some IndependenceTest to test for independence between variables.

Keyword parameters

w is the Theiler window, which determines if temporal neighbors are excluded during neighbor searches (defaults to 0, meaning that only the point itself is excluded when searching for neighbours).

The estimator can be used both for pairwise and conditional transfer entropy estimation.

Description

For a given points in the joint embedding space jᵢ, this estimator first computes the distance dᵢ from jᵢ to its k-th nearest neighbor. Then, for each point mₖ[i] in the k-th marginal space, it counts the number of points within radius dᵢ.

The Shannon transfer entropy is then computed as

\[TE_S(X \to Y) = \psi(k) + \dfrac{1}{N} \sum_{i}^n \left[ \sum_{k=1}^3 \left( \psi(m_k[i] + 1) \right) \right],\]

where the index k references the three marginal subspaces T, TTf and ST for which neighbor searches are performed. Here this estimator has been modified to allow for conditioning too (a simple modification to Lindner2011's equation 5 and 6).

Example

using CausalityTools
using Random; rng = MersenneTwister(1234)
x = rand(rng, 10000)
y = rand(rng, 10000) .+ x
z = rand(rng, 10000) .+ y
est = Lindner(TEShannon(), k = 10)
association(est, x, z, y) # should be near 0 (and can be negative)

CausalityTools.LocalPermutationClosenessSearch — Type

The supertype of all types indicating a way of determining "closeness" for the local permutation algorithm.

CausalityTools.LocalPermutationTest — Type

LocalPermutationTest <: IndependenceTest
LocalPermutationTest(measure, [est];
    kperm::Int = 5,
    nshuffles::Int = 100,
    rng = Random.default_rng(),
    replace = true,
    w::Int = 0,
    show_progress = false)

LocalPermutationTest is a generic conditional independence test Runge2018LocalPerm for assessing whether two variables X and Y are conditionally independendent given a third variable Z (all of which may be multivariate).

When used with independence, a LocalPermutationTestResult is returned.

Description

This is a generic one-sided hypothesis test that checks whether X and Y are independent (given Z, if provided) based on resampling from a null distribution assumed to represent independence between the variables. The null distribution is generated by repeatedly shuffling the input data in some way that is intended to break any dependence between x and y, but preserve dependencies between x and z.

The algorithm is as follows:

Compute the original conditional independence statistic I(X; Y | Z).
Allocate a scalar valued vector Î with space for nshuffles elements.
For k ∈ [1, 2, …, nshuffles], repeat
- For each zᵢ ∈ Y, let nᵢ be time indices of the kperm nearest neighbors of zᵢ, excluding the w nearest neighbors of zᵢ from the neighbor query (i.e w is the Theiler window).
- Let xᵢ⋆ = X[j], where j is randomly sampled from nᵢ with replacement. This way, xᵢ is replaced with xⱼ only if zᵢ ≈ zⱼ (zᵢ and zⱼ are close). Repeat for i = 1, 2, …, n and obtain the shuffled X̂ = [x̂₁, x̂₂, …, x̂ₙ].
- Compute the conditional independence statistic Iₖ(X̂; Y | Z).
- Let Î[k] = Iₖ(X̂; Y | Z).
Compute the p-value as count(Î[k] .<= I) / nshuffles).

In additional to the conditional variant from Runge (2018), we also provide a pairwise version, where the shuffling procedure is identical, except neighbors in Y are used instead of Z and we I(X; Y) and Iₖ(X̂; Y) instead of I(X; Y | Z) and Iₖ(X̂; Y | Z).

Compatible measures

Measure	Pairwise	Conditional	Requires `est`	Note
`PartialCorrelation`	✖	✓	No
`DistanceCorrelation`	✖	✓	No
`CMIShannon`	✖	✓	Yes
`TEShannon`	✓	✓	Yes	Pairwise tests not possible with `TransferEntropyEstimator`s, only lower-level estimators, e.g. `FPVP`, `GaussianMI` or `Kraskov`
`PartialMutualInformation`	✖	✓	Yes

The LocalPermutationTest is only defined for conditional independence testing. Exceptions are for measures like TEShannon, which use conditional measures under the hood even for their pairwise variants, and are therefore compatible with LocalPermutationTest.

The nearest-neighbor approach in Runge (2018) can be reproduced by using the CMIShannon measure with the FPVP estimator.

Examples

Example 1: Conditional independence test using CMIShannon
Example 2): Conditional independence test using TEShannon

CausalityTools.LocalPermutationTestResult — Type

LocalPermutationTestResult(m, m_surr, pvalue)

Holds the result of a LocalPermutationTest. m is the measure computed on the original data. m_surr is a vector of the measure computed on permuted data, where m_surr[i] is the measure compute on the i-th permutation. pvalue is the one-sided p-value for the test.

CausalityTools.MCR — Type

MCR <: AssociationMeasure
MCR(; r, metric = Euclidean())

An association measure based on mean conditional probabilities of recurrence (MCR) introduced by Romano2007.

Usage

Use with association to compute the raw MCR for pairwise or conditional association.
Use with IndependenceTest to perform a formal hypothesis test for pairwise or conditional association.

Description

r is mandatory keyword which specifies the recurrence threshold when constructing recurrence matrices. It can be instance of any subtype of AbstractRecurrenceType from RecurrenceAnalysis.jl. To use any r that is not a real number, you have to do using RecurrenceAnalysis first. The metric is any valid metric from Distances.jl.

For input variables X and Y, the conditional probability of recurrence is defined as

\[M(X | Y) = \dfrac{1}{N} \sum_{i=1}^N p(\bf{y_i} | \bf{x_i}) = \dfrac{1}{N} \sum_{i=1}^N \dfrac{\sum_{i=1}^N J_{R_{i, j}}^{X, Y}}{\sum_{i=1}^N R_{i, j}^X},\]

where $R_{i, j}^X$ is the recurrence matrix and $J_{R_{i, j}}^{X, Y}$ is the joint recurrence matrix, constructed using the given metric. The measure $M(Y | X)$ is defined analogously.

Romano2007's interpretation of this quantity is that if X drives Y, then M(X|Y) > M(Y|X), if Y drives X, then M(Y|X) > M(X|Y), and if coupling is symmetric, then M(Y|X) = M(X|Y).

Input data

X and Y can be either both univariate timeseries, or both multivariate StateSpaceSets.

Estimation

Example 1. Pairwise versus conditional MCR.

CausalityTools.MIDecomposition — Type

MIDecomposition(definition::MultivariateInformationMeasure, 
    est::MutualInformationEstimator)

Estimate the MultivariateInformationMeasure specified by definition by by decomposing, the measure, if possible, into a combination of mutual information terms. These terms are individually estimated using the given MutualInformationEstimator est, and finally combined to form the final value of the measure.

Usage

Use with association to compute a MultivariateInformationMeasure from input data: association(est::MIDecomposition, x...).
Use with some IndependenceTest to test for independence between variables.

Examples

Example 1: Estimating CMIShannon using a decomposition into MIShannon terms using the KraskovStögbauerGrassberger1 mutual information estimator.

CausalityTools.MIRenyiJizba — Type

MIRenyiJizba <: <: BivariateInformationMeasure
MIRenyiJizba(; q = 1.5, base = 2)

The Rényi mutual information $I_q^{R_{J}}(X; Y)$ defined in Jizba2012.

Usage

Use with association to compute the raw Rényi-Jizba mutual information from input data using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise dependence using the Rényi-Jizba mutual information.

Compatible estimators

JointProbabilities.
EntropyDecomposition.

Definition

\[I_q^{R_{J}}(X; Y) = H_q^{R}(X) + H_q^{R}(Y) - H_q^{R}(X, Y),\]

where $H_q^{R}(\cdot)$ is the Rényi entropy.

Estimation

Example 1: JointProbabilities with UniqueElements outcome space.
Example 2: EntropyDecomposition with LeonenkoProzantoSavani.
Example 3: EntropyDecomposition with ValueBinning.

CausalityTools.MIRenyiSarbu — Type

MIRenyiSarbu <: BivariateInformationMeasure
MIRenyiSarbu(; base = 2, q = 1.5)

The discrete Rényi mutual information from Sarbu2014.

Usage

Use with association to compute the raw Rényi-Sarbu mutual information from input data using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise dependence using the Rényi-Sarbu mutual information.

Compatible estimators

JointProbabilities.

Description

Sarbu (2014) defines discrete Rényi mutual information as the Rényi $\alpha$-divergence between the conditional joint probability mass function $p(x, y)$ and the product of the conditional marginals, $p(x) \cdot p(y)$:

\[I(X, Y)^R_q = \dfrac{1}{q-1} \log \left( \sum_{x \in X, y \in Y} \dfrac{p(x, y)^q}{\left( p(x)\cdot p(y) \right)^{q-1}} \right)\]

Estimation

Example 1: JointProbabilities with UniqueElements for categorical data.
Example 2: JointProbabilities with CosineSimilarityBinning for numerical data.

CausalityTools.MIShannon — Type

MIShannon <: BivariateInformationMeasure
MIShannon(; base = 2)

The Shannon mutual information $I_S(X; Y)$.

Usage

Use with association to compute the raw Shannon mutual information from input data using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise dependence using the Shannon mutual information.

Compatible estimators

JointProbabilities (generic)
EntropyDecomposition (generic)
KraskovStögbauerGrassberger1
KraskovStögbauerGrassberger2
GaoOhViswanath
GaoKannanOhViswanath
GaussianMI

Discrete definition

There are many equivalent formulations of discrete Shannon mutual information, meaning that it can be estimated in several ways, either using JointProbabilities (double-sum formulation), EntropyDecomposition (three-entropies decomposition), or some dedicated estimator.

Double sum formulation

Assume we observe samples $\bar{\bf{X}}_{1:N_y} = \{\bar{\bf{X}}_1, \ldots, \bar{\bf{X}}_n \}$ and $\bar{\bf{Y}}_{1:N_x} = \{\bar{\bf{Y}}_1, \ldots, \bar{\bf{Y}}_n \}$ from two discrete random variables $X$ and $Y$ with finite supports $\mathcal{X} = \{ x_1, x_2, \ldots, x_{M_x} \}$ and $\mathcal{Y} = y_1, y_2, \ldots, x_{M_y}$. The double-sum estimate is obtained by replacing the double sum

\[\hat{I}_{DS}(X; Y) = \sum_{x_i \in \mathcal{X}, y_i \in \mathcal{Y}} p(x_i, y_j) \log \left( \dfrac{p(x_i, y_i)}{p(x_i)p(y_j)} \right)\]

where $\hat{p}(x_i) = \frac{n(x_i)}{N_x}$, $\hat{p}(y_i) = \frac{n(y_j)}{N_y}$, and $\hat{p}(x_i, x_j) = \frac{n(x_i)}{N}$, and $N = N_x N_y$. This definition is used by association when called with a JointProbabilities estimator.

Three-entropies formulation

An equivalent formulation of discrete Shannon mutual information is

\[I^S(X; Y) = H^S(X) + H_q^S(Y) - H^S(X, Y),\]

where $H^S(\cdot)$ and $H^S(\cdot, \cdot)$ are the marginal and joint discrete Shannon entropies. This definition is used by association when called with a EntropyDecomposition estimator and a discretization.

Differential mutual information

One possible formulation of differential Shannon mutual information is

\[I^S(X; Y) = h^S(X) + h_q^S(Y) - h^S(X, Y),\]

where $h^S(\cdot)$ and $h^S(\cdot, \cdot)$ are the marginal and joint differential Shannon entropies. This definition is used by association when called with EntropyDecomposition estimator and a DifferentialInfoEstimator.

Estimation

Example 1: JointProbabilities with ValueBinning outcome space.
Example 2: JointProbabilities with UniqueElements outcome space on string data.
Example 3: Dedicated GaussianMI estimator.
Example 4: Dedicated KraskovStögbauerGrassberger1 estimator.
Example 5: Dedicated KraskovStögbauerGrassberger2 estimator.
Example 6: Dedicated GaoKannanOhViswanath estimator.
Example 7: EntropyDecomposition with Kraskov estimator.
Example 8: EntropyDecomposition with BubbleSortSwaps.
Example 9: EntropyDecomposition with Jackknife estimator and ValueBinning outcome space.
Example 10: Reproducing Kraskov et al. (2004).

CausalityTools.MITsallisFuruichi — Type

MITsallisFuruichi <: BivariateInformationMeasure
MITsallisFuruichi(; base = 2, q = 1.5)

The discrete Tsallis mutual information from Furuichi (2006)Furuichi2006, which in that paper is called the mutual entropy.

Usage

Use with association to compute the raw Tsallis-Furuichi mutual information from input data using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise dependence using the Tsallis-Furuichi mutual information.

Compatible estimators

JointProbabilities
EntropyDecomposition

Description

Furuichi's Tsallis mutual entropy between variables $X \in \mathbb{R}^{d_X}$ and $Y \in \mathbb{R}^{d_Y}$ is defined as

\[I_q^T(X; Y) = H_q^T(X) - H_q^T(X | Y) = H_q^T(X) + H_q^T(Y) - H_q^T(X, Y),\]

where $H^T(\cdot)$ and $H^T(\cdot, \cdot)$ are the marginal and joint Tsallis entropies, and q is the Tsallis-parameter.

Estimation

Example 1: JointProbabilities with UniqueElements outcome space.
Example 2: EntropyDecomposition with LeonenkoProzantoSavani estimator.
Example 3: EntropyDecomposition with Dispersion

CausalityTools.MITsallisMartin — Type

MITsallisMartin <: BivariateInformationMeasure
MITsallisMartin(; base = 2, q = 1.5)

The discrete Tsallis mutual information from Martin2004.

Usage

Use with association to compute the raw Tsallis-Martin mutual information from input data using of of the estimators listed below.
Use with independence to perform a formal hypothesis test for pairwise dependence using the Tsallis-Martin mutual information.

Compatible estimators

JointProbabilities
EntropyDecomposition

Description

Martin et al.'s Tsallis mutual information between variables $X \in \mathbb{R}^{d_X}$ and $Y \in \mathbb{R}^{d_Y}$ is defined as

\[I_{\text{Martin}}^T(X, Y, q) := H_q^T(X) + H_q^T(Y) - (1 - q) H_q^T(X) H_q^T(Y) - H_q(X, Y),\]

where $H^S(\cdot)$ and $H^S(\cdot, \cdot)$ are the marginal and joint Shannon entropies, and q is the Tsallis-parameter.

Estimation

Example 1: JointProbabilities with UniqueElements outcome space.
Example 2: EntropyDecomposition with LeonenkoProzantoSavani estimator.
Example 3: EntropyDecomposition with OrdinalPatterns outcome space.

CausalityTools.MMeasure — Type

MMeasure <: ClosenessMeasure
MMeasure(; K::Int = 2, dx = 2, dy = 2, τx = - 1, τy = -1, w = 0)

The MMeasure Andrzejak2003 is a pairwise association measure. It quantifies the probability with which close state of a target timeseries/embedding are mapped to close states of a source timeseries/embedding.

Note that τx and τy are negative by convention. See docstring for SMeasure for an explanation.

Usage

Use with association to compute the raw m-measure statistic.
Use with independence to perform a formal hypothesis test for directional dependence.

Description

The MMeasure is based on SMeasure and HMeasure. It is given by

\[M^{(k)}(x|y) = \dfrac{1}{N} \sum_{i=1}^{N} \log \left( \dfrac{R_i(x) - R_i^{(k)}(x|y)}{R_i(x) - R_i^k(x)} \right),\]

where $R_i(x)$ is computed as for HMeasure, while $R_i^k(x)$ and $R_i^{(k)}(x|y)$ is computed as for SMeasure. Parameters also have the same meaning as for SMeasure/HMeasure.

See also: ClosenessMeasure.

CausalityTools.MesnerShalizi — Type

MesnerShalizi <: ConditionalMutualInformationEstimator
MesnerShalizi(definition = CMIShannon(); k = 1, w = 0)

The MesnerShalizi ConditionalMutualInformationEstimator is designed for data that can be mixtures of discrete and continuous data Mesner2020.

k is the number of nearest neighbors. w is the Theiler window, which controls the number of temporal neighbors that are excluded during neighbor searches.

Compatible definitions

CMIShannon

Usage

Use with association to compute CMIShannon from input data.
Use with some IndependenceTest to test for independence between variables.

Examples

using CausalityTools
using Random; rng = MersenneTwister(1234)
x = rand(rng, 10000)
y = rand(rng, 10000) .+ x
z = rand(rng, 10000) .+ y
association(MesnerShalizi(; k = 10), x, z, y) # should be near 0 (and can be negative)

CausalityTools.MultivariateInformationMeasure — Type

MultivariateInformationMeasure <: AssociationMeasure

The supertype for all multivariate information-based measure definitions.

Definition

Following Datseris2024, we define a multivariate information measure as any functional of a multidimensional probability mass functions (PMFs) or multidimensional probability density.

Implementations

JointEntropy definitions:

JointEntropyShannon
JointEntropyRenyi
JointEntropyTsallis

ConditionalEntropy definitions:

ConditionalEntropyShannon
ConditionalEntropyTsallisAbe
ConditionalEntropyTsallisFuruichi

DivergenceOrDistance definitions:

HellingerDistance
KLDivergence
RenyiDivergence
VariationDistance

MutualInformation definitions:

MIShannon
MIRenyiJizba
MIRenyiSarbu
MITsallisMartin
MITsallisFuruichi

ConditionalMutualInformation definitions:

CMIShannon
CMITsallisPapapetrou
CMIRenyiJizba
CMIRenyiPoczos
CMIRenyiSarbu

TransferEntropy definitions:

TEShannon
TERenyiJizba

Other definitions:

PartialMutualInformation

CausalityTools.MultivariateInformationMeasureEstimator — Type

MultivariateInformationMeasureEstimator

The supertype for all estimators of multivariate information measures.

Generic implementations

JointProbabilities
EntropyDecomposition
MIDecomposition
CMIDecomposition

Dedicated implementations

MutualInformationEstimators:

KraskovStögbauerGrassberger1
KraskovStögbauerGrassberger2
GaoOhViswanath
GaoKannanOhViswanath
GaussianMI

ConditionalMutualInformationEstimators:

FPVP
MesnerShalizi
Rahimzamani
PoczosSchneiderCMI
GaussianCMI

TransferEntropyEstimators:

Zhu1
Lindner

CausalityTools.MutualInformation — Type

MutualInformation

Abstract type for all mutual information measures.

Concrete implementations

MIShannon
MITsallisMartin
MITsallisFuruichi
MIRenyiJizba
MIRenyiSarbu

See also: ClosenessMeasure.

CausalityTools.SurrogateAssociationTest — Type

SurrogateAssociationTest <: IndependenceTest
SurrogateAssociationTest(est_or_measure;
    nshuffles::Int = 100,
    surrogate = RandomShuffle(),
    rng = Random.default_rng(),
    show_progress = false,
)

A surrogate-data based generic (conditional) independence test for assessing whether the association between variables X and Y are independent, potentially conditioned on a third variable Z.

Compatible estimators and measures

Compatible with AssociationMeasures that measure some sort of pairwise or conditional association.

Note

You must yourself determine whether using a particular measure is meaningful, and what it means.

Note

If used with a TransferEntropy measure such as TEShannon, then the source variable is always shuffled, and the target and conditional variable are left unshuffled.

Usage

Use with independence to perform a surrogate test with input data. This will return a SurrogateAssociationTestResult.

Description

This is a generic one-sided hypothesis test that checks whether x and y are independent (given z, if provided) based on resampling from a null distribution assumed to represent independence between the variables. The null distribution is generated by repeatedly shuffling the input data in some way that is intended to break any dependence between the input variables.

The test first estimates the desired statistic using est_or_measure on the input data. Then, the first input variable is shuffled nshuffled times according to the given surrogate method (each type of surrogate represents a distinct null hypothesis). For each shuffle, est_or_measure is recomputed and the results are stored.

Examples

Example 1: SMeasure test for pairwise independence.
Example 2: DistanceCorrelation test for pairwise independence.
Example 3: PartialCorrelation test for conditional independence.
Example 4: MIShannon test for pairwise independence on categorical data.
Example 5: CMIShannon test for conditional independence on categorical data.
Example 6: MCR test for pairwise and conditional independence.

CausalityTools.SurrogateAssociationTestResult — Type

SurrogateAssociationTestResult(m, m_surr, pvalue)

Holds the result of a SurrogateAssociationTest. m is the measure computed on the original data. m_surr is a vector of the measure computed on permuted data, where m_surr[i] is the measure compute on the i-th permutation. pvalue is the one-sided p-value for the test.

CausalityTools.SymbolicTransferEntropy — Type

SymbolicTransferEntropy <: TransferEntropyEstimator
SymbolicTransferEntropy(definition = TEShannon(); m = 3, τ = 1, 
    lt = ComplexityMeasures.isless_rand

A convenience estimator for symbolic transfer entropy Staniek2008.

Compatible measures

TEShannon

Description

Symbolic transfer entropy consists of two simple steps. First, the input time series are encoded using codify with the CodifyVariables discretization and the OrdinalPatterns outcome space. This transforms the input time series into integer time series. Transfer entropy entropy is then estimated from the encoded time series by applying

Transfer entropy is then estimated as usual on the encoded timeseries with the embedding dictated by definition and the JointProbabilities estimator.

Examples

Example 1

CausalityTools.TERenyiJizba — Type

TERenyiJizba() <: TransferEntropy

The Rényi transfer entropy from Jizba2012.

Usage

Use with association to compute the raw transfer entropy.
Use with an IndependenceTest to perform a formal hypothesis test for pairwise and conditional dependence.

Description

The transfer entropy from source $S$ to target $T$, potentially conditioned on $C$ is defined as

\[\begin{align*} TE(S \to T) &:= I_q^{R_J}(T^+; S^- | T^-) \\ TE(S \to T | C) &:= I_q^{R_J}(T^+; S^- | T^-, C^-), \end{align*},\]

where $I_q^{R_J}(T^+; S^- | T^-)$ is Jizba et al. (2012)'s definition of conditional mutual information (CMIRenyiJizba). The - and + subscripts on the marginal variables $T^+$, $T^-$, $S^-$ and $C^-$ indicate that the embedding vectors for that marginal are constructed using present/past values and future values, respectively.

Estimation

Estimating Jizba's Rényi transfer entropy is a bit complicated, since it doesn't have a dedicated estimator. Instead, we re-write the Rényi transfer entropy as a Rényi conditional mutual information, and estimate it using an EntropyDecomposition with a suitable discrete/differential Rényi entropy estimator from the list below as its input.

Estimator	Sub-estimator	Principle
`EntropyDecomposition`	`LeonenkoProzantoSavani`	Four-entropies decomposition
`EntropyDecomposition`	`ValueBinning`	Four-entropies decomposition
`EntropyDecomposition`	`Dispersion`	Four-entropies decomposition
`EntropyDecomposition`	`OrdinalPatterns`	Four-entropies decomposition
`EntropyDecomposition`	`UniqueElements`	Four-entropies decomposition
`EntropyDecomposition`	`TransferOperator`	Four-entropies decomposition

Any of these estimators must be given as input to a `CMIDecomposition estimator.

Estimation

Example 1: EntropyDecomposition with TransferOperator outcome space.

CausalityTools.TEShannon — Type

TEShannon <: TransferEntropy
TEShannon(; base = 2; embedding = EmbeddingTE()) <: TransferEntropy

The Shannon-type transfer entropy measure.

Usage

Use with association to compute the raw transfer entropy.
Use with an IndependenceTest to perform a formal hypothesis test for pairwise and conditional dependence.

Description

The transfer entropy from source $S$ to target $T$, potentially conditioned on $C$ is defined as

\[\begin{align*} TE(S \to T) &:= I^S(T^+; S^- | T^-) \\ TE(S \to T | C) &:= I^S(T^+; S^- | T^-, C^-) \end{align*}\]

where $I(T^+; S^- | T^-)$ is the Shannon conditional mutual information (CMIShannon). The - and + subscripts on the marginal variables $T^+$, $T^-$, $S^-$ and $C^-$ indicate that the embedding vectors for that marginal are constructed using present/past values and future values, respectively.

Estimation

Example 1: EntropyDecomposition with TransferOperator outcome space.
Example 2: Estimation using the SymbolicTransferEntropy estimator.

CausalityTools.TEVars — Type

TEVars(Tf::Vector{Int}, T::Vector{Int}, S::Vector{Int})
TEVars(Tf::Vector{Int}, T::Vector{Int}, S::Vector{Int}, C::Vector{Int})
TEVars(;Tf = Int[], T = Int[], S = Int[], C = Int[]) -> TEVars

Which axes of the state space correspond to the future of the target (Tf), the present/past of the target (T), the present/past of the source (S), and the present/past of any conditioned variables (C)? This information is used by the transfer entropy estimators to ensure that marginal distributions are computed correctly.

Indices correspond to variables of the embedding, or, equivalently, colums of a StateSpaceSet.

The three-argument constructor assumes there will be no conditional variables.
The four-argument constructor assumes there will be conditional variables.

CausalityTools.TransferEntropy — Type

TransferEntropy <: AssociationMeasure

The supertype of all transfer entropy measures. Concrete subtypes are

TEShannon
TERenyiJizba

CausalityTools.TransferEntropyEstimator — Type

The supertype of all dedicated transfer entropy estimators.

CausalityTools.VariationDistance — Type

VariationDistance <: DivergenceOrDistance

The variation distance.

Usage

Use with association to compute the compute the variation distance between two pre-computed probability distributions, or from raw data using one of the estimators listed below.

Compatible estimators

JointDistanceDistribution

Description

The variation distance between two probability distributions $P_X = (p_x(\omega_1), \ldots, p_x(\omega_n))$ and $P_Y = (p_y(\omega_1), \ldots, p_y(\omega_m))$, both defined over the same OutcomeSpace $\Omega = \{\omega_1, \ldots, \omega_n \}$, is defined as

\[D_{V}(P_Y(\Omega) || P_Y(\Omega)) = \dfrac{1}{2} \sum_{\omega \in \Omega} | p_x(\omega) - p_y(\omega) |\]

Examples

Example 1: From precomputed probabilities
Example 2: JointProbabilities with OrdinalPatterns outcome space

CausalityTools.Zhu1 — Type

Zhu1 <: TransferEntropyEstimator
Zhu1(k = 1, w = 0, base = MathConstants.e)

The Zhu1 transfer entropy estimator Zhu2015 for normalized input data (as described in Zhu2015) for both for pairwise and conditional transfer entropy.

Compatible definitions

TEShannon

Usage

Use with association to compute TEShannon from input data.
Use with some IndependenceTest to test for independence between variables.

Description

This estimator approximates probabilities within hyperrectangles surrounding each point xᵢ ∈ x using using k nearest neighbor searches. However, it also considers the number of neighbors falling on the borders of these hyperrectangles. This estimator is an extension to the entropy estimator in Singh2003.

Description

The Shannon transfer entropy is then computed as

\[TE_S(X \to Y) = \psi(k) + \dfrac{1}{N} \sum_{i}^n \left[ \sum_{k=1}^3 \left( \psi(m_k[i] + 1) \right) \right],\]

Example

using CausalityTools
using Random; rng = MersenneTwister(1234)
x = rand(rng, 10000)
y = rand(rng, 10000) .+ x
z = rand(rng, 10000) .+ y
est = Zhu1(TEShannon(), k = 10)
association(est, x, z, y) # should be near 0 (and can be negative)

CausalityTools._convert_logunit — Method

_convert_logunit(h_a::Real, , to) → h_b

Convert a number h_a computed with logarithms to base a to an entropy h_b computed with logarithms to base b. This can be used to convert the "unit" of e.g. an entropy.

CausalityTools.association — Method

association(estimator::AssociationMeasureEstimator, x, y, [z, ...]) → r
association(definition::AssociationMeasure, x, y, [z, ...]) → r

Estimate the (conditional) association between input variables x, y, z, … using the given estimator (an AssociationMeasureEstimator) or definition (an AssociationMeasure).

Info

The type of the return value r depends on the measure/estimator. The interpretation of the returned value also depends on the specific measure and estimator used.

Examples

The examples section of the online documentation has numerous using association.

CausalityTools.backwards_eliminate! — Method

backwards_eliminate!(alg::OCE, parents::OCESelectedParents, x, i; verbose)

Algorithm 2.2 in Sun et al. (2015). Perform backward elimination for the i-th variable in x, given the previously inferred parents, which were deduced using parameters in alg. Modifies parents in-place.

CausalityTools.codified_marginals — Function

codified_marginals(o::OutcomeSpace, x::VectorOrStateSpaceSet...)

Encode/discretize each input vector (e.g. timeseries) xᵢ ∈ x according to a procedure determined by o.

For some outcome spaces, the encoding is sequential (i.e. time ordering matters). Any xᵢ ∈ X that are multidimensional (StateSpaceSets) will be encoded column-wise, i.e. each column of xᵢ is treated as a timeseries and is encoded separately.

This is useful for discretizing input data when computing some MultivariateInformationMeasure. This method is used internally by both the JointProbabilities and EntropyDecomposition estimators to handle discretization.

Supported estimators

ValueBinning. Bin visitation frequencies are counted in the joint space XY, then marginal visitations are obtained from the joint bin visits. This behaviour is the same for both FixedRectangularBinning and RectangularBinning (which adapts the grid to the data). When using FixedRectangularBinning, the range along the first dimension is used as a template for all other dimensions.
OrdinalPatterns. Each timeseries is separately codify-ed by embedding the timeseries, then sequentially encoding the ordinal patterns of the embedding vectors.
Dispersion. Each timeseries is separately codify-ed by embedding the timeseries, then sequentially encoding the embedding vectors according to their dispersion pattern (which for each embedding vector is computed relative to all other embedding vectors).
CosineSimilarityBinning. Each timeseries is separately codify-ed by embedding the timeseries, the encoding the embedding points in a in a sequential manner according to the cosine similarity of the embedding vectors.
UniqueElements. Each timeseries is codify-ed according to its unique values (i.e. each unique element gets assigned a specific integer).

More implementations are possible.

CausalityTools.cpdag — Method

cpdag(alg::PC, skeleton::SimpleDiGraph, separating_sets::Dict{Edge, Vector{Int}}) → dg::SimpleDiGraph

Orient edges in the skeleton graph using the given separating_sets using algorithm 2 in Kalisch2008 and return the directed graph cpdag.

Description

First, a directed graph dg is constructed from skeleton, such that every undirected edge X - Y in skeleton is replaced by the bidirectional edge X ↔ Y in dg. In practices, for each X ↔ Y, we construct two directional edges X → Y and Y → X.

Orientiation rules 0-3 are then applied to dg. We use the rules as stated in Colombo & Maathuis, 2014.

Rule 0 (orients v-structures): X ↔ Y ↔ Z becomes X → Y ← Z if Y is not in the separating set S(X, Z).
Rule 1 (prevents new v-structures): X → Y ↔ Z becomes X → Y → Z if X and Z are not adjacent.
Rule 2 (avoids cycles): X → Y → Z ↔ X becomes X → Y → Z ← X`
Rule 3: To avoid creating cycles or new v-structures, X X | | | | | | Y | W becomes Y | W ↘ | ↙ ↘ ↓ ↙ Z Z

CausalityTools.crossmap — Function

crossmap(measure::CrossmapEstimator, t::AbstractVector, s::AbstractVector) → ρ::Real
crossmap(measure::CrossmapEstimator, est, t::AbstractVector, s::AbstractVector) → ρ::Vector
crossmap(measure::CrossmapEstimator, t̄::AbstractVector, S̄::AbstractStateSpaceSet) → ρ

Compute the cross map estimates between between raw time series t and s (and return the real-valued cross-map statistic ρ). If a CrossmapEstimator est is provided, cross mapping is done on random subsamples of the data, where subsampling is dictated by est (a vector of values for ρ is returned).

Alternatively, cross-map between time-aligned time series t̄ and source embedding S̄ that have been constructed by jointly (pre-embedding) some input data.

This is just a wrapper around predict that simply returns the correspondence measure between the source and the target.

CausalityTools.distance_covariance — Method

distance_covariance(x, y) → dcov::Real
distance_covariance(x, y, z) → pdcov::Real

Compute the empirical/sample distance covariance (Székely et al., 2007)^{[Székely2007]} between StateSpaceSets x and y. Alternatively, compute the partial distance covariance pdcov.

CausalityTools.distance_variance — Method

distance_variance(x) → dvar::Real

Compute the empirical/sample distance variance (Székely et al., 2007)^{[Székely2007]} for StateSpaceSet x.

CausalityTools.eliminate_loop! — Method

eliminate_loop!(alg::OCE, parents::OCESelectedParents, xᵢ; verbose = false)

Inner portion of algorithm 2.2 in Sun et al. (2015). This method is called in an external while-loop that handles the variable elimination step in their line 3.

CausalityTools.estimate_from_marginals — Function

estimate_from_marginals(est::TransferEntropyEstimator,
    S::AbstractStateSpaceSet,
    T::AbstractStateSpaceSet,
    T⁺::AbstractStateSpaceSet,
    C::AbstractStateSpaceSet)

Convenience method for TransferEntropyEstimators that allows easier integration with LocalPermutationTest. Zhu1 and [Lindner])@ref) uses this method.

CausalityTools.estimator_with_overridden_parameters — Method

estimator_with_overridden_parameters(definition, lower_level_estimator) → e::typeof(lower_level_estimator)

Given some higher-level definition of an information measure, which is to be estimated using some lower_level_estimator, return a modified version of the estimator in which its parameter have been overriden by any overlapping parameters from the defintiion.

This method is explicitly extended for each possible decomposition.

CausalityTools.fishers_z — Method

fishers_z(p̂)

Compute Fisher's z-transform on the sample partial correlation coefficient p̂ (computed as the correlation between variables i and j, given the remaining variables):

\[Z(V_i, V_j | \bf{S}) = \dfrac{1}{2} \log{\left( \dfrac{1 + \hat{p}(V_i, V_j | \bf{S})}{1 - \hat{p}(V_i, V_j | \bf{S})} \right) }\]

CausalityTools.independence — Method

independence(test::IndependenceTest, x, y, [z]) → summary

Perform the given IndependenceTest test on data x, y and z. If only x and y are given, test must provide a bivariate association measure. If z is given too, then test must provide a conditional association measure.

Returns a test summary, whose type depends on test.

Compatible independence tests

SurrogateAssociationTest
LocalPermutationTest
JointDistanceDistributionTest
CorrTest

CausalityTools.infer_graph — Function

infer_graph(algorithm::GraphAlgorithm, x) → g

Infer graph from input data x using the given algorithm.

Returns g, whose type depends on algorithm.

CausalityTools.jdd — Method

jdd(measure::JointDistanceDistribution, source, target) → Δ

Compute the joint distance distribution Amigo2018 from source to target using the given JointDistanceDistribution measure.

Returns the distribution Δ from the paper directly (example). Use JointDistanceDistributionTest to perform a formal indepencence test.

CausalityTools.library_indices — Function

library_indices(measure::CCMLike, est::CrossmapEstimator, i::Int,  target, source)

Produce (randomly, if relevant) the i-th subset of indices for a CrossmapEstimator that is being applied to target and source.

CausalityTools.marginal — Method

marginal(p::Probabilities; dims = 1:ndims(p))
marginal(c::Counts; dims = 1:ndims(p))

Given a set of counts c (a contingency table), or a multivariate probability mass function p, return the marginal counts/probabilities along the given dims.

CausalityTools.marginal_indices — Method

marginal_indices(x)

Returns a column vector v with the same number of elements as there are unique elements in x. v[i] is the indices of elements in x matching v[i].

For example, if the third unique element in x, and the element u₃ = unique(x)[3] appears four times in x, then v[3] is a vector of four integers indicating the position of the elements matching u₃.

CausalityTools.marginal_probs_from_μ — Method

marginal_probs_from_μ(seleced_axes, visited_bins, iv::InvariantMeasure, inds_non0measure)

Estimate marginal probabilities from a pre-computed invariant measure, given a set of visited bins, an invariant measure and the indices of the positive-measure bins. The indices in selected_axes determines which marginals are selected.

CausalityTools.max_inputs_vars — Method

max_inputs_vars(m::AssociationMeasure) → nmax::Int

Return the maximum number of variables is that the measure can be computed for.

For example, MIShannon cannot be computed for more than 2 variables.

CausalityTools.max_segmentlength — Function

max_segmentlength(x::AbstractVector, measure::CrossmapMeasure)

Given an input vector x, which is representative of the size of the other input vectors too, compute the maximum segment/library length that can be used for predictions.

CausalityTools.min_inputs_vars — Method

min_inputs_vars(m::AssociationMeasure) → nmin::Int

Return the minimum number of variables is that the measure can be computed for.

For example, CMIShannon requires 3 input variables.

CausalityTools.optimize_marginals_te — Function

optimize_marginals_te([scheme = OptimiseTraditional()], s, t, [c]) → EmbeddingTE

Optimize marginal embeddings for transfer entropy computation from source time series s to target time series t, conditioned on c if c is given, using the provided optimization scheme.

CausalityTools.optimize_marginals_te — Method

optimize_marginals_te(opt::OptimiseTraditional, s, t, [c]; exclude_source = false) → EmbeddingTE

Optimise the marginals for a transfer entropy analysis from source time series s to target time series t, potentially given a conditional time series c.

If exclude_source == true, then no optimisation is done for the source. This is useful for SurrogateAssociationTest, because most surrogate methods accept univariate time series, and if we embed the source and it becomes multidimensional, then we can't create surrogates. A future optimization is to do column-wise surrogate generation.

CausalityTools.pairwise_test — Method

pairwise_test(p::OCESelectedParents) → pairwise::Bool

If the parent set is empty, return true (a pairwise test should be performed). If the parent set is nonempty, return false (a conditional test should performed).

CausalityTools.partial_correlation_from_precision — Method

partial_correlation_from_precision(P, i::Int, j::Int)

Given a precision matrix P, compute the partial correlation of variables i and j conditional on all other variables.

CausalityTools.predict — Method

predict(measure::CrossmapEstimator, t::AbstractVector, s::AbstractVector) → t̂ₛ, t̄, ρ
predict(measure::CrossmapEstimator, t̄::AbstractVector, S̄::AbstractStateSpaceSet) → t̂ₛ

Perform point-wise cross mappings between source embeddings and target time series according to the algorithm specified by the given cross-map measure (e.g. ConvergentCrossMapping or PairwiseAsymmetricInference).

First method: Jointly embeds the target t and source s time series (according to measure) to obtain time-index aligned target timeseries t̄ and source embedding S̄ (which is now a StateSpaceSet). Then calls predict(measure, t̄, S̄) (the first method), and returns both the predictions t̂ₛ, observations t̄ and their correspondence ρ according to measure.
Second method: Returns a vector of predictions t̂ₛ (t̂ₛ := "predictions of t̄ based on source embedding S̄"), where t̂ₛ[i] is the prediction for t̄[i]. It assumes pre-embedded data which have been correctly time-aligned using a joint embedding, i.e. such that t̄[i] and S̄[i] correspond to the same time index.

Description

For each i ∈ {1, 2, …, N} where N = length(t) == length(s), we make the prediction t̂[i] (an estimate of t[i]) based on a linear combination of D + 1 other points in t, where the selection of points and weights for the linear combination are determined by the D+1 nearest neighbors of the point S̄[i]. The details of point selection and weights depend on measure.

Note: Some CrossmapMeasures may define more general mapping procedures. If so, the algorithm is described in their docstring.

CausalityTools.rank_transformation — Method

rank_transformation(x::AbstractVector)
rank_transformation(x::AbstractStateSpaceSet) → ranks::NTuple{D, Vector}

Rank-transform each variable/column of the length-n D-dimensional StateSpaceSet x and return the rank-transformed variables as a D-tuple of length-n vectors.

Returns the unscaled ranks. Divide by n to get an approximation to the empirical cumulative distribution function (ECDF) x.

Description

Modulo division by n, rank_transformation does roughly the same as naively computing the ECDF as

[count(xᵢ .<= x)  for xᵢ in x] / length(x)

but an order of magnitude faster and with roughly three orders of magnitude less allocations. The increased efficiency of this function relative to naively computing the ECDF is because it uses sorting of the input data to determine ranks, arbitrarily breaking ties according to the sorting algorithm. Rank ties can therefore never occur, and equal values are assigned different but close ranks. To preserve ties, which you might want to do for example when dealing with categorical or integer-valued data, use (the much slower) empcdf.

CausalityTools.s_measure — Method

s_measure(measure::SMeasure, x::VectorOrStateSpaceSet, y::VectorOrStateSpaceSet) → s ∈ [0, 1]

Compute the given measure to quantify the directional dependence between univariate/multivariate time series x and y.

Returns a scalar s where s = 0 indicates independence between x and y, and higher values indicate synchronization between x and y, with complete synchronization for s = 1.0.

Example

using CausalityTools

x, y = rand(1000), rand(1000)

# 4-dimensional embedding for `x`, 5-dimensional embedding for `y`
m = SMeasure(dx = 4, τx = 3, dy = 5, τy = 1)
association(m, x, y)

CausalityTools.select_parents — Method

select_parents(alg::OCE, x)

The parent selection step of the OCE algorithm, which identifies the parents of each xᵢ ∈ x, assuming that x must be integer-indexable, i.e. x[i] yields the i-th variable.

CausalityTools.skeleton — Method

skeleton(algorithm::PC, x) → (g, s)

Infer the skeleton graph for the variables x using the provided algorithm. x must be some iterable where the i-th variable can be accessed as x[i].

Return a tuple of the undirected skeleton graph g::SimpleGraph, and the separating sets s::Dict{SimpleEdge, Vector{Int}}).

CausalityTools.skeleton_conditional! — Method

skeleton_conditional!(alg::PC, graph, x, conditional_test::IndependenceTest)

Thin the skeleton graph, where each vertex is represented by the data x[i], by using conditional_test. Whenever x[i] ⫫ x[j] | x[S] for some set of variables not including i and j, the edge between i and j is removed, and S is stored in the separating set for i and j. This is essentially algorithm 3.2 in Colombo2014, for the cases 𝓁 >= 1.

Modifies graph in-place.

CausalityTools.skeleton_unconditional! — Method

skeleton_unconditional!(alg::PC, g::SimpleGraph, x)

Perform pairwise independence tests between all vertices in the graph, where each vertex is represented by the data x[i], and remove any edges in graph where adjacent vertices are found to be independent according to the given independence test. The null hypothesis of independence is rejected whenever the p-value is below α.

This is essentially algorithm 3.2 in Colombo2014, but only considering the case 𝓁 = 0.

Modifies graph in-place.

If alg.pairwise_test is a directed test, then edges are considered one-by-one. If alg.pairwise_test is not a directed test, then edges (X → Y,Y → X`) are considered simultaneously.

CausalityTools.te_embed — Method

te_embed(source::VectorOr1DDataset, target::VectorOr1DDataset, p::EmbeddingTE) → (points, vars, τs)
te_embed(source::VectorOr1DDataset, target::VectorOr1DDataset, cond::VectorOr1DDataset, p::EmbeddingTE) → (points, vars, τs)

Generalised delay reconstruction of source and target (and cond if provided) for transfer entropy computation using embedding parameters provided by the EmbeddingTE instance p.

Returns a tuple of the embedded points, vars (a TEVars instance that keeps track of which variables of the embedding belong to which marginals of the reconstruction; indices are: source = 1, target = 2, cond = 3), and a tuple τs, which stores the lags for each variable of the reconstruction.

CausalityTools.tolevels! — Method

tolevels!(levels, x) → levels, dict
tolevels(x) → levels, dict

Apply the bijective map $f : \mathcal{Q} \to \mathbb{N}^+$ to each x[i] and store the result in levels[i], where levels is a pre-allocated integer vector such that length(x) == length(levels).

$\mathcal{Q}$ can be any space, and each $q \in \mathcal{Q}$ is mapped to a unique integer in the range 1, 2, …, length(unique(x)). This is useful for integer-encoding categorical data such as strings, or other complex discrete data structures.

The single-argument method allocated a levels vector internally.

dict gives the inverse mapping.

CausalityTools.verify_decomposition_entropy_type — Method

verify_decomposition_entropy_type(
    definition::MultivariateInformationMeasure, 
    est::Union{DiscreteInfoEstimator, DifferentialInfoEstimator}
)

Check that we can actually decompose the definition into est.definition. The default is to do nothing. Certain definitions may override (e.g. CMIRenyiJizba does so).

ComplexityMeasures.codify — Method

codify(encoding::CodifyPoints{N}, x::Vararg{<:AbstractStateSpaceSet, N})

Codify each timeseries xᵢ ∈ x according to the given encoding.

Examples

x = StateSpaceSet(rand(10000, 2))
y = StateSpaceSet(rand(10000, 3))
z = StateSpaceSet(rand(10000, 2))

# For `x`, we use a relative mean encoding.
ex = RelativeMeanEncoding(0.0, 1.0, n = 3)
# For `y`, we use a combination encoding.
ey = CombinationEncoding(
    RelativeMeanEncoding(0.0, 1.0, n = 2), 
    OrdinalPatternEncoding(3)
)
# For `z`, we use ordinal patterns to encode.
ez = OrdinalPatternEncoding(2)

# Codify two input datasets gives a 2-tuple of Vector{Int}
codify(CodifyPoints(ex, ey), x, y)

# Codify three input datasets gives a 3-tuple of Vector{Int}
codify(CodifyPoints(ex, ey, ez), x, y, z)

ComplexityMeasures.codify — Method

codify(d::CodifyVariables, x::Vararg{<:AbstractStateSpaceSet, N})
codify(d::CodifyPoints, x::Vararg{<:AbstractStateSpaceSet, N})

Codify each timeseries xᵢ ∈ x according to the given encoding/discretization d.

Compatible discretizations

CodifyVariables
CodifyPoints

Examples

using CausalityTools

# Sliding window encoding
x = [0.1, 0.2, 0.3, 0.2, 0.1, 0.0, 0.5, 0.3, 0.5]
xc1 = codify(CodifyVariables(OrdinalPatterns(m=2)), x) # should give [1, 1, 2, 2, 2, 1, 2, 1]
xc2 = codify(OrdinalPatterns(m=2), x) # equivalent
length(xc1) < length(x) # should be true, because `OrdinalPatterns` delay embeds.  

# Point-by-point encoding
x, y = StateSpaceSet(rand(100, 3)), StateSpaceSet(rand(100, 3))
cx, cy = codify(CodifyPoints(OrdinalPatternEncoding(3)), x, y)

ComplexityMeasures.counts — Method

counts(o::UniqueElements, x₁, x₂, ..., xₙ) → Counts{N}
counts(encoding::CodifyPoints, x₁, x₂, ..., xₙ) → Counts{N}
counts(encoding::CodifyVariables, x₁, x₂, ..., xₙ) → Counts{N}

Construct an N-dimensional contingency table from the input iterables x₁, x₂, ..., xₙ which are such that length(x₁) == length(x₂) == ⋯ == length(xₙ).

If x₁, x₂, ..., xₙ are already discrete, then use UniqueElements as the first argument to directly construct the joint contingency table.

If x₁, x₂, ..., xₙ need to be discretized, provide as the first argument

CodifyPoints (encodes every point in each of the input variables xᵢs individually)
CodifyVariables (encodes every xᵢ individually using a sliding window encoding). NB: If using different OutcomeSpaces for the different xᵢ, then total_outcomes must be the same for every outcome space.

Examples

# Discretizing some non-discrete data using a sliding-window encoding for each variable
x, y = rand(100), rand(100)
c = CodifyVariables(OrdinalPatterns(m = 4))
counts(c, x, y)

# Discretizing the data by binning each individual data point
binning = RectangularBinning(3)
encoding = RectangularBinEncoding(binning, [x; y]) # give input values to ensure binning covers all data
c = CodifyPoints(encoding)
counts(c, x, y)

# Counts table for already discrete data
n = 50 # all variables must have the same number of elements
x = rand(["dog", "cat", "mouse"], n)
y = rand(1:3, n)
z = rand([(1, 2), (2, 1)], n)

counts(UniqueElements(), x, y, z)

ComplexityMeasures.information — Method

information(est::MultivariateInformationMeasureEstimator, x...)

Estimate some MultivariateInformationMeasure on input data x..., using the given MultivariateInformationMeasureEstimator.

This is just a convenience wrapper around association(est, x...).

ComplexityMeasures.probabilities — Method

probabilities(o::UniqueElements, x₁, x₂, ..., xₙ) → Counts{N}
probabilities(encoding::CodifyPoints, x₁, x₂, ..., xₙ) → Counts{N}
probabilities(encoding::CodifyVariables, x₁, x₂, ..., xₙ) → Counts{N}

Construct an N-dimensional Probabilities array from the input iterables x₁, x₂, ..., xₙ which are such that length(x₁) == length(x₂) == ⋯ == length(xₙ).

Description

Probabilities are computed by first constructing a joint contingency matrix in the form of a Counts instance.

If x₁, x₂, ..., xₙ are already discrete, then use UniqueElements as the first argument to directly construct the joint contingency table.

If x₁, x₂, ..., xₙ need to be discretized, provide as the first argument

CodifyPoints (encodes every point in each of the input variables xᵢs individually)
CodifyVariables (encodes every xᵢ individually using a sliding window encoding).

Examples

# Discretizing some non-discrete data using a sliding-window encoding for each variable
x, y = rand(100), rand(100)
c = CodifyVariables(OrdinalPatterns(m = 4))
probabilities(c, x, y)

# Discretizing the data by binning each individual data point
binning = RectangularBinning(3)
encoding = RectangularBinEncoding(binning, [x; y]) # give input values to ensure binning covers all data
c = CodifyPoints(encoding)
probabilities(c, x, y)

# Joint probabilities for already discretized data
n = 50 # all variables must have the same number of elements
x = rand(["dog", "cat", "mouse"], n)
y = rand(1:3, n)
z = rand([(1, 2), (2, 1)], n)

probabilities(UniqueElements(), x, y, z)

DelayEmbeddings.embed — Method

embed(measure::CrossmapMeasure, target::AbstractVector,
    sources::AbstractVector...) → emb::StateSpaceSet{D}

Jointly embed the input vector target, together with one or more vectors s ∈ sources, according to the given CrossmapMeasure.

This produces emb, a D-dimensional StateSpaceSet where

The last column is always the non-lagged target variable. Typically, this is the variable we're trying to predict.
The D-1 first columns are the (non)lagged versions of each source time series s ∈ sources. Typically, emb[:, 1:D-1] is the subspace in which neighborhood searches are done, which forms the basis of cross-map predictions.

StatsAPI.pvalue — Method

pvalue(test::CorrTest, z, c::Int, n::Int)

Compute the two-sided p-value for the test of the partial correlation coefficient p̂ being zero, where c is the cardinality of the conditioning set and n is the number of samples.

Székely2007Székely, G. J., Rizzo, M. L., & Bakirov, N. K. (2007). Measuring and testing dependence by correlation of distances. The annals of statistics, 35(6), 2769-2794.
Székely2007Székely, G. J., Rizzo, M. L., & Bakirov, N. K. (2007). Measuring and testing dependence by correlation of distances. The annals of statistics, 35(6), 2769-2794.