ParetoSmooth

Documentation for ParetoSmooth.

ParetoSmooth.AbstractCV
ParetoSmooth.AbstractCVMethod
ParetoSmooth.ModelComparison
ParetoSmooth.Psis
ParetoSmooth.PsisLoo
ParetoSmooth.PsisLooMethod
ParetoSmooth._assume_one_chain
ParetoSmooth._check_input_validity_psis
ParetoSmooth._check_tail
ParetoSmooth._convert_to_array
ParetoSmooth._def_tail_length
ParetoSmooth._do_psis_i!
ParetoSmooth._generate_r_eff
ParetoSmooth._invalid_number
ParetoSmooth._psis_smooth_tail!
ParetoSmooth._throw_pareto_k_warning
ParetoSmooth.gpd_quantile
ParetoSmooth.gpdfit
ParetoSmooth.loo
ParetoSmooth.loo_compare
ParetoSmooth.loo_from_psis
ParetoSmooth.naive_lpd
ParetoSmooth.pointwise_log_likelihoods
ParetoSmooth.psis
ParetoSmooth.psis_ess
ParetoSmooth.psis_loo
ParetoSmooth.relative_eff
ParetoSmooth.sup_ess

ParetoSmooth.AbstractCV — Type

AbstractCV

An abstract type used in cross-validation.

ParetoSmooth.AbstractCVMethod — Type

AbstractCVMethod

An abstract type used to dispatch the correct method for cross validation.

ParetoSmooth.ModelComparison — Type

ModelComparison

A struct containing the results of model comparison.

Fields

pointwise::KeyedArray: An array containing
estimates::KeyedArray: A table containing the results of model comparison, with the following columns –
- cv_elpd: The difference in total leave-one-out cross validation scores between models.
- cv_avg: The difference in average LOO-CV scorees between models.
- weight: A set of Akaike-like weights assigned to each model, which can be used in pseudo-Bayesian model averaging.
std_err::NamedTuple: A named tuple containing the standard error of cv_elpd. Note that these estimators (incorrectly) assume all folds are independent, despite their substantial overlap, which creates a downward biased estimator. LOO-CV differences are not asymptotically normal, so these standard errors cannot be used to calculate a confidence interval.
gmpd::NamedTuple: The geometric mean of the predictive distribution. It equals the geometric mean of the probability assigned to each data point by the model, that is, exp(cv_avg). This measure is only meaningful for classifiers (variables with discrete outcomes). We can think of it as measuring how often the model was right: A model that always predicts incorrectly will have a GMPD of 0, while a model that always predicts correctly will have a GMPD of 1. However, the GMPD gives a model "Partial points" between 0 and 1 whenever the model assigns a probability other than 0 or 1 to the outcome that actually happened.

See also: PsisLoo

ParetoSmooth.Psis — Type

Psis{R<:Real, AT<:AbstractArray{R, 3}, VT<:AbstractVector{R}}

A struct containing the results of Pareto-smoothed importance sampling.

Fields

weights: A vector of smoothed, truncated, and normalized importance sampling weights.
pareto_k: Estimates of the shape parameter k of the generalized Pareto distribution.
ess: Estimated effective sample size for each LOO evaluation, based on the variance of the weights.
sup_ess: Estimated effective sample size for each LOO evaluation, based on the supremum norm, i.e. the size of the largest weight. More likely than ess to warn when importance sampling has failed. However, it can have a high variance.
r_eff: The relative efficiency of the MCMC chain, i.e. ESS / posterior sample size.
tail_len: Vector indicating how large the "tail" is for each observation.
posterior_sample_size: How many draws from an MCMC chain were used for PSIS.
data_size: How many data points were used for PSIS.

ParetoSmooth.PsisLoo — Type

PsisLoo <: AbstractCV

A struct containing the results of leave-one-out cross validation using Pareto smoothed importance sampling.

Fields

estimates::KeyedArray: A KeyedArray with columns :total, :se_total, :mean, :se_mean, and rows :cv_elpd, :naive_lpd, :p_eff. See # Extended help for more.
- :cv_elpd contains estimates for the out-of-sample prediction error, as estimated using leave-one-out cross validation.
- :naive_lpd contains estimates of the in-sample prediction error.
- :p_eff is the effective number of parameters – a model with a p_eff of 2 is "about as overfit" as a model with 2 parameters and no regularization.
pointwise::KeyedArray: A KeyedArray of pointwise estimates with 5 columns –
- :cv_elpd contains the estimated out-of-sample error for this point, as measured
using leave-one-out cross validation.
- :naive_lpd contains the in-sample estimate of error for this point.
- :p_eff is the difference in the two previous estimates.
- :ess is the L2 effective sample size, which estimates the simulation error caused by using Monte Carlo estimates. It does not measure model performance.
- :inf_ess is the supremum-based effective sample size, which estimates the simulation error caused by using Monte Carlo estimates. It is more robust than :ess and should therefore be preferred. It does not measure model performance.
- :pareto_k is the estimated value for the parameter ξ of the generalized Pareto distribution. Values above .7 indicate that PSIS has failed to approximate the true distribution.
psis_object::Psis: A Psis object containing the results of Pareto-smoothed importance sampling.
gmpd: The geometric mean of the predictive density. It is defined as the geometric mean of the probability assigned to each data point by the model, i.e. exp(cv_avg). This measure is only interpretable for classifiers (variables with discrete outcomes). We can think of it as measuring how often the model was right: A model that always predicts incorrectly will have a GMPD of 0, while a model that always predicts correctly will have a GMPD of 1. However, the GMPD gives a model "Partial points" between 0 and 1 whenever the model assigns a probability other than 0 or 1 to the outcome that actually happened, making it a fully Bayesian measure of model quality.
mcse: A float containing the estimated Monte Carlo standard error for the total cross-validation estimate.

Extended help

The total score depends on the sample size, and summarizes the weight of evidence for or against a model. Total scores are on an interval scale, meaning that only differences of scores are meaningful. It is not possible to interpret a total score by looking at it. The total score is not a goodness-of-fit statistic (for this, see the average score).

The average score is the total score, divided by the sample size. It estimates the expected log score, i.e. the expectation of the log probability density of observing the next point. The average score is a relative goodness-of-fit statistic which does not depend on sample size.

Unlike for chi-square goodness of fit tests, models do not have to be nested for model comparison using cross-validation methods.

See also: [loo]@ref, [bayes_cv]@ref, [psis_loo]@ref, [Psis]@ref

ParetoSmooth.PsisLooMethod — Type

PsisLooMethod

Use Pareto-smoothed importance sampling together with leave-one-out cross validation to estimate the out-of-sample predictive accuracy.

ParetoSmooth._assume_one_chain — Method

Assume that all objects belong to a single chain if chain index is missing. Inform user.

ParetoSmooth._check_input_validity_psis — Method

Make sure all inputs to psis are valid.

ParetoSmooth._check_tail — Method

Check the tail to make sure a GPD fit is possible.

ParetoSmooth._convert_to_array — Method

Convert a matrix+chain_index representation to a 3d array representation.

ParetoSmooth._def_tail_length — Method

_def_tail_length(log_ratios::AbstractVector, r_eff::Real) -> tail_len::Integer

Define the tail length as in Vehtari et al. (2019).

ParetoSmooth._do_psis_i! — Method

_do_psis_i!(is_ratios::AbstractVector{Real}, tail_length::Integer) -> T

Do PSIS on a single vector, smoothing its tail values.

Arguments

is_ratios::AbstractVector{Real}: A vector of importance sampling ratios, scaled to have a maximum of 1.

Returns

T<:Real: ξ, the shape parameter for the GPD; big numbers indicate thick tails.

ParetoSmooth._generate_r_eff — Method

Generate the relative effective sample size if not provided by the user.

ParetoSmooth._invalid_number — Method

Check if an input is invalid.

ParetoSmooth._psis_smooth_tail! — Method

_psis_smooth_tail!(tail::AbstractVector{T}, cutoff::T) where {T<:Real} -> ξ::T

Takes an already sorted vector of observations from the tail and smooths it in place with PSIS before returning shape parameter ξ.

ParetoSmooth._throw_pareto_k_warning — Method

Throw a warning if any pareto_k values exceed 0.7.

ParetoSmooth.gpd_quantile — Method

gpd_quantile(p::T, k::T, sigma::T) where {T<:Real} -> T

Compute the p quantile of the Generalized Pareto Distribution (GPD).

Arguments

p: A scalar between 0 and 1.
ξ: A scalar shape parameter.
σ: A scalar scale parameter.

Returns

A quantile of the Generalized Pareto Distribution.

ParetoSmooth.gpdfit — Method

gpdfit(
    sample::AbstractVector{T<:Real}; 
    wip::Bool=true, 
    min_grid_pts::Integer=30, 
    sort_sample::Bool=false
) -> (ξ::T, σ::T)

Return a named list of estimates for the parameters ξ (shape) and σ (scale) of the generalized Pareto distribution (GPD), assuming the location parameter is 0.

Arguments

sample::AbstractVector: A numeric vector. The sample from which to estimate the parameters.
wip::Bool = true: Logical indicating whether to adjust ξ based on a weakly informative Gaussian prior centered on 0.5. Defaults to true.
min_grid_pts::Integer = 30: The minimum number of grid points used in the fitting algorithm. The actual number used is min_grid_pts + ⌊sqrt(length(sample))⌋.

Note

Estimation method taken from Zhang, J. and Stephens, M.A. (2009). The parameter ξ is the negative of k.

ParetoSmooth.loo — Method

function loo(args...; method=PsisLooMethod(), kwargs...) -> PsisLoo

Compute the approximate leave-one-out cross-validation score using the specified method.

Currently, this function only serves to call psis_loo, but this could change in the future. The default methods or return type may change without warning; thus, we recommend using psis_loo instead if reproducibility is required.

See also: psis, loo, PsisLoo.

ParetoSmooth.naive_lpd — Method

naive_lpd(log_likelihood::AbstractArray{var"#s9376", 3} where var"#s9376"<:Real) -> Any

Calculate the naive (in-sample) estimate of the expected log probability density, otherwise known as the in-sample Bayes score. Not recommended for most uses.

ParetoSmooth.pointwise_log_likelihoods — Method

pointwise_log_likelihoods(
    ll_fun::Function, samples::AbstractArray{<:Real,3}, data;
    splat::Bool=true[, chain_index::Vector{<:Integer}]
)

Compute the pointwise log likelihoods.

Arguments

ll_fun::Function: A function taking a single data point and returning the log-likelihood

of that point. This function must take the form f(θ[1], ..., θ[n], data), where θ is the parameter vector. See also the splat keyword argument.

samples::AbstractArray: A three dimensional array of MCMC samples. Here, the first dimension should indicate the step of the MCMC algorithm; the second dimension should indicate the parameter; and the third should indicate the chain.
data: An array of data points used to estimate the parameters of the model.
splat: If true (default), f must be a function of n different parameters. Otherwise, f is assumed to be a function of a single parameter vector.
chain_index::Vector: An optional vector of integers specifying which chain each step

belongs to. For instance, chain_index[step] should return 2 if log_likelihood[:, step] belongs to the second chain.

Returns

Array: A three dimensional array of pointwise log-likelihoods.

ParetoSmooth.psis — Method

psis(
    log_ratios::AbstractArray{T<:Real}, 
    r_eff::AbstractVector; 
    source::String="mcmc", 
    log_weights::Bool=false
) -> Psis

Implements Pareto-smoothed importance sampling (PSIS).

Arguments

Positional Arguments

log_ratios::AbstractArray: A 2d or 3d array of (unnormalized) importance ratios on the log scale. Indices must be ordered as [data, step, chain]. The chain index can be left off if there is only one chain, or if keyword argument chain_index is provided.
r_eff::AbstractVector: An (optional) vector of relative effective sample sizes used in ESS

calculations. If left empty, calculated automatically using the FFTESS method from InferenceDiagnostics.jl. See relative_eff to calculate these values.

Keyword Arguments

chain_index::Vector: An optional vector of integers specifying which chain each step

belongs to. For instance, chain_index[step] should return 2 if log_likelihood[:, step] belongs to the second chain.

source::String="mcmc": A string or symbol describing the source of the sample being used. If "mcmc", adjusts ESS for autocorrelation. Otherwise, samples are assumed to be independent. Currently permitted values are ["mcmc", "vi", "other"].
log_weights::Bool=false: Return the log weights, rather than the PSIS weights.

See also: [relative_eff]@ref, [psis_loo]@ref, [psis_ess]@ref.

ParetoSmooth.psis_ess — Method

function psis_ess(
    weights::AbstractVector{T},
    r_eff::AbstractVector{T}
) -> AbstractVector{T}

Calculate the (approximate) effective sample size of a PSIS sample, using the correction in Vehtari et al. 2019. This uses the variance-based definition of ESS, and measures the L2 distance of the proposal and target distributions.

Arguments

weights: A set of importance sampling weights derived from PSIS.
r_eff: The relative efficiency of the MCMC chains from which PSIS samples were derived.

See ?relative_eff to calculate r_eff.

ParetoSmooth.psis_loo — Method

function psis_loo(
    log_likelihood::Array{Real} [, args...];
    [, chain_index::Vector{Integer}, kwargs...]
) -> PsisLoo

Use Pareto-Smoothed Importance Sampling to calculate the leave-one-out cross validation score.

Arguments

log_likelihood::Array: A matrix or 3d array of log-likelihood values indexed as [data, step, chain]. The chain argument can be left off if chain_index is provided or if all posterior samples were drawn from a single chain.
args...: Positional arguments to be passed to psis.
chain_index::Vector: An optional vector of integers specifying which chain each step

belongs to. For instance, chain_index[step] should return 2 if log_likelihood[:, step] belongs to the second chain.

kwargs...: Keyword arguments to be passed to psis.