ParetoSmooth

Documentation for ParetoSmooth.

ParetoSmooth.Psis
ParetoSmooth._generate_r_eff
ParetoSmooth.calc_ξ
ParetoSmooth.check_input_validity_psis
ParetoSmooth.check_tail
ParetoSmooth.def_tail_length
ParetoSmooth.do_psis_i!
ParetoSmooth.gpd_quantile
ParetoSmooth.gpdfit
ParetoSmooth.psis
ParetoSmooth.psis_n_eff
ParetoSmooth.psis_smooth_tail!
ParetoSmooth.relative_eff

ParetoSmooth.Psis — Type

Psis{V<:AbstractVector{F},I<:Integer} where {F<:AbstractFloat}

A struct containing the results of Pareto-smoothed improtance sampling.

Fields

weights: A vector of smoothed, truncated, and normalized importance sampling weights.
pareto_k: Estimates of the shape parameter $k$ of the generalized Pareto distribution.
ess: Estimated effective sample size for each LOO evaluation.
tail_len: Vector of tail lengths used for smoothing the generalized Pareto distribution.
dims: Named tuple of length 2 containing s (posterior sample size) and n (number of

observations).

ParetoSmooth._generate_r_eff — Method

Generate the relative effective sample size if not provided by the user.

ParetoSmooth.calc_ξ — Method

calc_ξ(sample, θ_hat)

Calculate ξ, the parameter for the GPD.

ParetoSmooth.check_input_validity_psis — Method

Make sure all inputs to psis are valid.

ParetoSmooth.check_tail — Method

Check the tail to make sure a GPD fit is possible.

ParetoSmooth.def_tail_length — Method

def_tail_length(log_ratios::AbstractVector, r_eff::AbstractFloat) -> tail_len::Integer

Define the tail length as in Vehtari et al. (2019).

ParetoSmooth.do_psis_i! — Method

do_psis_i!(is_ratios::AbstractVector{AbstractFloat}, tail_length::Integer)::T

Do PSIS on a single vector, smoothing its tail values.

Arguments

is_ratios::AbstractVector{AbstractFloat}: A vector of importance sampling ratios,

scaled to have a maximum of 1.

Returns

T<:AbstractFloat: ξ, the shape parameter for the GPD; big numbers indicate thick tails.

Extended help

Additional information can be found in the LOO package from R.

ParetoSmooth.gpd_quantile — Method

gpd_quantile(p::T, k::T, sigma::T) where {T<:AbstractFloat} -> T

Compute the p quantile of the Generalized Pareto Distribution (GPD).

Arguments

p: A scalar between 0 and 1.
ξ: A scalar shape parameter.
σ: A scalar scale parameter.

Returns

A quantile of the Generalized Pareto Distribution.

ParetoSmooth.gpdfit — Method

gpdfit(
    sample::AbstractVector{T<:AbstractFloat}; 
    wip::Bool=true, 
    min_grid_pts::Integer=30, 
    sort_sample::Bool=false
    ) -> (ξ::T, σ::T)

Return a named list of estimates for the parameters ξ (shape) and σ (scale) of the generalized Pareto distribution (GPD), assuming the location parameter is 0.

Arguments

sample::AbstractVector: A numeric vector. The sample from which to estimate the parameters.
wip::Bool = true: Logical indicating whether to adjust ξ based on a weakly informative Gaussian prior centered on 0.5. Defaults to true.
min_grid_pts::Integer = 30: The minimum number of grid points used in the fitting algorithm. The actual number used is min_grid_pts + ⌊sqrt(length(sample))⌋.

Note

Estimation method taken from Zhang, J. and Stephens, M.A. (2009). The parameter ξ is the negative of $k$.

ParetoSmooth.psis — Method

psis(
    log_ratios::AbstractArray{T:>AbstractFloat}, 
    r_eff; 
    source::String="mcmc", 
    log_weights::Bool=false
    ) -> Psis

Implements Pareto-smoothed importance sampling (PSIS).

Arguments

Positional Arguments

log_ratios::AbstractArray{T}: An array of importance ratios on the log scale (for

PSIS-LOO these are negative log-likelihood values). Indices must be ordered as [data, draw, chain]: log_ratios[1, 2, 3] should be the log-likelihood of the first data point, evaluated at the second iteration of the third chain. Chain indices can be left off if there is only a single chain, or if keyword argument chain_index is provided.

r_eff::AbstractArray{T}: An (optional) vector of relative effective sample sizes used

in ESS calculations. If left empty, calculated automatically using the FFTESS method from InferenceDiagnostics.jl. See relative_eff to calculate these values.

Keyword Arguments

chain_index::Vector{Integer}: An (optional) vector of integers indicating which chain

each sample belongs to.

source::String="mcmc": A string or symbol describing the source of the sample being

used. If "mcmc", adjusts ESS for autocorrelation. Otherwise, samples are assumed to be independent. Currently permitted values are ["mcmc", "vi", "other"].

log_weights::Bool=false: Return log weights, rather than the PSIS weights.

ParetoSmooth.psis_n_eff — Method

function psis_n_eff(
    weights::AbstractVector{T},
    r_eff::AbstractVector{T}
) -> AbstractVector{T}

ParetoSmooth.psis_smooth_tail! — Method

psis_smooth_tail!(tail::AbstractVector{T}, cutoff::T) where {T<:AbstractFloat} -> ξ::T

Takes an already sorted vector of observations from the tail and smooths it in place with PSIS before returning shape parameter ξ.

ParetoSmooth.relative_eff — Method

relative_eff(sample::AbstractArray{AbstractFloat, 3}; method=FFTESSMethod())

Compute the MCMC effective sample size divided by the nominal sample size.