Docstrings · Extremes.jl

Extremes.AbstractExtremeValueModel — Type

AbstractExtremeValueModel

Abstract type containing the extreme value model types.

BlockMaxima
ThresholdExceedance

Extremes.AbstractFittedExtremeValueModel — Type

AbstractFittedExtremeValueModel{T<:AbstractExtremeValueModel}

Abstract type containing the fitted extreme value model types.

BayesianAbstractExtremeValueModel
MaximumLikelihoodAbstractExtremeValueModel
pwmAbstractExtremeValueModel

Extremes.BlockMaxima — Method

BlockMaxima{GeneralizedExtremeValue}(data::Vector{<:Real};
    locationcov::Vector{Variable} = Vector{Variable}(),
    logscalecov::Vector{Variable} = Vector{Variable}(),
    shapecov::Vector{Variable} = Vector{Variable}())::BlockMaxima

Creates a BlockMaxima structure.

Extremes.BlockMaxima — Method

BlockMaxima{Gumbel}(data::Vector{<:Real};
    locationcov::Vector{Variable} = Vector{Variable}(),
    logscalecov::Vector{Variable} = Vector{Variable}())::BlockMaxima

Creates a BlockMaxima{Gumbel} structure.

Extremes.Cluster — Type

Cluster(u₁::Real,u₂::Real,position::Vector{<:Int},value::Vector{<:Real})

Cluster type.

Extremes.Flat — Type

Flat()

Construct a Flat <: ContinuousUnivariateDistribution object representing an improper uniform distribution on the real line.

Extremes.ReturnLevel — Type

ReturnLevel

ReturnLevel type constructed by the function returnlevel.

Extremes.ThresholdExceedance — Method

ThresholdExceedance(exceedances::Vector{<:Real};
    logscalecov::Vector{<:DataItem} = Vector{Variable}(),
    shapecov::Vector{<:DataItem} = Vector{Variable}())::ThresholdExceedance

Creates a ThresholdExceedance structure.

Extremes.Variable — Type

Variable(name::String, value :: Vector{<:Real})

Construct a Variable type

Extremes.VariableStd — Method

VariableStd(name::String, z::Vector{<:Real})::VariableStd

Construct a VariableStd type from the standardized vector z with the name name.

Base.length — Method

length(c::Cluster)

Compute the cluster length.

Base.maximum — Method

max(c::Cluster)

Compute the cluster maximum.

Base.show — Method

Base.show(io::IO, obj::AbstractExtremeValueModel)

Override of the show function for the objects of type AbstractExtremeValueModel.

Base.show — Method

Base.show(io::IO, obj::AbstractFittedExtremeValueModel)

Override of the show function for the objects of type AbstractFittedExtremeValueModel.

Base.show — Method

Base.show(io::IO, obj::Cluster)

Override of the show function for the objects of type AbstractExtremeValueModel.

Base.show — Method

Base.show(io::IO, obj::ReturnLevel)

Override of the show function for the objects of type ReturnLevel.

Base.sum — Method

sum(c::Cluster)

Compute the cluster sum.

Distributions.location — Method

location(fm::AbstractFittedExtremeValueModel)

Return the location parameters of the fitted model.

Distributions.scale — Method

scale(fm::AbstractFittedExtremeValueModel)

Return the scale parameters of the fitted model.

Distributions.shape — Method

shape(fm::AbstractFittedExtremeValueModel)

Return the shape parameters of the fitted model.

Distributions.shape — Method

shape(pd::Gumbel)

Return the Gumbel distribution shape parameter value, i.e. 0.

Extremes.aic — Method

aic(fm:::MaximumLikelihoodAbstractExtremeValueModel)

Compute the Akaike information criterion (AIC) of the fitted model by maximum likelihood method.

Details

The AIC is defined as follows:

$AIC = 2 k - 2 \log \hat{L};$

where $k$ is the number of estimated parameters and $\hat{L}$ is the maximized value of the likelihood function for the model.

Extremes.bic — Method

bic(fm:::MaximumLikelihoodAbstractExtremeValueModel)

Compute the Bayesian information criterion (BIC) of the fitted model by maximum likelihood method.

Details

The BIC is defined as follows:

$BIC = k \log n - 2 \log \hat{L};$

where $k$ is the number of estimated parameters, $n$ is the number of data and $\hat{L}$ is the maximized value of the likelihood function for the model.

Extremes.buildVariables — Method

buildVariables(df::DataFrame, ids::Vector{Symbol})::Vector{Variable}

Build the Variable type from the columns ids of the DataFrame df.

Example

julia> df = Extremes.dataset("fremantle")
julia> Extremes.buildVariables(df, [:Year, :SOI])

Extremes.checknonstationarity — Method

checknonstationarity(model::AbstractExtremeValueModel)

Check if the extreme value model model is nonstationary.

Extremes.checkstationarity — Method

checkstationarity(model::AbstractExtremeValueModel)

Check if the extreme value model model is stationary.

Extremes.cint — Function

cint(..., confidencelevel::Real=.95)

Compute confidence interval or credible interval in the case of Bayesian estimation.

The function can be applied on any AbstractFittedExtremeValueModel subtype to obtain a confidence interval on the model parameters. It can also be applied on ReturnLevel type to obtain a confidence interval on the return level.

Implementation

The method used for computing the interval depends on the estimation method. In the case of maximum likelihood estimation, the confidence intervals are computed using the Wald approximation based on the approximate parameter estimates covariance matrix. In the case of Bayesian estimation, the return interval is the highest posterior density estimate based on the MCMC sample. In the case of probability weighted moment estimation, the intervals are computed using a boostrap procedure.

Extremes.computeparamfunction — Method

computeparamfunction(covariates::Vector{Variable})

Establish the parameter as function of the corresponding covariates.

Extremes.dataset — Method

dataset(name::String)::DataFrame

Load the dataset associated with name.

Some datasets used by Coles (2001) are available using the following names:

portpirie: annual maximum sea-levels in Port Pirie,
glass: breaking strengths of glass fibers
fremantle: annual maximum sea-levels in Fremantle
rain: daily rainfall accumulations in south-west England
wooster: daily minimum temperatures recorded in Wooster
dowjones: daily closing prices of the Dow Jones Index

These datasets have been retrieved using the R package ismev.

Examples

julia> Extremes.dataset("portpirie")

Extremes.delta — Method

delta(g::Function, θ̂::AbstractVector{<:Real}, H::AbstractPDMat)

Compute the variance of the function g of estimated paramters θ̂ with negative observed information matrix H.

Detail

Hcorresponds to the Hessian matrix of the negative log likelihood.

Extremes.diagnosticplots — Method

diagnosticplots(fm::AbstractFittedExtremeValueModel)

Diagnostic plots

Extremes.ecdf — Method

ecdf(y::Vector{<:Real})::Tuple{Vector{<:Real}, Vector{<:Real}}

Compute the empirical cumulative distribution function using the Gumbel formula.

The empirical quantiles are computed using the Gumbel plotting positions as as recommended by Makkonen (2006).

Example

julia> (x, F̂) = Extremes.ecdf(y)

Reference

Makkonen, L. (2006). Plotting positions in extreme value analysis. Journal of Applied Meteorology and Climatology, 45(2), 334-340.

Extremes.findposteriormode — Method

findposteriormode(fm::BayesianAbstractExtremeValueModel)::Vector{<:Real}

Find the maximum a posteriori probability (MAP) estimate.

Extremes.fit — Method

fit(model::AbstractExtremeValueModel; initialvalues::Vector{<:Real})::MaximumLikelihoodAbstractExtremeValueModel

Fit the extreme value model by maximum likelihood.

Extremes.fit — Method

fit(model::AbstractExtremeValueModel)::MaximumLikelihoodAbstractExtremeValueModel

Fit the extreme value model by maximum likelihood.

Extremes.fitbayes — Method

fitbayes(model::AbstractExtremeValueModel; niter::Int=5000, warmup::Int=2000)::BayesianAbstractExtremeValueModel

Fit the extreme value model under the Bayesian paradigm.

Extremes.fitpwmfunction — Method

fitpwmfunction(fm::pwmAbstractExtremeValueModel{BlockMaxima{GeneralizedExtremeValue})::Function

Returns the corresponding fitpwm function.

Extremes.fitpwmfunction — Method

fitpwmfunction(fm::pwmAbstractExtremeValueModel{BlockMaxima{Gumbel}})::Function

Returns the corresponding fitpwm function.

Extremes.fitpwmfunction — Method

fitpwmfunction(fm::pwmAbstractExtremeValueModel{ThresholdExceedance})::Function

Returns the corresponding fitpwm function.

Extremes.getcluster — Method

getcluster(y::Vector{<:Real}, u₁::Real, u₂::Real)

Extract the clusters from vector y.

A cluster is defined as a sequence of values higher than threshold u₂ with at least a value higher than threshold u₁.

See also Cluster.

Extremes.getcovariatenumber — Function

getcovariatenumber(model::AbstractExtremeValueModel)::Int

Return the number of covariates.

Extremes.getdistribution — Function

getdistribution(model::AbstractExtremeValueModel, θ::Vector{<:Real})
getdistribution(fm::AbstractFittedExtremeValueModel)

Return the distributions corresponding to the model or the fitted model.

If an extreme value model is provided, the distributions corresponding to the parameter vector θ are returned. If a fitted extreme value model is provident, the distributions corresponding to the parameter estimates are returned.

Implementation

In the stationary case, a single extreme value distribution is returned.

In the non-stationary case, a vector of extreme value distributions is returned, one for each data value.

In the Bayesian fitted model case, a array of distributions is returned where each column corresponds to a MCMC iteration.

Extremes.getdistribution — Method

getdistribution(fittedmodel::MaximumLikelihoodAbstractExtremeValueModel)::Vector{<:Distribution}

Return the fitted distribution in case of stationarity or the vector of fitted distribution in case of non-stationarity.

Extremes.getdistribution — Method

getdistribution(fittedmodel::pwmAbstractExtremeValueModel)::Vector{<:Distribution}

Return the fitted distribution for the model fitted with the probability weigthed moments.

Extremes.getinitialvalue — Function

getinitialvalue(model::AbstractExtremeValueModel)

Get an initial estimates of the model parameters.

Extremes.getinitialvalue — Method

getinitialvalue(::Type{GeneralizedExtremeValue},y::Vector{<:Real})::Vector{<:Real}

Compute the initial values of the GEV parameters given the data y.

If the probability weighted moment estimations are valid, then those values are returned. Otherwise, the probability weighted moment estimations of the Gumbel distribution are returned.

Example

julia> Extremes.getinitialvalue(GeneralizedExtremeValue, y)

Extremes.getinitialvalue — Method

getinitialvalue(::Type{GeneralizedPareto},y::Vector{<:Real})::Vector{<:Real}

Compute the initial values of the GP distribution parameters given the data y.

If the probability weighted moment estimations are valid, then those values are returned. Otherwise, the moment estimation of the exponential distribution is returned.

Example

julia> Extremes.getinitialvalue(GeneralizedPareto, y)

Extremes.gevfit — Function

gevfit()

Estimate the GEV parameters by maximum likelihood.

Implementation

The function uses Nelder-Mead solver implemented in the Optim.jl package to find the point where the log-likelihood is maximal.

The GEV parameters can be modeled as function of covariates as follows:

\[μ = X₁ × β₁,\]

\[ϕ = X₂ × β₂,\]

\[ξ = X₃ × β₃.\]

The covariates are standardized before estimating the parameters to help fit the model. They are transformed back on their original scales before returning the fitted model.

See also gevfit for the other methods, gevfitpwm, gevfitbayes and BlockMaxima.

Extremes.gevfit — Method

gevfit(model::{BlockMaxima{GeneralizedExtremeValue}}, initialvalues::Vector{<:Real})

Estimate the parameters of the BlockMaxima model using the given initialvalues.

Extremes.gevfit — Method

gevfit(df::DataFrame,
    datacol::Symbol,
    locationcovid = Vector{Symbol}(),
    logscalecovid = Vector{Symbol}(),
    shapecovid = Vector{Symbol}()
    )

Estimate the GEV parameters.

Arguments

df::DataFrame: The dataframe containing the data.
datacol::Symbol: The symbol of the column of df containing the block maxima data.
initialvalues::Vector{<:Real}: Vector of parameters initial values.
locationcovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the location parameter.
logscalecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the log-scale parameter.
shapecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the shape parameter.

Extremes.gevfit — Method

gevfit(df::DataFrame,
    datacol::Symbol,
    locationcovid = Vector{Symbol}(),
    logscalecovid = Vector{Symbol}(),
    shapecovid = Vector{Symbol}()
    )

Estimate the GEV parameters.

Arguments

df::DataFrame: The dataframe containing the data.
datacol::Symbol: The symbol of the column of df containing the block maxima data.
locationcovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the location parameter.
logscalecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the log-scale parameter.
shapecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the shape parameter.

Extremes.gevfit — Method

gevfit(y,
    initialvalues,
    locationcov = Vector{Variable}(),
    logscalecov = Vector{Variable}(),
    shapecov = Vector{Variable}()
    )

Estimate the GEV parameters.

Arguments

y::Vector{<:Real}: the vector of block maxima.
initialvalues::Vector{<:Real}: Vector of parameters initial values.
locationcov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the location parameter.
logscalecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the log-scale parameter.
shapecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the shape parameter.

Extremes.gevfit — Method

gevfit(y,
    locationcov = Vector{Variable}(),
    logscalecov = Vector{Variable}(),
    shapecov = Vector{Variable}()
    )

Estimate the GEV parameters.

Arguments

y::Vector{<:Real}: The vector of block maxima.
locationcov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the location parameter.
logscalecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the log-scale parameter.
shapecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the shape parameter.

Extremes.gevfitbayes — Function

gevfitbayes(..., niter::Int=5000, warmup::Int=2000)

Generate a sample from the GEV parameters' posterior distribution.

Arguments

niter::Int = 5000: The total number of MCMC iterations.
warmup::Int = 2000: The number of warmup iterations (burn-in).

Implementation

The function uses the No-U-Turn Sampler (NUTS; Hoffman and Gelman, 2014) implemented in the Mamba.jl package to generate a random sample from the posterior distribution.

Currently, only the improper uniform prior is implemented, i.e.

\[f_{(β₁,β₂,β₃)}(β₁,β₂,β₃) ∝ 1,\]

where

\[μ = X₁ × β₁,\]

\[ϕ = X₂ × β₂,\]

\[ξ = X₃ × β₃.\]

In the stationary case, this improper prior yields to a proper posterior if the sample size is larger than 3 (Northrop and Attalides, 2016).

The covariates are standardized before estimating the parameters to help fit the model. They are transformed back on their original scales before returning the fitted model.

See also gevfitbayes for the other methods, gevfitpwm, gevfit and BlockMaxima.

References

Hoffman M. D. and Gelman A. (2014). The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. Journal of Machine Learning Research, 15:1593–1623.

Paul J. Northrop P. J. and Attalides N. (2016). Posterior propriety in Bayesian extreme value analyses using reference priors. Statistica Sinica, 26:721-743.

Extremes.gevfitbayes — Method

gevfitbayes(model::BlockMaxima{GeneralizedExtremeValue};
    niter::Int=5000,
    warmup::Int=2000)

Generate a sample from the BlockMaxima model parameters' posterior distribution.

Extremes.gevfitbayes — Method

gevfitbayes(df::DataFrame,
    datacol::Symbol,
    locationcovid = Vector{Symbol}(),
    logscalecovid = Vector{Symbol}(),
    shapecovid = Vector{Symbol}(),
    niter::Int=5000,
    warmup::Int=2000)

Generate a sample from the GEV parameters' posterior distribution.

Arguments

df::DataFrame: The dataframe containing the data.
datacol::Symbol: The symbol of the column of df containing the block maxima data.
locationcovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the location parameter.
logscalecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the log-scale parameter.
shapecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the shape parameter.

Extremes.gevfitbayes — Method

gevfitbayes(y,
    locationcov = Vector{Variable}(),
    logscalecov = Vector{Variable}(),
    shapecov = Vector{Variable}(),
    niter::Int=5000,
    warmup::Int=2000
    )

Generate a sample from the GEV parameters' posterior distribution.

Arguments

y::Vector{<:Real}: The vector of block maxima.
locationcov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the location parameter.
logscalecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the log-scale parameter.
shapecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the shape parameter.

Extremes.gevfitpwm — Function

gevfitpwm(...)

Estimate the GEV parameters with the probability weighted moments.

Implementation

Estimation with the probability weighted moments, as described by Hosking et al. (1985), is only possible in the stationary case.

See also gevfitpwm for the other methods, gevfit, gevfitbayes and BlockMaxima.

Reference

Hosking, J. R. M., Wallis, J. R. and Wood, E. F. (1985). Estimation of the generalized extreme-value distribution by the method of probability-weighted moments. Technometrics, 27:251-261.

Extremes.gevfitpwm — Method

gevfitpwm(model::BlockMaxima{GeneralizedExtremeValue})

Estimate the GEV parameters with the probability weighted moments.

Extremes.gevfitpwm — Method

gevfitpwm(df::DataFrame, datacol::Symbol)

Estimate the GEV parameters with the probability weighted moments.

Block maxima data are in the column datacol of the dataframe df.

Extremes.gevfitpwm — Method

gevfitpwm(y::Vector{<:Real})

Estimate the GEV parameters with the probability weighted moments.

Extremes.gpfit — Function

gpfit(...)

Estimate the GP parameters by maximum likelihood.

Data provided must be the exceedances above the threshold, i.e. the data above the threshold minus the threshold.

Implementation

The function uses Nelder-Mead solver implemented in the Optim.jl package to find the point where the log-likelihood is maximal.

The GP parameters can be modeled as function of covariates as follows:

\[ϕ = X₂ × β₂,\]

\[ξ = X₃ × β₃.\]

The covariates are standardized before estimating the parameters to help fit the model. They are transformed back on their original scales before returning the fitted model.

See also gpfit for the other methods, gpfitpwm, gpfitbayes and ThresholdExceedance.

Extremes.gpfit — Method

gpfit(df::DataFrame,
    datacol::Symbol,
    logscalecovid = Vector{Symbol}(),
    shapecovid = Vector{Symbol}()
    )

Estimate the GP parameters

Arguments

df::DataFrame: The dataframe containing the data.
datacol::Symbol: The symbol of the column of df containing the exceedances.
initialvalues::Vector{<:Real}: Vector of parameters initial values.
logscalecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the log-scale parameter.
shapecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the shape parameter.

Extremes.gpfit — Method

gpfit(df::DataFrame,
    datacol::Symbol,
    logscalecovid = Vector{Symbol}(),
    shapecovid = Vector{Symbol}()
    )

Estimate the GP parameters

Arguments

df::DataFrame: The dataframe containing the data.
datacol::Symbol: The symbol of the column of df containing the exceedances.
logscalecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the log-scale parameter.
shapecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the shape parameter.

Extremes.gpfit — Method

gpfit(model::ThresholdExceedance, initialvalues::Vector{<:Real})

Estimate the parameters of the ThresholdExceedance model using the given initialvalues.

Extremes.gpfit — Method

gpfit(y,
    initialvalues;
    logscalecov = Vector{Variable}(),
    shapecov = Vector{Variable}()
    )

Estimate the GP parameters

Arguments

y::Vector{<:Real}: The vector of exceedances.
initialvalues::Vector{<:Real}: The vector of parameters initial values.
logscalecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the log-scale parameter.
shapecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the shape parameter.

Extremes.gpfit — Method

gpfit(y,
    logscalecov = Vector{Variable}(),
    shapecov = Vector{Variable}()
    )

Estimate the GP parameters

Arguments

y::Vector{<:Real}: The vector of exceedances.
logscalecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the log-scale parameter.
shapecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the shape parameter.

Extremes.gpfitbayes — Function

gpfitbayes(..., niter::Int=5000, warmup::Int=2000)

Generate a sample from the GP parameters' posterior distribution.

Data provided must be the exceedances above the threshold, i.e. the data above the threshold minus the threshold.

Arguments

niter::Int = 5000: The total number of MCMC iterations
warmup::Int = 2000: The number of warmup iterations (burn-in).

Implementation

The function uses the No-U-Turn Sampler (NUTS; Hoffman and Gelman, 2014) implemented in the Mamba.jl package to generate a random sample from the posterior distribution.

Currently, only the improper uniform prior is implemented, i.e.

\[f_{(β₂,β₃)}(β₂,β₃) ∝ 1,\]

where

\[ϕ = X₂ × β₂,\]

\[ξ = X₃ × β₃.\]

In the stationary case, this improper prior yields to a proper posterior if the sample size is larger than 2 (Northrop and Attalides, 2016).

The covariates are standardized before estimating the parameters to help fit the model. They are transformed back on their original scales before returning the fitted model.

See also gpfitbayes for the other methods, gpfitpwm, gpfit and ThresholdExceedance.

Reference

Hoffman M. D. and Gelman A. (2014). The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. Journal of Machine Learning Research, 15:1593–1623.

Paul J. Northrop P. J. and Attalides N. (2016). Posterior propriety in Bayesian extreme value analyses using reference priors. Statistica Sinica, 26:721-743.

Extremes.gpfitbayes — Method

gpfitbayes(df::DataFrame,
    datacol::Symbol,
    logscalecovid = Vector{Symbol}(),
    shapecovid = Vector{Symbol}(),
    niter::Int=5000,
    warmup::Int=2000)

Generate a sample from the GP parameters' posterior distribution.

Arguments

df::DataFrame: The dataframe containing the data.
datacol::Symbol: The symbol of the column of df containing the exceedances.
logscalecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the log-scale parameter.
shapecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the shape parameter.

Extremes.gpfitbayes — Method

gpfitbayes(model::ThresholdExceedance;
    niter::Int=5000,
    warmup::Int=2000)

Generate a sample from the GP parameters' posterior distribution.

Extremes.gpfitbayes — Method

gpfitbayes(y,
    logscalecov = Vector{Variable}(),
    shapecov = Vector{Variable}(),
    niter::Int=5000,
    warmup::Int=2000
    )

Generate a sample from the GP parameters' posterior distribution.

Arguments

y::Vector{<:Real}: The vector of exceedances.
logscalecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the log-scale parameter.
shapecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the shape parameter.

Extremes.gpfitpwm — Function

gpfitpwm(...)

Estimate the GP parameters with the probability weighted moments.

Implementation

Estimation with the probability weighted moments, as described by Hosking and Wallis (1987), is only possible in the stationary case.

See also gpfitpwm for the other methods, gpfit, gpfitbayes and ThresholdExceedance.

Reference

Hosking, J. R. M. and Wallis, J. R. (1987). Parameter and quantile estimation for the Generalized Pareto distribution, Technometrics, 29:339-349.

Extremes.gpfitpwm — Method

gpfitpwm(df::DataFrame, datacol::Symbol)

Estimate the GP parameters with the probability weighted moments.

Block maxima data are in the column datacol of the dataframe df.

Extremes.gpfitpwm — Method

gpfitpwm(model::ThresholdExceedance)

Estimate the GP parameters with the probability weighted moments.

Extremes.gpfitpwm — Method

gpfitpwm(y::Vector{<:Real})

Estimate the GP parameters with the probability weighted moments.

Extremes.gumbelfit — Function

gumbelfit()

Estimate the Gumbel parameters by maximum likelihood.

Details

The Gumbel distribution is a particular case of the generalized extreme value distribution when the shape parameter is zero.

Extreme value theory

In extreme value theory, it is best to avoid the choice of a sub-family of extreme value familu as the Gumbel. This is because the choice of family is made with the data at hand, and when extrapolating to large quantiles, i.e. larger than the range of the data, the uncertainty associated with this choice is not taken into account. If the data suggest that the Gumbel family is the best one, this does not imply that the other families are not plausible. In applications, the confidence intervals on the shape parameter are often wide, representing the difficulty of discriminating the tail behavior using only the limited number of data. Therefore, the use of the GEV distribution for the block maxima model makes more sense. As Coles (2001) also argued in Page 64, this is "...the safest option is to accept there is uncertainty about the value of the shape parameter ... and to prefer the inference based on the GEV model. The larger measures of uncertainty generated by the GEV model then provide a more realistic quantification of the genuine uncertainties involved in model extrapolation."

Implementation

The function uses Nelder-Mead solver implemented in the Optim.jl package to find the point where the log-likelihood is maximal.

The Gumbel parameters can be modeled as function of covariates as follows:

\[μ = X₁ × β₁,\]

\[ϕ = X₂ × β₂,\]

The covariates are standardized before estimating the parameters to help fit the model. They are transformed back on their original scales before returning the fitted model.

See also gumbelfit for the other methods and gevfit for estiming the parameters of the GEV distribution.

Extremes.gumbelfit — Method

gumbelfit(model::{BlockMaxima{Gumbel}}, initialvalues::Vector{<:Real})

Estimate the parameters of the BlockMaxima model using the given initialvalues.

Extremes.gumbelfit — Method

gumbelfit(df::DataFrame,
    datacol::Symbol,
    locationcovid = Vector{Symbol}(),
    logscalecovid = Vector{Symbol}()
    )

Estimate the Gumbel parameters.

Arguments

df::DataFrame: The dataframe containing the data.
datacol::Symbol: The symbol of the column of df containing the block maxima data.
initialvalues::Vector{<:Real}: Vector of parameters initial values.
locationcovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the location parameter.
logscalecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the log-scale parameter.

Extremes.gumbelfit — Method

gumbelfit(df::DataFrame,
    datacol::Symbol,
    locationcovid = Vector{Symbol}(),
    logscalecovid = Vector{Symbol}(),
    shapecovid = Vector{Symbol}()
    )

Estimate the Gumbel parameters.

Arguments

df::DataFrame: The dataframe containing the data.
datacol::Symbol: The symbol of the column of df containing the block maxima data.
locationcovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the location parameter.
logscalecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the log-scale parameter.

Extremes.gumbelfit — Method

gumbelfit(y,
    initialvalues,
    locationcov = Vector{Variable}(),
    logscalecov = Vector{Variable}()
    )

Estimate the Gumbel parameters.

Arguments

y::Vector{<:Real}: the vector of block maxima.
initialvalues::Vector{<:Real}: Vector of parameters initial values.
locationcov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the location parameter.
logscalecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the log-scale parameter.

Extremes.gumbelfit — Method

gumbelfit(y,
    locationcov = Vector{Variable}(),
    logscalecov = Vector{Variable}()
    )

Estimate the Gumbel parameters.

Arguments

y::Vector{<:Real}: The vector of block maxima.
locationcov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the location parameter.
logscalecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the log-scale parameter.

Extremes.gumbelfitbayes — Function

gumbelfitbayes(..., niter::Int=5000, warmup::Int=2000)

Generate a sample from the Gumbel parameters' posterior distribution.

Arguments

niter::Int = 5000: The total number of MCMC iterations.
warmup::Int = 2000: The number of warmup iterations (burn-in).

Implementation

The function uses the No-U-Turn Sampler (NUTS; Hoffman and Gelman, 2014) implemented in the Mamba.jl package to generate a random sample from the posterior distribution.

Currently, only the improper uniform prior is implemented, i.e.

\[f_{(β₁,β₂,β₃)}(β₁,β₂,β₃) ∝ 1,\]

where

\[μ = X₁ × β₁,\]

\[ϕ = X₂ × β₂,\]

In the stationary case, this improper prior yields to a proper posterior if the sample size is larger than 3 (Northrop and Attalides, 2016).

The covariates are standardized before estimating the parameters to help fit the model. They are transformed back on their original scales before returning the fitted model.

See also gevfitbayes for the other methods and BlockMaxima.

References

Hoffman M. D. and Gelman A. (2014). The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. Journal of Machine Learning Research, 15:1593–1623.

Paul J. Northrop P. J. and Attalides N. (2016). Posterior propriety in Bayesian extreme value analyses using reference priors. Statistica Sinica, 26:721-743.

Extremes.gumbelfitbayes — Method

gumbelfitbayes(model::BlockMaxima{Gumbel};
    niter::Int=5000,
    warmup::Int=2000)

Generate a sample from the BlockMaxima model parameters' posterior distribution.

Extremes.gumbelfitbayes — Method

gumbelfitbayes(df::DataFrame,
    datacol::Symbol,
    locationcovid = Vector{Symbol}(),
    logscalecovid = Vector{Symbol}(),
    niter::Int=5000,
    warmup::Int=2000)

Generate a sample from the Gumbel parameters' posterior distribution.

Arguments

df::DataFrame: The dataframe containing the data.
datacol::Symbol: The symbol of the column of df containing the block maxima data.
locationcovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the location parameter.
logscalecovid::Vector{Symbol} = Vector{Symbol}(): The symbols of the columns of df containing the covariates of the log-scale parameter.

Extremes.gumbelfitbayes — Method

gumbelfitbayes(y,
    locationcov = Vector{Variable}(),
    logscalecov = Vector{Variable}(),
    niter::Int=5000,
    warmup::Int=2000
    )

Generate a sample from the Gumbel parameters' posterior distribution.

Arguments

y::Vector{<:Real}: The vector of block maxima.
locationcov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the location parameter.
logscalecov::Vector{<:DataItem} = Vector{Variable}(): The covariates of the log-scale parameter.

Extremes.gumbelfitpwm — Function

gumbelfitpwm(...)

Estimate the Gumbel parameters with the probability weighted moments.

Implementation

Estimation with the probability weighted moments, as described by Landwehr *et al. (1979), is only possible in the stationary case.

See also gumbelfitpwm for the other methods, gevfit, gevfitbayes and BlockMaxima.

Reference

Landwehr, J. M., Matalas, N. C. and Wallis, J. R. (1979). Probability weighted moments compared with some traditional techniques in estimating Gumbel parameters and quantiles. Water Resources Research, 15:1055–1064.

Extremes.gumbelfitpwm — Method

gumbelfitpwm(model::BlockMaxima{Gumbel})

Estimate the Gumbel parameters with the probability weighted moments.

Extremes.gumbelfitpwm — Method

gumbelfitpwm(df::DataFrame, datacol::Symbol)::pwmAbstractExtremeValueModel

Estimate the Gumbel parameters with the probability weighted moments.

Block maxima data are in the column datacol of the dataframe df.

Extremes.gumbelfitpwm — Method

gumbelfitpwm(y::Vector{<:Real})

Estimate the Gumbel parameters with the probability weighted moments.

Extremes.hessian — Method

hessian(model::MaximumLikelihoodAbstractExtremeValueModel)::PDMat{Float64, Matrix{Float64}}

Calculates the Hessian matrix associated with the MaximumLikelihoodAbstractExtremeValueModel model.

Extremes.hisplot_data — Function

histplot_data(fm::fittedModel)

Return the histogram plot data in a Dictionary.

Extremes.histplot — Method

histplot(fm::AbstractFittedExtremeValueModel)

Histogram plot

Extremes.loglike — Method

loglike(model::AbstractExtremeValueModel, θ::Vector{<:Real})

Compute the model loglikelihood AbstractExtremeValueModelluated at θ.

Extremes.loglike — Method

loglike(fd::MaximumLikelihoodAbstractExtremeValueModel)::Real

Compute the model loglikelihood AbstractExtremeValueModelluated at θ̂ if the maximum likelihood method has been used.

Extremes.merge — Method

merge(c₁::Cluster, c₂::Cluster)

Merge cluster c₁ and c₂ into a single cluster.

Extremes.mrlplot — Function

mrlplot(y::Vector{<:Real}, steps::Int = 100)

Mean residual plot