L-moment - Wikiwand

In statistics, L-moments are a sequence of statistics used to summarize the shape of a probability distribution.^[1]^[2]^[3]^[4] They are linear combinations of order statistics (L-statistics) analogous to conventional moments, and can be used to calculate quantities analogous to standard deviation, skewness and kurtosis, termed the L-scale, L-skewness and L-kurtosis respectively (the L-mean is identical to the conventional mean). Standardized L-moments are called L-moment ratios and are analogous to standardized moments. Just as for conventional moments, a theoretical distribution has a set of population L-moments. Sample L-moments can be defined for a sample from the population, and can be used as estimators of the population L-moments.

Remove ads

Population L-moments

Summarize

Perspective

For a random variable $X$ , the $r$ th population L-moment is^[1]

$\lambda _{r}={\frac {1}{r}}\sum _{k=0}^{r-1}(-1)^{k}{\binom {r-1}{k}}\operatorname {\mathbb {E} } [X_{r-k:r}]\,,$

where $X k:n$ denotes the $k$ th order statistic ( $k$ th smallest value) in an independent sample of size $n$ from the distribution of $X$ and $\mathbb {E}$ denotes expected value operator. In particular, the first four population L-moments are

${\begin{aligned}\lambda _{1}&=\operatorname {\mathbb {E} } [X]\\[4pt]\lambda _{2}&={\tfrac {1}{2}}\left(\operatorname {\mathbb {E} } [X_{2:2}]-\operatorname {\mathbb {E} } [X_{1:2}]\right)\\[4pt]\lambda _{3}&={\tfrac {1}{3}}\left(\operatorname {\mathbb {E} } [X_{3:3}]-2\operatorname {\mathbb {E} } [X_{2:3}]+\operatorname {\mathbb {E} } [X_{1:3}]\right)\\[4pt]\lambda _{4}&={\tfrac {1}{4}}\left(\operatorname {\mathbb {E} } [X_{4:4}]-3\operatorname {\mathbb {E} } [X_{3:4}]+3\operatorname {\mathbb {E} } [X_{2:4}]-\operatorname {\mathbb {E} } [X_{1:4}]\right).\end{aligned}}$

Note that the coefficients of the $r$ th L-moment are the same as in the $r$ th term of the binomial transform, as used in the $r$ -order finite difference (finite analog to the derivative).

The first two of these L-moments have conventional names:

$\lambda _{1}$ is the "mean", "L-mean", or "L-location",
$\lambda _{2}$ is the "L-scale".

The L-scale is equal to half the Mean absolute difference.^[5]

Analytic calculation

Expectations are often defined in terms of probability density functions, but the connection in terms of these between the order statistics $X_{r:n}$ and their underlying random variable $X$ is rather remote. A closer connection can be found in terms of cumulative distribution functions (CDFs), since these (see this section) satisfy $F_{X_{r:n}}(x)=\sum _{j=r}^{n}{\binom {n}{j}}F_{X}(x)^{j}{\bigl (}1-F_{X}(x){\bigr )}^{n-j}.$ In particular one may define polynomials $b_{r:n}(y)=\sum _{j=r}^{n}{\binom {n}{j}}y^{j}(1-y)^{n-j}$ and express $F_{X_{r:n}}=b_{r:n}\circ F_{X}$ .

Having a CDF $F_{X}$ , the expectation $\mathbb {E} \{X\}$ may be expressed using a Stieltjes integral as $\mathbb {E} \{X\}=\int _{\mathbb {R} }x\,dF_{X}(x),$ thus $\mathbb {E} \{X_{r:n}\}=\int _{\mathbb {R} }x\,d(b_{r:n}\circ F_{X})(x)=\int _{\mathbb {R} }xb_{r:n}'{\bigl (}F_{X}(x){\bigr )}\,dF_{X}(x)$ where $b_{r:n}'$ is straight off the derivative of $b_{r:n}$ . This integral can often be made more tractable by introducing the quantile function $Q_{X}$ via the change of variables $y=F_{X}(x),x=Q_{X}(y)$ : $\mathbb {E} \{X_{r:n}\}=\int _{\mathbb {R} }xb_{r:n}'{\bigl (}F_{X}(x){\bigr )}\,dF_{X}(x)=\int _{0}^{1}Q_{X}(y)b_{r:n}'(y)\,dy.$ Since the L-moments are linear combinations of such expectations, the corresponding integrals can be combined into one for each moment, where the integrand is $Q_{X}(y)$ times a polynomial. We have^[1] $\lambda _{n}=\int _{0}^{1}Q_{X}(y){\widetilde {P}}_{n-1}(y)\,dy$ where ${\widetilde {P}}_{m}(y)=\sum _{k=0}^{m}(-1)^{m-k}{\binom {m}{k}}{\binom {m+k}{k}}y^{k}$ are the shifted Legendre polynomials, orthogonal on $[0,1]$ .

In particular ${\begin{aligned}\lambda _{1}&=\int _{0}^{1}Q_{X}(y)\,dy,\\[2pt]\lambda _{2}&=\int _{0}^{1}Q_{X}(y)\left(2y-1\right)dy,\\[2pt]\lambda _{3}&=\int _{0}^{1}Q_{X}(y)\left(6y^{2}-6y+1\right)dy,\\[2pt]\lambda _{4}&=\int _{0}^{1}Q_{X}(y)\left(20y^{3}-30y^{2}+12y-1\right)dy.\end{aligned}}$

Sillitto's Theorem

The above integral formula for $\lambda _{n}$ has the form of a generalized Fourier coefficient, and they appeared as such in the literature years before being named moments. In the notation of this article, Sillitto^[6] proved

Theorem—Let $X$ be a real-valued continuous random variable with finite variance, quantile function $Q_{X}(y)$ and L-moments $\{\lambda _{r}\}_{r=1}^{\infty }$ . Then the representation $Q_{X}(y)=\sum _{r=1}^{\infty }(2r-1)\lambda _{r}{\widetilde {P}}_{r-1}(y)\qquad {\text{for }}0<y<1$ is convergent in $L^{2}$ norm.

However Hosking^[1] cautions that partial sums of this series tend to give poor approximations for the tails of the distribution, and need not be monotonic. Similar problems arise with the Cornish–Fisher expansion of $Q_{X}$ in terms of the cumulants of $X$ .

Remove ads

Sample L-moments

Summarize

Perspective

The sample L-moments can be computed as the population L-moments of the sample, summing over r-element subsets of the sample $\left\{x_{1}<\cdots <x_{j}<\cdots <x_{r}\right\},$ hence averaging by dividing by the binomial coefficient: $\lambda _{r}={\frac {1}{r\cdot {\tbinom {n}{r}}}}\,\sum _{x_{1}<\cdots <x_{j}<\cdots <x_{r}}(-1)^{r-j}{\binom {r-1}{j}}\,x_{j}\,.$

Grouping these by order statistic counts the number of ways an element of an $n$ element sample can be the $j$ th element of an $r$ element subset, and yields formulas of the form below. Direct estimators for the first four L-moments in a finite sample of $n$ observations are:^[7]

${\begin{aligned}\ell _{1}&={\frac {1}{\tbinom {n}{1}}}\sum _{i=1}^{n}x_{(i)}\\[1ex]\ell _{2}&={\frac {1}{2{\tbinom {n}{2}}}}\sum _{i=1}^{n}\left[{\tbinom {i-1}{1}}-{\tbinom {n-i}{1}}\right]x_{(i)}\\[1ex]\ell _{3}&={\frac {1}{3{\tbinom {n}{3}}}}\sum _{i=1}^{n}\left[{\tbinom {i-1}{2}}-2{\tbinom {i-1}{1}}{\tbinom {n-i}{1}}+{\tbinom {n-i}{2}}\right]x_{(i)}\\[1ex]\ell _{4}&={\frac {1}{4{\tbinom {n}{4}}}}\sum _{i=1}^{n}\left[{\tbinom {i-1}{3}}-3{\tbinom {i-1}{2}}{\tbinom {n-i}{1}}+3{\tbinom {i-1}{1}}{\tbinom {n-i}{2}}-{\tbinom {n-i}{3}}\right]x_{(i)}\end{aligned}}$

where $x (i)$ is the $i$ th order statistic and ${\tbinom {\boldsymbol {\cdot }}{\boldsymbol {\cdot }}}$ is a binomial coefficient. Sample L-moments can also be defined indirectly in terms of probability weighted moments,^[1]^[8]^[9] which leads to a more efficient algorithm for their computation.^[7]^[10]

Remove ads

L-moment ratios

A set of L-moment ratios, or scaled L-moments, is defined by $\tau _{r}=\lambda _{r}/\lambda _{2},\qquad r=3,4,\dots ~.$ The most useful of these are $\tau _{3},$ called the L-skewness, and $\tau _{4},$ the L-kurtosis.

L-moment ratios lie within the interval $(-1, 1)$ . Tighter bounds can be found for some specific L-moment ratios; in particular, the L-kurtosis $\tau _{4}$ lies in $[-1 /4, 1)$ , and^[1] ${\tfrac {1}{4}}\left(5\tau _{3}^{2}-1\right)\leq \tau _{4}<1\,.$

A quantity analogous to the coefficient of variation, but based on L-moments, can also be defined: $\tau =\lambda _{2}/\lambda _{1}\,,$ which is called the "coefficient of L-variation", or "L-CV". For a non-negative random variable, this lies in the interval $(0, 1)$ ^[1] and is identical to the Gini coefficient.^[11]

Remove ads

Related quantities

L-moments are statistical quantities that are derived from probability weighted moments^[12] (PWM) which were defined earlier (1979).^[8] PWM are used to efficiently estimate the parameters of distributions expressable in inverse form such as the Gumbel,^[9] the Tukey lambda, and the Wakeby distributions.

Usage

Summarize

Perspective

There are two common ways that L-moments are used, in both cases analogously to the conventional moments:

As summary statistics for data.
To derive estimators for the parameters of probability distributions, applying the method of moments to the L-moments rather than conventional moments.

In addition to doing these with standard moments, the latter (estimation) is more commonly done using maximum likelihood methods; however using L-moments provides a number of advantages. Specifically, L-moments are more robust than conventional moments, and existence of higher L-moments only requires that the random variable have finite mean. One disadvantage of L-moment ratios for estimation is their typically smaller sensitivity. For instance, the Laplace distribution has a kurtosis of 6 and weak exponential tails, but a larger 4th L-moment ratio than e.g. the student-t distribution with d.f.=3, which has an infinite kurtosis and much heavier tails.

As an example consider a dataset with a few data points and one outlying data value. If the ordinary standard deviation of this data set is taken it will be highly influenced by this one point: however, if the L-scale is taken it will be far less sensitive to this data value. Consequently, L-moments are far more meaningful when dealing with outliers in data than conventional moments. However, there are also other better suited methods to achieve an even higher robustness than just replacing moments by L-moments. One example of this is using L-moments as summary statistics in extreme value theory (EVT). This application shows the limited robustness of L-moments, i.e. L-statistics are not resistant statistics, as a single extreme value can throw them off, but because they are only linear (not higher-order statistics), they are less affected by extreme values than conventional moments.

Another advantage L-moments have over conventional moments is that their existence only requires the random variable to have finite mean, so the L-moments exist even if the higher conventional moments do not exist (for example, for Student's t distribution with low degrees of freedom). A finite variance is required in addition in order for the standard errors of estimates of the L-moments to be finite.^[1]

Some appearances of L-moments in the statistical literature include the book by David & Nagaraja (2003, Section 9.9)^[13] and a number of papers.^[11]^[14]^[15]^[16]^[17]^[18] A number of favourable comparisons of L-moments with ordinary moments have been reported.^[19]^[20]

Remove ads

Values for some common distributions

Summarize

Perspective

The table below gives expressions for the first two L moments and numerical values of the first two L-moment ratios of some common continuous probability distributions with constant L-moment ratios.^[1]^[5] More complex expressions have been derived for some further distributions for which the L-moment ratios vary with one or more of the distributional parameters, including the log-normal, Gamma, generalized Pareto, generalized extreme value, and generalized logistic distributions.^[1]

More information Distribution, Parameters ...

Distribution	Parameters	mean, $λ 1$	L-scale, $λ 2$	L-skewness, $τ 3$	L-kurtosis, $τ 4$
Uniform	$a$ , $b$	$.mw-parser-output .sfrac{white-space:nowrap}.mw-parser-output .sfrac.tion,.mw-parser-output .sfrac .tion{display:inline-block;vertical-align:-0.5em;font-size:85%;text-align:center}.mw-parser-output .sfrac .num{display:block;line-height:1em;margin:0.0em 0.1em;border-bottom:1px solid}.mw-parser-output .sfrac .den{display:block;line-height:1em;margin:0.1em 0.1em}.mw-parser-output .sr-only{border:0;clip:rect(0,0,0,0);clip-path:polygon(0px 0px,0px 0px,0px 0px);height:1px;margin:-1px;overflow:hidden;padding:0;position:absolute;width:1px}⁠ 1 /2⁠(a + b)$	$⁠ 1 / 6 ⁠ (b - a)$	$0$	$0$
Logistic	$μ$ , $s$	$μ$	$s$	0	$⁠ 1 / 6 ⁠ = 0.1667$
Normal	$μ$ , $σ 2$	$μ$	$⁠ σ / \sqrt π ⁠$	0	$30 ⁠ θ m / π ⁠ - 9 = 0.1226$
Laplace	$μ$ , $b$	$μ$	$⁠ 3 / 4 ⁠ b$	0	$⁠ 1 / 3 \sqrt 2 ⁠ = 0.2357$
Student's t, 2 d.f.	$ν = 2$	0	$⁠ π / 2 \sqrt 2 ⁠ = 1.111$	0	$⁠ 3 / 8 ⁠ = 0.375$
Student's t, 4 d.f.	$ν = 4$	0	$⁠ 15 / 64 ⁠ π = 0.7363$	0	$⁠ 111 / 512 ⁠ = 0.2168$
Exponential	$λ$	$⁠ 1 / λ ⁠$	$⁠ 1 / 2 λ ⁠$	$⁠ 1 / 3 ⁠ = 0.3333$	$⁠ 1 / 6 ⁠ = 0.1667$
Gumbel	$μ$ , $β$	$μ$ + $γ$ _e $β$	$β log 2 (3)$	$2 log 2 (3) - 3 = 0.1699$	$16 - 10 log 2 (3) = 0.1504$

The notation for the parameters of each distribution is the same as that used in the linked article. In the expression for the mean of the Gumbel distribution, $γ$ _e is the Euler–Mascheroni constant $0.5772 1566 4901 ...$ .

Remove ads

Extensions

Trimmed L-moments are generalizations of L-moments that give zero weight to extreme observations. They are therefore more robust to the presence of outliers, and unlike L-moments they may be well-defined for distributions for which the mean does not exist, such as the Cauchy distribution.^[21]

References

Loading content...

External links

Loading content...

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.

Remove ads

Population L-moments

Analytic calculation

Sillitto's Theorem

Sample L-moments

L-moment ratios

Related quantities

Usage

Values for some common distributions

Extensions

See also

References

External links