Nonparametric regression

Definition

Nonparametric regression assumes the following relationship, given the random variables $X$ and $Y$ :

\mathbb {E} [Y\mid X=x]=m(x),

where $m(x)$ is some deterministic function. Linear regression is a restricted case of nonparametric regression where $m(x)$ is assumed to be a linear function of the data. Sometimes a slightly stronger assumption of additive noise is used:

Y=m(X)+U,

where the random variable $U$ is the `noise term', with mean 0. Without the assumption that $m$ belongs to a specific parametric family of functions it is impossible to get an unbiased estimate for $m$ , however most estimators are consistent under suitable conditions.

Remove ads

Examples

Summarize

Perspective

Gaussian process regression or Kriging

In Gaussian process regression, also known as Kriging, a Gaussian prior is assumed for the regression curve. The errors are assumed to have a multivariate normal distribution and the regression curve is estimated by its posterior mode. The Gaussian prior may depend on unknown hyperparameters, which are usually estimated via empirical Bayes. The hyperparameters typically specify a prior covariance kernel. In case the kernel should also be inferred nonparametrically from the data, the critical filter can be used.

Smoothing splines have an interpretation as the posterior mode of a Gaussian process regression.

Kernel regression

Thumb — Example of a curve (red line) fit to a small data set (black points) with nonparametric regression using a Gaussian kernel smoother. The pink shaded area illustrates the kernel function applied to obtain an estimate of y for a given value of x. The kernel function defines the weight given to each data point in producing the estimate for a target point.

This section does not cite any sources. (August 2020)

Kernel regression estimates the continuous dependent variable from a limited set of data points by convolving the data points' locations with a kernel function—approximately speaking, the kernel function specifies how to "blur" the influence of the data points so that their values can be used to predict the value for nearby locations.

Regression trees

Decision tree learning algorithms can be applied to learn to predict a dependent variable from data.^[2] Although the original Classification And Regression Tree (CART) formulation applied only to predicting univariate data, the framework can be used to predict multivariate data, including time series.^[3]

Nonparametric regression

Definition

Common nonparametric regression algorithms

Examples

Gaussian process regression or Kriging

Kernel regression

Regression trees

See also

References

Further reading

External links

Wikiwand - on