fbpx
Wikipedia

Kernel (statistics)

The term kernel is used in statistical analysis to refer to a window function. The term "kernel" has several distinct meanings in different branches of statistics.

Bayesian statistics edit

In statistics, especially in Bayesian statistics, the kernel of a probability density function (pdf) or probability mass function (pmf) is the form of the pdf or pmf in which any factors that are not functions of any of the variables in the domain are omitted.[1] Note that such factors may well be functions of the parameters of the pdf or pmf. These factors form part of the normalization factor of the probability distribution, and are unnecessary in many situations. For example, in pseudo-random number sampling, most sampling algorithms ignore the normalization factor. In addition, in Bayesian analysis of conjugate prior distributions, the normalization factors are generally ignored during the calculations, and only the kernel considered. At the end, the form of the kernel is examined, and if it matches a known distribution, the normalization factor can be reinstated. Otherwise, it may be unnecessary (for example, if the distribution only needs to be sampled from).

For many distributions, the kernel can be written in closed form, but not the normalization constant.

An example is the normal distribution. Its probability density function is

 

and the associated kernel is

 

Note that the factor in front of the exponential has been omitted, even though it contains the parameter   , because it is not a function of the domain variable   .

Pattern analysis edit

The kernel of a reproducing kernel Hilbert space is used in the suite of techniques known as kernel methods to perform tasks such as statistical classification, regression analysis, and cluster analysis on data in an implicit space. This usage is particularly common in machine learning.

Nonparametric statistics edit

In nonparametric statistics, a kernel is a weighting function used in non-parametric estimation techniques. Kernels are used in kernel density estimation to estimate random variables' density functions, or in kernel regression to estimate the conditional expectation of a random variable. Kernels are also used in time-series, in the use of the periodogram to estimate the spectral density where they are known as window functions. An additional use is in the estimation of a time-varying intensity for a point process where window functions (kernels) are convolved with time-series data.

Commonly, kernel widths must also be specified when running a non-parametric estimation.

Definition edit

A kernel is a non-negative real-valued integrable function K. For most applications, it is desirable to define the function to satisfy two additional requirements:

 
  • Symmetry:
 

The first requirement ensures that the method of kernel density estimation results in a probability density function. The second requirement ensures that the average of the corresponding distribution is equal to that of the sample used.

If K is a kernel, then so is the function K* defined by K*(u) = λKu), where λ > 0. This can be used to select a scale that is appropriate for the data.

Kernel functions in common use edit

 
All of the kernels below in a common coordinate system.

Several types of kernel functions are commonly used: uniform, triangle, Epanechnikov,[2] quartic (biweight), tricube,[3] triweight, Gaussian, quadratic[4] and cosine.

In the table below, if   is given with a bounded support, then   for values of u lying outside the support.

Kernel Functions, K(u)     Efficiency[5] relative to the Epanechnikov kernel
Uniform ("rectangular window")  

Support:  

 

"Boxcar function"

        92.9%
Triangular  

Support:  

          98.6%
Epanechnikov

(parabolic)

 

Support:  

          100%
Quartic
(biweight)
 

Support:  

          99.4%
Triweight  

Support:  

          98.7%
Tricube  

Support:  

          99.8%
Gaussian             95.1%
Cosine  

Support:  

          99.9%
Logistic             88.7%
Sigmoid function             84.3%
Silverman kernel[6]             not applicable

See also edit

References edit

  1. ^ Schuster, Eugene (August 1969). "Estimation of a probability density function and its derivatives". The Annals of Mathematical Statistics. 40 (4): 1187-1195. doi:10.1214/aoms/1177697495.
  2. ^ Named for Epanechnikov, V. A. (1969). "Non-Parametric Estimation of a Multivariate Probability Density". Theory Probab. Appl. 14 (1): 153–158. doi:10.1137/1114019.
  3. ^ Altman, N. S. (1992). "An introduction to kernel and nearest neighbor nonparametric regression". The American Statistician. 46 (3): 175–185. doi:10.1080/00031305.1992.10475879. hdl:1813/31637.
  4. ^ Cleveland, W. S.; Devlin, S. J. (1988). "Locally weighted regression: An approach to regression analysis by local fitting". Journal of the American Statistical Association. 83 (403): 596–610. doi:10.1080/01621459.1988.10478639.
  5. ^ Efficiency is defined as  .
  6. ^ Silverman, B. W. (1986). Density Estimation for Statistics and Data Analysis. Chapman and Hall, London.
  • Li, Qi; Racine, Jeffrey S. (2007). Nonparametric Econometrics: Theory and Practice. Princeton University Press. ISBN 978-0-691-12161-1.
  • Zucchini, Walter. "APPLIED SMOOTHING TECHNIQUES Part 1: Kernel Density Estimation" (PDF). Retrieved 6 September 2018.
  • Comaniciu, D; Meer, P (2002). "Mean shift: A robust approach toward feature space analysis". IEEE Transactions on Pattern Analysis and Machine Intelligence. 24 (5): 603–619. CiteSeerX 10.1.1.76.8968. doi:10.1109/34.1000236.

kernel, statistics, other, uses, kernel, disambiguation, this, article, needs, additional, citations, verification, please, help, improve, this, article, adding, citations, reliable, sources, unsourced, material, challenged, removed, find, sources, kernel, sta. For other uses see Kernel disambiguation This article needs additional citations for verification Please help improve this article by adding citations to reliable sources Unsourced material may be challenged and removed Find sources Kernel statistics news newspapers books scholar JSTOR May 2012 Learn how and when to remove this template message The term kernel is used in statistical analysis to refer to a window function The term kernel has several distinct meanings in different branches of statistics Contents 1 Bayesian statistics 2 Pattern analysis 3 Nonparametric statistics 3 1 Definition 3 2 Kernel functions in common use 4 See also 5 ReferencesBayesian statistics editIn statistics especially in Bayesian statistics the kernel of a probability density function pdf or probability mass function pmf is the form of the pdf or pmf in which any factors that are not functions of any of the variables in the domain are omitted 1 Note that such factors may well be functions of the parameters of the pdf or pmf These factors form part of the normalization factor of the probability distribution and are unnecessary in many situations For example in pseudo random number sampling most sampling algorithms ignore the normalization factor In addition in Bayesian analysis of conjugate prior distributions the normalization factors are generally ignored during the calculations and only the kernel considered At the end the form of the kernel is examined and if it matches a known distribution the normalization factor can be reinstated Otherwise it may be unnecessary for example if the distribution only needs to be sampled from For many distributions the kernel can be written in closed form but not the normalization constant An example is the normal distribution Its probability density function is p x m s2 12ps2e x m 22s2 displaystyle p x mu sigma 2 frac 1 sqrt 2 pi sigma 2 e frac x mu 2 2 sigma 2 nbsp and the associated kernel is p x m s2 e x m 22s2 displaystyle p x mu sigma 2 propto e frac x mu 2 2 sigma 2 nbsp Note that the factor in front of the exponential has been omitted even though it contains the parameter s2 displaystyle sigma 2 nbsp because it is not a function of the domain variable x displaystyle x nbsp Pattern analysis editThe kernel of a reproducing kernel Hilbert space is used in the suite of techniques known as kernel methods to perform tasks such as statistical classification regression analysis and cluster analysis on data in an implicit space This usage is particularly common in machine learning Nonparametric statistics editFurther information Kernel smoothing In nonparametric statistics a kernel is a weighting function used in non parametric estimation techniques Kernels are used in kernel density estimation to estimate random variables density functions or in kernel regression to estimate the conditional expectation of a random variable Kernels are also used in time series in the use of the periodogram to estimate the spectral density where they are known as window functions An additional use is in the estimation of a time varying intensity for a point process where window functions kernels are convolved with time series data Commonly kernel widths must also be specified when running a non parametric estimation Definition edit Further information Integral kernel A kernel is a non negative real valued integrable function K For most applications it is desirable to define the function to satisfy two additional requirements Normalization K u du 1 displaystyle int infty infty K u du 1 nbsp Symmetry K u K u for all values of u displaystyle K u K u mbox for all values of u nbsp The first requirement ensures that the method of kernel density estimation results in a probability density function The second requirement ensures that the average of the corresponding distribution is equal to that of the sample used If K is a kernel then so is the function K defined by K u lK lu where l gt 0 This can be used to select a scale that is appropriate for the data Kernel functions in common use edit nbsp All of the kernels below in a common coordinate system Several types of kernel functions are commonly used uniform triangle Epanechnikov 2 quartic biweight tricube 3 triweight Gaussian quadratic 4 and cosine In the table below if K displaystyle K nbsp is given with a bounded support then K u 0 displaystyle K u 0 nbsp for values of u lying outside the support Kernel Functions K u u2K u du displaystyle textstyle int u 2 K u du nbsp K u 2du displaystyle textstyle int K u 2 du nbsp Efficiency 5 relative to the Epanechnikov kernelUniform rectangular window K u 12 displaystyle K u frac 1 2 nbsp Support u 1 displaystyle u leq 1 nbsp nbsp Boxcar function 13 displaystyle frac 1 3 nbsp 12 displaystyle frac 1 2 nbsp 92 9 Triangular K u 1 u displaystyle K u 1 u nbsp Support u 1 displaystyle u leq 1 nbsp nbsp 16 displaystyle frac 1 6 nbsp 23 displaystyle frac 2 3 nbsp 98 6 Epanechnikov parabolic K u 34 1 u2 displaystyle K u frac 3 4 1 u 2 nbsp Support u 1 displaystyle u leq 1 nbsp nbsp 15 displaystyle frac 1 5 nbsp 35 displaystyle frac 3 5 nbsp 100 Quartic biweight K u 1516 1 u2 2 displaystyle K u frac 15 16 1 u 2 2 nbsp Support u 1 displaystyle u leq 1 nbsp nbsp 17 displaystyle frac 1 7 nbsp 57 displaystyle frac 5 7 nbsp 99 4 Triweight K u 3532 1 u2 3 displaystyle K u frac 35 32 1 u 2 3 nbsp Support u 1 displaystyle u leq 1 nbsp nbsp 19 displaystyle frac 1 9 nbsp 350429 displaystyle frac 350 429 nbsp 98 7 Tricube K u 7081 1 u 3 3 displaystyle K u frac 70 81 1 left u right 3 3 nbsp Support u 1 displaystyle u leq 1 nbsp nbsp 35243 displaystyle frac 35 243 nbsp 175247 displaystyle frac 175 247 nbsp 99 8 Gaussian K u 12pe 12u2 displaystyle K u frac 1 sqrt 2 pi e frac 1 2 u 2 nbsp nbsp 1 displaystyle 1 nbsp 12p displaystyle frac 1 2 sqrt pi nbsp 95 1 Cosine K u p4cos p2u displaystyle K u frac pi 4 cos left frac pi 2 u right nbsp Support u 1 displaystyle u leq 1 nbsp nbsp 1 8p2 displaystyle 1 frac 8 pi 2 nbsp p216 displaystyle frac pi 2 16 nbsp 99 9 Logistic K u 1eu 2 e u displaystyle K u frac 1 e u 2 e u nbsp nbsp p23 displaystyle frac pi 2 3 nbsp 16 displaystyle frac 1 6 nbsp 88 7 Sigmoid function K u 2p1eu e u displaystyle K u frac 2 pi frac 1 e u e u nbsp nbsp p24 displaystyle frac pi 2 4 nbsp 2p2 displaystyle frac 2 pi 2 nbsp 84 3 Silverman kernel 6 K u 12e u 2 sin u 2 p4 displaystyle K u frac 1 2 e frac u sqrt 2 cdot sin left frac u sqrt 2 frac pi 4 right nbsp nbsp 0 displaystyle 0 nbsp 3216 displaystyle frac 3 sqrt 2 16 nbsp not applicableSee also editKernel density estimation Kernel smoother Stochastic kernel Positive definite kernel Density estimation Multivariate kernel density estimation Kernel methodThis article includes a list of general references but it lacks sufficient corresponding inline citations Please help to improve this article by introducing more precise citations May 2012 Learn how and when to remove this template message References edit Schuster Eugene August 1969 Estimation of a probability density function and its derivatives The Annals of Mathematical Statistics 40 4 1187 1195 doi 10 1214 aoms 1177697495 Named for Epanechnikov V A 1969 Non Parametric Estimation of a Multivariate Probability Density Theory Probab Appl 14 1 153 158 doi 10 1137 1114019 Altman N S 1992 An introduction to kernel and nearest neighbor nonparametric regression The American Statistician 46 3 175 185 doi 10 1080 00031305 1992 10475879 hdl 1813 31637 Cleveland W S Devlin S J 1988 Locally weighted regression An approach to regression analysis by local fitting Journal of the American Statistical Association 83 403 596 610 doi 10 1080 01621459 1988 10478639 Efficiency is defined as u2K u du K u 2du displaystyle sqrt int u 2 K u du int K u 2 du nbsp Silverman B W 1986 Density Estimation for Statistics and Data Analysis Chapman and Hall London Li Qi Racine Jeffrey S 2007 Nonparametric Econometrics Theory and Practice Princeton University Press ISBN 978 0 691 12161 1 Zucchini Walter APPLIED SMOOTHING TECHNIQUES Part 1 Kernel Density Estimation PDF Retrieved 6 September 2018 Comaniciu D Meer P 2002 Mean shift A robust approach toward feature space analysis IEEE Transactions on Pattern Analysis and Machine Intelligence 24 5 603 619 CiteSeerX 10 1 1 76 8968 doi 10 1109 34 1000236 Retrieved from https en wikipedia org w index php title Kernel statistics amp oldid 1184079234, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.