fbpx
Wikipedia

Bhattacharyya distance

In statistics, the Bhattacharyya distance measures the similarity of two probability distributions. It is closely related to the Bhattacharyya coefficient which is a measure of the amount of overlap between two statistical samples or populations.

It is not a metric, despite named a "distance", since it does not obey the triangle inequality.

Definition

For probability distributions   and   on the same domain  , the Bhattacharyya distance is defined as

 

where

 

is the Bhattacharyya coefficient for discrete probability distributions.

For continuous probability distributions, with   and   where   and   are the probability density functions, the Bhattacharyya coefficient is defined as

 .

More generally, given two probability measures   on a measurable space  , let   be a (sigma finite) measure such that   and   are absolutely continuous with respect to   i.e. such that  , and   for probability density functions   with respect to   defined  -almost everywhere. Such a measure, even such a probability measure, always exists, e.g.  . Then define the Bhattacharrya measure on   by

 

It does does not depend on the measure  , for if we choose a measure   such that   and an other measure choice   are absolutely continuous i.e.   and  , then

 ,

and similarly for  . We then have

 .

We finally define the Bhattacharyya coefficient

 .

By the above, the quantity   does not depend on  , and by the Cauchy inequality  . In particular if   is absolutely continuous wrt to   with Radon Nikodym derivative  , then

 

Properties

  and  .

  does not obey the triangle inequality, though the Hellinger distance   does.

Let  ,  , where   is the normal distribution with mean   and variance  , then

 

And in general, given two multivariate normal distributions  ,

 

where   Note that the first term is a squared Mahalanobis distance.

Applications

The Bhattacharyya coefficient quantifies the "closeness" of two random statistical samples.

Given two sequences from distributions  , bin them into   buckets, and let the frequency of samples from   in bucket   be  , and similarly for  , then the sample Bhattacharyya coefficient is

 

which is an estimator of  . The quality of estimation depends on the choice of buckets; too few buckets would overestimate  , while too many would underestimate..

A common task in classification is estimating the separability of classes. Up to a multiplicative factor, the squared Mahalanobis distance is a special case of the Bhattacharyya distance when the two classes are normally distributed with the same variances. When two classes have similar means but significantly different variances, the Mahalanobis distance would be close to zero, while the Bhattacharyya distance would not be.

The Bhattacharyya coefficient is used in the construction of polar codes.[1]

The Bhattacharyya distance is used in feature extraction and selection,[2] image processing,[3] speaker recognition,[4] and phone clustering.[5]

A "Bhattacharyya space" has been proposed as a feature selection technique that can be applied to texture segmentation.[6]

History

Both the Bhattacharyya distance and the Bhattacharyya coefficient are named after Anil Kumar Bhattacharyya, a statistician who worked in the 1930s at the Indian Statistical Institute.[7] He developed the method to measure the distance between two non-normal distributions and illustrated this with the classical multinomial populations[8] as well as probability distributions that are absolutely continuous with respect to the Lebesgue measure.[9] The latter work appeared partly in 1943 in the Bulletin of the Calcutta Mathematical Society,[9] while the former part, despite being submitted for publication in 1941, appeared almost five years later in Sankhya.[8][7]

See also

References

  1. ^ Arıkan, Erdal (July 2009). "Channel polarization: A method for constructing capacity-achieving codes for symmetric binary-input memoryless channels". IEEE Transactions on Information Theory. 55 (7): 3051–3073. arXiv:0807.3917. doi:10.1109/TIT.2009.2021379. S2CID 889822.
  2. ^ Euisun Choi, Chulhee Lee, "Feature extraction based on the Bhattacharyya distance", Pattern Recognition, Volume 36, Issue 8, August 2003, Pages 1703–1709
  3. ^ François Goudail, Philippe Réfrégier, Guillaume Delyon, "Bhattacharyya distance as a contrast parameter for statistical processing of noisy optical images", JOSA A, Vol. 21, Issue 7, pp. 1231−1240 (2004)
  4. ^ Chang Huai You, "An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition", Signal Processing Letters, IEEE, Vol 16, Is 1, pp. 49-52
  5. ^ Mak, B., "Phone clustering using the Bhattacharyya distance", Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on, Vol 4, pp. 2005–2008 vol.4, 3−6 Oct 1996
  6. ^ Reyes-Aldasoro, C.C., and A. Bhalerao, "The Bhattacharyya space for feature selection and its application to texture segmentation", Pattern Recognition, (2006) Vol. 39, Issue 5, May 2006, pp. 812–826
  7. ^ a b Sen, Pranab Kumar (1996). "Anil Kumar Bhattacharyya (1915-1996): A Reverent Remembrance". Calcutta Statistical Association Bulletin. 46 (3–4): 151–158. doi:10.1177/0008068319960301. S2CID 164326977.
  8. ^ a b Bhattacharyya, A. (1946). "On a Measure of Divergence between Two Multinomial Populations". Sankhyā. 7 (4): 401–406. JSTOR 25047882.
  9. ^ a b Bhattacharyya, A. (March 1943). "On a measure of divergence between two statistical populations defined by their probability distributions". Bulletin of the Calcutta Mathematical Society. 35: 99–109. MR 0010358.
  • Nielsen, F.; Boltz, S. (2010). "The Burbea–Rao and Bhattacharyya centroids". IEEE Transactions on Information Theory. 57 (8): 5455–5466. arXiv:1004.5049. doi:10.1109/TIT.2011.2159046. S2CID 14238708.
  • Kailath, T. (1967). "The Divergence and Bhattacharyya Distance Measures in Signal Selection". IEEE Transactions on Communication Technology. 15 (1): 52–60. doi:10.1109/TCOM.1967.1089532.

External links

bhattacharyya, distance, statistics, measures, similarity, probability, distributions, closely, related, bhattacharyya, coefficient, which, measure, amount, overlap, between, statistical, samples, populations, metric, despite, named, distance, since, does, obe. In statistics the Bhattacharyya distance measures the similarity of two probability distributions It is closely related to the Bhattacharyya coefficient which is a measure of the amount of overlap between two statistical samples or populations It is not a metric despite named a distance since it does not obey the triangle inequality Contents 1 Definition 2 Properties 3 Applications 4 History 5 See also 6 References 7 External linksDefinition EditFor probability distributions P displaystyle P and Q displaystyle Q on the same domain X displaystyle mathcal X the Bhattacharyya distance is defined as D B P Q ln B C P Q displaystyle D B P Q ln left BC P Q right where B C P Q x X P x Q x displaystyle BC P Q sum x in mathcal X sqrt P x Q x is the Bhattacharyya coefficient for discrete probability distributions For continuous probability distributions with P d x p x d x displaystyle P dx p x dx and Q d x q x d x displaystyle Q dx q x dx where p x displaystyle p x and q x displaystyle q x are the probability density functions the Bhattacharyya coefficient is defined as B C P Q X p x q x d x displaystyle BC P Q int mathcal X sqrt p x q x dx More generally given two probability measures P Q displaystyle P Q on a measurable space X B displaystyle mathcal X mathcal B let l displaystyle lambda be a sigma finite measure such that P displaystyle P and Q displaystyle Q are absolutely continuous with respect to l displaystyle lambda i e such that P d x p x l d x displaystyle P dx p x lambda dx and Q d x q x l d x displaystyle Q dx q x lambda dx for probability density functions p q displaystyle p q with respect to l displaystyle lambda defined l displaystyle lambda almost everywhere Such a measure even such a probability measure always exists e g l 1 2 P Q displaystyle lambda tfrac 1 2 P Q Then define the Bhattacharrya measure on X B displaystyle mathcal X mathcal B by b c d x P Q p x q x l d x P d x l d x x Q d x l d x x l d x displaystyle bc dx P Q sqrt p x q x lambda dx sqrt frac P dx lambda dx x frac Q dx lambda dx x lambda dx It does does not depend on the measure l displaystyle lambda for if we choose a measure m displaystyle mu such that l displaystyle lambda and an other measure choice l displaystyle lambda are absolutely continuous i e l l x m displaystyle lambda l x mu and l l x m displaystyle lambda l x mu then P d x p x l d x p x l d x p x l x m d x p x l x m d x displaystyle P dx p x lambda dx p x lambda dx p x l x mu dx p x l x mu dx and similarly for Q displaystyle Q We then have b c d x P Q p x q x l d x p x q x l x m x p x l x q x l x m d x p x l x q x l x m d x p x q x l d x displaystyle bc dx P Q sqrt p x q x lambda dx sqrt p x q x l x mu x sqrt p x l x q x l x mu dx sqrt p x l x q x l x mu dx sqrt p x q x lambda dx We finally define the Bhattacharyya coefficient B C P Q X b c d x P Q X p x q x l d x displaystyle BC P Q int mathcal X bc dx P Q int mathcal X sqrt p x q x lambda dx By the above the quantity B C P Q displaystyle BC P Q does not depend on l displaystyle lambda and by the Cauchy inequality 0 B C P Q 1 displaystyle 0 leq BC P Q leq 1 In particular if P d x p x Q d x displaystyle P dx p x Q dx is absolutely continuous wrt to Q displaystyle Q with Radon Nikodym derivative p x P d x Q d x x displaystyle p x frac P dx Q dx x thenB C P Q X p x Q d x X P d x Q d x Q d x E Q P d x Q d x displaystyle BC P Q int mathcal X sqrt p x Q dx int mathcal X sqrt frac P dx Q dx Q dx E Q left sqrt frac P dx Q dx right Properties Edit0 B C 1 displaystyle 0 leq BC leq 1 and 0 D B displaystyle 0 leq D B leq infty D B displaystyle D B does not obey the triangle inequality though the Hellinger distance 1 B C p q displaystyle sqrt 1 BC p q does Let p N m p s p 2 displaystyle p sim mathcal N mu p sigma p 2 q N m q s q 2 displaystyle q sim mathcal N mu q sigma q 2 where N m s 2 displaystyle mathcal N mu sigma 2 is the normal distribution with mean m displaystyle mu and variance s 2 displaystyle sigma 2 then D B p q 1 4 m p m q 2 s p 2 s q 2 1 2 ln s p 2 s q 2 2 s p s q displaystyle D B p q frac 1 4 frac mu p mu q 2 sigma p 2 sigma q 2 frac 1 2 ln left frac sigma p 2 sigma q 2 2 sigma p sigma q right And in general given two multivariate normal distributions p i N m i S i displaystyle p i mathcal N boldsymbol mu i boldsymbol Sigma i D B p 1 p 2 1 8 m 1 m 2 T S 1 m 1 m 2 1 2 ln det S det S 1 det S 2 displaystyle D B p 1 p 2 1 over 8 boldsymbol mu 1 boldsymbol mu 2 T boldsymbol Sigma 1 boldsymbol mu 1 boldsymbol mu 2 1 over 2 ln left det boldsymbol Sigma over sqrt det boldsymbol Sigma 1 det boldsymbol Sigma 2 right where S S 1 S 2 2 displaystyle boldsymbol Sigma boldsymbol Sigma 1 boldsymbol Sigma 2 over 2 Note that the first term is a squared Mahalanobis distance Applications EditThe Bhattacharyya coefficient quantifies the closeness of two random statistical samples Given two sequences from distributions P Q displaystyle P Q bin them into n displaystyle n buckets and let the frequency of samples from P displaystyle P in bucket i displaystyle i be p i displaystyle p i and similarly for q i displaystyle q i then the sample Bhattacharyya coefficient is B C p q i 1 n p i q i displaystyle BC mathbf p mathbf q sum i 1 n sqrt p i q i which is an estimator of B C P Q displaystyle BC P Q The quality of estimation depends on the choice of buckets too few buckets would overestimate B C P Q displaystyle BC P Q while too many would underestimate A common task in classification is estimating the separability of classes Up to a multiplicative factor the squared Mahalanobis distance is a special case of the Bhattacharyya distance when the two classes are normally distributed with the same variances When two classes have similar means but significantly different variances the Mahalanobis distance would be close to zero while the Bhattacharyya distance would not be The Bhattacharyya coefficient is used in the construction of polar codes 1 The Bhattacharyya distance is used in feature extraction and selection 2 image processing 3 speaker recognition 4 and phone clustering 5 A Bhattacharyya space has been proposed as a feature selection technique that can be applied to texture segmentation 6 History EditBoth the Bhattacharyya distance and the Bhattacharyya coefficient are named after Anil Kumar Bhattacharyya a statistician who worked in the 1930s at the Indian Statistical Institute 7 He developed the method to measure the distance between two non normal distributions and illustrated this with the classical multinomial populations 8 as well as probability distributions that are absolutely continuous with respect to the Lebesgue measure 9 The latter work appeared partly in 1943 in the Bulletin of the Calcutta Mathematical Society 9 while the former part despite being submitted for publication in 1941 appeared almost five years later in Sankhya 8 7 See also EditBhattacharyya angle Kullback Leibler divergence Hellinger distance Mahalanobis distance Chernoff bound Renyi entropy F divergence Fidelity of quantum statesReferences Edit Arikan Erdal July 2009 Channel polarization A method for constructing capacity achieving codes for symmetric binary input memoryless channels IEEE Transactions on Information Theory 55 7 3051 3073 arXiv 0807 3917 doi 10 1109 TIT 2009 2021379 S2CID 889822 Euisun Choi Chulhee Lee Feature extraction based on the Bhattacharyya distance Pattern Recognition Volume 36 Issue 8 August 2003 Pages 1703 1709 Francois Goudail Philippe Refregier Guillaume Delyon Bhattacharyya distance as a contrast parameter for statistical processing of noisy optical images JOSA A Vol 21 Issue 7 pp 1231 1240 2004 Chang Huai You An SVM Kernel With GMM Supervector Based on the Bhattacharyya Distance for Speaker Recognition Signal Processing Letters IEEE Vol 16 Is 1 pp 49 52 Mak B Phone clustering using the Bhattacharyya distance Spoken Language 1996 ICSLP 96 Proceedings Fourth International Conference on Vol 4 pp 2005 2008 vol 4 3 6 Oct 1996 Reyes Aldasoro C C and A Bhalerao The Bhattacharyya space for feature selection and its application to texture segmentation Pattern Recognition 2006 Vol 39 Issue 5 May 2006 pp 812 826 a b Sen Pranab Kumar 1996 Anil Kumar Bhattacharyya 1915 1996 A Reverent Remembrance Calcutta Statistical Association Bulletin 46 3 4 151 158 doi 10 1177 0008068319960301 S2CID 164326977 a b Bhattacharyya A 1946 On a Measure of Divergence between Two Multinomial Populations Sankhya 7 4 401 406 JSTOR 25047882 a b Bhattacharyya A March 1943 On a measure of divergence between two statistical populations defined by their probability distributions Bulletin of the Calcutta Mathematical Society 35 99 109 MR 0010358 Nielsen F Boltz S 2010 The Burbea Rao and Bhattacharyya centroids IEEE Transactions on Information Theory 57 8 5455 5466 arXiv 1004 5049 doi 10 1109 TIT 2011 2159046 S2CID 14238708 Kailath T 1967 The Divergence and Bhattacharyya Distance Measures in Signal Selection IEEE Transactions on Communication Technology 15 1 52 60 doi 10 1109 TCOM 1967 1089532 Djouadi A Snorrason O Garber F 1990 The quality of Training Sample estimates of the Bhattacharyya coefficient IEEE Transactions on Pattern Analysis and Machine Intelligence 12 1 92 97 doi 10 1109 34 41388 For a short list of properties see http www mtm ufsc br taneja book node20 htmlExternal links Edit Bhattacharyya distance Encyclopedia of Mathematics EMS Press 2001 1994 Bhattacharyya s distance measure as a precursor of genetic distance measures Journal of Biosciences 2004 Statistical Intuition of Bhattacharyya s distance Retrieved from https en wikipedia org w index php title Bhattacharyya distance amp oldid 1124772472, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.