fbpx
Wikipedia

Pólya urn model

In statistics, a Pólya urn model (also known as a Pólya urn scheme or simply as Pólya's urn), named after George Pólya, is a family of urn models that can be used to interpret many commonly used statistical models.

The model represents objects of interest (such as atoms, people, cars, etc.) as colored balls in an urn. In the basic Pólya urn model, the experimenter puts x white and y black balls into an urn. At each step, one ball is drawn uniformly at random from the urn, and its color observed; it is then returned in the urn, and an additional ball of the same color is added to the urn.

If by random chance, more black balls are drawn than white balls in the initial few draws, it would make it more likely for more black balls to be drawn later. Similarly for the white balls. Thus the urn has a self-reinforcing property ("the rich get richer"). It is the opposite of sampling without replacement, where every time a particular value is observed, it is less likely to be observed again, whereas in a Pólya urn model, an observed value is more likely to be observed again. In a Pólya urn model, successive acts of measurement over time have less and less effect on future measurements, whereas in sampling without replacement, the opposite is true: After a certain number of measurements of a particular value, that value will never be seen again.

It is also different from sampling with replacement, where the ball is returned to the urn but without adding new balls. In this case, there is neither self-reinforcing nor anti-self-reinforcing.

Basic results edit

Questions of interest are the evolution of the urn population and the sequence of colors of the balls drawn out.

After   draws, the probability that the urn contains   white balls and   black balls is  , where the overbar denotes rising factorial. This can be proved by drawing the Pascal's triangle of all possible configurations.

More generally, if the urn starts with   balls of color  , with  , then after   draws, the probability that the urn contains   balls of color   is

 
where we use the multinomial coefficient.

Conditional on the urn ending up with   balls of color   after   draws, there are   different trajectories that could have led to such an end-state. The conditional probability of each trajectory is the same:  .

Interpretation edit

One of the reasons for interest in this particular rather elaborate urn model (i.e. with duplication and then replacement of each ball drawn) is that it provides an example in which the count (initially x black and y white) of balls in the urn is not concealed, which is able to approximate the correct updating of subjective probabilities appropriate to a different case in which the original urn content is concealed while ordinary sampling with replacement is conducted (without the Pólya ball-duplication). Because of the simple "sampling with replacement" scheme in this second case, the urn content is now static, but this greater simplicity is compensated for by the assumption that the urn content is now unknown to an observer. A Bayesian analysis of the observer's uncertainty about the urn's initial content can be made, using a particular choice of (conjugate) prior distribution. Specifically, suppose that an observer knows that the urn contains only identical balls, each coloured either black or white, but they do not know the absolute number of balls present, nor the proportion that are of each colour. Suppose that they hold prior beliefs about these unknowns: for them the probability distribution of the urn content is well approximated by some prior distribution for the total number of balls in the urn, and a beta prior distribution with parameters (x,y) for the initial proportion of these which are black, this proportion being (for them) considered approximately independent of the total number. Then the process of outcomes of a succession of draws from the urn (with replacement but without the duplication) has approximately the same probability law as does the above Pólya scheme in which the actual urn content was not hidden from them. The approximation error here relates to the fact that an urn containing a known finite number m of balls of course cannot have an exactly beta-distributed unknown proportion of black balls, since the domain of possible values for that proportion are confined to being multiples of  , rather than having the full freedom to assume any value in the continuous unit interval, as would an exactly beta distributed proportion. This slightly informal account is provided for reason of motivation, and can be made more mathematically precise.

This basic Pólya urn model has been generalized in many ways.

Distributions related to the Pólya urn edit

  • beta-binomial distribution: The distribution of the number of successful draws (trials), e.g. number of extractions of white ball, given   draws from a Pólya urn.
  • Beta negative binomial distribution: The distribution of number of white balls observed until a fixed number black balls are observed.
  • Dirichlet-multinomial distribution (also known as the multivariate Pólya distribution): The distribution over the number of balls of each color, given   draws from a Pólya urn where there are   different colors instead of only two.
  • Dirichlet negative multinomial distribution: The distribution over the number of balls of each color until a fixed number of stopping colored balls are observed.
  • Martingales, the Beta-binomial distribution and the beta distribution: Let w and b be the number of white and black balls initially in the urn, and   the number of white balls currently in the urn after n draws. Then the sequence of values   for   is a normalized version of the Beta-binomial distribution. It is a martingale and converges to the beta distribution when n → ∞.
  • Dirichlet process, Chinese restaurant process, Hoppe urn: Imagine a modified Pólya urn scheme as follows. We start with an urn with   black balls. When drawing a ball from the urn, if we draw a black ball, put the ball back along with a new ball of a new non-black color randomly generated from a uniform distribution over an infinite set of available colours, and consider the newly generated color to be the "value" of the draw. Otherwise, put the ball back along with another ball of the same color, as for the standard Pólya urn scheme. The colors of an infinite sequence of draws from this modified Pólya urn scheme follow a Chinese restaurant process. If, instead of generating a new color, we draw a random value from a given base distribution and use that value to label the ball, the labels of an infinite sequence of draws follow a Dirichlet process.[1]
  • Moran model: An urn model used to model genetic drift in theoretical population genetics. This is closely similar to the Pólya urn model except that, in addition to adding a new ball of the same color, a randomly drawn ball is removed from the urn. The number of balls in the urn thus remains constant. Continued sampling then leads ultimately to an urn with all balls of one color, the probability of each color being the proportion of that color in the original urn. There are variants of the Moran model that insist that the ball removed from the urn be a different ball from one originally sampled in that step, and variants that do the removal of a ball immediately after the new ball is placed in the urn, so that the new ball is one of the balls available to be removed. This makes a small difference in the time taken to reach the state in which all balls are the same color. The Moran process models genetic drift in a population with overlapping generations.

Exchangeability edit

Polya's Urn is a quintessential example of an exchangeable process.

Suppose we have an urn containing   white balls and   black balls. We proceed to draw balls at random from the urn. On the  -th draw, we define a random variable,  , by   if the ball is black and   otherwise. We then return the ball to the urn, with an additional ball of the same colour. For a given  , if we have that   for many  , then it is more likely that  , because more black balls have been added to the urn. Therefore, these variables are not independent of each other.

The sequence   does, however, exhibit the weaker property of exchangeability.[2] Recall that a (finite or infinite) sequence of random variables is called exchangeable if its joint distribution is invariant under permutations of indices.

To show exchangeability of the sequence  , assume that   balls are picked from the urn, and out of these   balls,   balls are black and   are white. On the first draw the number of balls in the urn is  ; on the second draw it is   and so on. On the  -th draw, the number of balls will be  . The probability that we draw all   black balls first, and then all   white balls is given by 

Now we must show that if the order of black and white balls is permuted, there is no change to the probability. As in the expression above, even after permuting the draws, the  th denominator will always be  , since this is the number of balls in the urn at that round.

If we see  -th black ball in round  , the probability   will be equal to  , i.e. the numerator will be equal to  . With the same argument, we can calculate the probability for white balls. Therefore, for any sequence   in which   occurs   times and   occurs   times (i.e. a sequence with   black balls and   white balls drawn in some order) the final probability will be equal to the following expression, where we take advantage of commutativity of multiplication in the numerator:

 

This probability is not related to the order of seeing black and white balls and only depends on the total number of white balls and the total number of black balls.[2]

According to the De Finetti's theorem, there must be a unique prior distribution such that the joint distribution of observing the sequence is a Bayesian mixture of the Bernoulli probabilities. It can be shown that this prior distribution is a beta distribution with parameters  . In De Finetti's theorem, if we replace   with  , then we get the previous equation:[2]


 


In this equation  .

See also edit

References edit

  1. ^ Hoppe, Fred (1984). "Pólya-like urns and the Ewens' sampling formula". Journal of Mathematical Biology. 20: 91. doi:10.1007/BF00275863. hdl:2027.42/46944. S2CID 122994288.
  2. ^ a b c Hoppe, Fred M (1984). "Polya-like urns and the Ewens' sampling formula". Journal of Mathematical Biology. 20 (1): 91–94. doi:10.1007/bf00275863. hdl:2027.42/46944. ISSN 0303-6812. S2CID 122994288.[dead link]

Further reading edit

  • F. Alajaji and T. Fuja, "A Communication Channel Modeled on Contagion," IEEE Transactions on Information Theory, Vol. 40, pp. 2035–2041, November 1994.
  • A. Banerjee, P. Burlina and F. Alajaji, "Image Segmentation and Labeling Using the Pólya Urn Model," IEEE Transactions on Image Processing, Vol. 8, No. 9, pp. 1243–1253, September 1999.

Bibliography edit

  • N.L. Johnson and S.Kotz, (1977) "Urn Models and Their Application." John Wiley.
  • Hosam Mahmoud, (2008) "Pólya Urn Models." Chapman and Hall/CRC. ISBN 978-1420059830.

pólya, model, statistics, also, known, pólya, scheme, simply, pólya, named, after, george, pólya, family, models, that, used, interpret, many, commonly, used, statistical, models, model, represents, objects, interest, such, atoms, people, cars, colored, balls,. In statistics a Polya urn model also known as a Polya urn scheme or simply as Polya s urn named after George Polya is a family of urn models that can be used to interpret many commonly used statistical models The model represents objects of interest such as atoms people cars etc as colored balls in an urn In the basic Polya urn model the experimenter puts x white and y black balls into an urn At each step one ball is drawn uniformly at random from the urn and its color observed it is then returned in the urn and an additional ball of the same color is added to the urn If by random chance more black balls are drawn than white balls in the initial few draws it would make it more likely for more black balls to be drawn later Similarly for the white balls Thus the urn has a self reinforcing property the rich get richer It is the opposite of sampling without replacement where every time a particular value is observed it is less likely to be observed again whereas in a Polya urn model an observed value is more likely to be observed again In a Polya urn model successive acts of measurement over time have less and less effect on future measurements whereas in sampling without replacement the opposite is true After a certain number of measurements of a particular value that value will never be seen again It is also different from sampling with replacement where the ball is returned to the urn but without adding new balls In this case there is neither self reinforcing nor anti self reinforcing Contents 1 Basic results 2 Interpretation 3 Distributions related to the Polya urn 4 Exchangeability 5 See also 6 References 7 Further reading 8 BibliographyBasic results editQuestions of interest are the evolution of the urn population and the sequence of colors of the balls drawn out After n displaystyle n nbsp draws the probability that the urn contains x n 1 displaystyle x n 1 nbsp white balls and y n 2 displaystyle y n 2 nbsp black balls is n n 1 x n 1 y n 2 x y n displaystyle binom n n 1 frac x bar n 1 y bar n 2 x y bar n nbsp where the overbar denotes rising factorial This can be proved by drawing the Pascal s triangle of all possible configurations More generally if the urn starts with a i displaystyle a i nbsp balls of color i displaystyle i nbsp with i 1 2 k displaystyle i 1 2 k nbsp then after n displaystyle n nbsp draws the probability that the urn contains a i n i displaystyle a i n i nbsp balls of color i displaystyle i nbsp is n n 1 n k i 1 k a i n i i a i n displaystyle binom n n 1 cdots n k frac prod i 1 k a i bar n i sum i a i bar n nbsp where we use the multinomial coefficient Conditional on the urn ending up with a i n i displaystyle a i n i nbsp balls of color i displaystyle i nbsp after n displaystyle n nbsp draws there are n n 1 n k displaystyle binom n n 1 cdots n k nbsp different trajectories that could have led to such an end state The conditional probability of each trajectory is the same n n 1 n k 1 displaystyle binom n n 1 cdots n k 1 nbsp Interpretation editThis section does not cite any sources Please help improve this section by adding citations to reliable sources Unsourced material may be challenged and removed January 2024 Learn how and when to remove this template message One of the reasons for interest in this particular rather elaborate urn model i e with duplication and then replacement of each ball drawn is that it provides an example in which the count initially x black and y white of balls in the urn is not concealed which is able to approximate the correct updating of subjective probabilities appropriate to a different case in which the original urn content is concealed while ordinary sampling with replacement is conducted without the Polya ball duplication Because of the simple sampling with replacement scheme in this second case the urn content is now static but this greater simplicity is compensated for by the assumption that the urn content is now unknown to an observer A Bayesian analysis of the observer s uncertainty about the urn s initial content can be made using a particular choice of conjugate prior distribution Specifically suppose that an observer knows that the urn contains only identical balls each coloured either black or white but they do not know the absolute number of balls present nor the proportion that are of each colour Suppose that they hold prior beliefs about these unknowns for them the probability distribution of the urn content is well approximated by some prior distribution for the total number of balls in the urn and a beta prior distribution with parameters x y for the initial proportion of these which are black this proportion being for them considered approximately independent of the total number Then the process of outcomes of a succession of draws from the urn with replacement but without the duplication has approximately the same probability law as does the above Polya scheme in which the actual urn content was not hidden from them The approximation error here relates to the fact that an urn containing a known finite number m of balls of course cannot have an exactly beta distributed unknown proportion of black balls since the domain of possible values for that proportion are confined to being multiples of 1 m displaystyle 1 m nbsp rather than having the full freedom to assume any value in the continuous unit interval as would an exactly beta distributed proportion This slightly informal account is provided for reason of motivation and can be made more mathematically precise This basic Polya urn model has been generalized in many ways Distributions related to the Polya urn editbeta binomial distribution The distribution of the number of successful draws trials e g number of extractions of white ball given n displaystyle n nbsp draws from a Polya urn Beta negative binomial distribution The distribution of number of white balls observed until a fixed number black balls are observed Dirichlet multinomial distribution also known as the multivariate Polya distribution The distribution over the number of balls of each color given n displaystyle n nbsp draws from a Polya urn where there are k displaystyle k nbsp different colors instead of only two Dirichlet negative multinomial distribution The distribution over the number of balls of each color until a fixed number of stopping colored balls are observed Martingales the Beta binomial distribution and the beta distribution Let w and b be the number of white and black balls initially in the urn and w n w displaystyle w n w nbsp the number of white balls currently in the urn after n draws Then the sequence of values w n w w b n displaystyle frac w n w w b n nbsp for n 1 2 3 displaystyle n 1 2 3 dots nbsp is a normalized version of the Beta binomial distribution It is a martingale and converges to the beta distribution when n Dirichlet process Chinese restaurant process Hoppe urn Imagine a modified Polya urn scheme as follows We start with an urn with a displaystyle alpha nbsp black balls When drawing a ball from the urn if we draw a black ball put the ball back along with a new ball of a new non black color randomly generated from a uniform distribution over an infinite set of available colours and consider the newly generated color to be the value of the draw Otherwise put the ball back along with another ball of the same color as for the standard Polya urn scheme The colors of an infinite sequence of draws from this modified Polya urn scheme follow a Chinese restaurant process If instead of generating a new color we draw a random value from a given base distribution and use that value to label the ball the labels of an infinite sequence of draws follow a Dirichlet process 1 Moran model An urn model used to model genetic drift in theoretical population genetics This is closely similar to the Polya urn model except that in addition to adding a new ball of the same color a randomly drawn ball is removed from the urn The number of balls in the urn thus remains constant Continued sampling then leads ultimately to an urn with all balls of one color the probability of each color being the proportion of that color in the original urn There are variants of the Moran model that insist that the ball removed from the urn be a different ball from one originally sampled in that step and variants that do the removal of a ball immediately after the new ball is placed in the urn so that the new ball is one of the balls available to be removed This makes a small difference in the time taken to reach the state in which all balls are the same color The Moran process models genetic drift in a population with overlapping generations Exchangeability editPolya s Urn is a quintessential example of an exchangeable process Suppose we have an urn containing g displaystyle gamma nbsp white balls and a displaystyle alpha nbsp black balls We proceed to draw balls at random from the urn On the i displaystyle i nbsp th draw we define a random variable X i displaystyle X i nbsp by X i 1 displaystyle X i 1 nbsp if the ball is black and X i 0 displaystyle X i 0 nbsp otherwise We then return the ball to the urn with an additional ball of the same colour For a given i displaystyle i nbsp if we have that X j 1 displaystyle X j 1 nbsp for many j lt i displaystyle j lt i nbsp then it is more likely that X i 1 displaystyle X i 1 nbsp because more black balls have been added to the urn Therefore these variables are not independent of each other The sequence X 1 X 2 X 3 displaystyle X 1 X 2 X 3 dots nbsp does however exhibit the weaker property of exchangeability 2 Recall that a finite or infinite sequence of random variables is called exchangeable if its joint distribution is invariant under permutations of indices To show exchangeability of the sequence X 1 X 2 X 3 displaystyle X 1 X 2 X 3 dots nbsp assume that n displaystyle n nbsp balls are picked from the urn and out of these n displaystyle n nbsp balls k displaystyle k nbsp balls are black and n k displaystyle n k nbsp are white On the first draw the number of balls in the urn is g a displaystyle gamma alpha nbsp on the second draw it is g a 1 displaystyle gamma alpha 1 nbsp and so on On the i displaystyle i nbsp th draw the number of balls will be g a i 1 displaystyle gamma alpha i 1 nbsp The probability that we draw all k displaystyle k nbsp black balls first and then all n k displaystyle n k nbsp white balls is given byP X 1 1 X k 1 X k 1 0 X n 0 a g a a 1 g a 1 a k 1 g a k 1 g g a k g 1 g a k 1 g n k 1 g a n 1 displaystyle mathbb P left X 1 1 dots X k 1 X k 1 0 dots X n 0 right frac alpha gamma alpha times frac alpha 1 gamma alpha 1 times cdots times frac alpha k 1 gamma alpha k 1 times frac gamma gamma alpha k times frac gamma 1 gamma alpha k 1 times cdots times frac gamma n k 1 gamma alpha n 1 nbsp Now we must show that if the order of black and white balls is permuted there is no change to the probability As in the expression above even after permuting the draws the i displaystyle i nbsp th denominator will always be g a i 1 displaystyle gamma alpha i 1 nbsp since this is the number of balls in the urn at that round If we see j displaystyle j nbsp th black ball in round t displaystyle t nbsp the probability X t 1 displaystyle X t 1 nbsp will be equal to a j 1 g a t 1 displaystyle frac alpha j 1 gamma alpha t 1 nbsp i e the numerator will be equal to a j 1 displaystyle alpha j 1 nbsp With the same argument we can calculate the probability for white balls Therefore for any sequence x 1 x 2 x 3 displaystyle x 1 x 2 x 3 dots nbsp in which 1 displaystyle 1 nbsp occurs k displaystyle k nbsp times and 0 displaystyle 0 nbsp occurs n k displaystyle n k nbsp times i e a sequence with k displaystyle k nbsp black balls and n k displaystyle n k nbsp white balls drawn in some order the final probability will be equal to the following expression where we take advantage of commutativity of multiplication in the numerator P X 1 x 1 X 2 x 2 X n x n i 1 k a i 1 i 1 n k g i 1 i 1 n g a i 1 a k 1 g n k 1 a g 1 a 1 g 1 a g n 1 displaystyle begin aligned mathbb P X 1 x 1 X 2 x 2 X n x n amp frac prod i 1 k left alpha i 1 right times prod i 1 n k left gamma i 1 right prod i 1 n left gamma alpha i 1 right amp frac left alpha k 1 right times left gamma n k 1 right times left alpha gamma 1 right left alpha 1 right times left gamma 1 right left alpha gamma n 1 right end aligned nbsp This probability is not related to the order of seeing black and white balls and only depends on the total number of white balls and the total number of black balls 2 According to the De Finetti s theorem there must be a unique prior distribution such that the joint distribution of observing the sequence is a Bayesian mixture of the Bernoulli probabilities It can be shown that this prior distribution is a beta distribution with parameters b a g displaystyle beta left cdot alpha gamma right nbsp In De Finetti s theorem if we replace p displaystyle pi cdot nbsp with b a g displaystyle beta left cdot alpha gamma right nbsp then we get the previous equation 2 p X 1 x 1 X 2 x 2 X n x n 8 i 1 n x i 1 8 n i 1 n x i b 8 a g d 8 8 i 1 n x i 1 8 n i 1 n x i a g 1 a 1 g 1 8 a 1 1 8 g 1 d 8 8 a 1 i 1 n x i 1 8 n g 1 i 1 n x i a g 1 a 1 g 1 d 8 8 a k 1 1 8 n k 1 g a g 1 a 1 g 1 d 8 a g 1 a 1 g 1 8 a k 1 1 8 n k g 1 d 8 a g 1 a 1 g 1 G g n k G a k G a g n a k 1 g n k 1 a g 1 a 1 g 1 a g n 1 displaystyle begin aligned p X 1 x 1 X 2 x 2 X n x n amp int theta left sum i 1 n x i right times left 1 theta right left n sum i 1 n x i right beta left theta alpha gamma right d left theta right amp int theta left sum i 1 n x i right times left 1 theta right left n sum i 1 n x i right dfrac alpha gamma 1 alpha 1 gamma 1 theta alpha 1 1 theta gamma 1 d left theta right amp int theta left alpha 1 sum i 1 n x i right times left 1 theta right left n gamma 1 sum i 1 n x i right dfrac alpha gamma 1 alpha 1 gamma 1 d left theta right amp int theta left alpha k 1 right times left 1 theta right left n k 1 gamma right dfrac alpha gamma 1 alpha 1 gamma 1 d left theta right amp dfrac alpha gamma 1 alpha 1 gamma 1 int theta left alpha k 1 right times left 1 theta right left n k gamma 1 right d left theta right amp dfrac alpha gamma 1 alpha 1 gamma 1 dfrac Gamma gamma n k Gamma alpha k Gamma alpha gamma n amp dfrac left alpha k 1 right times left gamma n k 1 right times left alpha gamma 1 right left alpha 1 right times left gamma 1 right left alpha gamma n 1 right end aligned nbsp In this equation k i 1 n x i displaystyle k sum i 1 n x i nbsp See also editPitman Yor process Moran process Yule process De Finetti s theorem Chinese restaurant processReferences edit Hoppe Fred 1984 Polya like urns and the Ewens sampling formula Journal of Mathematical Biology 20 91 doi 10 1007 BF00275863 hdl 2027 42 46944 S2CID 122994288 a b c Hoppe Fred M 1984 Polya like urns and the Ewens sampling formula Journal of Mathematical Biology 20 1 91 94 doi 10 1007 bf00275863 hdl 2027 42 46944 ISSN 0303 6812 S2CID 122994288 dead link Further reading editF Alajaji and T Fuja A Communication Channel Modeled on Contagion IEEE Transactions on Information Theory Vol 40 pp 2035 2041 November 1994 A Banerjee P Burlina and F Alajaji Image Segmentation and Labeling Using the Polya Urn Model IEEE Transactions on Image Processing Vol 8 No 9 pp 1243 1253 September 1999 Bibliography editN L Johnson and S Kotz 1977 Urn Models and Their Application John Wiley Hosam Mahmoud 2008 Polya Urn Models Chapman and Hall CRC ISBN 978 1420059830 Retrieved from https en wikipedia org w index php title Polya urn model amp oldid 1202925726, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.