fbpx
Wikipedia

Poisson regression

In statistics, Poisson regression is a generalized linear model form of regression analysis used to model count data and contingency tables.[1] Poisson regression assumes the response variable Y has a Poisson distribution, and assumes the logarithm of its expected value can be modeled by a linear combination of unknown parameters. A Poisson regression model is sometimes known as a log-linear model, especially when used to model contingency tables.

Negative binomial regression is a popular generalization of Poisson regression because it loosens the highly restrictive assumption that the variance is equal to the mean made by the Poisson model. The traditional negative binomial regression model is based on the Poisson-gamma mixture distribution. This model is popular because it models the Poisson heterogeneity with a gamma distribution.

Poisson regression models are generalized linear models with the logarithm as the (canonical) link function, and the Poisson distribution function as the assumed probability distribution of the response.

Regression models edit

If   is a vector of independent variables, then the model takes the form

 

where   and  . Sometimes this is written more compactly as

 

where   is now an (n + 1)-dimensional vector consisting of n independent variables concatenated to the number one. Here   is simply   concatenated to  .

Thus, when given a Poisson regression model   and an input vector  , the predicted mean of the associated Poisson distribution is given by

 

If   are independent observations with corresponding values   of the predictor variables, then   can be estimated by maximum likelihood. The maximum-likelihood estimates lack a closed-form expression and must be found by numerical methods. The probability surface for maximum-likelihood Poisson regression is always concave, making Newton–Raphson or other gradient-based methods appropriate estimation techniques.

Interpretation of coefficients edit

Suppose we have a model with a single predictor, that is,  :

 

Suppose we compute the predicted values at point   and  :

 
 

By substracting the first from the second:

 

Suppose now that  . We obtain:

 

So the coefficient of the model is to be interpreted as the increase in the logarithm of the count of the outcome variable when the independent variable increases by 1.

By applying the rules of logarithms:

 
 
 

That is, when the independent variable increases by 1, the outcome variable is multiplied by the exponentiated coefficient.

The exponentiated coefficient is also called the incidence ratio.

Average partial effect edit

Often, the object of interest is the average partial effect or average marginal effect  , which is interpreted as the change in the outcome   for a one unit change in the independent variable  . The average partial effect in the Poisson model for a continuous   can be shown to be:[2]

 

This can be estimated using the coefficient estimates from the Poisson model   with the observed values of  .

Maximum likelihood-based parameter estimation edit

Given a set of parameters θ and an input vector x, the mean of the predicted Poisson distribution, as stated above, is given by

 

and thus, the Poisson distribution's probability mass function is given by

 

Now suppose we are given a data set consisting of m vectors  , along with a set of m values  . Then, for a given set of parameters θ, the probability of attaining this particular set of data is given by

 

By the method of maximum likelihood, we wish to find the set of parameters θ that makes this probability as large as possible. To do this, the equation is first rewritten as a likelihood function in terms of θ:

 

Note that the expression on the right hand side has not actually changed. A formula in this form is typically difficult to work with; instead, one uses the log-likelihood:

 

Notice that the parameters θ only appear in the first two terms of each term in the summation. Therefore, given that we are only interested in finding the best value for θ we may drop the yi! and simply write

 

To find a maximum, we need to solve an equation   which has no closed-form solution. However, the negative log-likelihood,  , is a convex function, and so standard convex optimization techniques such as gradient descent can be applied to find the optimal value of θ.

Poisson regression in practice edit

Poisson regression may be appropriate when the dependent variable is a count, for instance of events such as the arrival of a telephone call at a call centre.[3] The events must be independent in the sense that the arrival of one call will not make another more or less likely, but the probability per unit time of events is understood to be related to covariates such as time of day.

"Exposure" and offset edit

Poisson regression may also be appropriate for rate data, where the rate is a count of events divided by some measure of that unit's exposure (a particular unit of observation).[4] For example, biologists may count the number of tree species in a forest: events would be tree observations, exposure would be unit area, and rate would be the number of species per unit area. Demographers may model death rates in geographic areas as the count of deaths divided by person−years. More generally, event rates can be calculated as events per unit time, which allows the observation window to vary for each unit. In these examples, exposure is respectively unit area, person−years and unit time. In Poisson regression this is handled as an offset. If the rate is count/exposure, multiplying both sides of the equation by exposure moves it to the right side of the equation. When both sides of the equation are then logged, the final model contains log(exposure) as a term that is added to the regression coefficients. This logged variable, log(exposure), is called the offset variable and enters on the right-hand side of the equation with a parameter estimate (for log(exposure)) constrained to 1.

 

which implies

 

Offset in the case of a GLM in R can be achieved using the offset() function:

glm(y ~ offset(log(exposure)) + x, family=poisson(link=log) ) 

Overdispersion and zero inflation edit

A characteristic of the Poisson distribution is that its mean is equal to its variance. In certain circumstances, it will be found that the observed variance is greater than the mean; this is known as overdispersion and indicates that the model is not appropriate. A common reason is the omission of relevant explanatory variables, or dependent observations. Under some circumstances, the problem of overdispersion can be solved by using quasi-likelihood estimation or a negative binomial distribution instead.[5][6]

Ver Hoef and Boveng described the difference between quasi-Poisson (also called overdispersion with quasi-likelihood) and negative binomial (equivalent to gamma-Poisson) as follows: If E(Y) = μ, the quasi-Poisson model assumes var(Y) = θμ while the gamma-Poisson assumes var(Y) = μ(1 + κμ), where θ is the quasi-Poisson overdispersion parameter, and κ is the shape parameter of the negative binomial distribution. For both models, parameters are estimated using iteratively reweighted least squares. For quasi-Poisson, the weights are μ/θ. For negative binomial, the weights are μ/(1 + κμ). With large μ and substantial extra-Poisson variation, the negative binomial weights are capped at 1/κ. Ver Hoef and Boveng discussed an example where they selected between the two by plotting mean squared residuals vs. the mean.[7]

Another common problem with Poisson regression is excess zeros: if there are two processes at work, one determining whether there are zero events or any events, and a Poisson process determining how many events there are, there will be more zeros than a Poisson regression would predict. An example would be the distribution of cigarettes smoked in an hour by members of a group where some individuals are non-smokers.

Other generalized linear models such as the negative binomial model or zero-inflated model may function better in these cases.

On the contrary, underdispersion may pose an issue for parameter estimation.[8]

Use in survival analysis edit

Poisson regression creates proportional hazards models, one class of survival analysis: see proportional hazards models for descriptions of Cox models.

Extensions edit

Regularized Poisson regression edit

When estimating the parameters for Poisson regression, one typically tries to find values for θ that maximize the likelihood of an expression of the form

 

where m is the number of examples in the data set, and   is the probability mass function of the Poisson distribution with the mean set to  . Regularization can be added to this optimization problem by instead maximizing[9]

 

for some positive constant  . This technique, similar to ridge regression, can reduce overfitting.

See also edit

References edit

  1. ^ Nelder, J. A. (1974). "Log Linear Models for Contingency Tables: A Generalization of Classical Least Squares". Journal of the Royal Statistical Society, Series C (Applied Statistics). 23 (3): pp. 323–329. doi:10.2307/2347125. JSTOR 2347125.
  2. ^ Wooldridge, Jeffrey (2010). Econometric Analysis of Cross Section and Panel Data (2nd ed.). Cambridge, Massachusetts: The MIT Press. p. 726.
  3. ^ Greene, William H. (2003). Econometric Analysis (Fifth ed.). Prentice-Hall. pp. 740–752. ISBN 978-0130661890.
  4. ^ Frome, Edward L. (1983). "The Analysis of Rates Using Poisson Regression Models". Biometrics. 39 (3): pp. 665–674. doi:10.2307/2531094. JSTOR 2531094.
  5. ^ Paternoster R, Brame R (1997). "Multiple routes to delinquency? A test of developmental and general theories of crime". Criminology. 35: 45–84. doi:10.1111/j.1745-9125.1997.tb00870.x.
  6. ^ Berk R, MacDonald J (2008). "Overdispersion and Poisson regression". Journal of Quantitative Criminology. 24 (3): 269–284. doi:10.1007/s10940-008-9048-4. S2CID 121273486.
  7. ^ Ver Hoef, JAY M.; Boveng, Peter L. (2007-01-01). "Quasi-Poisson vs. Negative Binomial Regression: How should we model overdispersed count data?". Ecology. 88 (11): 2766–2772. Bibcode:2007Ecol...88.2766V. doi:10.1890/07-0043.1. PMID 18051645. Retrieved 2016-09-01.
  8. ^ Schwarzenegger, Rafael; Quigley, John; Walls, Lesley (23 November 2021). "Is eliciting dependency worth the effort? A study for the multivariate Poisson-Gamma probability model". Proceedings of the Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability. 237 (5): 5. doi:10.1177/1748006X211059417.
  9. ^ Perperoglou, Aris (2011-09-08). "Fitting survival data with penalized Poisson regression". Statistical Methods & Applications. 20 (4). Springer Nature: 451–462. doi:10.1007/s10260-011-0172-1. ISSN 1618-2510. S2CID 10883925.

Further reading edit

  • Cameron, A. C.; Trivedi, P. K. (1998). Regression analysis of count data. Cambridge University Press. ISBN 978-0-521-63201-0.
  • Christensen, Ronald (1997). Log-linear models and logistic regression. Springer Texts in Statistics (Second ed.). New York: Springer-Verlag. ISBN 978-0-387-98247-2. MR 1633357.
  • Gouriéroux, Christian (2000). "The Econometrics of Discrete Positive Variables: the Poisson Model". Econometrics of Qualitative Dependent Variables. New York: Cambridge University Press. pp. 270–83. ISBN 978-0-521-58985-7.
  • Greene, William H. (2008). "Models for Event Counts and Duration". Econometric Analysis (8th ed.). Upper Saddle River: Prentice Hall. pp. 906–944. ISBN 978-0-13-600383-0.
  • Hilbe, J. M. (2007). Negative Binomial Regression. Cambridge University Press. ISBN 978-0-521-85772-7.
  • Jones, Andrew M.; et al. (2013). "Models for count data". Applied Health Economics. London: Routledge. pp. 295–341. ISBN 978-0-415-67682-3.
  • Myers, Raymond H.; et al. (2010). "Logistic and Poisson Regression Models". Generalized Linear Models With Applications in Engineering and the Sciences (Second ed.). New Jersey: Wiley. pp. 176–183. ISBN 978-0-470-45463-3.

poisson, regression, statistics, generalized, linear, model, form, regression, analysis, used, model, count, data, contingency, tables, assumes, response, variable, poisson, distribution, assumes, logarithm, expected, value, modeled, linear, combination, unkno. In statistics Poisson regression is a generalized linear model form of regression analysis used to model count data and contingency tables 1 Poisson regression assumes the response variable Y has a Poisson distribution and assumes the logarithm of its expected value can be modeled by a linear combination of unknown parameters A Poisson regression model is sometimes known as a log linear model especially when used to model contingency tables Negative binomial regression is a popular generalization of Poisson regression because it loosens the highly restrictive assumption that the variance is equal to the mean made by the Poisson model The traditional negative binomial regression model is based on the Poisson gamma mixture distribution This model is popular because it models the Poisson heterogeneity with a gamma distribution Poisson regression models are generalized linear models with the logarithm as the canonical link function and the Poisson distribution function as the assumed probability distribution of the response Contents 1 Regression models 2 Interpretation of coefficients 2 1 Average partial effect 3 Maximum likelihood based parameter estimation 4 Poisson regression in practice 4 1 Exposure and offset 4 2 Overdispersion and zero inflation 4 3 Use in survival analysis 5 Extensions 5 1 Regularized Poisson regression 6 See also 7 References 8 Further readingRegression models editIf x R n displaystyle mathbf x in mathbb R n nbsp is a vector of independent variables then the model takes the form log E Y x a b x displaystyle log operatorname E Y mid mathbf x alpha mathbf beta mathbf x nbsp where a R displaystyle alpha in mathbb R nbsp and b R n displaystyle mathbf beta in mathbb R n nbsp Sometimes this is written more compactly as log E Y x 8 x displaystyle log operatorname E Y mid mathbf x boldsymbol theta mathbf x nbsp where x displaystyle mathbf x nbsp is now an n 1 dimensional vector consisting of n independent variables concatenated to the number one Here 8 displaystyle theta nbsp is simply a displaystyle alpha nbsp concatenated to b displaystyle beta nbsp Thus when given a Poisson regression model 8 displaystyle theta nbsp and an input vector x displaystyle mathbf x nbsp the predicted mean of the associated Poisson distribution is given by E Y x e 8 x displaystyle operatorname E Y mid mathbf x e boldsymbol theta mathbf x nbsp If Y i displaystyle Y i nbsp are independent observations with corresponding values x i displaystyle mathbf x i nbsp of the predictor variables then 8 displaystyle theta nbsp can be estimated by maximum likelihood The maximum likelihood estimates lack a closed form expression and must be found by numerical methods The probability surface for maximum likelihood Poisson regression is always concave making Newton Raphson or other gradient based methods appropriate estimation techniques Interpretation of coefficients editSuppose we have a model with a single predictor that is n 1 displaystyle n 1 nbsp log E Y x a b x displaystyle log operatorname E Y mid mathbf x alpha beta x nbsp Suppose we compute the predicted values at point Y 2 x 2 displaystyle Y 2 x 2 nbsp and Y 1 x 1 displaystyle Y 1 x 1 nbsp log E Y 2 x 2 a b x 2 displaystyle log operatorname E Y 2 mid x 2 alpha beta x 2 nbsp log E Y 1 x 1 a b x 1 displaystyle log operatorname E Y 1 mid x 1 alpha beta x 1 nbsp By substracting the first from the second log E Y 2 x 2 log E Y 1 x 1 b x 2 x 1 displaystyle log operatorname E Y 2 mid x 2 log operatorname E Y 1 mid x 1 beta x 2 x 1 nbsp Suppose now that x 2 x 1 1 displaystyle x 2 x 1 1 nbsp We obtain log E Y 2 x 2 log E Y 1 x 1 b displaystyle log operatorname E Y 2 mid x 2 log operatorname E Y 1 mid x 1 beta nbsp So the coefficient of the model is to be interpreted as the increase in the logarithm of the count of the outcome variable when the independent variable increases by 1 By applying the rules of logarithms log E Y 2 x 2 E Y 1 x 1 b displaystyle log left dfrac operatorname E Y 2 mid x 2 operatorname E Y 1 mid x 1 right beta nbsp E Y 2 x 2 E Y 1 x 1 e b displaystyle dfrac operatorname E Y 2 mid x 2 operatorname E Y 1 mid x 1 e beta nbsp E Y 2 x 2 e b E Y 1 x 1 displaystyle operatorname E Y 2 mid x 2 e beta operatorname E Y 1 mid x 1 nbsp That is when the independent variable increases by 1 the outcome variable is multiplied by the exponentiated coefficient The exponentiated coefficient is also called the incidence ratio Average partial effect edit Often the object of interest is the average partial effect or average marginal effect E Y x x displaystyle frac partial E Y x partial x nbsp which is interpreted as the change in the outcome Y displaystyle Y nbsp for a one unit change in the independent variable x displaystyle x nbsp The average partial effect in the Poisson model for a continuous x displaystyle x nbsp can be shown to be 2 E Y x x e x p 8 x b displaystyle frac partial E Y x partial x exp theta mathbb x beta nbsp This can be estimated using the coefficient estimates from the Poisson model 8 a b displaystyle hat theta hat alpha hat beta nbsp with the observed values of x displaystyle mathbb x nbsp Maximum likelihood based parameter estimation editGiven a set of parameters 8 and an input vector x the mean of the predicted Poisson distribution as stated above is given by l E Y x e 8 x displaystyle lambda operatorname E Y mid x e theta x nbsp and thus the Poisson distribution s probability mass function is given by p y x 8 l y y e l e y 8 x e e 8 x y displaystyle p y mid x theta frac lambda y y e lambda frac e y theta x e e theta x y nbsp Now suppose we are given a data set consisting of m vectors x i R n 1 i 1 m displaystyle x i in mathbb R n 1 i 1 ldots m nbsp along with a set of m values y 1 y m N displaystyle y 1 ldots y m in mathbb N nbsp Then for a given set of parameters 8 the probability of attaining this particular set of data is given by p y 1 y m x 1 x m 8 i 1 m e y i 8 x i e e 8 x i y i displaystyle p y 1 ldots y m mid x 1 ldots x m theta prod i 1 m frac e y i theta x i e e theta x i y i nbsp By the method of maximum likelihood we wish to find the set of parameters 8 that makes this probability as large as possible To do this the equation is first rewritten as a likelihood function in terms of 8 L 8 X Y i 1 m e y i 8 x i e e 8 x i y i displaystyle L theta mid X Y prod i 1 m frac e y i theta x i e e theta x i y i nbsp Note that the expression on the right hand side has not actually changed A formula in this form is typically difficult to work with instead one uses the log likelihood ℓ 8 X Y log L 8 X Y i 1 m y i 8 x i e 8 x i log y i displaystyle ell theta mid X Y log L theta mid X Y sum i 1 m left y i theta x i e theta x i log y i right nbsp Notice that the parameters 8 only appear in the first two terms of each term in the summation Therefore given that we are only interested in finding the best value for 8 we may drop the yi and simply write ℓ 8 X Y i 1 m y i 8 x i e 8 x i displaystyle ell theta mid X Y sum i 1 m left y i theta x i e theta x i right nbsp To find a maximum we need to solve an equation ℓ 8 X Y 8 0 displaystyle frac partial ell theta mid X Y partial theta 0 nbsp which has no closed form solution However the negative log likelihood ℓ 8 X Y displaystyle ell theta mid X Y nbsp is a convex function and so standard convex optimization techniques such as gradient descent can be applied to find the optimal value of 8 Poisson regression in practice editPoisson regression may be appropriate when the dependent variable is a count for instance of events such as the arrival of a telephone call at a call centre 3 The events must be independent in the sense that the arrival of one call will not make another more or less likely but the probability per unit time of events is understood to be related to covariates such as time of day Exposure and offset edit Poisson regression may also be appropriate for rate data where the rate is a count of events divided by some measure of that unit s exposure a particular unit of observation 4 For example biologists may count the number of tree species in a forest events would be tree observations exposure would be unit area and rate would be the number of species per unit area Demographers may model death rates in geographic areas as the count of deaths divided by person years More generally event rates can be calculated as events per unit time which allows the observation window to vary for each unit In these examples exposure is respectively unit area person years and unit time In Poisson regression this is handled as an offset If the rate is count exposure multiplying both sides of the equation by exposure moves it to the right side of the equation When both sides of the equation are then logged the final model contains log exposure as a term that is added to the regression coefficients This logged variable log exposure is called the offset variable and enters on the right hand side of the equation with a parameter estimate for log exposure constrained to 1 log E Y x 8 x displaystyle log operatorname E Y mid x theta x nbsp which implies log E Y x exposure log E Y x log exposure 8 x log exposure displaystyle log left frac operatorname E Y mid x text exposure right log operatorname E Y mid x log text exposure theta x log text exposure nbsp Offset in the case of a GLM in R can be achieved using the offset function glm y offset log exposure x family poisson link log Overdispersion and zero inflation edit A characteristic of the Poisson distribution is that its mean is equal to its variance In certain circumstances it will be found that the observed variance is greater than the mean this is known as overdispersion and indicates that the model is not appropriate A common reason is the omission of relevant explanatory variables or dependent observations Under some circumstances the problem of overdispersion can be solved by using quasi likelihood estimation or a negative binomial distribution instead 5 6 Ver Hoef and Boveng described the difference between quasi Poisson also called overdispersion with quasi likelihood and negative binomial equivalent to gamma Poisson as follows If E Y m the quasi Poisson model assumes var Y 8m while the gamma Poisson assumes var Y m 1 km where 8 is the quasi Poisson overdispersion parameter and k is the shape parameter of the negative binomial distribution For both models parameters are estimated using iteratively reweighted least squares For quasi Poisson the weights are m 8 For negative binomial the weights are m 1 km With large m and substantial extra Poisson variation the negative binomial weights are capped at 1 k Ver Hoef and Boveng discussed an example where they selected between the two by plotting mean squared residuals vs the mean 7 Another common problem with Poisson regression is excess zeros if there are two processes at work one determining whether there are zero events or any events and a Poisson process determining how many events there are there will be more zeros than a Poisson regression would predict An example would be the distribution of cigarettes smoked in an hour by members of a group where some individuals are non smokers Other generalized linear models such as the negative binomial model or zero inflated model may function better in these cases On the contrary underdispersion may pose an issue for parameter estimation 8 Use in survival analysis edit Poisson regression creates proportional hazards models one class of survival analysis see proportional hazards models for descriptions of Cox models Extensions editRegularized Poisson regression edit When estimating the parameters for Poisson regression one typically tries to find values for 8 that maximize the likelihood of an expression of the form i 1 m log p y i e 8 x i displaystyle sum i 1 m log p y i e theta x i nbsp where m is the number of examples in the data set and p y i e 8 x i displaystyle p y i e theta x i nbsp is the probability mass function of the Poisson distribution with the mean set to e 8 x i displaystyle e theta x i nbsp Regularization can be added to this optimization problem by instead maximizing 9 i 1 m log p y i e 8 x i l 8 2 2 displaystyle sum i 1 m log p y i e theta x i lambda left theta right 2 2 nbsp for some positive constant l displaystyle lambda nbsp This technique similar to ridge regression can reduce overfitting See also editZero inflated model Poisson distribution Fixed effect Poisson model Partial likelihood methods for panel data Pooled QMLE for Poisson models Control function econometrics Endogeneity in Poisson regressionReferences edit Nelder J A 1974 Log Linear Models for Contingency Tables A Generalization of Classical Least Squares Journal of the Royal Statistical Society Series C Applied Statistics 23 3 pp 323 329 doi 10 2307 2347125 JSTOR 2347125 Wooldridge Jeffrey 2010 Econometric Analysis of Cross Section and Panel Data 2nd ed Cambridge Massachusetts The MIT Press p 726 Greene William H 2003 Econometric Analysis Fifth ed Prentice Hall pp 740 752 ISBN 978 0130661890 Frome Edward L 1983 The Analysis of Rates Using Poisson Regression Models Biometrics 39 3 pp 665 674 doi 10 2307 2531094 JSTOR 2531094 Paternoster R Brame R 1997 Multiple routes to delinquency A test of developmental and general theories of crime Criminology 35 45 84 doi 10 1111 j 1745 9125 1997 tb00870 x Berk R MacDonald J 2008 Overdispersion and Poisson regression Journal of Quantitative Criminology 24 3 269 284 doi 10 1007 s10940 008 9048 4 S2CID 121273486 Ver Hoef JAY M Boveng Peter L 2007 01 01 Quasi Poisson vs Negative Binomial Regression How should we model overdispersed count data Ecology 88 11 2766 2772 Bibcode 2007Ecol 88 2766V doi 10 1890 07 0043 1 PMID 18051645 Retrieved 2016 09 01 Schwarzenegger Rafael Quigley John Walls Lesley 23 November 2021 Is eliciting dependency worth the effort A study for the multivariate Poisson Gamma probability model Proceedings of the Institution of Mechanical Engineers Part O Journal of Risk and Reliability 237 5 5 doi 10 1177 1748006X211059417 Perperoglou Aris 2011 09 08 Fitting survival data with penalized Poisson regression Statistical Methods amp Applications 20 4 Springer Nature 451 462 doi 10 1007 s10260 011 0172 1 ISSN 1618 2510 S2CID 10883925 Further reading editCameron A C Trivedi P K 1998 Regression analysis of count data Cambridge University Press ISBN 978 0 521 63201 0 Christensen Ronald 1997 Log linear models and logistic regression Springer Texts in Statistics Second ed New York Springer Verlag ISBN 978 0 387 98247 2 MR 1633357 Gourieroux Christian 2000 The Econometrics of Discrete Positive Variables the Poisson Model Econometrics of Qualitative Dependent Variables New York Cambridge University Press pp 270 83 ISBN 978 0 521 58985 7 Greene William H 2008 Models for Event Counts and Duration Econometric Analysis 8th ed Upper Saddle River Prentice Hall pp 906 944 ISBN 978 0 13 600383 0 Hilbe J M 2007 Negative Binomial Regression Cambridge University Press ISBN 978 0 521 85772 7 Jones Andrew M et al 2013 Models for count data Applied Health Economics London Routledge pp 295 341 ISBN 978 0 415 67682 3 Myers Raymond H et al 2010 Logistic and Poisson Regression Models Generalized Linear Models With Applications in Engineering and the Sciences Second ed New Jersey Wiley pp 176 183 ISBN 978 0 470 45463 3 Retrieved from https en wikipedia org w index php title Poisson regression amp oldid 1221373274, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.