fbpx
Wikipedia

Loss function

In mathematical optimization and decision theory, a loss function or cost function (sometimes also called an error function) [1] is a function that maps an event or values of one or more variables onto a real number intuitively representing some "cost" associated with the event. An optimization problem seeks to minimize a loss function. An objective function is either a loss function or its opposite (in specific domains, variously called a reward function, a profit function, a utility function, a fitness function, etc.), in which case it is to be maximized. The loss function could include terms from several levels of the hierarchy.

In statistics, typically a loss function is used for parameter estimation, and the event in question is some function of the difference between estimated and true values for an instance of data. The concept, as old as Laplace, was reintroduced in statistics by Abraham Wald in the middle of the 20th century.[2] In the context of economics, for example, this is usually economic cost or regret. In classification, it is the penalty for an incorrect classification of an example. In actuarial science, it is used in an insurance context to model benefits paid over premiums, particularly since the works of Harald Cramér in the 1920s.[3] In optimal control, the loss is the penalty for failing to achieve a desired value. In financial risk management, the function is mapped to a monetary loss.

Comparison of common loss functions used for regression

Examples edit

Regret edit

Leonard J. Savage argued that using non-Bayesian methods such as minimax, the loss function should be based on the idea of regret, i.e., the loss associated with a decision should be the difference between the consequences of the best decision that could have been made had the underlying circumstances been known and the decision that was in fact taken before they were known.

Quadratic loss function edit

The use of a quadratic loss function is common, for example when using least squares techniques. It is often more mathematically tractable than other loss functions because of the properties of variances, as well as being symmetric: an error above the target causes the same loss as the same magnitude of error below the target. If the target is t, then a quadratic loss function is

 

for some constant C; the value of the constant makes no difference to a decision, and can be ignored by setting it equal to 1. This is also known as the squared error loss (SEL).[1]

Many common statistics, including t-tests, regression models, design of experiments, and much else, use least squares methods applied using linear regression theory, which is based on the quadratic loss function.

The quadratic loss function is also used in linear-quadratic optimal control problems. In these problems, even in the absence of uncertainty, it may not be possible to achieve the desired values of all target variables. Often loss is expressed as a quadratic form in the deviations of the variables of interest from their desired values; this approach is tractable because it results in linear first-order conditions. In the context of stochastic control, the expected value of the quadratic form is used. The quadratic loss assigns more importance to outliers than to the true data due to its square nature, so alternatives like the Huber, Log-Cosh and SMAE losses are used when the data has many large outliers.

 
Effect of using different loss functions, when the data has outliers.

0-1 loss function edit

In statistics and decision theory, a frequently used loss function is the 0-1 loss function

 

using Iverson bracket notation, i.e. it evaluates to 1 when  , and 0 otherwise.

Constructing loss and objective functions edit

In many applications, objective functions, including loss functions as a particular case, are determined by the problem formulation. In other situations, the decision maker’s preference must be elicited and represented by a scalar-valued function (called also utility function) in a form suitable for optimization — the problem that Ragnar Frisch has highlighted in his Nobel Prize lecture.[4] The existing methods for constructing objective functions are collected in the proceedings of two dedicated conferences.[5][6] In particular, Andranik Tangian showed that the most usable objective functions — quadratic and additive — are determined by a few indifference points. He used this property in the models for constructing these objective functions from either ordinal or cardinal data that were elicited through computer-assisted interviews with decision makers.[7][8] Among other things, he constructed objective functions to optimally distribute budgets for 16 Westfalian universities[9] and the European subsidies for equalizing unemployment rates among 271 German regions.[10]

Expected loss edit

In some contexts, the value of the loss function itself is a random quantity because it depends on the outcome of a random variable X.

Statistics edit

Both frequentist and Bayesian statistical theory involve making a decision based on the expected value of the loss function; however, this quantity is defined differently under the two paradigms.

Frequentist expected loss edit

We first define the expected loss in the frequentist context. It is obtained by taking the expected value with respect to the probability distribution, Pθ, of the observed data, X. This is also referred to as the risk function[11][12][13][14] of the decision rule δ and the parameter θ. Here the decision rule depends on the outcome of X. The risk function is given by:

 

Here, θ is a fixed but possibly unknown state of nature, X is a vector of observations stochastically drawn from a population,   is the expectation over all population values of X, dPθ is a probability measure over the event space of X (parametrized by θ) and the integral is evaluated over the entire support of X.

Bayes Risk edit

In a Bayesian approach, the expectation is calculated using the prior distribution π* of the parameter θ:

 

where m(x) is known as the predictive likelihood wherein θ has been "integrated out," π* (θ | x) is the posterior distribution, and the order of integration has been changed. One then should choose the action a* which minimises this expected loss, which is referred to as Bayes Risk [12]. In the latter equation, the integrand inside dx is known as the Posterior Risk, and minimising it with respect to decision a also minimizes the overall Bayes Risk. This optimal decision, a* is known as the Bayes (decision) Rule - it minimises the average loss over all possible states of nature θ, over all possible (probability-weighted) data outcomes. One advantage of the Bayesian approach is to that one need only choose the optimal action under the actual observed data to obtain a uniformly optimal one, whereas choosing the actual frequentist optimal decision rule as a function of all possible observations, is a much more difficult problem. Of equal importance though, the Bayes Rule reflects consideration of loss outcomes under different states of nature, θ.

Examples in statistics edit

  • For a scalar parameter θ, a decision function whose output   is an estimate of θ, and a quadratic loss function (squared error loss)
     
    the risk function becomes the mean squared error of the estimate,
     
    An Estimator found by minimizing the Mean squared error estimates the Posterior distribution's mean.
  • In density estimation, the unknown parameter is probability density itself. The loss function is typically chosen to be a norm in an appropriate function space. For example, for L2 norm,
     
    the risk function becomes the mean integrated squared error
     

Economic choice under uncertainty edit

In economics, decision-making under uncertainty is often modelled using the von Neumann–Morgenstern utility function of the uncertain variable of interest, such as end-of-period wealth. Since the value of this variable is uncertain, so is the value of the utility function; it is the expected value of utility that is maximized.

Decision rules edit

A decision rule makes a choice using an optimality criterion. Some commonly used criteria are:

  • Minimax: Choose the decision rule with the lowest worst loss — that is, minimize the worst-case (maximum possible) loss:
     
  • Invariance: Choose the decision rule which satisfies an invariance requirement.
  • Choose the decision rule with the lowest average loss (i.e. minimize the expected value of the loss function):
     

Selecting a loss function edit

Sound statistical practice requires selecting an estimator consistent with the actual acceptable variation experienced in the context of a particular applied problem. Thus, in the applied use of loss functions, selecting which statistical method to use to model an applied problem depends on knowing the losses that will be experienced from being wrong under the problem's particular circumstances.[15]

A common example involves estimating "location". Under typical statistical assumptions, the mean or average is the statistic for estimating location that minimizes the expected loss experienced under the squared-error loss function, while the median is the estimator that minimizes expected loss experienced under the absolute-difference loss function. Still different estimators would be optimal under other, less common circumstances.

In economics, when an agent is risk neutral, the objective function is simply expressed as the expected value of a monetary quantity, such as profit, income, or end-of-period wealth. For risk-averse or risk-loving agents, loss is measured as the negative of a utility function, and the objective function to be optimized is the expected value of utility.

Other measures of cost are possible, for example mortality or morbidity in the field of public health or safety engineering.

For most optimization algorithms, it is desirable to have a loss function that is globally continuous and differentiable.

Two very commonly used loss functions are the squared loss,  , and the absolute loss,  . However the absolute loss has the disadvantage that it is not differentiable at  . The squared loss has the disadvantage that it has the tendency to be dominated by outliers—when summing over a set of  's (as in  ), the final sum tends to be the result of a few particularly large a-values, rather than an expression of the average a-value.

The choice of a loss function is not arbitrary. It is very restrictive and sometimes the loss function may be characterized by its desirable properties.[16] Among the choice principles are, for example, the requirement of completeness of the class of symmetric statistics in the case of i.i.d. observations, the principle of complete information, and some others.

W. Edwards Deming and Nassim Nicholas Taleb argue that empirical reality, not nice mathematical properties, should be the sole basis for selecting loss functions, and real losses often are not mathematically nice and are not differentiable, continuous, symmetric, etc. For example, a person who arrives before a plane gate closure can still make the plane, but a person who arrives after can not, a discontinuity and asymmetry which makes arriving slightly late much more costly than arriving slightly early. In drug dosing, the cost of too little drug may be lack of efficacy, while the cost of too much may be tolerable toxicity, another example of asymmetry. Traffic, pipes, beams, ecologies, climates, etc. may tolerate increased load or stress with little noticeable change up to a point, then become backed up or break catastrophically. These situations, Deming and Taleb argue, are common in real-life problems, perhaps more common than classical smooth, continuous, symmetric, differentials cases.[17]

See also edit

References edit

  1. ^ a b Hastie, Trevor; Tibshirani, Robert; Friedman, Jerome H. (2001). The Elements of Statistical Learning. Springer. p. 18. ISBN 0-387-95284-5.
  2. ^ Wald, A. (1950). Statistical Decision Functions. Wiley.
  3. ^ Cramér, H. (1930). On the mathematical theory of risk. Centraltryckeriet.
  4. ^ Frisch, Ragnar (1969). "From utopian theory to practical applications: the case of econometrics". The Nobel Prize–Prize Lecture. Retrieved 15 February 2021.
  5. ^ Tangian, Andranik; Gruber, Josef (1997). Constructing Scalar-Valued Objective Functions. Proceedings of the Third International Conference on Econometric Decision Models: Constructing Scalar-Valued Objective Functions, University of Hagen, held in Katholische Akademie Schwerte September 5–8, 1995. Lecture Notes in Economics and Mathematical Systems. Vol. 453. Berlin: Springer. doi:10.1007/978-3-642-48773-6. ISBN 978-3-540-63061-6.
  6. ^ Tangian, Andranik; Gruber, Josef (2002). Constructing and Applying Objective Functions. Proceedings of the Fourth International Conference on Econometric Decision Models Constructing and Applying Objective Functions, University of Hagen, held in Haus Nordhelle, August, 28 — 31, 2000. Lecture Notes in Economics and Mathematical Systems. Vol. 510. Berlin: Springer. doi:10.1007/978-3-642-56038-5. ISBN 978-3-540-42669-1.
  7. ^ Tangian, Andranik (2002). "Constructing a quasi-concave quadratic objective function from interviewing a decision maker". European Journal of Operational Research. 141 (3): 608–640. doi:10.1016/S0377-2217(01)00185-0. S2CID 39623350.
  8. ^ Tangian, Andranik (2004). "A model for ordinally constructing additive objective functions". European Journal of Operational Research. 159 (2): 476–512. doi:10.1016/S0377-2217(03)00413-2. S2CID 31019036.
  9. ^ Tangian, Andranik (2004). "Redistribution of university budgets with respect to the status quo". European Journal of Operational Research. 157 (2): 409–428. doi:10.1016/S0377-2217(03)00271-6.
  10. ^ Tangian, Andranik (2008). "Multi-criteria optimization of regional employment policy: A simulation analysis for Germany". Review of Urban and Regional Development. 20 (2): 103–122. doi:10.1111/j.1467-940X.2008.00144.x.
  11. ^ Nikulin, M.S. (2001) [1994], "Risk of a statistical procedure", Encyclopedia of Mathematics, EMS Press
  12. ^ Berger, James O. (1985). Statistical decision theory and Bayesian Analysis (2nd ed.). New York: Springer-Verlag. Bibcode:1985sdtb.book.....B. ISBN 978-0-387-96098-2. MR 0804611.
  13. ^ DeGroot, Morris (2004) [1970]. Optimal Statistical Decisions. Wiley Classics Library. ISBN 978-0-471-68029-1. MR 2288194.
  14. ^ Robert, Christian P. (2007). The Bayesian Choice. Springer Texts in Statistics (2nd ed.). New York: Springer. doi:10.1007/0-387-71599-1. ISBN 978-0-387-95231-4. MR 1835885.
  15. ^ Pfanzagl, J. (1994). Parametric Statistical Theory. Berlin: Walter de Gruyter. ISBN 978-3-11-013863-4.
  16. ^ Detailed information on mathematical principles of the loss function choice is given in Chapter 2 of the book Klebanov, B.; Rachev, Svetlozat T.; Fabozzi, Frank J. (2009). Robust and Non-Robust Models in Statistics. New York: Nova Scientific Publishers, Inc. (and references there).
  17. ^ Deming, W. Edwards (2000). Out of the Crisis. The MIT Press. ISBN 9780262541152.

Further reading edit

  • Aretz, Kevin; Bartram, Söhnke M.; Pope, Peter F. (April–June 2011). "Asymmetric Loss Functions and the Rationality of Expected Stock Returns" (PDF). International Journal of Forecasting. 27 (2): 413–437. doi:10.1016/j.ijforecast.2009.10.008. SSRN 889323.
  • Berger, James O. (1985). Statistical decision theory and Bayesian Analysis (2nd ed.). New York: Springer-Verlag. Bibcode:1985sdtb.book.....B. ISBN 978-0-387-96098-2. MR 0804611.
  • Cecchetti, S. (2000). "Making monetary policy: Objectives and rules". Oxford Review of Economic Policy. 16 (4): 43–59. doi:10.1093/oxrep/16.4.43.
  • Horowitz, Ann R. (1987). "Loss functions and public policy". Journal of Macroeconomics. 9 (4): 489–504. doi:10.1016/0164-0704(87)90016-4.
  • Waud, Roger N. (1976). "Asymmetric Policymaker Utility Functions and Optimal Policy under Uncertainty". Econometrica. 44 (1): 53–66. doi:10.2307/1911380. JSTOR 1911380.

loss, function, mathematical, optimization, decision, theory, loss, function, cost, function, sometimes, also, called, error, function, function, that, maps, event, values, more, variables, onto, real, number, intuitively, representing, some, cost, associated,. In mathematical optimization and decision theory a loss function or cost function sometimes also called an error function 1 is a function that maps an event or values of one or more variables onto a real number intuitively representing some cost associated with the event An optimization problem seeks to minimize a loss function An objective function is either a loss function or its opposite in specific domains variously called a reward function a profit function a utility function a fitness function etc in which case it is to be maximized The loss function could include terms from several levels of the hierarchy In statistics typically a loss function is used for parameter estimation and the event in question is some function of the difference between estimated and true values for an instance of data The concept as old as Laplace was reintroduced in statistics by Abraham Wald in the middle of the 20th century 2 In the context of economics for example this is usually economic cost or regret In classification it is the penalty for an incorrect classification of an example In actuarial science it is used in an insurance context to model benefits paid over premiums particularly since the works of Harald Cramer in the 1920s 3 In optimal control the loss is the penalty for failing to achieve a desired value In financial risk management the function is mapped to a monetary loss Comparison of common loss functions used for regression Contents 1 Examples 1 1 Regret 1 2 Quadratic loss function 1 3 0 1 loss function 2 Constructing loss and objective functions 3 Expected loss 3 1 Statistics 3 1 1 Frequentist expected loss 3 1 2 Bayes Risk 3 1 3 Examples in statistics 3 2 Economic choice under uncertainty 4 Decision rules 5 Selecting a loss function 6 See also 7 References 8 Further readingExamples editRegret edit Main article Regret decision theory Leonard J Savage argued that using non Bayesian methods such as minimax the loss function should be based on the idea of regret i e the loss associated with a decision should be the difference between the consequences of the best decision that could have been made had the underlying circumstances been known and the decision that was in fact taken before they were known Quadratic loss function edit The use of a quadratic loss function is common for example when using least squares techniques It is often more mathematically tractable than other loss functions because of the properties of variances as well as being symmetric an error above the target causes the same loss as the same magnitude of error below the target If the target is t then a quadratic loss function is l x C t x 2 displaystyle lambda x C t x 2 nbsp for some constant C the value of the constant makes no difference to a decision and can be ignored by setting it equal to 1 This is also known as the squared error loss SEL 1 Many common statistics including t tests regression models design of experiments and much else use least squares methods applied using linear regression theory which is based on the quadratic loss function The quadratic loss function is also used in linear quadratic optimal control problems In these problems even in the absence of uncertainty it may not be possible to achieve the desired values of all target variables Often loss is expressed as a quadratic form in the deviations of the variables of interest from their desired values this approach is tractable because it results in linear first order conditions In the context of stochastic control the expected value of the quadratic form is used The quadratic loss assigns more importance to outliers than to the true data due to its square nature so alternatives like the Huber Log Cosh and SMAE losses are used when the data has many large outliers nbsp Effect of using different loss functions when the data has outliers 0 1 loss function edit In statistics and decision theory a frequently used loss function is the 0 1 loss function L y y y y displaystyle L hat y y left hat y neq y right nbsp using Iverson bracket notation i e it evaluates to 1 when y y displaystyle hat y neq y nbsp and 0 otherwise Constructing loss and objective functions editSee also Scoring rule In many applications objective functions including loss functions as a particular case are determined by the problem formulation In other situations the decision maker s preference must be elicited and represented by a scalar valued function called also utility function in a form suitable for optimization the problem that Ragnar Frisch has highlighted in his Nobel Prize lecture 4 The existing methods for constructing objective functions are collected in the proceedings of two dedicated conferences 5 6 In particular Andranik Tangian showed that the most usable objective functions quadratic and additive are determined by a few indifference points He used this property in the models for constructing these objective functions from either ordinal or cardinal data that were elicited through computer assisted interviews with decision makers 7 8 Among other things he constructed objective functions to optimally distribute budgets for 16 Westfalian universities 9 and the European subsidies for equalizing unemployment rates among 271 German regions 10 Expected loss editSee also Empirical risk minimization In some contexts the value of the loss function itself is a random quantity because it depends on the outcome of a random variable X Statistics edit Both frequentist and Bayesian statistical theory involve making a decision based on the expected value of the loss function however this quantity is defined differently under the two paradigms Frequentist expected loss edit We first define the expected loss in the frequentist context It is obtained by taking the expected value with respect to the probability distribution P8 of the observed data X This is also referred to as the risk function 11 12 13 14 of the decision rule d and the parameter 8 Here the decision rule depends on the outcome of X The risk function is given by R 8 d E 8 L 8 d X X L 8 d x d P 8 x displaystyle R theta delta operatorname E theta L big theta delta X big int X L big theta delta x big mathrm d P theta x nbsp Here 8 is a fixed but possibly unknown state of nature X is a vector of observations stochastically drawn from a population E 8 displaystyle operatorname E theta nbsp is the expectation over all population values of X dP8 is a probability measure over the event space of X parametrized by 8 and the integral is evaluated over the entire support of X Bayes Risk edit In a Bayesian approach the expectation is calculated using the prior distribution p of the parameter 8 r p a 8 X L 8 a x d P x 8 d p 8 X 8 L 8 a x d p 8 x d M x displaystyle rho pi a int Theta int mathbf X L theta a mathbf x mathrm d P mathbf x vert theta mathrm d pi theta int mathbf X int Theta L theta a mathbf x mathrm d pi theta vert mathbf x mathrm d M mathbf x nbsp where m x is known as the predictive likelihood wherein 8 has been integrated out p 8 x is the posterior distribution and the order of integration has been changed One then should choose the action a which minimises this expected loss which is referred to as Bayes Risk 12 In the latter equation the integrand inside dx is known as the Posterior Risk and minimising it with respect to decision a also minimizes the overall Bayes Risk This optimal decision a is known as the Bayes decision Rule it minimises the average loss over all possible states of nature 8 over all possible probability weighted data outcomes One advantage of the Bayesian approach is to that one need only choose the optimal action under the actual observed data to obtain a uniformly optimal one whereas choosing the actual frequentist optimal decision rule as a function of all possible observations is a much more difficult problem Of equal importance though the Bayes Rule reflects consideration of loss outcomes under different states of nature 8 Examples in statistics edit For a scalar parameter 8 a decision function whose output 8 displaystyle hat theta nbsp is an estimate of 8 and a quadratic loss function squared error loss L 8 8 8 8 2 displaystyle L theta hat theta theta hat theta 2 nbsp the risk function becomes the mean squared error of the estimate R 8 8 E 8 8 8 2 displaystyle R theta hat theta operatorname E theta theta hat theta 2 nbsp An Estimator found by minimizing the Mean squared error estimates the Posterior distribution s mean In density estimation the unknown parameter is probability density itself The loss function is typically chosen to be a norm in an appropriate function space For example for L2 norm L f f f f 2 2 displaystyle L f hat f f hat f 2 2 nbsp the risk function becomes the mean integrated squared error R f f E f f 2 displaystyle R f hat f operatorname E f hat f 2 nbsp Economic choice under uncertainty edit In economics decision making under uncertainty is often modelled using the von Neumann Morgenstern utility function of the uncertain variable of interest such as end of period wealth Since the value of this variable is uncertain so is the value of the utility function it is the expected value of utility that is maximized Decision rules editA decision rule makes a choice using an optimality criterion Some commonly used criteria are Minimax Choose the decision rule with the lowest worst loss that is minimize the worst case maximum possible loss a r g m i n d max 8 8 R 8 d displaystyle underset delta operatorname arg min max theta in Theta R theta delta nbsp Invariance Choose the decision rule which satisfies an invariance requirement Choose the decision rule with the lowest average loss i e minimize the expected value of the loss function a r g m i n d E 8 8 R 8 d a r g m i n d 8 8 R 8 d p 8 d 8 displaystyle underset delta operatorname arg min operatorname E theta in Theta R theta delta underset delta operatorname arg min int theta in Theta R theta delta p theta d theta nbsp Selecting a loss function editSound statistical practice requires selecting an estimator consistent with the actual acceptable variation experienced in the context of a particular applied problem Thus in the applied use of loss functions selecting which statistical method to use to model an applied problem depends on knowing the losses that will be experienced from being wrong under the problem s particular circumstances 15 A common example involves estimating location Under typical statistical assumptions the mean or average is the statistic for estimating location that minimizes the expected loss experienced under the squared error loss function while the median is the estimator that minimizes expected loss experienced under the absolute difference loss function Still different estimators would be optimal under other less common circumstances In economics when an agent is risk neutral the objective function is simply expressed as the expected value of a monetary quantity such as profit income or end of period wealth For risk averse or risk loving agents loss is measured as the negative of a utility function and the objective function to be optimized is the expected value of utility Other measures of cost are possible for example mortality or morbidity in the field of public health or safety engineering For most optimization algorithms it is desirable to have a loss function that is globally continuous and differentiable Two very commonly used loss functions are the squared loss L a a 2 displaystyle L a a 2 nbsp and the absolute loss L a a displaystyle L a a nbsp However the absolute loss has the disadvantage that it is not differentiable at a 0 displaystyle a 0 nbsp The squared loss has the disadvantage that it has the tendency to be dominated by outliers when summing over a set of a displaystyle a nbsp s as in i 1 n L a i textstyle sum i 1 n L a i nbsp the final sum tends to be the result of a few particularly large a values rather than an expression of the average a value The choice of a loss function is not arbitrary It is very restrictive and sometimes the loss function may be characterized by its desirable properties 16 Among the choice principles are for example the requirement of completeness of the class of symmetric statistics in the case of i i d observations the principle of complete information and some others W Edwards Deming and Nassim Nicholas Taleb argue that empirical reality not nice mathematical properties should be the sole basis for selecting loss functions and real losses often are not mathematically nice and are not differentiable continuous symmetric etc For example a person who arrives before a plane gate closure can still make the plane but a person who arrives after can not a discontinuity and asymmetry which makes arriving slightly late much more costly than arriving slightly early In drug dosing the cost of too little drug may be lack of efficacy while the cost of too much may be tolerable toxicity another example of asymmetry Traffic pipes beams ecologies climates etc may tolerate increased load or stress with little noticeable change up to a point then become backed up or break catastrophically These situations Deming and Taleb argue are common in real life problems perhaps more common than classical smooth continuous symmetric differentials cases 17 See also editBayesian regret Loss functions for classification Discounted maximum loss Hinge loss Scoring rule Statistical riskReferences edit a b Hastie Trevor Tibshirani Robert Friedman Jerome H 2001 The Elements of Statistical Learning Springer p 18 ISBN 0 387 95284 5 Wald A 1950 Statistical Decision Functions Wiley Cramer H 1930 On the mathematical theory of risk Centraltryckeriet Frisch Ragnar 1969 From utopian theory to practical applications the case of econometrics The Nobel Prize Prize Lecture Retrieved 15 February 2021 Tangian Andranik Gruber Josef 1997 Constructing Scalar Valued Objective Functions Proceedings of the Third International Conference on Econometric Decision Models Constructing Scalar Valued Objective Functions University of Hagen held in Katholische Akademie Schwerte September 5 8 1995 Lecture Notes in Economics and Mathematical Systems Vol 453 Berlin Springer doi 10 1007 978 3 642 48773 6 ISBN 978 3 540 63061 6 Tangian Andranik Gruber Josef 2002 Constructing and Applying Objective Functions Proceedings of the Fourth International Conference on Econometric Decision Models Constructing and Applying Objective Functions University of Hagen held in Haus Nordhelle August 28 31 2000 Lecture Notes in Economics and Mathematical Systems Vol 510 Berlin Springer doi 10 1007 978 3 642 56038 5 ISBN 978 3 540 42669 1 Tangian Andranik 2002 Constructing a quasi concave quadratic objective function from interviewing a decision maker European Journal of Operational Research 141 3 608 640 doi 10 1016 S0377 2217 01 00185 0 S2CID 39623350 Tangian Andranik 2004 A model for ordinally constructing additive objective functions European Journal of Operational Research 159 2 476 512 doi 10 1016 S0377 2217 03 00413 2 S2CID 31019036 Tangian Andranik 2004 Redistribution of university budgets with respect to the status quo European Journal of Operational Research 157 2 409 428 doi 10 1016 S0377 2217 03 00271 6 Tangian Andranik 2008 Multi criteria optimization of regional employment policy A simulation analysis for Germany Review of Urban and Regional Development 20 2 103 122 doi 10 1111 j 1467 940X 2008 00144 x Nikulin M S 2001 1994 Risk of a statistical procedure Encyclopedia of Mathematics EMS Press Berger James O 1985 Statistical decision theory and Bayesian Analysis 2nd ed New York Springer Verlag Bibcode 1985sdtb book B ISBN 978 0 387 96098 2 MR 0804611 DeGroot Morris 2004 1970 Optimal Statistical Decisions Wiley Classics Library ISBN 978 0 471 68029 1 MR 2288194 Robert Christian P 2007 The Bayesian Choice Springer Texts in Statistics 2nd ed New York Springer doi 10 1007 0 387 71599 1 ISBN 978 0 387 95231 4 MR 1835885 Pfanzagl J 1994 Parametric Statistical Theory Berlin Walter de Gruyter ISBN 978 3 11 013863 4 Detailed information on mathematical principles of the loss function choice is given in Chapter 2 of the book Klebanov B Rachev Svetlozat T Fabozzi Frank J 2009 Robust and Non Robust Models in Statistics New York Nova Scientific Publishers Inc and references there Deming W Edwards 2000 Out of the Crisis The MIT Press ISBN 9780262541152 Further reading editAretz Kevin Bartram Sohnke M Pope Peter F April June 2011 Asymmetric Loss Functions and the Rationality of Expected Stock Returns PDF International Journal of Forecasting 27 2 413 437 doi 10 1016 j ijforecast 2009 10 008 SSRN 889323 Berger James O 1985 Statistical decision theory and Bayesian Analysis 2nd ed New York Springer Verlag Bibcode 1985sdtb book B ISBN 978 0 387 96098 2 MR 0804611 Cecchetti S 2000 Making monetary policy Objectives and rules Oxford Review of Economic Policy 16 4 43 59 doi 10 1093 oxrep 16 4 43 Horowitz Ann R 1987 Loss functions and public policy Journal of Macroeconomics 9 4 489 504 doi 10 1016 0164 0704 87 90016 4 Waud Roger N 1976 Asymmetric Policymaker Utility Functions and Optimal Policy under Uncertainty Econometrica 44 1 53 66 doi 10 2307 1911380 JSTOR 1911380 Retrieved from https en wikipedia org w index php title Loss function amp oldid 1223442039, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.