Modified Pert Simulation
Modified Pert Simulation
I) Introduction
The Pert Distribution is one of the most important distributions for practical use in business, because it is widely used to generate random values within a range in the financial models and simulations in the area of processes and analysis in general. Distribution Pert, as well as a variant called Modified Pert Distribution, is a particular case of Generalized Beta Distribution, encompassing a wide range of distributions with values within the defined range. Initially this paper establish the relationship between the Modified Pert Distribution parameters and Generalized Beta Distribution parameters, in a complete and intuitive way, with a display of Pert families in graphical mode. The article follows showing that the form normally used for simulation of the Pert Distribution is not well suited to real situations. The article concludes by proposing an alternative way to specify the Modified Pert and develops a numerical method to proceed as suggested.
This distribution has the property to display a frequency of values around the mode (most frequent value) that drops softly in both directions, in a configurable way, unlike the Triangular Distribution, which decreases sharply from mode to extremes values. This feature makes Beta to be more realistic in practical applications than Triangular Distribution, because it is rare that a practical situation shows this kind of discontinuity. The standard deviation (SD), mean and shape parameters in Beta are calculated from the following formulas: Mean = min + (max - min) * (a/(a+b)) (1) SD = Raiz(a/(a+b) * b/(a+b) * (Max-Min)2 / (a+b+1)) (2) Mode = Min + (a-1)/(a+b-2) * (Max - Min) (3) The minimum and maximum parameters are intuitive since they represent the possible interval of occurrence of values. To make sense a practical use in business, it is necessary that the shape parameters (a and b) are both above 1. However, Beta has its practical use hampered because it is not intuitive directly estimate the shape parameters a and b.
It is easy to see in this formula, the higher lambda (), the steeper the function in the mode neighbour (higher kurtosis), this feature becomes smaller the distance between mean and mode. it also makes the curve near the ends (minimum and maximum) less important and frequent. Obviously Modified Pert becomes Pert when lambda () is worth 4.
similarity between the different curves that share the same lambda, as we will see at graphic in the next section. Calling a+b as SS (shape sum) and substituting in (10) and (11) we obtain: a=(Mean-Min)/(Max-Min)*SS (13) b=(Max-Mean)/(Max-Min)*SS (14)
0,1
0,2
0,3
0,4
0,5
0,6
0,7
0,8
0,9
Note the smoothness of distribution values near mode. This is a realistic behaviour of Beta, because natural phenomena do not have, in general, sharp changes. See also as at the least central curves, there is practically no values near the end in the longer tail. When the mode is close to the middle between the two ends, the curve is symmetrical. Moving to the sides, there is a increasingly asymmetrical feature. This versatility, different than symmetrical Normal distribution, is what becomes Beta Distribution as a so useful distribution to model many metrics from business world. It's a very common situation in which one needs assign a variable within a specified range, where the mode is closer to one of the two ends. For example, we can estimate an expected sales growth
It only makes sense talk in interval probability, ie probability that the distribution takes a value within a range which is given by the integral of the curve between two points from PDF. The integral of all PDF curve must always be 1, ie 100%, that covers all possibilities
between 20% and 40%, being 25% the most likely growth ( because a greater growth would depend on more uncertain actions). It is shown for lambda values above 4 that it is always possible to find a Normal Distribution fitting a symmetrical Beta (Mode = Mean). The approximation becomes accurate, the higher lambda value. However, the standard deviation is greater than the normal Beta and increasingly moves further Beta standard deviation.2 The mean, in turn, except when the mode is equidistant from the extremes, is always shifted toward the longer tail, which "attract" the mean. Below we plot variations of two curve families. For lambda = 2, we have observed the flattening of the curves, resembling a horse saddle, with lower kurtosis. By other side, when lambda = 8, the curves are steeper, expressed by a higher kurtosis, making mode surroundings more abrupt. Note, in that case, as the middle valley is deeper than the curve with lambda = 4.
0,1
0,2
0,3
0,4
0,5
0,6
0,7
0,8
0,9
0,1
0,2
0,3
0,4
0,5
0,6
0,7
0,8
0,9
The Normal Distribution is not a proper distribution to express variables that have a limited range of values. Some authors use Truncated Normal Distribution for this, but it creates a discontinuity in the ends that is not very realistic.
DP = (Max - Min)/6 (19) In Modified Pert the above formula is: DP = (Max - Min)/(Lambda+2) (19a) It is easily shown that this is a bad approximation: Ron Davis (2008) uses the approximate formula in (19) to demonstrate a wrong relationship to the shape parameter calculation of Beta. Pleguezuele (1999) focuses a lot on above (19) approximation to deduce other properties, including until a cubic equation.
A Xo. percentile means that the stochastic variable is taked where X% of the theoretically possible values of this variable are below this value.
We've used to LABFIT curve fitting tool (www.labfit.net), a brazilian commercial product that completely automates the curve fitting from hundreds of functions in its internal database. We've taken care to avoid deformation at the fitting extremes, by considering only the points where mininum(a, b) is greater than 1.2. The tool have generated the Harris curve, function that has obtained the best fitting, according to the statistical error adopted: Y = 1/(A+B*X**C)+D (23) The statistic parameter that the tool uses to sort the fitting functions is the reduced chisquared, whose value was less than 1:1,000,000: Where O is the observed data, E is the expected or theoric data, is the standard deviation of observed data and n is the number of observations. For lambda = 6, we have A=1,6056, B=1,6056, C=1,3985 e D=0,18859 For lambda = 8, we have A=1,4257, B=1,4257, C=1,5843 e D=0,1493 Based on this parameter, the steps of determining the parameters of Beta are: 1) Calculate Factor of Beta specified by (21): Factor = ( P95 - Mode ) / (Mode - P5) 2) Find AShare based on Factor in accordance with A, B, C and D parameters obtained for the aimed lambda. If lambda = 4, the equation (23) becomes: AShare = 1/ (1,6056 + 1,6056 * Fator ^1,3985) + 0,18859 3) Calculate a and b: a = AShare * SS b = SS - b 4) Calculate Mode from Standard Beta Padro: Mode' = (a-1)/lambda 5) Calculate P5' from Standard Beta, using the inverse of the cumulative distribution function, which corresponds to BETAINV Excel function P5' = BETAINV(0,05;a;b) 6) Calculate the scale (Max-Min): Scale = (P5 - Mode ) /(P5' - Mode') 7) Calculate minimum: Min = Mode - Mode' * Scale 8) Calculate maximum Max = Min + Scale The above process was used for lambda = 4 case, to compare with the tool @Risk. Mode = 15 P5=15 P95=30
The chi-squared error was reduced by less than 0.2%, except for the maximum (0.36%), what shows to be very applicable in practical situations, with the advantage that it can be calibrated for any lambda value.
VI) Conclusion
This article fulfills the mission of explaining Pert Distribution, justifying the logic of its generalization (Pert Distribution Modified) and also clarify its relationship with the Generalized Beta Distribution, highlighting the elegance of lambda ()parameter which added with 2 equals to the sum of Beta shape parameters. The practical importance of the Beta Distribution or Pert Distribution is so great that professionals should use it with full awareness of what they are doing. This justifies the focus of this paper to give an intuitive and graphical view of Modified Pert, where each value of lambda defines a family of curves with similar shape. In follow-up it has showed the weaknesses of conventional parametric Pert specification, demonstrating the difficulty of the user select minimum and maximum values, ending with a suggestion that replaces minimum and maximum with percentiles, maintaining the Mode specification. The article concludes showing how to implement this new proposed specification using techniques of numerical analysis, that can be applied to any development environment. We have achieved our goal if the reader gets a clearer view about Beta Pert family, reducing the dry and "black box" sensation, when one studies traditional statistics textbooks.
VII) Bibliography
DAVIS, Ron. "Teaching Project Simulation in Excel Using PERT-Beta Distributions", Informs - Transactions on Education, Vol. 8, No. 3, pp. 139148, 2008 EVANS, Merran. Statistical Distributions. 3.ed. New York - John Wiley & Sons, 2000 JENSEN, Paul. "Beta Distribution", Operations Management/Industrial Engineering, 2004 http://www.me.utexas.edu/~jensen/ORMM/omie/computation/unit/project/beta.html LAB Fit Curve Fitting Software (Nonlinear Regression and Treatment of Data Program) 1999-2011 PALISADE. Risk Analysis and Simulation Add-In for Microsoft Excel, 2010 PLEGUEZUELO, Rafael et alli. "The Parameters of the Classical Pert and Assessesment of its Success", Central European Journal of Operations Research, v.7, issue 3, pp 159175, 1999 VOSE Software "Modified PERT distribution" - www.vosesoftware.com WIKIPEDIA. "Beta Distribution" - http://en.wikipedia.org/wiki/Beta_distribution WOLFRAM "Pert Distribution" http://reference.wolfram.com/mathematica/ref/PERTDistribution.html