CHAPTER-6 FORECASTING TECHNIQUES - Formatted PDF
CHAPTER-6 FORECASTING TECHNIQUES - Formatted PDF
CHAPTER-6 FORECASTING TECHNIQUES - Formatted PDF
6 : FORECASTING TECHNIQUES
What people do
examples: Testing Marketing Reaction tests
The data from past activities are cheapest to collect but may be outdated and past behavior is not necessarily indicative of future behavior.
Data derived from surveys are more expensive to obtain and needs critical appraisal - intentions as expressed in surveys and questionnaires are not always translated into action.
Finally, the data derived from recording what people actually do are the most reliable but also the most expensive and occasionally it is not feasible for the data to be obtained. Forecasting is a process of estimating a future event by casting forward past data. The past data are systematically combined in a predetermined way to obtain the estimate of the future. Prediction is a process of estimating a future event based on subjective considerations other than just past data; these subjective considerations need not be combined in a predetermined way. Thus forecast is an estimate of future values of certain specified indicators relating to a decisional/planning situation, In some situations forecast regarding single indicator is sufficient, where as, in some other situations
142
forecast regarding several indicators is necessary. The number of indicators and the degree of detail required in the forecast depends on the intended use of the forecast. There are two basic reasons for the need for forecast in any field. 1. Purpose Any action devised in the PRESENT to take care of some contingency accruing out of a situation or set of conditions set in future. These future conditions offer a purpose / target to be achieved so as to take advantage of or to minimize the impact of (if the foreseen conditions are adverse in nature) these future conditions. 2. Time To prepare plan, to organize resources for its implementation, to implement; and complete the plan; all these need time as a resource. Some situations need very little time, some other situations need several years of time. Therefore, if future forecast is available in advance, appropriate actions can be planned and implemented intime. 6.2 Some Applications of Forecasting: Forecasts are vital to every business organization and for every significant management decision.
We now will discuss some areas in which forecasting is widely used. Sales Forecasting
Any company in selling goods needs to forecast the demand for those goods. Manufactures need to know how much to produce. Wholesalers and retailers need to know now much to stock. Substantially understanding demand is likely to lead to many lost sales, unhappy customers, and perhaps allowing the competition to gain the upper hand in the marketplace. On the other hand, significantly overestimating demand also is very costly due to (1) excessive inventory costs, (2) forced price reductions, (3) unneeded production or storage capacity, and (4) lost opportunities to market more profitable goods. Successful marketing and production managers understand very well the importance of obtaining good sales forecasts. For the production managers these sales forecast are essential to help trigger the forecast for production which in turn triggers the forecasting of the raw materials needed for production. Forecasting the need for raw materials and spare parts Although effective sales forecasting is a key for virtually any company, some organizations must rely on other types of forecasts as well. A prime example involves forecasts of the need for raw materials and spare parts. Many companies need to maintain an inventory of spare parts to enable them to quickly repair either own equipment or their products sold or leased to customers.
143
Ministry of Petroleum
The officials of this crucial ministry have to make decisions on the quantum of purchase to be made for various types of crude oils and petroleum products from different sources across the oil-exporting nations for the next few years. They also have to decide as to how much money has to be spent on development of indigenous sources. These decisions involve/need information on the future demand of different types of petroleum products and the likely change in the prices and the availability of crude oil and petroleum products in the country and the oil-exporting nations. This takes us back to the filed of forecasting.
Department of Technology
The top officials of this department want to make decisions on the type of information technology to recommend to the union government for the next decade. But they are not very clear on the directions which will be taken by this year rapidly changing field. They decided to entrust this task to the information system group of a national management institute. The team leader decided to forecast the changing technology in this area with the help of a team of information technology experts throughout the country. This is again a forecasting problem although of a much different type. This field of forecasting is known as technological forecasting. Forecasting is the basis of corporate long-run planning. In the functional areas of finance and accounting, forecasts provide the basis for budgetary planning and cost control. Marketing relies on sales forecasting to plan new products, compensate sales personnel, and make other key decisions. Productions and operations personnel use forecasts to make periodic decisions involving process selection, capacity planning, and facility layout, as well as for continual decisions about production planning, scheduling, and inventory. As we have observed in the aforementioned examples, forecasting forms an important input in many business and social science-related situations.
144
145
Forecasting Techniques
Qualitative Methods
Time Series Methods Nave Methods Moving Average Exponential Smoothing Trend Projections
Grass Roots Market Research Panel Consensus Historical Analogy Delphi Method
Fig. 6.1 Different Forecasting Methods 6.4 General Steps In The Forecasting Process
The general steps in the forecasting process are as follows: 1) Identify the general need 2) Select the Period (Time Horizon) of Forecast 3) Select Forecast Model to be used: For this, knowledge of various forecasting models, in which situations these are applicable, how reliable each one of them is; what type of data is required. On these considerations; one or more models can be selected. 4) Data Collection: With reference to various indicators identified-collect data from various appropriate sources-data which is compatible with the model(s) selected in step(3). Data should also go back that much in past, which meets the requirements of the model. 5) Prepare forecast: Apply the model using the data collected and calculate the value of the forecast. 6) Evaluate: The forecast obtained through any of the model should not be used, as it is, blindly. It should be evaluated in terms of confidence interval usually all good forecast models have methods of calculating upper value and the lower value within which the given forecast is expected to remain with
146
certain specified level of probability. It can also be evaluated from logical point of view whether the value obtained is logically feasible? It can also be evaluated against some related variable or phenomenon. Thus, it is possible, some times advisable to modify the statistically forecasted value based on evaluation.
Market Research: Firms often hire outside companies that specialize in market research to conduct this type of forecasting. You may have been involved in market surveys through a marketing class. Certainly you have not escaped telephone calls asking you about product preferences, your income, habits, and so on. Market research is used mostly for product research in the sense of looking for new product ideas, likes and dislikes about existing products, which competitive products within a particular class are preferred, and so on. Again, the data collection methods are primarily surveys and interviews.
Panel Consensus: In a panel consensus, the idea that two heads are better than one is extrapolated to the idea that a panel of people from a variety of positions can develop a more reliable forecast than a narrower group. Panel forecasts are developed through open meetings with free exchange of ideas form all levels of management and individuals. The difficulty with this open style is that lower employee levels are intimidated by higher levels of management. For example, a salesperson in a particular product line may have a good estimate of future product demand but may not speak up to refute a much different estimate given by the vice president of marketing. The Delphi technique (which we discuss shortly) was developed to try to correct this impairment to free exchange. When decisions in forecasting are at a broader, higher level (as when introducing a new product line or concerning strategic product decisions such as new marketing areas) the term executive judgment is generally used. The term is self-explanatory: a higher level of management is involved.
Historical Analogy: The historical analogy method is used for forecasting the demand for a product or service under the circumstances that no past demand data are available. This may specially be true if the product happens to be new for the organization. However, the organization may have marketed product(s) earlier which may be similar in some features to the new product. In such circumstances, the marketing personnel use the historical analogy between the two products and derive the demand for the new product using the historical data of the earlier
147
product. The limitations of this method are quite apparent. They include the questionable assumption of the similarity of demand behaviors, the changed marketing conditions, and the impact of the substitutability factor on the demand. Delphi Method: As we mentioned under panel consensus, a statement or opinion of a higher-level person will likely be weighted more than that of a lower-level person. The worst case in where lower level people feel threatened and do not contribute their true beliefs. To prevent this problem, the Delphi method conceals the identity of the individuals participating in the study. Everyone has the same weight. A moderator creates a questionnaire and distributes it to participants. Their responses are summed and given back to the entire group along with a new set of questions. The Delphi method was developed by the Rand Corporation in the 1950s. The step-by-step procedure is 1) Choose the experts to participate. There should be a variety of knowledgeable people in different areas. 2) Through a questionnaire (or e-mail), obtain forecasts (and any premises or qualification captions for the forecasts) from all participants. 3) Summarize the results and redistribute them to the participants along with appropriate new questions. 4) Summarize again, refining forecasts and conditions, and again develop new questions. 5) Repeat Step 4 if necessary. Distribute the final results to all participants. The Delphi technique can usually achieve satisfactory results in three rounds. The time required is a function of the number of participants, how much work is involved for them to develop their forecasts, and their speed in responding. We now discuss the quantitative methods of forecasting
148
1) Trends: These relate to the long-term persistent movements/tendencies/changes in data like price increases, population growth, and decline in market shares. An example of a decreasing linear trend is shown in Fig. 6.2
Market share
Market share
Time
Time
Market share
Market share
Time
(2) Seasonal variations: There could be periodic, repetitive variations in time-series which occur because of buying or consuming patterns and social habits, during different times of a year. The demand for products like soft drinks, woolens and refrigerators, also exhibits seasonal variations. An illustration of seasonal variations is provided in Fig. 6.3
Sales
Sales
Time
(3) Cyclical variations: These refer to the variations in time series which arise out of the phenomenon of business cycles. The business cycle refers to the periods of expansion followed by periods of contraction.
149
The period of a business cycle may vary from one year to thirty years. The duration and the level of resulting demand variation due to business cycles are quite difficult to predict. (4) Random or irregular variations: These refer to the erratic fluctuations in the data which cannot be attributed to the trend, seasonal or cyclical factors. In many cases, the root cause of these variations can be isolated only after a detailed analysis of the data and the accompanying explanations, if any. Such variations can be due to a wide variety of factors like sudden weather changes, strike or a communal clash. Since these are truly random in nature, their future occurrence and the resulting impact on demand are difficult to predict. The effect of these events can be eliminated by smoothing the time series data. A graphical example of the random variations is given in Fig. 6.4.
Demand
Time Fig. 6.4 The historical time series, as obtained from the past records, contains all the four factors described earlier. One of the major tasks is to isolate each of the components, as elegantly as possible. This process of desegregating the time series is called decomposition. The main objective here is to isolate the trend in time series by eliminating the other components. The trend line can then be used for projecting into the future. The effect of the other components on the forecast can be brought about by adding the corresponding cyclical, seasonal and irregular variations. In most short-term forecasting situations the elimination of the cyclical component is not attempted. Also, it is assumed that the irregular variations are small and tend to cancel each other out over time. Thus, the major objective, in most cases, is to seek the removal of seasonal variations from the time series. This process is known as deseasonalization of the time series data. There are a number of time-series-based methods. Not all of them involve explicit decomposition of the data. The methods extend from mathematically very simple to fairly complicated ones. Let us also see some of the time series models are based on the trend lines of the data. The constant-level models assume no trend at all in the data. The time series is assumed to have a relatively constant mean. The forecast for any period in the future is a horizontal line. Linear trend models forecast a straight-line trend for any period in the future. Refer Fig. 6.2 Exponential trends forecast that the amount of growth will increase continuously. At long horizons, these trends become unrealistic. Thus models with a damped trend have been developed for longer-range forecasting. The amount of trend extrapolated declines each period in a damped trend model. Eventually, the trend dies out and the forecasts become a horizontal line. Refer Fig 6.2 The additive seasonal pattern assumes that the seasonal fluctuations are of constant size.
150
The multiplicative pattern assumes that the seasonal fluctuations are proportional to the data. As the trend increases, the seasonal fluctuations get larger. Refer Fig 6.3 6.6.1 The Naive Methods The forecasting methods covered under this category are mathematically very simple. The simplest of them uses the most recently observed value in the time series as the forecast for the next period. Effectively, this implies that all prior observations are not considered. Another method of this type is the free-hand projection method. This includes the plotting of the data series on a graph paper and fitting a free-hand curve to it. This curve is extended into the future for deriving the forecasts. The semi-average projection method is another naive method. Here, the time-series is divided into two equal halves, averages calculated for both, and a line drawn connecting the two semi averages. This line is projected into the future and the forecasts are developed. Illustration 6.1: Consider the demand data for 8 years as given. Use these data for forecasting the demand for the year 1991 using the three nave methods described earlier. Year 1983 1984 1985 1986 1987 1988 1989 1990 100 105 103 107 109 110 115 117 Actual sales
Solution: The forecasted demand for 1991, using the last period method = actual sales in 1990 = 117 units. The forecasted demand for 1991, using the free-hand projection method = 119 units. (Please check the results using a graph papers!) The semi-averages for this problem will be calculated for the periods 1983-86 and 1987-90. The resultant semiaverages are 103.75 and 112.75. A straight line joining these points would lead to a forecast for the year 1991. The value of this forecast will be = 120 units
151
Year 83 84 85 86 87 88 89 90 91
Fig. 6.5 6.6.2 Simple Moving Average Method When demand for a product is neither growing nor declining rapidly, and if it does not have seasonal characteristics, a moving average can be useful can be useful in removing the random fluctuations for forecasting. Although moving averages are frequently centered, it is more convenient to use past data to predict the following period directly. To illustrate, a centered five-month average of January, February, March, April and May gives an average centered on March. However, all five months of data must already exist. If our objective is to forecast for June, we must project our moving average- by some means- from March to June. If the average is not centered but is at forward end, we can forecast more easily, through we may lose some accuracy. Thus, if we want to forecast June with a five-month moving average, we can take the average of January, February, March, April and May. When June passes, the forecast for July would be the average of February, March, April, May and June. Although it is important to select the best period for the moving average, there are several conflicting effects of different period lengths. The longer the moving average period, the more the random elements are smoothed (which may be desirable in many cases). But if there is a trend in the data-either increasing or decreasing-the moving average has the adverse characteristic of lagging the trend. Therefore, while a shorter time span produces more oscillation, there is a closer following of the trend. Conversely, a longer time span gives a smoother response but lags the trend. The formula for a simple moving average is
+A +A + ...... + A A t2 t 3 t n F = t 1 t n where, Ft = Forecast for the coming period, n = Number of period to be averaged and At-1, At-2, At-3 and so on are the actual occurrences in the in the past period, two periods ago, three periods ago and so on respectively.
152
Illustration 6.2: The data in the first two columns of the following table depict the sales of a company. The first
two columns show the month and the sales. The forecasts based on 3, 6 and 12 month moving average and shown in the next three columns. The 3 month moving average of a month is the average of sales of the preceding three months. The reader is asked to verify the calculations himself.
Forecasts produced by
3 month moving average 6 month moving average 12 month moving average
January February March April May June July August September October November December January
450 440 460 410 380 400 370 360 410 450 470 490 460 (450+440+460)/3 = 450 (440+460+410)/3 = 437 (460+410+380)/3 = 417 397 383 377 380 407 443 470 423 410 397 388 395 410 425 424
The 6 month moving average is given by the average of the preceding 6 months actual sales. For the month of July it is calculated as Julys forecast = ( Sum of the actual sales from January to June ) / 6 = ( 450 + 440 + 460 + 410 + 380 + 400 ) / 6 = 423 ( rounded ) For the forecast of January by the 12 month moving average we sum up the actual sales from January to December of the preceding year and divide it by 12.
Note:
1. A moving average can be used as a forecast as shown above but when graphing moving averages it is important to realize, that being averages, they must be plotted at the mid point of the period to which they relate. 2. Twelve-monthly moving averages or moving annual totals form part of a commonly used diagram, called the Z chart. It is called a Z chart because the completed diagram is shaped like a Z. The top part of the Z is formed by the moving annual total, the bottom part by the individual monthly figures and the sloping line by the cumulative monthly figures.
153
Illustration 6.3: Using the data given in the Illustration 1 forecast the demand for the period 1987 to 1991 using
a. b. 3- year moving average and 5- year moving average
If we want to check the error in our forecast as Error = Actual observed value Forecasted value find which one gives a lower error in the forecast.
Year
Demand
Here we observe that the forecast always lags behind the actual values. The lag is greater for the 5-year moving average based forecasts.
154
c. d.
The moving average calculation takes no account of data outside the period of average, so full use is not made of all the data available. The use of the unadjusted moving average as a forecast can cause misleading results when there is an underlying seasonal variation.
Month 1 100
the forecast for month 5 would be
Month 2 90
Month 3 105
Month 4 95
Month 5 ?
F =w A +w A +w A + ........ + w A t 1 t 1 2 t 2 3 t 3 n tn
Where Ft = Forecast for the coming period, n = the total number of periods in the forecast.
wi = the weight to be given to the actual occurrence for the period t-i Ai = the actual occurrence for the period t-i
Although many periods may be ignored (that is, their weights are zero) and the weighting scheme may be in any order (for example, more distant data may have greater weights than more recent data), the sum of all the weights must equal 1.
n w =1 i i =1
Suppose sales for month 5 actually turned out to be 110. Then the forecast for month 6 would be
Choosing Weights : Experience and trial and error are the simplest ways to choose weights. As a general rule,
the most recent past is the most important indicator of what to expect in the future, and, therefore, it should get higher weighting. The past month's revenue or plant capacity, for example, would be a better estimate for the coming month than the revenue or plant capacity of several months ago.
155
However, if the data are seasonal, for example, weights should be established accordingly. For example, sales of air conditioners in May of last year should be weighted more heavily than sales of air conditioners in December. The weighted moving average has a definite advantage over the simple moving average in being able to vary the effects of past data. However, it is more inconvenient and costly to use than the exponential smoothing method, which we examine next.
The method involves the automatic weighting of past data with weights that decrease exponentially with time,
Ft
Ft-1 = The exponentially smoothed forecast made for the prior period At-1 = The actual demand in the prior period
= The desired response rate, or smoothing constant
156
The total of the weights of observations contributing to the new forecast is 1 and the weight reduces exponentially progressively from the alpha value for the latest observation to smaller value for the older observations. For example, if the alpha value was 0.3 and Junes sales were being forecast, then Junes forecast is produced from averaging past sales weighted as follows. 0.3 (Mays Sales) + 0.21 (Aprils Sales) + 0.147 (Marchs Sales) + 0.1029 (February Sales) + 0.072 (January Sales) + 0.050 (December Sales), etc. In the above calculation, the reader will observe that (1- ) = 0.3, (1- ) = 0.21, (1- ) = 0.147
0 1 2
In the exponential smoothing method, only three pieces of data are needed to forecast the future: the most recent forecast, the actual demand that occurred for that forecast period and a smoothing constant alpha (). This smoothing constant determines the level of smoothing and the speed of reaction to differences between forecasts and actual occurrences. The value for the constant is determined both by the nature of the product and by the managers sense of what constitutes a good response rate. For example, if a firm produced a standard item with relatively stable demand, the reaction rate to difference between actual and forecast demand would tend to be
157
small, perhaps just 5 or 10 percentage points. However, if the firm were experiencing growth, it would be desirable to have a higher reaction rate, perhaps 15 to 30 percentage points, to give greater importance to recent growth experience. The more rapid the growth, the higher the reaction rate should be. Sometimes users of the simple moving average switch to exponential smoothing but like to keep the forecasts about the same as the simple moving average. In this case, is approximated by 2 (n+1), where n is the number of time periods. To demonstrate the method once again, assume that the long-run demand for the product under study is relatively stable and a smoothing constant () of 0.05 is considered appropriate. If the exponential method were used as a continuing policy, forecast would have been made for last month. Assume that last months forecast (Ft-1) was 1,050 units. If 1,000 actually were demanded, rather than 1,050, the forecast for this month would be
Illustration 6.4: The data are given in the first two columns and the forecasts have been prepared using the
values of as 0.2 and 0.8.
Month January February March April May June July August September October November December January
Exponential Forecasts
= 0.2 450** 450 + 0.8(440-450) =442 442 + 0.8(460-442) =456.4 456.4 + 0.8(410-456.4) =419.28 387.86 397.57 375.51 363.102 400.62 440.12 464.02 484.80 = 0.8
450 + 0.2 ( 440-450) = 448 448 + 0.2 ( 460-448) = 450.4 450.4 + 0.2 (410 450.4) = 442.32 429.86 423.89 413.11 402.49 403.99 413.19 424.55 437.64
** In the above example, no previous forecast was available. So January sales were used as Februarys forecast. The reader is advised to check the calculations for himself
158
It is apparent that the higher value, 0.8, produces a forecast which adjusts more readily to the most recent sales.
Illustration 6.5: Data on production of an item for 30 periods are tabulated below. Determine which value
of the smoothing constant (), out of 0.1 and 0.3, will lead to the best simple exponential smoothing model. The first 15 values can be used for initialization of the model and then check the error in the forecast as asked after the table.
Month 1 2 3 4 5 6 7 8 9 10
Use the
Production (in tones) 30.50 28.80 31.50 29.90 31.40 33.50 25.70 32.10 29.10 30.80
Month
11 12 13 14 15 16 17 18 19 20
Production (in tones) 25.70 30.90 31.50 28.10 30.80 29.50 29.80 30.00 29.90 31.50
Month
21 22 23 24 25 26 27 28 29 30
Production (in tones) 27.60 29.90 30.20 35.50 30.80 28.80 30.80 32.20 31.20 29.80
total
squared error or the mean squared error as the basis of comparison of the performances. The total squared error is the sum of the squares of all the error figures for the period selected (here only the last 15 figures because the first 15 periods will initialize the forecast). Their mean is the mean squared error.
159
Solution: The following table give the forecasted values for the two different values of the smoothing constant for the first 15 periods. = 0.1 Forecast 30.50 30.33 30.45 30.39 30.49 30.79 30.28 30.47 30.33 30.38 29.91 30.01 30.16 29.95 30.04 30.50 29.90 30.44 30.28 3.62 31.48 29.75 30.45 30.05 30.27 28.90 29.50 30.10 29.50 29.89 = 0.3 Forecast
Month 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Production ( in tonnes) 30.50 28.80 31.50 29.90 31.40 33.50 25.70 32.10 29.10 30.80 25.70 30.90 31.50 28.10 30.80
The following table now gives the forecasted values and also checks the errors in the forecast for the last 15 periods.
Month
Production ( in tonnes)
Forecast 30.04 29.98 29.96 29.97 29.96 30.12 29.86 29.87 29.90 30.46 30.49 30.32 30.37 30.56 30.62
= 0.1 error -0.54 -0.18 0.04 -0.07 1.54 -2.52 0.04 0.33 5.60 0.34 -1.69 0.48 1.83 0.64 -0.82 Squared error 0.29 0.03 0.0 0.0 2.37 6.33 0.0 0.11 31.35 0.12 2.87 0.23 3.34 0.42 0.67 48.13 =48.13/15 =3.20 forecast 29.89 29.77 29.78 29.85 29.86 30.35 29.53 29.64 29.81 31.52 31.30 30.55 30.63 31.10 31.13
= 0.3 Squared error 0.15 0.0 0.05 0.0 2.68 7.58 0.14 0.31 32.40 0.51 6.25 0.06 2.48 0.01 1.76 54.41 =54.41/15 =3.62
16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
29.50 29.80 30.00 29.90 31.50 27.60 29.90 30.20 35.50 30.80 28.80 30.80 32.20 31.20 29.80
160
The results here indicate that the forecast accuracy is better for = 0.1 as compared to 0.3. They also indicate that a search around 0.1 may lead to a more refined single exponential smoothing model.
161
Illustration 6.6: Data regarding the sales of a particular item in the 12 time periods is given below. The
manager wants to forecast 1 time period ahead in order to plan properly. Determine the forecasts using (a) Nave method (b) 3 period moving average (c) simple exponential smoothing taking = 0.1. Also compute the errors MAD, MAPE, and MSE to check the forecasting accuracy for the last six periods. Solution: (a) The following table shows the nave forecasting model. In this model the next periods forecast is the present periods actual observed value
Demand (D)
28 27 33 25 34 33 35 30 33 35 27 29
Forecast (F)
28 27 33 25 34 33 35 30 33 35 27 29
Absolute Error (E = D F )
2 5 3 2 8 2 22
Absolute Percentage error (E/D) 100 5.7% 16.7% 9.1% 5.7% 29.6% 6.9% 73.7%
162
(b) The following table shows the 3 period moving average model.
Demand (D)
28 27 33 25 34
1 2 3 4 5
Forecast by 3 period moving average (F) (28+27+33)/3 = 29.3 (27+33+25)/3 =28.3 30.7 30.7 34.0 32.7 32.7 32.7 31.7 30.3
(c) The simple exponential smoothing model with smoothing constant = 0.1 is presented below.
Demand (D)
28 27 33 25 34 33 35 30 33 35 27 29 -
MSE ( periods 7 to 12 ) =
163
3.
164
Illustration 6.7 : A firms sales for a product line during the 12 quarters of the past three years were as follows. Quarter 1 2 3 4 5 6 Sales
600 1550 1500 1500 2400 3100 7 8 9 10 11 12
Quarter
Sales
2600 2900 3800 4500 4000 4900
Forecast the sales for the 13, 14, 15 and 16th quarters using a hand-fit regression equation.
Solution: The procedure is quite simple: Lay a straightedge across the data points until the line seems to fit
well, and draw the line. This is the regression line. The next step is to determine the intercept a and slope b. The following fig shows a plot of the data and the straight line we drew through the points.
The intercept a, where the line cuts the vertical axis, appears to be about 400. The slope b is the "rise" divided by the "run" (the change in the height of some portion of the line divided by the number of units in the horizontal axis). Any two points can be used, but two points some distance apart give the best accuracy because of the errors in reading values from the graph. We use values for the 1st and 12th quarters. By reading from the points on the line, the Y values for quarter 1 and quarter 12 are about 750 and 4,950. Therefore.
142
Quarter 13 14 15 16
Forecast
400 + 382(13) = 5366 400 + 382(14) = 5748 400 + 382(15) = 6130 400 + 382(16) = 6512
These forecasts are based on the line only and do not identify or adjust for elements such as seasonal or cyclical elements.
Y = Dependent variable computed by the equation y = The actual dependent variable data point (used below) a = y intercept , b = Slope of the line, X = Time period
The least squares method tries to fit the line to the data that minimize the sum of the sum of the squares of the vertical distance between each data point and its corresponding point on the line. If a straight line is drawn through general area of the points, the difference between the point and the line is y -
Y. The sum of the squares of the differences between the plotted data points and the line points is (y1 Y1)2 + (y2 Y2)2 + .. + (y12 Y12)2
The best line to use is the one that minimizes this total. As before, the straight line equation is
Y = a + bX
Previously we determined a and b from the graph. In the least squares method, the equations for a and b are and
a = y bX
Where
b =
Xy n X .y 2 2 X nX
Illustration 6.8: Forecast the sales for the 13, 14, 15 and 16th quarters for the data given in illustration 7 using
the least squares method. Also calculate the standard error of the estimate.
143
Solution: The following table shows the computations carried out for the 12 data points.
X
1 2 3 4 5 6 7 8 9 10 11 12 Total :78
y
600 1550 1500 1500 2400 3100 2600 2900 3800 4500 4000 4900 33350
Xy
600 3100 4500 6000 12000 18600 18200 23200 34200 45000 44000 58800
X2
1 4 9 16 25 36 49 64 81 100 121 144
y2
360000 2402500 2250000 2250000 5760000 9610000 6760000 8410000 14440000 20250000 16000000 24010000 112502500
Y
801.3 1160.9 1520.5 1880.1 2239.7 2599.4 2959.0 3318.6 3678.2 4037.8 4397.4 4757.1
268200
b = 359.61
650
X = 6.5
y = 2779.17
a = 441.66
The reader is advised to verify the calculations of a and b on his own. Note that the final equation for Y shows an intercept of 441.6 and a slope of 359.6. The slope shows that for every unit change in X, Y changes by 359.6. Strictly based on the equation, forecasts for periods 13 through 16 would be
Y13 = 441.6 + 359.6(13) = 5116.4 Y14 = 441.6 + 359.6(14) = 5476.0 Y15 = 441.6 + 359.6(15) = 5835.6 Y16 = 441.6 + 359.6(16) = 6195.2
7000 6000 5000 4000 3000 2000 1000 0 1 3 5 7 9 11 13 15
144
The reader is also advised to verify the results for the forecasts for the above two illustrations 7 and 8. The standard error of the estimate is computed as SyX which is given as follows
We have plotted the graph of the values of the actual demand, forecasted demand values by the two methods of hand fitting and the method of least squares. Following is the graph.
Here, Series 1 is the actual observed demand values, Series 2 are the values based on calculations by the method of least squares and Series 3 are the values by hand fitting the trend line. The following table compares the forecasted values for the 13th, 14th ,15th and the 16th quarters based on illustrations 7 and 8.
quarter
13 14 15 16
Forecasts by the hand fit line (line going above in the figure) Series 3 5366
5748 6130 6512
Forecasts by the least squares method (line beneath the hand fit line) series 2 5116.4
5476.0 5835.6 6195.2
145
X
1989 1990 1991 1992 1993 1994 1995
XX
1989 1992 = 1990 1992 = 1991 1992 = 1992 1992 = 1993 1992 = 1994 1992 = 1995 1992 =
X
1990 1991 1992 1993 1994 1995
XX
1990 1992.5 = 1991 1992.5 = 1992 1992.5 = 1993 1992.5 = 1994 1992.5 = 1995 1992.5 =
XX
2 -2.5 2 = -1.5 2 = -0.5 2 = 0.5 2 = 1.5 2 = 2.5 2 =
Coded time(x) -5 -3 -1 1 3 5
-3 -2 -1 0 1 2 3
X = 13944
X = 1992
X = 11955
X = 1992.5
146
We have two reasons for this translation of time. First, it eliminates the need to square numbers as large as 1992, 1993, 1994, and so on. This method also sets the mean year, x , equal to zero and allows us to simplify the least squares equations of the Y intercept, a and the slope b as follows.
b=
xy 2 x
a= y
where x represents the coded values.
Illustration 6.9: The following table gives the number of items produced in a factory between 1988 and 1995. Year Production 1988
98
1989
105
1990
116
1991
119
1992
135
1993
156
1994
177
1995
208
Determine the equation that will describe the secular trend of production. Also project the production for 1996. Solution: The following table calculates the necessary values.
X
1988 1989 1990 1991 1992 1993 1994 1995
Y
98 105 116 119 135 156 177 208
XX
-3.5 -2.5 -1.5 -0.5 0.5 1.5 2.5 3.5
x = X X 2
-7 -5 -3 -1 1 3 5 7
xY
-686 -525 -348 -119 135 468 885 1456
x2
49 25 9 1 1 9 25 49
2
X = 15932 Y = 1114
X = 1991. 5 Y = 139. 25
xY = 1266 x
= 168
With these values, we can now compute b and a to find the slope and the Y intercept for the line describing the trend in production.
b=
xy 2 x
a= y
147
Thus, the general linear equation describing the secular trend in production is
Y = a + bx = 139.25 + 7.536 x
where Y = dependent variable computed by the equation OR the estimated production calculated
x = coded time value representing the number of half year intervals ( a sign indicates half year intervals
before 1991.5, a + sign indicates half year intervals after 1991.5)
Now suppose we want to estimate the production for the year 1996. First, we must convert 1996 to the value of the coded time ( in half year intervals)
x = 1996 1991.5
= 4.5 years = 9 half year intervals Substituting this value in the equation for the secular trend, we get
Note: If the number of elements in our time series would have been odd, our procedure would have been the
same except that we would have dealt with 1 year intervals.
148
The decomposition methods assume the time series value at time t to be a function of the different components, i.e., where
Dt = f (Tt , St , Ct , Rt ) Tt = trend value at period t St = seasonal component at period t Ct = cyclical component at period t, and Rt = random variation at period t.
To make reasonably accurate forecasts, it is often necessary to separate out the above characteristics (i.e. T, S, C and R) from the raw data. This is known as time series decomposition or often just time series analysis. The separated elements are then combined to produce a forecast. The functional form for the series used may either be additive or multiplicative. The multiplicative form (most commonly used) is written as follows:
Dt = Tt St Ct Rt
Here the components are expressed as percentages or proportions The additive model takes the form
Dt = Tt + St + Ct + Rt
Here the components are expressed as absolute values The multiplicative model is commonly used in practice and is more appropriate if the characteristics interact, e.g. where a higher trend value increases the seasonal variation. The additive model is more suitable if the component factors are independent, e.g. where the amount of seasonal variation is not affected by the value of the trend. Of the four elements the most important are the first two; the trend and seasonal variation, so this book concentrates on these two.
Illustration 6.10: Assume that in the past years, a firm sold an average of 1000 units of air conditioners each
year. On the average, 200 units were sold from the period January to March, 350 in April to June, 300 in July to September and 150 in October to December. Calculate the seasonal indices for each season (quarter). If the expected demand for the next year is believed to be 1100, forecast the demand in each quarter.
149
Solution: First we find the average sales for each season and then divide the sales of each season by that average. Past sales Jan-Mar Apr-June July-Sept Oct-Dec Total 200 350 300 150 1000 Average sales for each season = Total / 4 = 1000 / 4 250 250 250 250 200 / 250 =0.8 350 / 250 = 1.4 300 / 250 = 1.2 150 / 250 = 0.6 Seasonal factor
Using these factors, if we expected demand for next year to be 1100 units, we would forecast the demand to occur as shown in the following table. First we distribute the demand equally among all the quarters and then divide those amounts by the seasonal index obtained in the previous step.
Expected demand For next year Jan-Mar Apr-June July-Sept Oct-Dec Total
1100
Seasonal Factor
0.8 1.4 1.2 0.6 = = = =
The seasonal factors are updated periodically as the new data are available. This is how we can forecast based on the past seasonal data and knowing the future expected demand. One issue in the above illustration was that the expected demand for the next year was known to be 1100. When not known, we can compute the seasonal indices by even using a hand-fir straight line. This procedure is given in the following illustration.
Illustration 6.11: Simply hand fit a straight line through the data points and measure the trend and the intercept
from the graph. The history of the data is
Quarter
1 2 3 4
Sales
300 200 220 530
Year
2005 2005 2005 2005
Quarter
1 2 3 4
Sales
520 420 400 700
150
Find the seasonal factors for the quarters and using them forecast the sales in the quarters of 2006. Solution: First we plot the data points on the graph and then fit a straight line. (The reader can fit a straight line in a manner different from this. Naturally then his estimates are likely to vary a bit. You can also fit a straight line by the method of least squares).
The equation for the trend line is Tt = 170 + 55t , which is derived from the intercept 170 and a slope of (610170) / 8 = 55. Next we derive a seasonal index by comparing the actual data with the trend line as shown below.
Quarter
Actual
170 + 55t
2004
1 2 3 4
2005
= 170 + 55(1) = 225 = 170 + 55(2) = 280 = 170 + 55(3) = 335 = 170 + 55(4) = 390 = 170 + 55(5) = 445 = 170 + 55(6) = 500 = 170 + 55(7) = 555 = 170 + 55(8) = 610
Q-1 : (1.33 + 1.17)/2 = 1.25 Q-2 : (0.71 + 0.84)/2 = 0.78 Q-3 : (0.66 + 0.72)/2 = 0.69 Q-4 : (1.36 + 1.15)/2 = 1.25
1 2 3 4
151
We can now compute the forecast for the quarters in 2006 including trend and the seasonal factors using the following equation: Forecast = Trend value seasonal factor
Year 2006 Q1 Q2 Q3 Q4
Tt
= 170 + 55(9) = 665 = 170 + 55(10) = 720 = 170 + 55(11) = 775 = 170 + 55(12) = 830 1.25 0.78 0.69 1.25
St
831 562 535 1038
Tt St
With this knowledge of seasonal factors we now go on and study the decomposition of time series and forecast into the future. The following illustration shows how the trend (T) and seasonal variation (S) are separated out from a time series and how the seasonal indices are calculated. We can calculate the seasonal indices for the time series data by two methods Method 1. By using a trend line from the data by the least squares method and Method 2. By using the method of ratio to moving average. We illustrate the first method in the following illustration.
Illustration 6.12: Estimate the sales of air conditioners for the quarters 17, 18, 19, 20 i.e the four quarters of the
year 2005 using the following data by the method of decomposition of time series.
It is apparent that there is a strong seasonal element in the above data (low in Quarter 1 and high in Quarter 3) and that there is a generally upward trend. The steps in analyzing the data and preparing a forecast are:
Step 1: Calculate the trend in the data using the least squares method. Step 2: Estimate the sales for each quarter using the regression formula established in Step 1. Step 3: Calculate the percentage variation of each quarter's actual sales from the estimates, obtained in
Step 2.
Step 4: Average the percentage variations from Step 3 for each quarter. This establishes the average
seasonal variations.
152
Step 1. We use the following least squares linear regression equations to compute the regression line Y = a + bX
y = an + b x xy = a x + b x
X (quarters) Y ( sales ) 20 32 62 29 21 42 75 31 23 39 77 48 27 39 92 53
2
XY 20 64 186 116 105 252 525 248 207 390 847 576 351 546 1380 848 X2 1 4 9 16 25 36 49 64 81 100 121 144 169 196 225 256
2
Year 2001
1 2 3 4 5 6 7 8
Year 2002
Year 2003
9 10 11 12 13 14 15 16
Year 2004
= 1496
Solving them we get a = 28.74 and b =1.84. Thus, the trend line equation is Y = 28.74 + 1.84X
Step 2 and Step 3: Use the trend line to calculate the estimated sales for each quarter.
Express the actual value of sales as a percentage of this estimated sales. ( Remember that this is similar to finding the ratio of actual to trend in the above two illustrations )
153
Y ( sales ) (A) 20
32 62 29 21 42 75 31 23 39 77 48 27 39 92 53
(actual/Estimated)% = (A/B)100
65 99 181 80 55 106 180 71 51 83 157 94 51 72 163 91
Year 2002
Year 2003
Year 2004
Step 4: Average the percentage variations to find the average seasonal variation. In % 2001 2002 2003 2004 Total 4= Q1
65 55 51 51 222 56%
Q2
99 106 83 72 360 90%
Q3
181 180 157 163 681 170%
Q4
80 71 94 91 336 84%
These then are the average variations expected from the trend for each of the quarters; for example, on average the first quarter of each year will be 56% of the value of the trend. Because the variations have been averaged, the amounts over 100% (Q3 in this example) should equal the amounts below 100%. (Ql, Q2 and Q4 in this example); This can be checked by adding the average variations and verifying that they total 400% thus: 56% + 90% + 170% + 84% = 400%. On occasions, roundings in the calculations will make slight adjustments necessary to the average variations. We will discuss this in the next illustration.
154
Step 5. Prepare final forecasts based on the trend line estimates and the averaged seasonal variations.
The seasonally adjusted forecast is calculated thus: Seasonally adjusted forecast = Trend estimate x Seasonal factor
Y ( sales )
20 32 62 29 21 42 75 31 23 39 77 48 27 39 92 53
Year 2002
Year 2003
Year 2004
Year 2005 Q1 Q2 Q3 Q4
Estimated trend Using the trend equation Y=28.74 + 1.84X ( T) = 28.74 + 1.84(17) = 60.02
= 28.74 + 1.84(18) = 61.86 = 28.74 + 1.84(19) = 63.7 = 28.74 + 1.84(20) = 65.54
Forecasted sales
TS 33.61 55.67 108.29 55.05
155
120
100
80
60
40
20
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Notes :
(a) Time series decomposition is not an adaptive forecasting system like moving averages and exponential smoothing. (b) Forecasts produced by such an analysis should always be treated with caution. Changing conditions and changing seasonal factors make long term forecasting a difficult task. (c) The above illustration has been an example of a multiplicative model. This is because the seasonal variations were expressed in percentage or proportionate terms. Similar steps would have been necessary if the additive model had been used except that the variations from the trend ( i.e when we compute Actual / estimated in case of the above illustration) would have been the absolute values (i.e we do not compute the ratio now but we compute the absolute variation). For example, the first two variations would have been Q1 : 20 - 30.58 = absolute variation = - 10.58. Q2 : 32 - 32.42 = absolute variation = - 0.42 and so on.
156
The absolute variations would have been averaged in the normal way to find the average absolute variation,( i.e finding the seasonal factor) whether + or -, and these values would have been used to make the final seasonally adjusted forecasts. Now we see another method to find the seasonal indices and forecast the future demand. This method is called the ratio-to-moving average method. This method provides an index that describes the degree of seasonal variation. The index is based on a mean of 100 with the degree of seasonality measured by variations away from the base. This process is the decomposition of a time series into a deseasonalized series. Consider the following illustration.
Illustration 6.13 : A manager wanted to establish the seasonal pattern of the units of a particular product X
demanded by his client. The following table contains the quarterly sales, that is, the average number of units sold during each quarter of the last 5 years.
Calculate the seasonal indices and deseasonalize the time series. Solution: Refer to the following table to demonstrate the five steps required to compute a seasonal index.
Step 1. The first step in computing a seasonal index is to calculate the 4-quarter moving total for the time series.
To do this, we total the values for the quarters during the first year, 2001 as 1,861 + 2,203 + 2,415 + 1,908 = 8,387. A moving total is associated with the middle data point in the set of values from which it was calculated. So this moving total is of the middle part i.e quarter 2 and quarter 3 of year 2001. We then take the mean of this total i.e we compute 8387 / 4 = 2096.75. This value is the 4 quarter moving average and we place it in the next column.
157
Year
Quarter
Sales
(1) 2001
(2)
1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4
(3)
1861 2203 2415 1908 1921 2343 2514 1986 1834 2154 2098 1799 1837 2025 2304 1965 2073 2414 2339 1967 2096.75 2111.75 2146.75 2171.50 2191.0 2169.25 2122.0 2018.0 1971.25 1972.0 1939.75 1991.25 2032.75 2091.75 2189.0 2197.75 2198.25
2104.250 2129.250 2159.125 2181.250 2180.125 2145.625 2070.00 1994.625 1971.625 1955.875 1965.500 2012.00 2062.250 2140.375 2193.375 2198.0
114.8 89.6 89 107.4 115.3 92.6 88.6 108 106.4 92 93.5 100.6 111.7 91.8 94.5 109.8
2002
2003
2004
2005
We now find the next moving total by dropping the 2001 quarter 1 value, 1,861, and adding the new 2002 quarter 1 value, 1,921. By dropping the first value and adding the fifth, we keep four quarters in the total. The four values added now are 2,203 + 2,415 + 1,908 + 1,921 = 8,447. We again find the average of this total giving us the 4 quarter moving average. We again place it in below the 4 quarter moving average we found out earlier. This process is continued over the time series until we have included the last value in the series.
Step 2.
In the second step, we center the 4-quarter moving average. The moving averages in column 4 all fall halfway between the quarters. We would like to have moving averages associated with each quarter. In order to center our moving averages, we associate with each quarter the average of the two 4-quarter moving averages falling just above and just below it. For the 2001 quarter 3, the resulting 4-quarter centered moving average is (2,096.75 + 2,111.75)/2 = 2,104.25. The other entries in column 5 are calculated the same way.
158
The above process when plotted on the graph illustrates how the moving average has smoothed the peaks and troughs of the original time series. The seasonal and irregular components have been smoothed. The smoothed line represents the cyclical and trend components of the series.
3000
2500
2000
1500
1000
500
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
time in quarters
actual
4Q cent. Mov.Avg
(Note: Whenever the number of periods for which we want indices is odd (7 days in a week or 3 four monthly data, etc.), would compute 7-day moving totals and moving averages, and the moving averages would already be centered (because the middle of a 7-day period is the fourth of those 7 days). However, when the number of periods is even (4 quarters, 12 months, 24 hours), then we must centre the moving averages)
Step 3.
Next, we calculate the percentage of the actual value to the moving-average value for each quarter in the time series having a 4-quarter moving-average entry. This step allows us to recover the seasonal component for the quarters. We determine this percentage by dividing each of the actual quarter values in column 3 of the Table by the corresponding 4-quarter centered moving-average values in column 5 and then multiplying the result by 100.
Step 4.
Collect all the percentage of actual to moving-average values in column 6 and arrange them by quarter. Then calculate the modified mean for each quarter. The modified mean is calculated by discarding the highest and lowest values for each quarter and averaging the remaining values. (You may not ignore some of the values but just take the average of all the means for a particular quarter) In the following table the process of finding the modified mean is shown.
159
Year
Modified Mean
Quarter 1 Quarter 2 Quarter 3 Quarter 4 Total 182.5/2 = 91.25 215.4/2 = 107.70 226.5/2 = 113.25 183.8/2 = 91.90
404.1
The seasonal values we recovered for the quarters in column 6 of the table still contain the cyclical and irregular components of variation in the time series. By eliminating the highest and lowest values from each quarter, we reduce the extreme cyclical and irregular variations. When we average the remaining values, we further smooth the cyclical and irregular components. Cyclical and irregular variations tend to be removed by this process, so the modified mean is an index of the seasonality component.
Step 5.
The final step, demonstrated in the following table adjusts the modified mean slightly. Notice that the four indices in Table total 404.1. However, the base for an index is 100. Thus, the four quarterly indices should total 400, and their mean should be 100. To correct for this error, we multiply each of the quarterly indices in previous Table by an adjusting constant. This number is found by dividing the desired sum of the indices (400) by the actual sum (404.1). In this case, the result is 0.9899. The following table shows that multiplying the indices by the adjusting constant brings the quarterly indices to a total of 400. (Sometimes even after this adjustment, the mean of the seasonal indices is not exactly 100 because of accumulated rounding errors. In this case, however, it is exactly 100.)
160
Quarter 1 2 3 4
= seasonal index
90.3 106.6 112.1 91.0 400.0
(NOTE: In the previous illustration the sum of all the seasonal indices added to 400. Here it is not the case.) The ratio-to-moving-average method just explained allows us to identify seasonal variation in a time series. The seasonal indices are used to remove the effects of seasonality from a time series. This is called deseasonalizing a time series. Before we can identify either the trend or cyclical components of a time series, we must eliminate seasonal variation. (NOTE: The reader is reminded that in the earlier method we initially found a trend line and based on those trend values calculated the ratio of actual sales to that of estimated sales and hence found the seasonal index by taking the average of those values. After that we calculated the estimated future sales using the trend equation and adjusted the values obtained with the seasonal index. In this method, we now prepare the deseasonalized trend line and then forecast using the seasonal indices) To deseasonalize a time series, we divide each of the actual values in the series by the appropriate seasonal index (expressed as a fraction of 100). To demonstrate, we shall deseasonalize the value of the first four quarters of the year 2001. In the following Table, we see the deseasonalizing process using the values for the seasonal indices from the previous table. Once the seasonal effect has been eliminated, the deseasonalized values that remain reflect only the trend, cyclical, and irregular components of the time series.
Quarter (2)
1 2 3 4
Once we have removed the seasonal variation, we can compute a deseasonalized trend line, which we can then project into the future. Suppose the manager in our example estimates from a deseasonalized trend line that the deseasonalized average sales for the fourth quarter of the next year 2006 will be 2,121. When this prediction has been obtained, management must then take the seasonality into account. To do this, it multiplies the
161
deseasonalized predicted average sales of 2,121 by the fourth-quarter seasonal index (expressed as a fraction of 100) to obtain a seasonalized estimate of 2121 0.910 = 1,930 units for the fourth-quarter average sales. The reader is given as an exercise to deseasonalize the whole series and then calculate the deseasonalized trend line. Based on that, forecast the sales for the four quarters of the year 2006 and then use the seasonal indices for the quarters we have obtained here to forecast the seasonalized sales.
(3)
For more than one year's data, the seasonality factor is averaged for each of the periods of the seasonal cycle. The factors should total up to the number of periods n per cycle. If they do not, then they should be adjusted so that they add up to n.
(4)
Next, the time series should be deseasonalized. The formula used here is:
D T = t t S t Next, the trend is estimated for the deseasonalized data using regression analysis or by moving averages
(5)
with trend adjustments. In case a linear trend results, then the resulting values are a (intercept) and b (slope). (6) The forecast for period t + m can now be prepared using the relation
162
Data Availability
The choice of forecasting method is often constrained by available data. An econometric model might require data which are simply not available in the short run; therefore another method must be selected. The BoxJenkins time-series method requires about 60 data points (5 years of monthly data).
Data Pattern
The pattern in the data will affect the type of forecasting method selected. If the time-series is flat, a first order method can be used. However, if the data show trends or seasonal patterns, more advanced methods will be needed. The pattern in the data will also determine whether a time-series method will suffice or whether casuals model are needed. If the data pattern is unstable over time, a qualitative method may be selected. Thus the data pattern is one of the most important factors affecting the selection of a forecasting method. Another issue concerning the selection of forecasting methods is the difference between fit and prediction. When different models are tested, it is often thought that the model with the best fit to historical data (least error) is also the best predictive model. This is not true. For example, suppose demand observations are obtained over the last eight time periods and we want to fit the best time-series model to these data. A polynomial model of degree seven can be made to fit exactly through each of the past eight data points. But this model is not necessarily the best predictor of the future.
163
Sources Of Error
Errors can come from a variety of sources. One common source that many forecasters are unaware of is projecting past trends into the future. For example, when we talk about statistical errors in regression analysis, we are referring to the deviations of observations from our regression line. It is common to attach a confidence band (that is, statistical control limits) to the regression line to reduce the unexplained error. But when we then use this regression line as a forecasting device by projecting it into the future, the error may not be correctly defined by the projected confidence band. This is because the confidence interval is based on past data; it may not hold for projected data points and therefore cannot be used with the same confidence. In fact, experience has shown that the actual errors tend to be greater than those predicted from forecast models. Errors can be classified as bias or random. Bias errors occur when a consistent mistake is made. Sources of bias include failing to include the right variables; using the wrong relationships among variables; employing the wrong trend line, mistakenly shifting the seasonal demand from where it normally occurs; and the existence of some undetected secular trend. Random errors can be defined as those that cannot be explained by the forecast model being used.
Measurement Of Error
Several common terms used to describe the degree of error are standard error, mean squared error (or
variance), and mean absolute deviation. This error estimate helps in monitoring erratic demand observations. In
addition, they also help to determine when the forecasting method is no longer tracking actual demand and it need to be reset. For this tracking signals are used to indicate any positive or negative bias in the forecast. Standard error is discussed earlier. It being a square root of a function, it is often more convenient to use the function itself. This is called the mean square error (MSE) or variance. The mean absolute deviation (MAD) is also important because of its simplicity and usefulness in obtaining tracking signals. MAD is the average error in the forecasts, using absolute values. It is valuable because MAD, like the standard deviation, measures the dispersion of some observed value from some expected value. The only difference is that like standard deviation, the errors are not squared. When the errors that occur in the forecast are normally distributed (the usual case), the mean absolute deviation relates to the standard deviation as
1 standard deviation =
Conversely,
The standard deviation is the larger measure. If the MAD of a set of points was found to be 60 units, then the standard deviation would be 75 units. In the usual statistical manner, if control limits were set at plus or minus 3 standard deviations (or 3.75 MADs), then 99.7 percent of the points would fall within these limits.
164
A tracking signal is a measurement that indicates whether the forecast average is keeping pace with any genuine upward or downward changes in demand. As used in forecasting, the tracking signal is the number of mean absolute deviations that the forecast value is above or below the actual occurrence. The following figure shows a normal distribution with a mean of 0 and a MAD equal to 1. Thus, if we compute the tracking signal and find it equal to minus 2, we can see that the forecast model is providing forecasts that are quite a bit above the mean of the actual occurrences. A tracking signal (TS) can be calculated using the arithmetic sum of forecast deviations divided by the mean absolute deviation:
RFSE is the running sum of forecast errors, considering the nature of the error. (For example, negative
errors cancel positive error and vice versa.)
MAD is the average of all the forecast errors (disregarding whether the deviations are positive or
negative). It is the average of the absolute deviations.
For forecast to be in control, 89% of the errors are expected to fall within 2 MADs, 98% within 3 MADs, or 99% within 4 MADs.
4 MAD
4 MAD
3 MAD
3 MAD
2 MAD
2 MAD
1 MAD
1 MAD
M=0
Fig. 6.
The following illustration shows the procedure for computing MAD and the tracking signal.
165
Illustration 6.14: Compute the Mean Absolute deviation (MAD), the running sum of forecast errors (RSFE)
and hence obtain the Tracking Signal for a six month period for the following set of data where the period number, demand forecast ( set at constant 1000) and the actual demand occurrences are given below.
Month 1 2 3 4 5 6
Solution:
Month 1 2 3 4 5 6
Demand Forecast
1,000 1,000 1,000 1,000 1,000 1,000
Actual
950 1,070 1,100 960 1,090 1,050
Deviation RSFE
-50 +70 +100 -40 +90 +50 -50 +20 +120 +80 +170 +220
Abs. Dev.
50 70 100 40 90 50
TS = RSFE / MAD -1
0.33 1.64 1.2 2.4 3.3
Here, we say that the forecast, on the average, is off by 66.7 units and the tracking signal is equal to 3.3 mean absolute deviations.
We can get a better feel for what the MAD and tracking signal mean by plotting the points on a graph.
4 tracking signal 3 2 1 0 -1 -2 Periods (months)
Note that it drifted from minus 1 MAD to plus 3.3 MADs. This happened because actual demand was greater than the forecast in four of the six periods. If the actual demand does not fall below the forecast to offset the
166
continual positive RSFE, the tracking signal would continue to rise and we would conclude that assuming a demand of 1,000 is a bad forecast. Acceptable limits for the tracking signal depend on the size of the demand being forecast (high-volume or highrevenue items should be monitored frequently) and the amount of personnel time available (narrower acceptable limits cause more forecasts to be out of limits and therefore require more time to investigate).
The Percentages of Points included within the Control Limits for a Range of 1 to 4 MADs
Number of MADs +1 +2 +3 +4 Related Number of Standard Deviations 0.798
1.596 2.394 3.192
In a perfect forecasting model, the sum of the actual forecast errors would be 0; the errors that result in overestimates should be offset by errors that are underestimates. The tracking signal would then also be 0, indicating an unbiased model, neither leading nor lagging the actual demands. Exponentially smoothed MAD
Often MAD is used to forecast errors. It might then be desirable to make the MAD more sensitive to recent data. A useful technique to do this is to compute an exponentially smoothed MAD as a forecast for the next periods error range. The procedure is similar to single exponential smoothing, covered earlier in this chapter. The value of the MAD forecast is to provide a range of error. In the case of inventory control, this is useful in setting safety stock levels.
where
MADt = Forecast MAD for the tth period = Smoothing constant (normally in the range of 0.05 to 0.20) A t-1 = Actual demand in the period t 1 Ft-1 = Forecast demand for period t - 1
167
REVIEW EXERCISE Q. Demand for patient surgery at a hospital has increased steadily in the past few years, as seen in the following
table.
year 1 2 3 4 5 6
The director of medical services predicted six years ago that demand in year 1 would be 42 surgeries. Using exponential smoothing with a weight = 0.20, develop forecasts for years 2 through 6. What is the MAD?
Ans: 42.6,44.1,45.7,47.7,49.8, MAD = 7.78 Q. Room registrations in the Park Hotel have been recorded for the past nine years. Management would like to
determine the mathematical trend of guest registration in order to project future occupancy. This estimate would help the hotel determine whether a future expansion will be needed. Given the following time series data, develop a regression equation relating registrations to time. Then forecast year 11s registration. Room registrations are in thousands. Year 1 : 17 , Year 2 : 16 , Year 3 : 16 , Year 4 : 21, Year 5 : 20 , Year 6 : 20 , year 7 : 23 , Year 8 : 25 , Year 9 : 24
Ans: b = 1.135, a = 14.545 , Y = 14.545 + 1.135 X , Reg. for the 11th year = 27030 guests Q. Annual sales of Brand X over the last eleven years have been as follows:
1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 50 59 46 54 65 51 60 70 56 66 76
You are required to (a) calculate a three year moving average, (b) plot a the series and the trend on the same graph (c) produce a sales forecast for 2006 stating assumptions, if any.
Q. The following tabulations are actual sales of units for six months and a starting forecast in January.
(a) Calculate forecasts for the remaining five months using simple exponential smoothing with = 0.2. (b) Calculate MAD for the forecasts
168
Actual
100 94 106 80 68 94
Forecast
80
Ans: 84, 86, 90, 88, 84 Q. Here are the actual tabulated demands for an item for a nine month period (January through September). The
supervisor wants to test forecasting methods to see which method was better over the period.
Actual demand
110 130 150 170 160
Month
June July August September
Actual demand
180 140 130 140
(a) Forecast April through September using a three month moving average. (b) Use simple exponential smoothing with = 0.3 to estimate April through September. (c) Use MAD to decide which method produced the better forecast over the six month period.
Ans: (a) 130, 150, 160, 170, 160, 150. (b) 136, 146, 150, 159, 153, 146 (c) Exponential smoothing performed better Q. A bakery markets cakes through a chain of food stores. It has been experiencing over and under production
because of forecasting errors. The following data are its demand for cakes for the past four weeks. Cakes are made for the following day; for example, Sundayss cakes production is for Mandays sales, Mandays production is for Tuesdays sales and so on. The bakery is closed on Sunday. So Fridays production must satisfy demand for both Saturday and Sunday.
3 weeks ago
2400 2100 2400 1900 1800 2700
2 weeks ago
2300 2200 2300 1800 2100 3000
Last week
2400 2200 2500 2000 2000 2900
169
Make a forecast for this week on the following basis: (a) Daily, using a simple four week moving average. (b) Daily, using a weighted average of 0.40, 0.30, 0.20, and 0.10 for the past four weeks. (c) The bakery is also planning its purchases for bread production. If bread demand had been forecast for last week at 22,000 loaves and only 21,000 loaves were actually demanded, what would the bakerys forecast be for this week using exponential smoothing with = 0.10? (d) Suppose, with the forecast made in (c) above, this weeks demand actually turns out to be 22,500. What would the new forecast be for the next week?
Ans: (a) Monday: 2325, Tuesday: 2125, Wednesday: 2375, Thursday: 1875, Friday: 1950, Sat. and Sun: 2850.
(b) Monday: 2350, Tuesday: 2160, Wednesday: 2400, Thursday: 1900, Friday: 1980, Sat. and Sun: 2880 (c) 21900 (d) 21960
Q. Data collected on the yearly demand for 50 pound bags of fertilizer at ABC Fertilizer Company are shown in
the following table. Develop a three year moving average to forecast sales. Then estimate demand again with a weighted moving average in which sales in the most recent year are given a weight of 2 and sales in the other two years are each given a weight of 1. Which method do you think is best?
Year 1 2 3 4 5 6
Year
7 8 9 10 11
Ans: weighted average is slightly more accurate Q. Sales of COLDWAVE air conditioners have grown steadily during the past five years. Year 1 2 3 4 5 6 Sales
450 495 518 563 584
170
(a) The sales manager has predicted that year 1s sales would be 410 air conditioners. Using exponential smoothing with a weight = 0.3, develop forecasts for years 2 through 6. (b) Using smoothing constants 0.6 and 0.9, develop a forecast for the sales. (c) which of the above three smoothing constants gives the most accurate forecast? (d) Use a three year moving average forecasting model to forecast the sales. (e) Using the trend projection method, develop a forecasting model for the sales.
Ans: (a) Year 1: 410, Year 2: 422, Year 3: 443.9, Year 4: 466.1, Year 5: 495, Year 6: 521.8
(c) MAD for = 0.3 is 74.56, MAD for = 0.6 is 51.8, MAD for = 0.9 is 38.1 (e) If the years are coded -2,-1,0,1 and 2 , Y = 522 + 33.6X; next years sales = 622.8 If the years are coded 1,2,3,4 and 5, Y = 421.2 + 33.6X
Q. In this problem, you are to test the validity of your forecasting model. Here are the forecasts for a model you
have been using and the actual demands that occurred.
Week 1 2 3 4 5 6
Forecast
800 850 950 950 1000 975
Actual
900 1000 1050 900 900 1100
Compute the MAD and tracking signal. Then decide whether the forecasting model you have been using is giving reasonable results.
Ans: MAD : 104, TS = 3.1. The high TS value indicates the model is unacceptable. Q. The following table shows predicted product demand using your particular forecasting method along with
the actual demand that occurred.
Actual
1550 1500 1600 1650 1700
171
(a) Compute the tracking signal using the mean absolute deviation and running sum of forecast errors (b) Discuss whether your forecasting method is giving good predictions.
Ans: (a) MAD = 90, TS = -1.67 (b) Model O.K since TS is -1.67 Q. The following table shows the past two years of quarterly sales information. Assume that there are both trend
and seasonal factors and that the season cycle is one year. Use time series decomposition to forecast quarterly sales for the next year.
Quarter 1 2 3 4
Sales
160 195 150 140
Quarter
5 6 7 8
Sales
215 240 205 190
April-June Q1
49 50 51 50
July-Sept Q2
37 38 40 42
Oct-Dec Q3
58 59 60 61
Jan-Mar Q4
67 68 70 --
Calculate the four quarterly seasonal indices and decompose the time series.
Ans: Note: For the computation of the modified mean, the highest and the lowest values of the means
in a quarter are not ignored. Adjustment factor: 4/3.994 = 1.0015 and the Seasonal indices are Q1: 0.924, Q2: 0.719,Q3: 1.905, Q4: 1.262.
172