Important Announcement
PubHTML5 Scheduled Server Maintenance on (GMT) Sunday, June 26th, 2:00 am - 8:00 am.
PubHTML5 site will be inoperative during the times indicated!

Home Explore BBA102_BCM102_Business Mathematics and Statistics

BBA102_BCM102_Business Mathematics and Statistics

Published by kuljeet.singh, 2021-01-29 07:41:58

Description: BBA102_BCM102_Business Mathematics and Statistics

Search

Read the Text Version

Regression Analysis 345 Illustration: 2: For a given distribution the value of correlation is 0.64 and its probable error is 0.13274. Find the number of items in the series. Solution: P.E. =0.6745 × 1 r2 Given, PE = 0.13274 Correlation (r) = 0.64 n n= ? 1(0.64)2 PE = 0.6745 × n Ÿ 0.13274 = 0.6745(10.4096) n Ÿ n 0.13274 = 0.6745 x 0.5904 0.3982 Ÿ n = 0.13274 Ÿ n = 2.999 ? n = 9 (Approx) Illustration: 3: Compute Karl Pearson’s Co-efficient of Correlation from the following data and calculate its probable error and interpret the result. Marks in Accountancy 77 54 27 52 14 35 90 25 56 60 Marks in English 35 58 60 40 50 40 35 56 34 42 Solution: Calculation of Karl Pearson’s Co-efficient of Correlation X X – 49 x2 Y Y – 45 y2 xy x y 77 + 28 784 35 – 10 100 – 280 + 65 54 + 5 25 58 + 13 169 – 330 – 15 27 – 22 484 60 + 15 225 – 175 + 70 52 + 3 9 40 –5 25 – 410 – 264 14 – 35 1,225 50 + 5 25 – 77 – 33 35 – 14 196 40 –5 25 90 + 41 1,681 35 – 10 100 25 – 24 576 56 + 11 121 56 + 7 49 34 – 11 121 60 + 11 121 42 –3 9 ¦X = 490 ¦x2 = 5,150 ¦Y = 450 ¦y2 = 920 ¦xy = – 1,449 r CU IDOL SELF LEARNING MATERIAL (SLM)

346 Business Mathematics and Statistics = ¦ xy 1449 0.666 ¦x2 u ¦y2 = 5150 u 920 1 r2 1 (0.666)2 0.3753 0.119 PE = 0.6745 = 0.6745 = 3.162 N 10 Illustration 9: Calculate Karl Pearson’s coefficient of correlation between percentage of pass and failures from the following data. Also obtain probable error: No. of Students: 80 60 90 70 50 40 No. of Students passed: 48 30 45 56 45 30 Solution: x=x–a x2 y=y–b y2 xy a = 50 b = 50 % of pass % of fail (x) (y) 60 40 10 100 –10 100 –100 50 50 0 0 0 0 0 50 50 0 0 0 0 0 80 20 30 900 –30 900 –900 90 10 40 1,600 –40 1,600 –1,600 75 25 25 625 –25 625 –625 405 195 105 3,225 -105 3,225 –3,225 405 195 X = 6 = 67.5 = 6 = 32.5 N.6xy6x. 6y r = N6x2 6x)2 N6y2 (6y)2 6(3, 225)(105)(105) = 6(3,225)(105)2 6(3, 225)(105)2 19, 350 11, 025 8,325 8,325 = 19,35011,025 19,35011,025 = (8,325)2 = 8,325 = –1 PE = 0.6745 1 r2 0.6745 1(1)2 0.6745 × 0 = 6 = 6 =0 n CU IDOL SELF LEARNING MATERIAL (SLM)

Regression Analysis 347 15.6 Summary Regression: The term “regression” was used by Sir Francis Galton to describe a hereditary phenomenon that he observed in his study of the heights of sons and fathers. His main observation was that though tall fathers usually had tall sons, the average height of the sons of tall fathers was less than the average height of the fathers. In short, the average height of the sons of tall fathers will regress or go back to the general average height. Galton called this backward or downward tendency in the average height as regression. At phenomena — whether economic business or social. A line that is drawn as close as possible to the plotted points of the scatter diagram shows the average tendency of the plotted points. This line is known as the regression line and its equation is called the regression equation. Unlike the coefficient of correlation, which indicates the extent of the relationship between two sets of figures, a regression equation enables us to calculate the amount of change in one variable corresponding to a change in the other. 15.7 Key Words/Abbreviations Regression Equation, Regression Coefficients 15.8 Learning Activity 1. Form a frequency distribution of the following test scores of 50 individuals using a class interval of 10: 75 130 135 90 118 92 80 142 97 147 98 94 115 109 154 109 111 117 120 91 124 101 97 98 126 94 109 109 94 110 82 96 119 92 99 114 104 169 107 93 102 83 117 98 77 133 87 145 91 CU IDOL SELF LEARNING MATERIAL (SLM)

348 Business Mathematics and Statistics 2. The following data represent weights, recorded to the nearest kilogram, of 30 students selected from a school of 500 students: 21 30 40 25 26 22 39 31 29 36 38 35 34 33 30 23 27 27 29 31 33 22 21 36 40 31 33 30 37 36 In the above data, state what is the: (i) Population (ii) Sample and (iii) Variate. 3. Draft a blank table to show: (a) Sex. (b) Three ranks — supervisors, assistants and clerks. (c) Years 1998 and 2003. (d) Age-groups: 18 years and under, over 18 but less than 55 years, over 55 years. 4. Draw up the proforma of suitable table (complete with title, rulings, columnar heading, sub-headings, source, note etc.) showing the number of students in your university in various classes, classified according to sex, residence, domicile and medium of instruction. 5. Draw up a table to show the distribution of workers in a certain cloth mill according to: (i) Salary: Below ` 150, ` 150 to 300, ` 300 to 450, ` 450 and over, (ii) Three years: 1967, 68, 69. (iii) Age groups: Below 20, 20 and under 30, 30 and under 40, 40 and under 50,50 and over. (iv) Sex. 6. Present the following information in a suitable tabular form, filling the gaps if any. “The total population of India is 3,566 lakhs, of which 2,948 lakhs belong to rural areas. In rural areas, 2,404 lakhs belong to agricultural classes; while in urban areas 531 lakhs belong to the non-agricultural classes. Of the rural agricultural classes, 687 lakhs are self-supporting persons, 1,414 lakhs are non-earning dependents and the remaining CU IDOL SELF LEARNING MATERIAL (SLM)

Regression Analysis 349 are earning dependents. Of the rural non-agricultural population, 170 lakhs are self- supporting, 326 lakhs are non-earning dependants and the remaining are earning dependants. In the urban agricultural classes 23 lakhs are self-supporting, 56 lakhs are non-earning dependants and the remaining are earning dependants. In the urban non- agricultural population, 153 lakhs are self-supporting, 347 lakhs are non-earning dependants and the remaining are earning dependants.” 15.9 Unit End Questions (MCQ and Descriptive) A. Descriptive Type: Short Answer Type Questions 1. Explain the concept of ‘regression’ and comment on its utility. 2. Explain the terms (i) coefficient of regression and (ii) lines of regression. 3. Show that the correlation coefficient is the geometric mean between regression coefficients. 4. Explain the concept of linear regression. Why are there two regression lines? Do they cut each other? 5. Compare and contrast the roles of correlation and regression in studying the interdependence of two variates. 6. Define ‘regression’. Why are there two regression lines? Under what conditions can there be only one regression line? 7. Distinguish between coefficient of correlation and coefficient of regression and state briefly why the latter is often more useful in statistics than the former. 8. Find the regression coefficients and the regression equations for the following data: X : 76 50 61 45 27 49 32 54 70 36 Y : 58 60 72 78 56 40 43 65 22 36 9. From the undermentioned data, find out (i) Karl Pearson’s coefficient of correlation, (ii) the two regression equations, CU IDOL SELF LEARNING MATERIAL (SLM)

350 Business Mathematics and Statistics (iii) the two regression coefficients, (iv) the best mean value of X when Y is 50, (v) the best mean value of Y when X is 45. Y: 55 59 63 68 56 73 82 76 64 74 X : 60 62 55 54 63 72 78 79 65 82 10. Calculate the coefficient of correlation and obtain the lines of regression for the following data: X: 1 2 3 4 5 6 7 8 9 Y: 9 8 10 12 11 13 14 16 15 Obtain an estimate of y which should correspond on the average to x = 6.2 11. In a partially destroyed laboratory record of an analysis of correlation data, the following results only are legible: Variance of x = 9. Regression equations: 8x – 10y + 66 = 0, 40x – 18y = 214. What were the mean values of x and y, the standard deviation of y and the coefficient of correlation between x and y? 12. The following are marks in Statistics (x) and Mathematics (y) of ten students. x : 56 55 58 58 57 56 60 54 59 57 y : 68 67 67 70 65 68 70 66 68 66 Calculate the coefficient of correlation and estimate marks in Mathematics of a student who scores 62 marks in statistics. 13. The following data give the height in inches (X) and the weight in lb (Y) of random sample of 10 students from a large group of students of age 17 years. x : 61 68 68 64 65 70 63 62 64 67 y : 112 123 130 115 110 125 100 113 116 126 Estimate the weight of a student of height 69 inches. 14. From the following data calculate the coefficient of correlation and estimate x when y is equal to 105. CU IDOL SELF LEARNING MATERIAL (SLM)

Regression Analysis 351 x : 44 58 49 46 58 56 48 46 48 47 y : 88 114 102 113 91 89 102 93 114 94 15. For a bivariate data the mean value of x = 53.2 the mean value of y = 27.9 the regression coefficient of x on y = – 0.2 and the regression coefficient of y on x = – 1.5 Find (i) the most probable value of y when x = 60, (ii) r the coefficient of correlation. 16. You are given the following results for the heights (x) and weights (y) of 1000 workers of a factory. x = 68 inches, Vx = 2.5 inches y = 150 lb, Vy = 20 lb rxy = 0.6. Estimate from the above data. (i) The height fo a particular factory worker whose weight is 200 lb (ii) The weight of a particular factory worker who is 5 feet tall. 17. Find the most likely price in Bombay (x) corresponding to the price of Rs.70 at Calcutta (y) from the following data: Average price at Calcutta = 55 Average prie at Bombay = 67 S. D. at Calcutta = 2.5 S. D. at Bombay = 3.5 = + .8 18. The equation of two regerssion lines obtained in a correlation analysis are as follows: 4y = 9x + 15 25x = 6y + 7 CU IDOL SELF LEARNING MATERIAL (SLM)

352 Business Mathematics and Statistics Obtain the value of correlation coefficient, the ratio of the coefficient of variability of x to that of y and also mean values of x and y. 19. Following is the information relating to per capita consumption of wheat per year and the retail price of wheat: Year (Y) (X) Per Capita Consumption Retail Price 1960 1961 (Kg) (Paise) 1962 35 63 1963 34 65 1964 32 68 1965 36 64 1966 37 64 34 68 33 70 1967 30 75 Fit a straight line of per capita consumption (Y) on retail price (X). 20. From the data given below find: (a) The two regression equations. (b) The coefficient of correlation between marks in Economics and Statistics. (c) The most likely marks in Statistics when the marks in Economics are 30. Marks in Economics : 25 28 35 32 31 36 29 38 34 32 Marks in Statistics : 43 46 49 41 36 32 31 30 33 39 B. Multiple Choice/Objective Type Questions 1. Given the 2 Keynesian equation 3x – 2y + 1 = 0 62y – 5x + 3 = 0 (a) 0.45 (b) 0.53 (c) 0.63 (d) None of these CU IDOL SELF LEARNING MATERIAL (SLM)

Regression Analysis 353 2. Given x = 1, y = 43, r = 0.62 Vx = 8.3, Vy = 6/4. like 2 Regression equations are x = 0.8y – 3.57 and y = 0.48 + 28.18 (a) Wrong (b) Correct (c) None of these 3. The term “regression” was used by Sir Francis Galton to describe the average ____________ of father and son. (a) Weight (b) Complexion (c) Height (d) size 4. The equation of regression line is called as __________. (a) Regression equation (b) Coefficient of correlation (c) Correlation (d) All the above Answers: (1) (b); (2) (b); (3) (c); (4) (a) 15.10 References References of this unit have been given at the end of the book. ˆˆˆ CU IDOL SELF LEARNING MATERIAL (SLM)

354 Business Mathematics and Statistics UNIT 16 TIME SERIES ANALYSIS Structure 16.0 Learning Objectives 16.1 Introduction 16.2 Time Series: Its meaning 16.3 Time Series: Analysis 16.4 Measurement of General Trend 16.5 Solved Problems 16.6 Summary 16.7 Key Words/Abbreviations 16.8 Learning Activity 16.9 Unit End Questions (MCQ and Descriptive) 16.10 References 16.0 Learning Objectives After completing this unit, you would be able to: 1. Explain the meaning of a time series and the various types of influences that include, General trend, Seasonal Changes, Cyclical fluctuations and irregular variations. 2. Elaborate the methods of measurement of the general trend that include: Geographical method, Method of moving averages and the method of least squares. CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 355 3. The procedure for finding 3 yearly, moving averages as explained in the solved problems 4. Differentiate the Merits and Demerits of Moving Averages. 5. Analyse the method of least-squares with reference to the solved problem. 6. Explain the histogram. 7. Elaborate the Parabolic Trend Equation with reference to the solved problem. 8. Explain the Exponentist Trend Method: and as well as various solved problems. 16.1 Introduction A orderly set of number corresponding to time is a time series. Time series corresponding to variables such as sales, projects, investment etc., show a number of changes done to the influence of various factory. These details have been explained in 16.3. For calculation of these changes Cartesian methods are used. These methods are explained in the pages that follows. 16.2 Time Series: Its Meaning In the world of business and economics we find variables of one kind or the other. For instance, sales, profits, capital, investments, prices, costs, rates of interest, savings, demand for commodities etc. These are said to be variables because they vary over a period of time. Managements are very often concerned in closely examining these variables over different periods of time. An orderly set of numbers written in the order corresponding to time is called a time series. For instance, a series of terms showing the profits of a business concern over a number of years is a time series because the terms of series representing profits are written in relation to time. A careful scrutiny of a time series along with a detailed analysis of condition in the past enables one to forecast future changes and estimates. The following is a time series showing the production of cereals in India in million tonnes. CU IDOL SELF LEARNING MATERIAL (SLM)

356 Business Mathematics and Statistics Net Production (Million Tonnes)* of Cereals in India Year Production 1951 40.01 1952 40.59 1953 45.36 1954 53.44 1955 51.59 1956 50.33 1957 52.67 1958 49.35 1959 57.29 1960 56.76 1961 60.66 1962 62.08 1963* 58.43 * Provisional. Source: Ministry of Food & Agriculture. Published in the Quarterly Bulletin of the Eastern Economist. New Delhi. Vol. 16, No. 2. 16.3 Time Series: Analysis A detailed analysis of the above time series enables us to find the different types of movements influencing its course. Usually in any given time series the different types of movements are characterized by the following component: 1. General Trend 2. Seasonal changes 3. Cyclical Fluctuations 4. Irregular Variations 16.3.1 General Trend The time series of the net productions of cereals in India shows that there is certainly a gradual increase in the production of cereals even though the increase is not regular. This gradual increase is called the “General Trend” or secular trend. This trend, which means here a gradual increase in the production of cereals, is brought about by the proper use of modern agricultural machinery and CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 357 techniques, better manure and chemical fertilizers and also the farmer’s efforts. Maximum yields can be expected when farmers know exactly the proper proportions of nutrients that are ideal for crops. An hectare of agricultural land would need several hundred kilograms of nitrogen, phosphorous and potash and a fair sparkling of secondary and micro elements like iron copper. In other words this gradual increase can be attributed to the growing realization among farmers about the advantages of profit-oriented farming, modern equipment, improved seeds, greater use of fertilizers, pesticides and modern methods of farming. The general trend is influenced by a group of factors which include seasonal fluctuations, cyclical fluctuations and also unforeseen events. The trend can therefore be clearly studied when these different influences are clearly eliminated. 16.3.2 Seasonal Fluctuations Eliminating seasonal influences is no easy task. The occurrence of seasons is responsible for these fluctuations. Natural phenomena are always unavoidable beyond any measure of control. For instance, variations in food grain production over time depend on whether the season is favorable or not. It is quite possible that production may be very high and beyond expectations in a certain season and very low at other times. In short seasonal influences are more or less regularly present in any time series. 16.3.3 Cyclical Fluctuations These are due to the occurrence of business or trade cycles in the economic world. For instance; there may be a period of depression followed by a boom after a few years. It is at such times that there is considerable variability in the production of cereals. Trade cycles cause up and down movements in any time series of economic data. 16.3.4 Irregular Variations In many time series of economic data, we observe sudden upward or downward movements in the trend. Such movements are irregular and are brought about by unforeseen events and sudden changes in economic activities and business conditions. Further, political factors too contribute to uncertain and significant changes. Other irregular and unforeseen events may be wars, earthquakes, revolutions, fires and floods, which cause sudden and unexpected increase in prices in general. CU IDOL SELF LEARNING MATERIAL (SLM)

358 Business Mathematics and Statistics Further technological progress may lead to an increase in industrial activities. Sometimes agricultural prosperity might be ushered in by a bounty of nature - favorable rainfall etc. agricultural prosperity in turn may bring about falling prices in foodgrains and other commodities. In short, irregular movements are changes by ‘fits and starts’. 16.4 Measurement of General Trend A general trend can be measured by the following methods: 1. Graphical method-freehand curve. 2. Method of moving averages. 3. Method of least squares. 16.4.1 Graphical Method This method consists in drawing the curve of the time series on graph paper. First of all, certain points representing the time series are plotted on the graph paper and these plotted points are joined by a smooth curve. Such a curve gives a picture of the different up and down movements in the time series. This curve, when it is drawn smoothly, fairly and regularly, shows the general direction of the trend. This method gives a rough indication of the nature of the trend and is especially useful to all those who do not know other techniques of measuring trends. The main drawback of this graphical method of indicating a trend is that different curves may be drawn by different persons for the same data and may provide a misleading picture. 16.4.2 Method of Moving Averages Moving Averages are calculated on the basis of a given period, such as, 3-yearly or 4-yearly or 5-yearly. For instance, 3-yearly moving averages are calculated by taking the average of the values of 3 years beginning from the first year. This method is especially useful in the case of a time series with regular cyclical movements. If the periodicity of a time series is known, moving averages can be calculated easily. The following examples make the method quite clear. 16.4.3 Solved Problems Problem 1: Calculate the three yearly moving average of the net production (million tonnes) of cereals. CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 359 Year: 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 Production: 40.01 40.59 45.36 53.44 51.59 50.33 52.67 49.35 57.29 56.76 60.66 62.08 58.63 Year Production Three Three yearly yearly Total Moving Average 1981 40.01 1982 40.59 — — 1983 45.36 125.96 41.99 1984 53.44 139.39 46.46 1985 51.59 150.39 50.13 1986 50.33 155.36 51.79 1987 52.67 154.59 51.53 1988 49.35 152.35 50.78 1989 57.29 159.31 53.10 1990 56.76 163.40 54.47 1991 60.66 174.71 58.24 1992 62.08 179.50 59.83 1993 58.63 181.37 60.46 — — Problem 2: Calculate four-yearly moving averages and five yearly moving averages for the following data: Year: 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 Profits: 112 114 119 111 106 101 113 118 121 119 123 124 126 Year Profits Four-yearly Total of Four-yearly Moving Two Moving Average 1982 112 Total 1983 114 4-yearly Centered 1984 1985 119 456 113.25 1986 450 906 110.87 111 108.50 887 106 437 431 868 CU IDOL SELF LEARNING MATERIAL (SLM)

360 Business Mathematics and Statistics 1987 101 869 108.62 438 111.37 1988 113 115.50 891 119.00 1989 118 453 121.00 122.37 1990 121 924 471 1991 119 952 1992 123 481 1993 124 968 1994 126 487 Year Profits 979 492 1982 112 1983 114 Four-yearly Total of Four-yearly 1984 119 Moving Two Moving Average 1985 111 Total 1986 106 4-yearly Centered 1987 101 1988 113 562 906 112.4 1989 118 551 887 110.2 1990 121 550 868 110 1991 119 549 869 1098 1992 123 559 891 111.8 1991 124 572 924 114.4 1994 126 594 952 118.8 605 968 121 613 979 122.6 16.4.4 Merits of the Moving Average Method 1. It is particularly useful in the case of a time series with regular cyclical movements. 2. Moving averages can be easily calculated and readily understood. 3. The trend either increasing or decreasing that may exist in a time series is clearly brought out by this method. CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 361 4. It is far better than the freehand curve method. 5. If the period of the moving averages and the duration of the cyclical variation happen to be the same, then the moving average method completely eliminates the variations. 16.4.5 Demerits of the Moving Averages Method 1. It is applicable to only such time series that show periodic fluctuations. 2. The trend values at the two extremities of a time series cannot be determined. 3. It is not useful in the case of a time series with regular variations. 16.4.6 Method of Least Squares Instead of drawing an observation-trend-line for a time series freehand, it is better to fit a straight line trend more accurately using the mathematical method of least squares. This straight line so drawn is called the ‘Line of Best Fit’. It is a straight line drawn so that the sum of the squares of the vertical deviations of the plotted points of the time series is the least. It is for this reason that the method is called the method of least squares. If the equation to the line of best fit is written as Y = a +bx, then the values of the constants a and b are determined by using the following two normal equations: 6y = Na + b6x 6XY = a6X + b6x2 Where X represents the year (time). Y represents the actual values of the terms in the series. N is the number of terms in the time series. This method can be best explained by the following example: Problem 3: Fit a straight line trend using the method of least squares to the/scale^ of X, XZ Industries as given below: CU IDOL SELF LEARNING MATERIAL (SLM)

362 Business Mathematics and Statistics Year Sales in Thousands of Rupees 1988 1989 204 1990 1230 1991 192 1992 ?~n 274 X Year Y XY X2 1 1988 2 204 204 1 1989 3 230 460 4 1990 4 192 576 9 1991 5 250 1000 16 1992 15 274 1370 25 1150 3610 55 Substituting the values N = 5, XY = 1150, 6X = 15, 6XY = 3610, 6X2 = 55 in the two. Normal equations: 6Y = Na + b6x … (1) 6XY = a6X + b6X2 … (2) We get 1150 = 5a + 15b … (1) 3610 = 15a + 55b Now, 3450 = 15a + 45b That is multiplying equation (1) by 3 and subtracting the resulting equation (1) from equation (2) we get: 160 = 10b 160 ? b = 10 = 16 Now 5a = 1150 – 240 910 ? a = 5 = 1182 The equation to the line of best fit is: Y = 182 + 16x CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 363 By putting X = 1, 2, 3, 4, 5 in the above equation, we obtain the trend values 198, 214, 230, 246 and 262 respectively. X = 1, Y = 182 + 16 = 198 X = 2,Y = 182 + 32 = 214 X = 3, Y = 182 + 48 = 230 X = 4, Y = 182 + 64 = 246 X = 5, Y = 182 + 80 = 262 Short Cut Method We can simplify above calculations by using a short cut method. This method consists of numbering the years (time) under the column heading X in a different manner. We put zero at the centre and write negative and positive values above and below it — that is, on either side of zero. By doing so, we get X = 0 in the two normal equations we obtain the following simplified equations: 6Y = Na6XY = b6X2 ¦Y ¦ XY a = N b = ¦x2 We now use the short cut method to fit a straight line trend (least squares) to the data given in the previous example. Let the equation to the line of best fit be: Y = a + bX To find the values of the constants a & b we prepare the following table: Year YX X2 XY 1988 204 –2 4 -408 1989 230 –1 1 -230 1990 192 00 1991 250 0 1 250 1992 274 +1 4 548 +2 1150 10 160 For the table we have N = 5, 6x2 = 10, 6X = 1150, 6XY = 1600. CU IDOL SELF LEARNING MATERIAL (SLM)

364 Business Mathematics and Statistics Substituting these values in the two simplified equations it follows that: ¦Y 1150 a = N = 5 = 230 And, ¦ XY 160 b = ¦ x2 = 10 = 16 Therefore the equation to the line of best fit is – Y = 230 + 16X By putting x = –2, –1, 0, 1 and 2 in this equation, at each stage, we get the values of the trend ordinates 198, 214, 230, 246 and 262 respectively. The following graph shows vividly actual sales trend, both in freehand and by the method of least squares. SALES OF X, Y, Z INDUSTRIES 300 250 200 ACTUAL SALES AND FORECAST SALES ACTUAL SALES 150 TREND (FREE·HAND) TREND (LEAST·SQUARES) 100 50 0 1988 1989 1990 1991 1902 CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 365 16.5 Solved Problems Problem 5: Fit an exponential trend Y = abx to the following data by the method of least squares and find the trend values for the years 1994-98. Year(X): 1994 1995 1996 1997 1998 1999 2000 Sales(Y) : 87 97 113 129 202 195 193 The exponential curve, is given by the equation Y = abx which can be written in the logarithmic form Log Y = log a + X log b … (1) If the deviations (X) of the given years are calculated from the middle year 1945, then 6X = 0 and the values of log a and log b is given by: ¦log Y ¦(Xlog Y) log a = N and, log b = ¦X2 Year Sales X Log Y XLogY X2 1994 87 –3 1.9395 –5.8185 9 1995 97 –2 1.9868 –3.9736 4 1996 113 –1 2.0531 –2.0531 1 1997 129 0 2.1106 0 1998 202 1 2.3054 0 1 1999 195 2 2.2900 2.3054 4 2000 193 3 2.2856 4.58 9 6.8568 0 14.9710 28 1.8970 N = 7, 6logY = 14.9710, 6(X log Y) = 1.8970, 6X2 =28. ¦log Y 14.9710 log a = N 7 = 2.1387 And 6(X log Y) = log b (6X2) ¦(X log Y) 1.9870 log b = ¦ X2 28 = 0.0678 Substituting the values of log a and log b in equation (I), we obtain the equation for the exponential trend in the logarithmic form CU IDOL SELF LEARNING MATERIAL (SLM)

366 Business Mathematics and Statistics log Y = 2.1387 + 0.0678X. Further a = Antilog (2.1387) = 137.6 b = Antilog (0.0678) = 1.169. Therefore the equation of the exponential curve is: Y = 137.6(1.169)x Substituting the relevant values of X (deviations from 1945 in the equation log Y = 2.1387 + 0.0678 X we get the logarithms of the trend values for the different years. By finding the anti- logarithms of these we obtain the trend values. The trend values for the years 1944 to 1948 are as follows: Year X log Y 1994 –1 2.1387 + (0.0678) (–1) = 2.0709 1995 0 2.1387 + (0.0678) (0) = 2.1387 1996 1 2.1387 + (0.0678) (1) = 2.2065 1997 2 2.1387 + (0.0678) (2) = 2.2743 1998 3 2.1387 + (0.0678) (3) = 2.3421 Year Trend values 1994 Antilog (2.0709) = 117.7 1995 Antilog (2.1387) = 137.6 1996 Antilog (2.2065) = 160.9 1997 Antilog (2.2743) = 188.0 1998 Antilog (2.3421) = 219.9 Problem 6: Use the method of moving averages (three yearly) to determine the trend in the following series showing the index numbers for values of imports into India during 1914-1928. 87, 62, 47, 24, 45, 57, 96, 97, 84, 79, 77, 80, 92, 106, and 113. CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 367 Solution: Year Index Three- Three-yearly No. yearly Total Moving Total Average 1914 87 i 1915 62 196 65.33 1916 47 133 44.33 1917 24 116 38.67 1918 45 126 42.00 1919 57 198 66.00 1920 96 250 83.33 1921 97 277 92.33 1922 84 260 86.67 1923 79 240 80.00 1924 77 236 78.67 1925 80 249 83.00 1926 92 278 92.67 1927 106 311 103.67 1928 113 Problem 7: Calculate the five-yearly moving averages of students studying in a commerce college as shown by the following figures. Year: 1991 92 93 94 95 96 97 98 99 2000 No. of students: 332 317 357 392 402 405 410 427 405 438 Draw a graph to represent the moving averages. Solution: Year No. of Five-yearly Five-yearly Students Total Moving Average 1991 332 1800 360.00 1992 317 1873 374.60 1993 357 1966 393.20 1994 392 2036 407.20 1995 402 2049 409.80 1996 405 2085 417.00 1997 410 1998 427 1999 405 2000 438 CU IDOL SELF LEARNING MATERIAL (SLM)

368 Business Mathematics and Statistics Problem 8: The following table shows the yearly available supplies of all cereals per adult equivalent of population in India for a number of years. Draw a graph to represent the data: Year: 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 Supplies in Kg.: 214 208 204 204 190 203 195 180 191 176 178 179 Calculate three-year moving averages and plot them on the same chart. What is the deviation of the actual value from the moving average for the year 19867 Solutions: Year Supply Three Yearly Three Yearly Total Moving Average 1981 214 1982 208 626 208.67 1983 204 616 205.33 1984 204 598 199.33 1985 190 597 199.00 1986 203 588 196.00 1987 195 578 192.67 1988 180 566 188.67 1989 191 547 182.33 1990 176 545 181.67 1991 178 533 177.67 1992 179 CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 369 Problem 9: Calculate four-yearly moving averages for the following data: Year Bank Four Yearly Two Yearly Four Yearly Clearances Total Total of Four Moving 1986 1987 53 Yearly Totals Average 0 1988 79 70.500 1989 76 0 0 74.375 1990 66 274 564 79.875 1991 69 290 595 86.125 1992 94 305 639 90.000 1993 105 334 689 1994 87 355 720 79 365 Problem 10: Calculate five-yearly moving averages for the following time series: Year: 1 2345678 9 Annual figure: 78 67 107 142 152 155 160 177 155 Year Annual Fig. Five Yearly Five Yearly Total Moving Average 1 78 2 67 546 109.2 3 107 623 124.6 4 142 716 143.2 5 152 786 157.2 6 155 799 159.8 7 160 8 177 9 155 CU IDOL SELF LEARNING MATERIAL (SLM)

370 Business Mathematics and Statistics Problem 11: Calculate four-yearly centered moving averages for the production in million tonnes for the following data. Year Production Five-yearly Fived-yearly Total Moving Average 1990 65 1991 62 316 63.2 1992 61 319 63.8 1993 63 320 64 1994 65 326 65.2 1995 68 323 64.6 1996 63 317 63.4 1997 67 1998 60 1999 59 Solution: Production Four-yearly Two-yearly total Four-yearly Year Total of Moving Average Four-yearly Total 1990 68 254 1991 62 251 505 63.125 1992 61 259 508 63.500 1993 63 263 651 64.500 1994 65 258 621 65.250 1995 68 258 521 65.250 1996 63 249 507 63.375 1997 67 1998 60 1999 59 CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 371 Problem 12: Fit a straight line trend by the method of least squares to the following data: X (Years): 1991 1992 1993 1994 1995 1996 1997 (Production): 80, 90 92 83 94 99 92 Years XY XY X2 1991 1 80 80 1 1992 2 90 180 4 1993 3 92 276 9 1994 4 83 332 16 1995 5 94 470 25 1996 6 99 594 36 1997 7 92 644 49 28 630 2576 140 Substituting the values N = 7, 6Y = 630, 6X = 28 in the two normal equations: (1) we get 630 = 7a + 28b i.e., 90 = a + 4b ... 2576 = 28a + 140b i.e., 92 = a + 5b... (2) Subtracting (1) from (2) we get b = 2 and a = 90 – 4b = 90 – 8 = 82 The equation to the line of the best fit: Y = 82 + 2x By putting X= 1, 2, 3, 4, 5, 6, 7 in the equation we obtain trend values 84, 86, 88, 90, 92, 92, 94, 96 respectively. x = 1, Y = 82 + 2 = 84 x = 12 Y = 82 + 4 = 86 x = 3, Y = 82 + 6 = 88 x = 4, Y = 82 + 8 = 90 x = 5, Y = 82 + 10 = 92 x = 6, Y = 82 + 12 = 94 x = 7, y = 82 + 14 = 9 6 CU IDOL SELF LEARNING MATERIAL (SLM)

372 Business Mathematics and Statistics Problem 13: The following are the annual profits in thousands of rupees of a certain business. Year: 1991 1992 1993 1994 1995 1996 1997 Profit in thousands of rupees: 60 72 71 65 80 85 95 Use the method of least squares to fit a straight line to the above data. Plot the above figures and draw the line. Also make an estimate of the profits ~ 1998. Solution: Let the equation to the line of best fit be Y = a + sx, where Y represents profits in thousands of rupees and X the years. To find the values of the constants a and b we prepare the following table: Year Y X XY X2 1991 60 –3 –180 9 4 1992 72 –2 –144 1 0 1993 75 –1 –75 1 4 1994 65 0 0 9 1995 80 1 80 1996 85 2 170 1997 95 3 285 Total 528 0 136 28 From the table, we have N = 5, 6X2 = 28, 6Y = 528, 6XY = 136. Substituting these values in the two simplified equations we get: ¦Y 532 a = N 7 = 76 And, ¦XY 136 b = ¦X2 28 = 4.86 Therefore the equation to the line of best fit is: = 76 + 4.86X By putting, X = –3, –2, –1, 0, +1, +2, +3 in this equation, we get the trend values as follows: 61.42, 66.28, 71.14, 76.00, 80.86, 85.72, 90.58. Putting X = –3, y = 76 + (4.86) (–3) = 76 – 14.58 = 61.42 CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 373 Putting X = –2, Y = 76 + (4.86) (–2) = 76 – 9.72 = 66.28 Putting X = –1, y = 76 + (4.86) (–1) = 76 – 4.86 = 71.14 Putting X = 0, y = 76 + (4.86) (0) = 76.00 Putting X = 1, y = 76 + (4.86) (1) = 80.86 Putting X = 2, y = 76 + (4.86) (2) = 76 + 9.7285.72 Putting X = 3, y = 76 + (4.86) (3) = 76 + 14.58 = 90.58 By putting X = 4, we the profits for 1958 are: = 76 + (4.86) (4) = 76+19.44 = 95.44 Profits for 1998 = ` 95.44 (in thousands) Problem 14: Given below are the production figures (in thousand maunds) of a sugar factory. Obtain the secular trend by fitting a straight line. Year: 1990 1991 1992 1993 1994 1995 1996 Production: 12 10 14 11 13 15 16 Also plot the trend values to indicate the trend line. CU IDOL SELF LEARNING MATERIAL (SLM)

374 Business Mathematics and Statistics Solution: Let the equation to the line of best fit be Y = a + bX where Y represents the sugar production in thousand maunds. To find the values of the constants a and b we prepare the following table. In this table, X stands for the deviation of the years from 1993. Year Y X XY X2 1990 12 –3 36 9 1991 10 –2 –20 4 1992 14 –1 –14 1 1993 11 0 1994 13 00 1 1995 15 +1 +13 4 1996 16 +2 +30 9 +3 +41 91 28 0 21 From the table, we have N = 7, 6X2 = 28, 6Y = 91, 6XY = 21. Substituting these values in the two simplified equation we get: ¦Y 21 a = N 31 = 13 And, ¦XY 21 b = ¦X2 28 = 0.75 Therefore the equation to the line of best fit is: Y = 13 + 0.75X By putting X = –3, –2, –1, 0, +1, + 2, +3 in this equation, we get the following trend values: 10.75, 11.50, 12.25, 13.00, 13.75, 14.50, 15.25. These trend values may be plotted on the graph and the line of best fit may be drawn. 16.6 Summary An orderly set of numbers written in the order corresponding to time is called a time series. For instance, a series of terms showing the profits of a business concern over a number of years is a time series because the terms of series representing profits are written in relation to time. A CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 375 careful scrutiny of a time series along with a detailed analysis of condition in the past enables one to forecast future changes and estimates. The following is a time series showing the production of cereals in India in million tonnes. Net Production (million tonnes)* of Cereals in India Year Production 1951 40.01 1952 40.59 1953 45.36 1954 53.44 1955 51.59 1956 50.33 1957 52.67 1958 49.35 1959 57.29 1960 56.76 1961 60.66 1962 62.08 1963 58.43 A detailed analysis of the above time series enables us to find the different types of movements influencing its course. 1. General Trend 2. Seasonal Changes 3. Cyclical Fluctuations 4. Irregular Variations 1. General Trend: The time series of the net productions of cereals in India shows that there is certainly a gradual increase in the production of cereals even though the increase is not regular. This gradual increase is called the “General Trend” or secular trend. 2. Seasonal Fluctuations: Eliminating seasonal influences is no easy task. The occurrence of seasons is responsible for these fluctuations. Natural phenomena are always unavoidable beyond any measure of control. CU IDOL SELF LEARNING MATERIAL (SLM)

376 Business Mathematics and Statistics 3. Cyclical Fluctuations: These are due to the occurrence of business or trade cycles in the economic world. For instance; there may be a period of depression followed by a boom after a few years. 4. Irregular Variations: In many time series of economic data, we observe sudden upward or downward movements in the trend. Such movements are irregular and are brought about by unforeseen events and sudden changes in economic activities and business conditions. A general trend can be measured by the following methods: A. Graphical method-freehand curve. B. Method of moving averages. C. Method of least squares. Graphical Method: This method consists in drawing the curve of the time series on graph paper. Method of Moving Averages: Moving Averages are calculated on the basis of a given period, such as, 3-yearly or 4-yearly or 5-yearly. Method of Least Squares: Instead of drawing an observation-trend-line for a time series freehand, it is better to fit a straight line trend more accurately using the mathematical method of least squares. This straight line so drawn is called the ‘Line of Best Fit’. If the equation to the line of best fit is written as Y = a + bx, then the values of the constants a and b are determined by using the following two normal equations: 6Y = Na + b6x 6XY = a6X + b6x2 Where X represents the year (time). Y represents the actual values of the terms in the series. N is the number of terms in the time series. The graph that represents a time series is called a ‘Historigram’. It is so called, simply because it represents a historical series which is another name for a time series. CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 377 Parabolic Trend Method: The straight line trend shows the increase or decrease in a time series by a constant amount. However, in certain cases the straight line trend may not be adequate and appropriate. In such a case the trend is indicated by a non linear curve and not by a straight line. The equation of the parabola is given by Y = a + bX + CX2 where the values of a, b, and c can be determined by solving the following normal equations: 6Y = Na + b6x + c6X2 6XY = b6x + b6x2 + c6X2 6X2Y = a6X2+ b6X3 + c6X4 When trends are plotted on a semi-logarithmic chart and arc represented by a non-linear curve, then an upward curve would indicate the increase at different rates. The rate of ioncrease depends on the slope-steeper the slope, higher the rate of increase. Using the logarithmic procedure we can calculate the exponential trends. 16.7 Key Words/Abbreviations General trend, Seasonal, Cyclical, Irregular, Variation, Fluctuation, Moving Averages, Least Sequences Method. 16.8 Learning Activity 1. Calculate five-yearly moving averages of students in a commerce college as shown by the following figures: Year No. of Students 1 1402 2 1405 3 1410 4 1427 s 1405 6 1438 7 1445 8 1447 CU IDOL SELF LEARNING MATERIAL (SLM)

378 Business Mathematics and Statistics 9 1480 10 1482 11 1482 12 1500 2. The following are the index numbers of annual production of a certain commodity. Assuming a five-yearly cycle, find the trend values. Year Production 1991 325 1992 310 1993 301, 1994 315 1995 323 1996 345 1997 335 1998 325 1999 333 2000 349 2001 365 3. Draw a graph to represent the following data showing the number of students in a college: Year No. of Students 1985 705 1986 685 1987 703 1988 687 1989 705 1990 689 1991 715 1992 685 1993 725 1994 751 Calculate the five-yearly moving averages of the above data and plot them on the same chart. CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 379 4. The following series relate to the profits of a commercial concern for eight years. Year Profits (`) 1995 15420 1996 14470 1997 15520 1998 21020 1999 26120 2000 31950 2001 35370 2002 35670 Find the trend of profits. Assume a three year cycle and ignore decimals. 5. Find the trend of bank clearances by moving averages (assume a five-yearly cycle). Years Bank Clearances (` in crores) yI y2 53 y3 79 y4 76 y5 66 y6 69 y7 94 y8 105 y9 87 y10 79 y11 104 y12 97 y13 92 101 16.9 Unit End Questions (MCQ and Descriptive) A. Descriptive Type: Short Answer Type Questions 1. Explain clearly what is meant by time series analysis. Indicate fully the importance of such analysis in business. CU IDOL SELF LEARNING MATERIAL (SLM)

380 Business Mathematics and Statistics 2. Describe the various types of fluctuations influencing a time series. Indicate fully the procedure of estimating the secular trend. 3. Explain what is meant by long term trend of a time series. Use the method of moving averages (three-yearly) to determine the trend in the following series showing index numbers for values of imports into India during 1984-1998. 87, 62, 47, 24, 45, 57, 96, 97, 84, 79, 77, 80, 92, 106 and 113. 4. What are the components of a time series? Explain the importance of the moving average method in the analysis of time series. 5. Distinguish between regular and irregular fluctuations in a time series. 6. Distinguish between seasonal variations and cyclical fluctuations. 7. Explain how a “growth factor”, a “decline factor”, a “seasonal factor II and a “cyclical factor” affect a variable over a period of time. 8. Write a short essay on “Components of a Time Series”. 9. Explain the meaning and use of a moving average. 10. Discuss the different methods of determining secular trend in a time series with their relative merits and demerits. 11. Write a note on Trend in a Time Series’. 12. Discuss any two methods of determining the secular trend in a time series with their merits and demerits. 13. What is a “historical series”? What are its main components? Discuss briefly anyone method of determining the general trend present in a given historical series. 14. Calculate the five-yearly moving averages of students in a commerce college as shown by the following figures. Year No. of Students 1 402 2 405 3 410 4 427 5 405 CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 381 6 438 7 445 8 447 9 480 10 482 11 482 12 500 15. Explain the “moving averages method II to find the secular trend in a time series. Mention its uses and limitations. 16. (a) What is a time series? What is meant by the analysis of time series? Discuss the practical utility of the analysis of a time series. (b) The following are the index numbers of the annual production of a certain commodity. Assuming a five-yearly cycle, find the trend values. Year Index Num 1991 225 1992 210 1993 201 1994 215 1995 223 1996 245 1997 235 1998 225 1999 233 2000 249 2001 265 17. Explain the method of moving averages in finding the trend in a time series data. 18. What is a “secular trend”? State the important method of finding the secular trend, explaining in detail anyone of them. 19. (a) What is a time series? State the various components of a time series? What is the purpose of such an analysis? (b) Determine the trend component for the following time series using the five-yearly moving averages. CU IDOL SELF LEARNING MATERIAL (SLM)

382 Business Mathematics and Statistics Year Index Number (in lakh) 1988 1989 80 1990 79 1991 98 1992 77 1993 64 1994 51 1995 50 1996 54 1997 53 1998 61 1999 123 2000 119 2001 120 98 20. Calculate four-yearly centered moving averages production in million pounds for the following data: Year Prod 1990 68 1991 62 1992 61 1993 63 1994 65 1995 68 1996 63 1997 67 1998 60 1999 59 CU IDOL SELF LEARNING MATERIAL (SLM)

Time Series Analysis 383 B. Multiple Choice/Objective Type Questions: 1. A time series is also called __________. (a) Value series (b) Historical series (c) biographical series (d) None of these. 2. General Trend in a time series is also called _____________. (a) Secular Trend (b) Fine Trend (c) Peculiar Trend (d) None of these 3. The graph that represent a time series is called a __________. (a) Biographical (b) Historical (c) Historigram (d) All the above 4. Parabolic Trend method is the straight line trend shows the increase or decrease in a time series by __________ amount. (a) Varying (b) Constant (c) Recurring (d) All the above Answers: (1) (b); (2) (a); (3) (c); (4) (b) 16.10 References References of this unit have been given at the end of the book. ˆˆˆ CU IDOL SELF LEARNING MATERIAL (SLM)

384 Business Mathematics and Statistics REFERENCES 1. Dr. A.B. Rao, “Business Mathematics”, Himalaya Publishing House Pvt. Ltd. 2. Dr. A.B. Rao, “Business Statistics”, Himalaya Publishing House Pvt. Ltd. 3. Dr. A.B. Rao, “Decision-making”, Himalaya Publishing House Pvt. Ltd. 4. Dr. A.B. Rao, “Operation Research”, Jaico Publishing House Pvt. Ltd. 5. Dr. A.B. Rao, “Quantitative Techniques in Business”, Jaico Publishing House Pvt. Ltd. 6. Lenin Jothi, “Business Mathematics”, Himalaya Publishing House Pvt. Ltd., Mumbai. 7. http://data.conferenceworld.in/SGTB/P726-729.pdf 8. https://www.inc.com/encyclopedia/annuities.html 9. https://blog.myrank.co.in/combination-properties-of-%E2%81%BFcr/ 10. https://blog.myrank.co.in/applications-of-permutation-and-combination/ 11. https://www.hindawi.com/journals/mpe/2010/723402/ ˆˆˆ CU IDOL SELF LEARNING MATERIAL (SLM)


Like this book? You can publish your book online for free in a few minutes!
Create your own flipbook