Dispersion 245 Solution: F xf dA fdA Cf dm fdm 248 X 2 8 5.7 11.4 628 4 11 0 0 4 5 14 3.7 14.8 6 3 14 2 6 8 2 40 1.7 85 16 4 8 10 1 17 6 6 12 4 30 0.3 09 21 8 32 14 16 21 24 2.3 46 68 Total 14 4.3 43 64 6.3 2.2 4 f.7 ¦ xf 204 Mean = n 21 = 9.7 ¦fdA 69.7 Mean deviation from mean = n 21 = 3.3 3.3 Mean coeficient of dispersion = 9.7 = 0.34 § n 1· Median = Value of ©¨ 2 ¸¹th item Median = Value of § 2121¸·¹th item ¨© Median = Value of the 11th item = 8 ¦ fdm 68 Mean Deviation from Median = n 21 3.238 Mean coefficient of dispersion = 8 = 0.40475 CU IDOL SELF LEARNING MATERIAL (SLM)
246 Business Mathematics and Statistics Problem 4: Calculate the mean deviation from mean for the data given below. Class f 0-5 3 5-10 7 10-15 12 15-20 18 20-25 13 25-30 10 30-35 7 70 dA Deviation from Mean 18.86 Solution: Class M.V. fD d fd dA / dA 0-5 2.5 3 –15 –3 –9 16.36 49.08 5-10 7.5 7 –10 –2 –14 11.36 79.52 10-15 12.5 12 –5 –1 –12 6.36 76.52 15-20 [17.5] 18 0 0 0 1.36 24.48 20-25 22.5 13 +5 +1 + 13 3.64 47.32 25-30 27.5 10 + +2 +20 8.64 86.40 30-35 32.5 7+ +3 +21 13.64 95.48 Total 70 + 19 458.60 ¦ fd Mean = a + n * i Mean = 17.5 + 19 * 5 70 ¦ fdA Mean = 17.5 + n 458.60 Mean = 70 MD for Mean = 6.55 CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 247 Problem 5: Calculate mean deviation from mean, median and mode from the following data: Class 3 7 0-10 15 10-20 12 20-30 8 30-40 5 40-50 50-60 50 Solution: Class M.V. f cf D d fc\\ 0-10 5 3 3 –20 –2 –6 10-20 15 7 10 –10 –1 –7 20-30 25 15 25 0 0 0 30-40 35 12 37 +10 +1 :t 12 40-50 45 8 45 +20 +2 + 16 50-60 55 5 50 +30 +3 + 15 50 +30 ¦ fd Mean = a n * i 30 Mean = 25 + 50 * 10 Mean = 25 + 6 Mean = 31 §n· Median = Value of ¨© 2 ¹¸th item § 50 · Median = Value of ©¨ 2 ¸¹th item Median = Value of the 25th item Median lies in the class 20-30 CU IDOL SELF LEARNING MATERIAL (SLM)
248 Business Mathematics and Statistics Median = L+ i§ N · f ¨© 2 CF¹¸ 10 Median = 20 + 15 (25 - 10) Median = 20 + 10 Median = 30 F1 F0 Mode = L + 2F1 F0 F2 * l 15 7 Mode = 20 + 30712 =10 8 Mode = 20 + 11 * 10 80 Mode = 20 + 11 Mode = 20 + 7.27 Mode = 27.27 Class M.V. / d a^ dm /dm dm a3 0-10 5 3 CD CM* +78 25 75 22.27 66.81 15 105 12.27 85.89 10-20 15 7 + 16 + 112 5 75 2.27 34.05 20-30 2% 15 +6 -’-90 5 60 7.73 95.76 15 120 17.73 141.84 30-40 35 12 +4 +48 25 125 27.73 138.65 40-50 45 8 + 14 + 112 50-60 55 5 +24 + 120 560 560 ¦ fd MD from Mean n = 11.2 MD from mode and median is also 11.2. CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 249 Problem: In the mean deviation room median and mode in the.following data: Age No. of Persons 16 30 17 32 18 35 19 46 20 51 21 60 22 43 23 32 24 19 Solution: By inspection, it follows that the median is 20 and mode is 21. Median Value § n 1· Median = ¨© 2 ¸¹th item Median = § 34821·¸¹th item ¨© Median = Value of the (174-.5)th item = 20 cf (by inspection of the cumulative frequencies). 30 62 Age F 97 143 16 30 194 17 32 254 18 35 297 19 46 329 20 51 348 21 60 22 43 23 32 24 19 CU IDOL SELF LEARNING MATERIAL (SLM)
250 Business Mathematics and Statistics Age f Deviation fdm Deviation room 1 dm room 0 dm 21 dm 16 30 120 150 17 32 4 96 5 128 18 35 3 70 4 105 19 46 2 46 3 20 51 10 0 2 92 21 60 60 1 51 22 43 1 86 0 0 23 32 2 96 1 43 24 19 3 76 2 64 4 3 57 650 690 ¦ fdm Mean Deviation from Median = n 650 Mean Deviation from Median = 348 = Antilog (log 650 – log 348) = Antilog (2.8129 – 2.5416) = 1.867 ¦ fdm Mean Deviation from Median = n 650 Mean Deviation from Median = 348 = Antilog (log 690 - log 348) = Antilog (2.8388 - 2.5416) = Antilog (0.2972) = 1.867 11.2.3 Quartile Deviation This measure also called the semi-inter quartile range is based on the third and the first quartiles. It is easy to calculate, because the deviations of individual items of the data need not be determined as in the case of mean deviation. Further, it provides a rough idea about the dispersion in given CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 251 data. Apparently, the greater the distance between the quartiles, the greater is the dispersion. If two diferent frequency distributions have the same values or quartiles, it follows that the quartile deviation or each distribution is also the same. This in turn implies that the dispersion of both distributions is identical. Hence, because of this disadvantage, quartile deviation is not an accurate measure of dispersion. Problem 7: Calculate the quartile deviation and its coeficient or the following data: 8, 3, 15, 4, 18, 6, 11. Solution: Arranging the data in ascending order: 3, 4, 6, 8, 11, 15, 18. n = 7. First Quartile: Ql = Value of the § n 4 1¹·¸th item ¨© § 71· Q1 = Value of ¨© 4 ¹¸th item Q1 = Value of the 2nd item Q1 = 4 § n 1· Third Quartile: Q3 = Value of 3¨© 4 ¸¹th item § 71· Q3 = Value of ¨© 4 ¸¹th item Q3 = Value of 6th item Q3 = 15 Quartile Deviation = Q3 Q1 2 154 11 =2 2 = 5.5 Coefficient of Quartile Deviation is Q3 Q1 154 11 Q3 Q1 254 29 = 0.58 CU IDOL SELF LEARNING MATERIAL (SLM)
252 Business Mathematics and Statistics Problem 8: Calculate the quartile deviation and its coeficient or the following data: Marks No. of Students 10 4 11 6 12 7 13 8 14 6 15 4 35 Marks cf 10 11 44 12 6 10 13 7 17 14 8 25 15 6 31 4 35 Solution: First Quartile: 35 Third Quartile: Q1 = Value of the § n 4 1¹¸·th item ©¨ § 35 1· Q1 = Value of ©¨ 4 ¹¸th Item Ql = Value of the 9th item Q1 = 11 § n 1· Q3 = Value of 3©¨ 4 ¸¹th item § 35 1· Q3 = Value of 3©¨ 4 ¹¸th Item Q3 = Value of 27th item CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 253 Q3 = 14 Quartile Deviation = Q3 Q1 2 1411 3 =3 2 = 1.5 Q3 Q1 14 11 4 Q3 Q1 14 11 25 = 0.12 Problem 9: Calculate the quartile deviation of quartile deviation or the following data, and the coeficient. Class Frequency 0-10 3 10-20 7 20-30 15 30-40 12 40-50 50-60 8 5 50 Class f cf 0-10 3 3 10-20 7 10 20-30 15 25 30-40 12 37 40-50 8 45 50-60 5 50 Solution: §n· First Quartile: Ql = Value of the ©¨ 4 ¹¸th item § 50 · Q1 = Value of ©¨ 4 ¸¹th item Q1 = Value of the (12.5)th item CU IDOL SELF LEARNING MATERIAL (SLM)
254 Business Mathematics and Statistics Q1 Lies in the class 20-30 Q1 = L+ i§ N · f ¨© 4 CF¹¸ Where L = Lower limit of the first quartile class (L = 20) i = Length of the class interval (i = 10) F = Frequency of the first quartile class (F = 15) CF = Cumulative Frequency of the class preceding the first quartile class (CF = 10) 10 Q1 = 20 + 15 (12.5 – 10) 10 Q1 = 20 + 15 (2.5) 2 Q1 = 20 + 1 13 Q1 = 20 + 1.67 Q1 = 21.67 Third Quartile: Q3 = Value of ª§ n ·º item «¬3©¨ 4 ¹¸ CF»¼th Q3 = Value of (37.5)th item Q3 = Lies in the class 40-50 Q3 = L iª § N · º f ¬«3©¨ 4 ¹¸ CF»¼ Where L = Lower limit of the third quartile class (L = 40) i = Length of the class interval (i = 10) F = Frequency of the third quartile class (F = 8) CF = Cumulative Frequency the third quartile class (CF = 37) Q3 = L iª § N · CF º f ¬«3©¨ 4 ¹¸ ¼» CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 255 Q3 = 4010 37.537 8 Q3 = 40 + 0.625 40.63 approximately Quartile Deviation = Q3 Q1 2 40.6321.67 18.96 =2 2 = 9.48 Q3 Q1 Coefficient of Quartile Deviation = Q3 Q1 40.63 21.67 = 40.6321.67 18.96 = 62.30 = Antilog (log 18.96 – log 62.30) = Antilog (1.2779 – 1.7945) = Antilog (1.4834) = 0.3044. 11.2.4 Standard Deviation This measure measures dispersion absolutely. The method of finding it is similar to that of the mean deviation method, with one difference: here individual deviations from the arithmetic mean are squared. Here’s how you calculate standard deviation: (i) Calculate the deviation of each item from the arithmetic mean and square it. (ii) Divide the sum of these squares by the number of items in the given data. (iii) The square root of this quantity gives you the standard deviation. Its main advantages are that it is rigidly defined and, secondly, individual deviations are squared in its calculations. However, this measure also has certain drawbacks. Since the individual deviations are squared, its value is affected a great deal by the value of each item in the data. Extreme items, CU IDOL SELF LEARNING MATERIAL (SLM)
256 Business Mathematics and Statistics or example, push up its value considerably. But that does not detract from its value as a good dispersion and standard deviation is commonly used. Variance It is the term used to denote the square of the standard deviation. Symbolically a - stands or variance. Coeficient of variation: lt is possible to compare diferent types of data by finding out the percentage of the standard deviation to the arithmetic mean or each type of data. In short the ratio when multiplied by 100 gives a certain percentage this is known as the coeficient of variation. Problem 10: Calculate the standard deviation from the following data: 2, 3, 7, 8, 10. Solution: X d d2 2 –4 16 3 –3 9 7 +1 1 8 +2 4 10 +4 16 30 ¦ x 30 x = 30 AM = 5 5 = 6 ¦d2 46 9.5 = 3.033 S.D. = 5 5 Problem 11: Calculate the standard deviation from the following data: Marks No. of Students 22 53 68 75 92 CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 257 Solution: x f xf d d2 fd.2 32 2 2 4 –4 16 3 5 3 15 –1 1 0 6 8 48 0 0 5 7 5 35 +1 1 18 9 2 18 +3 9 58 20 120 27 fd2 8 ¦ xf 120 3 Mean = n 20 = 6 0 5 SD = ¦ d2 58 2.9 = 1.703 12 5 20 Problem 12: Find the standard deviation from the following data: Class f 0-5 2 5-10 3 10-15 7 15-20 5 20-25 3 20 Solution: Class M.V. f D D fd 0-5 2.5 2 –10 –2 –4 5-10 7.5 3 –5 –1 –3 10-15 12.5 7000 15-20 17.5 5 +5 +1 +5 20-25 22.5 3 + +2 +6 20 +4 28 S.D. = ¦d2 §¨© fd ·2 *i n n ¸¹ CU IDOL SELF LEARNING MATERIAL (SLM)
258 Business Mathematics and Statistics S.D. = 28 § 4 ·2 *5 20 ©¨ 20 ¸¹ S.D. = 1.400.04 *5 S.D. = (1.166) * 5 S.D. = 5.830 Problem 13: Calculate the standard deviation and the coeficient of variation from the following data. Marks Obtained No. of Students 0-10 3 10-20 7 20-30 11 30-40 15 40-50 12 50-60 8 60-70 4 60 Solution: Class Mid Frequency Deviation Step - fd fd2 Values (f) D deviation d –9 27 0-10 5 3 –30 –3 –14 28 10-20 15 7 –20 –2 –11 11 20-30 25 11 –10 –1 30-40 35 15 o. 0 40-50 45 12 0 0 + 12 12 50-60 55 8 + 10 +1 + 16 32 60-70 65 4 +20 +2 + 12 36 +30 +3 146 Total 60 +6 SD = fd2 ¨§© fd ·2 *1 n n ¸¹ CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 259 S.D. = 146 § 46 ·2 *10 60 ¨© 60 ¹¸ S.D. = 2.430.01 *10 S.D. = (1.556) * 10 S.D. = 15.56 ¦ fd Mean = a + n * 1 6 Mean = 35 + 60 * 10 Mean = 36 S.D. Coeficient of Variation = Mean * 100 15.56 = *100 36 1556 = 36 = 43.22 Problem 14: Which of the following two series ismore variable? Give reasons. X deviation d2 y deviation d2 d d from 96 from 144 136 –8 4 86 –10 100 129 –13 169 120 –15 225 83 –32 1024 117 21 441 –24 576 64 134 –18 324 140 –27 729 75 00 156 165 78 +11 121 170 –10 100 + 15 225 173 +27 729 1440 –4 16 96 +41 1681 +12 144 107 4814 +21 441 111 +26 676 123 +29 841 137 3812 960 CU IDOL SELF LEARNING MATERIAL (SLM)
260 Business Mathematics and Statistics X: 136 129 120 117 134 140 156 165 170 173 Y: 86 83 64 75 78 96 107 111 123 137 Solution: For X series: ¦fx 1440 Mean = n 10 = 144 ¦ fd 2 SD = n 3812 SD = 10 SD = Antilog § 1 log · ¨© 2 381.2¸¹ § 2.5811· SD = Antilog ©¨ 2 ¹¸ SD = Antilog (1.2905) SD = 19.52 SD Coefficient of Variation = *100 Mean 19.52 = 144 * 100 1952 = 144 = Antilog (log 1952 – log 144) = Antilog (3.2904 – 2.1584) = Antilog (1.1320) = 13.55 CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 261 For Y Series: ¦ y 960 Mean = n 10 = 96 ¦ fd 2 SD = n 4814 SD = 10 SD = Antilog § 1 log · ¨© 2 481.4¹¸ § 2.6825· SD = Antilog ©¨ 2 ¹¸ SD = Antilog (1.3412) SD = 21.94 SD Coeficient of Variation = Mean * 100 21.94 = 96 * 100 2194 = 96 = Antilog (log 2194 – log 96) = Antilog (3.3412 – 1.9823) = Antilog (1.3589) = 22.85 Comparing the coefficient of variation in X series with that of Y series we find that there is a greater variation in Y series than in X series. Therefore Y series is more variable than X series. CU IDOL SELF LEARNING MATERIAL (SLM)
262 Business Mathematics and Statistics Problem 15: The following is the record of goals scored by team A in a football season: Number of Goals Scored by Number of Matches Team A in a Match 1 0 9 1 7 2 5 3 3 4 For team B, the average number of goals scored per match was 2.5 with a standard deviation of 1.25 goals. Find out which team is more consistent. Solution: ¦fx 50 Mean = n 25 = 2 SD = ¦ fd 2 where d stands for deviation from mean n 30 SD = 25 6 SD = 5 = 1095 Number of goals Number of xf Deviation fd f d2 scored matches from mean 2d X f –1 4 0 –2 –9 9 0 1 9 –1 0 0 1 9 14 0 5 5 2 7 15 +1 +6 12 3 5 12 +2 30 4 3 50 Total 25 SD Coeficient of Variation of Team A = Mean * 100 CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 263 1.095 = 2 * 100 109.5 = 96 = 54.75 SD Coefficient of Variation of Team B = Mean * 100 1125 = 2.5 * 100 112.5 = 2.5 The coeficient of variation for team B is less than that for team A. Therefore team B is more consistent. Problem 16: Given below are the share prices of A and B. State which is stable in value. A: 55 54 52 53 56 58 52 50 51 49 B: 108 107 105 105 106 107 104 103 104 101 Solution: deviation d2 B deviation d2 d d A from 533 from 105 55 54 +2 4 108 +3 9 52 +1 1 107 +2 4 53 –1 1 105 0 0 56 0 0 105 0 0 58 +3 9 106 +1 1 52 +5 25 107 +2 4 50 –1 1 104 –1 1 51 –3 9 103 –2 4 49 –2 4 104 –1 1 530 –4 16 101 –4 16 70 1050 40 CU IDOL SELF LEARNING MATERIAL (SLM)
264 Business Mathematics and Statistics For shares A: ¦ xf 530 Mean = n 10 = 53 ¦ fd 2 SD = n 70 SD = 10 SD = 7 = 2.646 (From table of Sq. roots) SD Coefficient of Variation = Mean * 100 2646 = 53 * 100 = 4.99 For shares B: ¦ xf 1050 Mean = n 10 = 105 SD = ¦fd2 n 40 SD = 10 SD = 4 = 2 (From table of Sq. roots) SD Coefficient of Variation = Mean * 100 2 = 105 * 100 CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 265 200 = 105 = 1.905 approx. Comparing the respective coeficients of variation in the prices of shares A and B, we find 1.905 is less than 4.99. Therefore the prices of share B are more stable than that of A. Problem 17: The scores of two batsmen A and B in ten innings during a certain season are as under. Which of the batsmen is more consistent in scoring? A: 32 28 47 63 71 39 10 60 96 14 B: 19 331 48 53 67 90 10 62 40 80 Solution: d d2 B d d2 A –14 196 19 –31 961 –18 324 32 +1 31 –19 361 28 +17 1 47 ±25 289 48 –2 4 63 625 71 –7 49 53 +3 9 39 –36 1296 10 + 14 199 67 + 17 289 60 +50 2500 96 –32 1024 90 +40 2600 14 6500 10 –40 1600 460 62 +12 144 40 –10 100 80 +30 900 500 5968 For Batsman A: ¦ x 460 AM = n 10 = 46 ¦ fd 2 SD = n 6500 SD = 10 CU IDOL SELF LEARNING MATERIAL (SLM)
266 Business Mathematics and Statistics SD = 650 = 25.49 SD Coefficient of Variation = Mean * 100 25.49 = 46 * 100 2549 = 46 = 55.40 For Batsman A: ¦ x 500 AM = n 10 = 50 ¦ fd 2 SD = n 5968 SD = 10 SD = 596.8 = 24.43 SD Coefficient of Variation = Mean * 100 24.43 = 50 * 100 2443 = 50 = 48.86 The coefficient of variation in the scores of batsman B is less than that of batsman A. Hence batsman B is more consistent in scoring. CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 267 Problem 18: Given the sets numbers 2, ?, 8, 11, 14 and 2, 8, 14, find – (i) The mean of each set. (ii) The variance of each set. (iii) The coefficient of variation of the combined or pooled sets. Solution: Let the two sets of numbers be denoted by A and B respectively. Set A: 2 + 5 + 8 + 11 + 14 40 Mean = 8 5 5 ¦ fd2 Variance = n (28)2 + (58)2 + (88)2 + (118)2 + (148)2 = 5 3690936 90 =5 5 = 18 Set B: 2814 24 Mean = 3 =8 3 ¦ fd2 Variance = n V2 (28)2 (88)2 (148)2 = 3 36036 72 =3 3 =24 Coefficient of Variation of the combined set A and set B: Combined Mean X' = n1x1' n 2 x ' 2 n1 n2 CU IDOL SELF LEARNING MATERIAL (SLM)
268 Business Mathematics and Statistics 5*83*8 8(53) Combined Mean X' = 53 53 = 8 Variance of set A: 12 = 18 Variance of set B: 22 = 24 X'1 = 8, X'2 = 8, n1 = 5, n2 = 3, X' = 8. s= n1V 2 n2V 2 n1(x1 x)2 n2 (x2 x)2 n1 n2 5*18 *3* 24 *5(88)23(88)2 s= 53 90 72 s= 8 162 s= 8 s = 20.25 s = 4.5 Problem 19: Find the standard deviation or the data given below, after finding the missing frequency. The mean of the data is 132. Class Frequency 100-110 2 110-120 4 120-130 7 130-140 – 140-150 5 150-160 2 160-170 1 CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 269 Class MV fD d d 100-110 105 2 –30 –3 –6 110-120 115 4 –20 –2 –8 120-130 125 7 –10 –1 –7 130-140 135 — 0 0 0 140-150 145 5 + 10 +1 +5 150-160 155 2 +20 +2 +4 160-170 165 1 +330 +3 +3 21 + x –9 Solution: n = 21 + x where x is the missing frequency. ¦ fd Mean = a + n *i (9) Mean = 135 + 21x * 10 90 132 = 135 + 21x 90 21x = 135 – 132 90 21x = 3 Therefore 90 = 63 + 3x Therefore 27 = 3x Therefore x = 9 CU IDOL SELF LEARNING MATERIAL (SLM)
270 Business Mathematics and Statistics Calculation of S.D. Class MV f D d fd fd2 100-110 105 2 –30 –3 –6 18 110-120 115 120-130 125 4 –20 –2 –8 16 130-140 135 140-150 145 7 –10 –1 –7 7 150-160 155 160-170 165 900p0 5 + 10 +1 +5 5 2 +20 +2 +4 8 1 +30 +3 +3 9 30 -9 63 SD = fd2 § fd ·2 * i n ©¨ n ¹¸ SD = 63 §¨©309 ·2 *10 30 ¹¸ SD = 2.100.09 *10 SD = 2.01 *10 SD = (1.418) * 10 SD = 14.18 §1· Note: 2.01 * 10 = Antilog ©¨ 2 ¹¸ log 2.01 * 10 § 0.3032 · = Antilog ©¨ 2 ¹¸ * 10 = Antilog (0.1516) * 10 = (1.418) * 10 = 14.18 Problem 20: From a certain frequency distribution consisting of 18 observations, the mean and the standard deviation were found to be 7 and 4 respectively. But on comparing with the original data it was found that a figure 12 was wrongly written as 21 in the calculation. Calculate the correct mean and standard deviation. CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 271 Solution: Mean = Sum of the observations 18 Sum of the observations 7 = 18 Therefore sum of observations = 7 * 18 = 126 Correct sum of observations = 126 – 21 + 12 = 117 117 Correct Mean = 18 = 6.5 SD = V = ¦x2 § ¦x ·2 n ¨© n ¸¹ ?4 = ¦ x2 (7)2 18 ? 16 = ¦x2 -49 18 ¦x2 16 + 49 = 18 ?x2 = 18 * 65 = 1170 This is incorrect 6x2 Correct 6x2 = 1170 – (21)2 + (12)2 = 1170 – 441 + 144 = 873 Correct SD = 873 (6.5)2 18 = 48.542.25 CU IDOL SELF LEARNING MATERIAL (SLM)
272 Business Mathematics and Statistics = 6.25 = 2.5 Problem 21: The arithmetic means of two samples of sizes 60 and 90 are 2 and 48 respectively. Their standard deviations are 9 and 12 respectively. Find the standard deviation of the combined sample of size 150. We have x'l = 52, x'2 = 48, n1 = 60, n2 = 90, s1 = 9 and s2 = 12. Let x' and 0 be the AM and standard deviation respectively of the combined sample size of 150. Then, n1V12 n 2V 2 n1 (x1' x ')2 n (x '2 x '1 ) 2 2 V= 2 n1 n2 (608190 *144)60(5249.6)2 90(4849.6)2 V= 150 17820 345.6 230.4 V = 150 18396 V = 150 V = 11.07 11.2.5 Graphs: Histogram, Frequency Polygon and Curve, Ogive Curve Graphs Three types of graphs are used to represent Frequency distributions: histograms, frequency Polygons/Curves and the Ogive. Histogram This graph is used to draw a continuous frequency distribution. It consists of vertical bars adjacent to each other. The class boundaries are marked of on a suitable scale along the horizontal axis or the x-axis, while frequencies are marked along the, vertical axis or the y-axis. CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 273 Rectangles are drawn on the class intervals marked of along the x-axis, with their heights proportional to the frequencies of the respective class intervals. The resulting figure would be a ‘histogram’ relating, to data of equal class intervals. Problem 1: Draw a histogram for the data given below: 16 20-30 30-40 40-50 50-60 60-70 70-80 14 12 10 8 6 4 2 0 10-20 In the above histogram one cm along the x-axis represents 10 marks and one cm along the y- axis represents two students. The total area of the histogram represents the total of the frequencies and the area of each rectangular bar is proportional to the frequency that it represents. Frequency Polygon and frequency Curve A frequency polygon is constructed by joining the mid-points of the upper sides of adjacent rectangles in the histogram with small straight lines. The frequency polygon is shown in figure 3. In this case, it becomes a closed polygon when the mid-point of the * class 0-10 is joined with the mid-point of the upper side of the’ rectangle on the class interval 10-20. Similarly, the mid-point of the class interval 80-90 may be joined with the mid-point of the upper side of the last rectangle. The area of the closed frequency polygon is always equal to the area of the histogram. The frequency curve is formed by drawing a smooth curve through the mid-points of the upper sides of the adjacent rectangles of the histogram. CU IDOL SELF LEARNING MATERIAL (SLM)
274 Business Mathematics and Statistics The Ogive: This example explains the ogive. The cumulative frequency distribution or the data. Marks No. of Students Less than 20 4 Less than 30 10 Less than 40 18 Less than 50 33 Less than 60 43 Less than 70 48 Less than 80 50 Points are plotted on the x-axis and the y-axis representing the above frequency distribution on a suitable scale. These plotted points are (20, 4), (30, 10), (40, 18), (50, 33), (60, 43), (70, 48) and (80, 50). When these points are joined by a smooth freehand curve, the curve that is so formed is known as the ogive or the cumulative frequency curve. This curve enables us to find the median easily. or instance, or the given data, the procedure or finding the median is as follows: we draw a straight line parallel to the x-axis, through the point of the y-axis, This line cuts the curve at the point L. from L we draw a perpendicular on the x axis. The root of this perpendicular shows the value of the median (see Fig. 4). 11.3 Summary Measures that describe, characterize or represent a given data are known as ‘Statistical Averages’. There are two main objects while calculating a statistical average. Firstly, to indicate precisely the sum and substance of an entire mass numerical data. Secondly, to serve as a means as well as a measure of comparison with other similar groups of numerical data. The different types of statistical averages are: z Arithmetic mean. z Median z Mode z Geometric mean CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 275 z Harmonic mean z Quadratic mean z Moving average z Progressive average The arithmetic mean (A.M.) is the quantity obtained by summing up the values of items in a variable and dividing the sum by the number of items. When the different items in a data are assigned weights according to their significance (relatively), the average then calculated is called: a weighted arithmetic average. The value of the middle item of a series that is ordered either in the ascending or descending order of magnitude is called Median. The value of the item in a variable that is repeated the greatest number of times is called the mode. In other words, it is the value of the item that occurs most frequently in the data. It is said to be the most prominent item as well as a typical measurement. If x1, x2, x3 … xn are n quantities, the geometric mean (G.M.) of these quantities is the nth root of the product of these quantities. If xl, x2, x3 … xn are n items, then the harmonic mean (H.M.) of these is the reciprocal of the average of the reciprocals of these n items. While calculating quadratic mean the average of the squares of the given quantities is to be obtained first and then its square-root calculated. As its name indicates, it is an average that moves over a period of time. It’s possible to find out the trend of some given values over a period of time with this average. Progressive average is calculated to get a progressive outlook or trend in business, profits, sales, output etc. The values of those items that divide a given data into four, ten or a hundred parts are called quartiles, deciles and percentiles respectively. Hence a given data has three quartiles denoted by Q1, Q2, Q3; nine deciles D1, D2, D3 .... D9; and 99 percentiles Pl, P2, P3 ..... P99. CU IDOL SELF LEARNING MATERIAL (SLM)
276 Business Mathematics and Statistics 11.4 Key Words/Abbreviations MD =Mean Deviation QD = Quartile Deviation SD = Standard Deviation CV = Coefficient of variation 11.5 Learning Activity 1. For the data given below, calculate: (a) Standard deviation and the coefficient of variation. (b) The wage limits or the central 50% of the wage earners Weekly Wages No. of Employees 208 21 9 22 11 23 15 24 18 25 26 26 17 27 14 28 5 29 2 2. From the prices of shares X and Y given below, state which is more stable. X 29 41 83 57 62 68 51 19 80 50 Y 50 53 47 61 49 63 70 57 60 40 3. From the prices of shares A and B given below, state which is more stable A 136 129 120 117 137 140 156 165 170 173 B 86 83 64 75 78 96 107 111 123 137 CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 277 11.6 Unit End Questions (MCQ and Descriptive) A. Descriptive Type: Short Answer Type Questions 1. What do you understand by dispersion? Explain the different measures of dispersion pointing out their relative merits and demerits. 2. Explain the practical utility of the various measures of dispersion. 3. Distinguish between mean deviation and standard deviation. 4. What do you understand by coefficient of variation? Explain its uses. 5. Is standard deviation superior to the other measures of dispersion? Give reasons. 6. For the data given below calculate (a) The coefficient of variation (b) The wage limits or the central 50% of the employees (c) The wage limit below which 15% of the employees have their earnings. Weekly Wages No. of Employees 20 3 21 9 22 11 23 15 24 18 25 26 26 17 27 14 28 6 29 2 7. Find in which factory payment of wages is more consistent. Daily Wages Factory AB 15-20 07 05 20-25 13 15 25-30 18 18 30-35 22 24 CU IDOL SELF LEARNING MATERIAL (SLM)
278 Business Mathematics and Statistics 35-40 15 13 40-45 10 08 45-50 05 07 8. Find the mean deviation from the median and mode in the following data: Age Number 16 30 17 32 18 35 19 46 20 51 21 60 22 43 23 32 24 19 9. Calculate the mean deviation from the mean and quartile deviation for the data given below Class f 0-9 4 10-19 6 20-29 8 30-39 15 40-49 10 50-59 5 60-69 2 [Ans: Mean deviation from mean = 11.94, quartile deviation = 10.91] 10. Find in which factory payment of wages is more consistent. Daily Wages Factory AB 15-20 13 21 20-25 27 39 25-30 48 43 30-35 52 57 35-40 25 22 40-45 15 18 45-50 10 10 CU IDOL SELF LEARNING MATERIAL (SLM)
Dispersion 279 11. From the following observation prepare frequency distribution table in ascending order starting with 5-10 (inclusive method) and find the mean and standard deviation. Scores 19 18 15 35 32 30 8 12 17 8 18 19 18 30 36 42 35 37 30 39 25 24 26 28 35 8 17 19 22 24 10 16 15 18 17 12 21 19 30 31 19 26 24 24 29 27 25 28 22 22 21 28 12. From the following observations prepare a frequency distribution table in ascending order starting with 100-110 (exclusive method) and find out the coefficient of variation. 125 108 112 126 110 132 136 130 149 155 120 130 136 138 125 111 119 125 140 148 147 137 145 150 142 135 137 132 165 154 B. Multiple Choice/Objective Type Questions 1. The simplest measure of dispersion is __________. (a) Range (b) Mean (c) Mode (d) Deviation 2. __________ deviation is also called as semi-inter quartile range. (a) Fractile (b) Quartile (c) Standard (d) None of these 3. __________ is the term used to denote the square of the standard deviation. (a) Mean (b) median (c) Variance (d) None of these 4. __________ is used to draw a continuous frequency distribution. (a) Histogram (b) Graph (c) Frequency polygon (d) Ogive curve Answers: (1) (a); (2) (b); (3) (c); (4) (a) 11.7 References References of this unit have been given at the end of the book. CU IDOL SELF LEARNING MATERIAL (SLM)
280 Business Mathematics and Statistics UNIT 12 SAMPLING Structure 12.0 Learning Objectives 12.1 Introduction 12.2 Sample and Sampling 12.3 Main Object of Sampling 12.4 Basic Principles of sampling 12.5 Summary 12.6 Key Words/Abbreviations 12.7 Learning Activity 12.8 Unit End Questions (MCQ and Descriptive) 12.9 References 12.0 Leaning Objectives After studying this unit, you will be able to: z Explain about the sample, population, universe, census, survey. z Elaborate the main object of sample and the basic principles of sampling 12.1 Introduction An entire collection of all the individuals or things or units is called a Universe or Population. The entire count of all the units of the population is called a Census. A part of a storability of individual units (census) is a sample. Study of a census takes much time, money and energy. Sample studies are covered out. The basic principles of sampling deserve special study. CU IDOL SELF LEARNING MATERIAL (SLM)
Sampling 281 Census and Sample: Census being an exhaustive procedure necessitates much efforts and time. Hence, representative sample of a given, population is considered as sufficient. We study various sampling methods. 12.2 Sample and Sampling The word ‘sample’ means a part of any collection of things, individuals or results of operations that are quantitatively expressed. A totality or a collection of things or individuals is said to constitute a population. Hence, a sample simply means a part of a population. A finite population is one that has individuals or things that can be finitely expressed in numerical terms, whereas an infinite population is one that cannot be expressed finitely. For example, the heights of 100 persons form a finite population and is also called Universe. The following are a few examples of finite populations : population of marks of students at an examination, population of prices, weights, incomes etc. To draw certain conclusions about the characteristics of a population it is sufficient to select and study a part of it. In short for the purposes of statistical studies a sample would suffice. The entire work of selecting samples from a population is termed sampling. 12.3 Main Object of Sampling The principle aim of selecting a sample and studying it is to acquire the maximum information about the population with the least amount of time, money and energy. In brief, maximum information with minimum effort is the goal of sampling. 12.4 Basic Principles of Sampling Before undertaking the work of sampling it is necessary to bear in mind the following points: (1) No bias or prejudice should creep in the selection of a sample or samples. (2) All the members of the sample should be governed by the same rules and conditions of sampling. (3) Individual members of the sample should be entirely independent of one another. (4) Special importance should never be attached to certain parts of the population while selecting members for the sample. CU IDOL SELF LEARNING MATERIAL (SLM)
282 Business Mathematics and Statistics 12.5 Summary The word ‘sample’ means a part of any collection of things, individuals or results of operations that are quantitatively expressed. A totality or a collection of things or individuals is said to constitute a population. Hence a sample simply means a part of a population. A finite population is one that has individuals or things that can be finitely expressed in numerical terms, whereas an infinite population is one that cannot be expressed finitely. The entire work of selecting samples from a population is termed sampling. The principle aim of selecting a sample and studying is to acquire maximum information about the population with the least amount of time, money and energy. In brief, maximum information with minimum effort is the goal of sampling. The basic principles of sampling state the following: No bias or prejudice should creep into the selection of a sample or samples, All members of the sample should be governed by the same rules and conditions of sampling, Individual members of the sample should be entirely independent of one another, Special importance should never be attached to certain parts of the population while selecting members for the sample. 12.6 Key Words/Abbreviations Bias, Unbaised sample, census, representative, sample survey, census survey or count. 12.7 Learning Activity (1) To learn the procedure to be adapted for the purpose of conducting a sample survey. ........................................................................................................................................ ........................................................................................................................................ (2) To understand the distinguishing features between a sample method of enquiry and the census method. ........................................................................................................................................ ........................................................................................................................................ CU IDOL SELF LEARNING MATERIAL (SLM)
Sampling 283 12.8 Unit End Questions (MCQ and Descriptive) A. Descriptive Type: Short Answer Type Questions 1. Explain the following terms: Finite Population, Representative Sample, Biased Sampling. 2. State the main objective of sampling and the basic principles of sampling. 3. What are the requirements of a good sample? B. Multiple Choice/Objective Type Questions 1. A sample discovered from a heterogeneous population is not-representative. (a) Correct (b) Incorrect (c) Relevant (d) None of these 2. Census survey is more accurate than a sample survey (a) Irrelevant (b) False (c) True (d) None of these 3. Sample does not represent all the features/characteristics of a given population. (a) Incorrect (b) True (c) False (d) None of these 4. Maximum information with minimum effort is the goal of sampling (a) Correct (b) Incorrect (c) Irrelevant *d) None of these Answers: (1) (a); (2) (c); (3) (b); (4) (a) 12.19 References References of this unit have been given at the end of the book. CU IDOL SELF LEARNING MATERIAL (SLM)
284 Business Mathematics and Statistics UNIT 13 TYPES OF PROBABILITY Structure 13.0 Learning Objectives 13.1 Introduction 13.2 Methods of Sampling 13.3 Other Methods of Sampling 13.4 Statistical Laws 13.5 Sample Size 13.6 Summary 13.7 Key Words/Abbreviations 13.8 Learning Activity 13.9 Unit End Questions (MCQ and Descriptive) 13.10 References 13.0 Learning Objectives After studying this unit, you will be able to: z Describe the importance of the concept of productivity and its uses in the sampling process. z Illustrate the various methods of probability sampling and non-probability sampling. z Explain the relevance of statistical laws and the methods commonly used for the purpose of finding representative sample size. CU IDOL SELF LEARNING MATERIAL (SLM)
Types of Probability 285 13.1 Introduction When samples are drawn from a given population it is necessary to ensure their representativeness. For this purpose use of probability methods is considered as very much appropriate. Hence probability and non-probability sampling methods deserve special study. These methods are explained below. 13.2 Methods of Sampling Amongst the methods of sampling there are two categories: probability sampling and non probability sampling. In probability sampling, the process of selecting members is based on the chance factor. If the process of selection is not based on chance but on some arbitrary method then it is non-probability sampling. The following methods of sampling deserve special study: (1) Random Sampling (probability sampling). (2) Systematic Sampling (probability sampling). (3) Purposive Sampling (non probability sampling). (4) Stratified Sampling (probability sampling). (5) Multi-stage Sampling (probability sampling). 13.2.1 Random Sampling A sample that consists of members that are chosen at random from a population is called a random sample; the entire process of selecting members at random from a population is known as random sampling. This method ensures that the probability of each member of the population in the sample is equal. In other words, each member has the same chance of being included in the sample. Random sampling is a good method because members of the sample are chosen not only without bias or prejudice but purely according to the principles of random sampling. Further, to overcome bias or errors due to human weakness, certain methods of random sampling have been devised. For instance, the following are some of the methods: (a) Method based on mechanical devices. (b) Method based on random sampling numbers CU IDOL SELF LEARNING MATERIAL (SLM)
286 Business Mathematics and Statistics One of the simplest devices is the lottery drum method. Here, members of the population are numbered on tickets of uniform size and colour. These tickets are placed in a drum. Just like a pure lottery, tickets are drawn at random. In the case of a small size random sample a simple device is to number all the members of the population on small indentical cards — of uniform size and colour, like playing cards and draw them from the pack, one by one, with replacement after each shuffle and drawing. This procedure is continued till the cards drawn give the required size of the random sample. In the case of large scale random sampling these methods are not considered to be useful. Hence, tables of random sampling numbers such as Tippet’s random sampling numbers or Kendall and Babington- Smith’s random sampling numbers are used. 13.2.2 Systematic Sampling When complete population lists are available this method of sampling is considered to be the most appropriate one. Further it is a quick and easy method. The method consists selecting at random the first member and thereafter the other members in a systematic way. For example, if a population list comprises 48,000 members and a sample of 800 is required then we can select every sixtieth item of the population. But before doing that it is necessary to select the first member by random sampling, from the numbers 1 to 60. For instance if 22 happens to be the number selected at random then it will be the first number, 82 will be the second number, 142 the third and so on. Being quick and simple this method of sampling is considered to be very useful and hence has universal application. 13.2.3 Purposive Sampling (Judgement Sampling) The main object of this technique is to serve a particular purpose. Members of the sample are chosen strictly according to the criterion laid down. This sampling is also known as judgement sampling. As the individual members of the sample are chosen in accordance with a certain principle, the method of such a selection is called purposive sampling. The nature of this sampling technique is such that members of the population have unequal probabilities of inclusion in the sample, which in turn means that some memebrs have a very high probability of inclusion over others. For instance, while undertaking an opinion survey, only the personal judgement of a particular group of persons may be taken into consideration. CU IDOL SELF LEARNING MATERIAL (SLM)
Types of Probability 287 13.2.4 Stratified Sampling (Probability Sampling) According to this method, the size of the sample the members of which are selected fromm a particular stratum by random sampling is proportional to the size of the stratum. All individual items selected by random sampling from each stratum constitute a sample called the stratified sample. Therefore the process of such a sample selection is termed stratified sampling. When compared with the systematic and random sampling methods, stratified sampling stands prominent as the better choice. This is because it overcomes the chief hurdle of random sampling, viz. giving unequal representation. Here unequal representation implies that certain parts of the population may be better represented than others in the sample. Further, the purposive sampling technique introduces a certain bias in the selection of members which is not so in the case of stratified sampling. The chief merit of this method, apart from stratification, is that the manner of picking up items from each stratum is purely according to random sampling principles. 13.2.5 Multi-stage Sampling This is the best method to use when large scale nationwide surveys are to be undertaken. The notable fact about this method is that the entire sampling is undertaken in a certain number of stages depending upon the nature of the survey. The only problem is that the larger the number of stages the greater is the likelihood of accuracy being sacrificed. The chief merit of this method of sampling is that it is less expensive and more practicable. Here’s an example of how multi-stage sampling draws random samples at each of the different stages. For a study of the smoking habits of people inhabiting a particular State a multi-stage sampling may be undertaken as follows : the process of selecting a few districts in the State by random sampling constitutes the first stage; selecting by random sampling a few towns and cities from these selected districts constiutes the second stage; selecting by random sampling a few blocks and localities from these selected towns and cities constitutes the third. Finally selecting, by random sampling, individuals from these selected blocks and localities constitutes the final stage. 13.3 Other Methods of Sampling The other methods of sampling are: (i) Cluster Sampling (probability sampling). (ii) Quota Sampling (non-probability sampling). CU IDOL SELF LEARNING MATERIAL (SLM)
288 Business Mathematics and Statistics (iii) Convenience Sampling (non-probability sampling). (iv) Panel sampling (non-probability sampling). 13.4 Statistical Laws We now consider the following basic laws which provide the background to sampling techniques: (1) The Law of Statistical Regularity. (2) The Law of Inertia of Large Numbers. 13.4.1 The Law of Statistical Regularity This law asserts that a sample will always show the characteristics of the population to which it belongs. In other words when a few members from a population are chosen at random, then the members so selected undoubtedly explain the nature of the population. Further, whatever be the selected part of the population, it will reflect to a certain extent the regular characteristics of the entire population. In short the philosophy of this law is that a part represents the whole. Let us take an example. If we consider the marks secured by 40 candidates in a subject, amongst 200 candidates, then clearly the arithmetic average of the marks of these 40 candidates will not differ much from the average of the marks of all the 200 candidates. Of course, we might observe that the larger the sample size the more negligible would be the difference. The utility of this law can be at once seen when we deal with statistics of life insurance. Even statistics of agricultural experiments indicate the significance of this Law. 13.4.2 The Law of Inertia of Large Numbers This law simply means that the larger the sample size the stronger would be its position in fully representing the entire population. In other words, large samples always tend to give more accurate results about the population to which they belong. For example, if we toss a coin only a few times then the proportion of tails to the total number of tosses is likely to differ significantly from the proportion of the number of heads to the total number of tosses. But if we toss a coin a large number of times then there is every possibility for the proportions of heads and tails to be each equal to half. This clearly points out that large samples are always more stable, in the sense that they represent the population more accurately. CU IDOL SELF LEARNING MATERIAL (SLM)
Types of Probability 289 13.5 Sample Size For the calculation of sample size, the followinig formula can be used., N (i) Sample Size n = 1 Ne2 where N = total population, e = error (confidence level) such as 5% or 10% that is 0.05 or 0.10. Solved Problems Problem 1: Suppose N =- 10000, e = 0.05, then 10000 10000 n = 110000 (0.05)2 125 = 384.615 = 385 approximately. A random sample of size 385 would represent a population of 10000 at 5% level (error). Problem 2: suppose N = 60000, e = 0.10, then 60000 n = 16000(0.10)2 = 98.36 = 98 approximately (ii) Sample size in the case of proportions § 2·2 n = ©¨ e ¹¸ pq where Z = Standard limit depending upon confidence level. For instance, 1.96 limit corresponds to 5% level. e = Sampliing error such as 0.05. p = The given proportion CU IDOL SELF LEARNING MATERIAL (SLM)
290 Business Mathematics and Statistics Problem 3: Suppose Z = 1.96, e = 0.05V, p = 0.20 then, §1.96 ·2 §1.96 ·2 N = ¨© 0.05¸¹ pq = ©¨ 0.05¹¸ (0.20) (0.80) = 245.8624 = 246 approximately. (iii) Method of deriving the formula for finding the sample size. We know that the sammpling error X' = P, X' = Sample Mean, P = Population mean. (confidence limits for mm are x ± Z (SE) where / is the confidence coefficient such as 1.96 at 5% level of significane and SE is the standard eerror. Now X – P = X(SE) V e = Z(SE) = Z n VV e = Z n , i.e., n = Z T n= § V 2 · = § Z ·2 z2©¨¨ T 2 ¸¸¹ ¨© T ¹¸ V = standard deviation of the population § Z ·2 § Z ·2 p(1 p) n = ¨© e ¸¹ pq ©¨ e ¹¸ (iv) Use of tables for finding the sample size A researcher can use the tables prepared by Eckhardt, on the basis of the values of p, E and Z, E is the error and z is the confidence level. Refer Echardt (social research Methods, 1978:400) (v) Problems with reference to stratified sampling and cluster sampling. Problem 4: According to a survey on the basis of income levels, a researcher divides 25,000 families into five strata as mentioned below: Stratum 1: Income less than ` 5,000 p.m. = 8,000 Stratum 2: Income less than ` 15,000 and above ` 5,000 to 10,000 CU IDOL SELF LEARNING MATERIAL (SLM)
Types of Probability 291 Stratum 3: Income less than ` 5,001 p.m. = 4,000 Stratum 4: Income less tan ` 1 lakh p.m. = 2,500 Stratum 5: Income above ` 1 lakh p.m. = 500 Suppose the researcher has to study an appropriately representative sample of 100 families, then according to proportional memthod, he has to select at random from each stratum, the number of members mentoned below, as per the calculations. 4000 Stratum 3: Sample size = 25000 × 100 = 16 2500 Stratum 4: Sample size = 25000 × 100 = 10 500 Stratum 5: Sample size = 25000 × 100 = 2 The final sample size = 32 + 40 + 16 + 10 + 2 == 100 Strata with Different Variability We consider strata that not only differ in their sizes but also in their respective variability. Therefore, in regard to variability within a stratum, the standard deviation has to be taken into account. Suppose we consider as the number of strata i.e., N1, N2, N3 … Nx with standard deviatons S1, S2, S3 … Sx = respectively and n = n1 + n2 + … nx where n1, n2, n3… nx denote the respective sample sizes of the x strata then the sample size of the pth strata is given by n x Np Sp np = N1S1 N2S2 5}NxSx where p = 1, 2, 3 … x 13.6 Summary The methods of sampling are: Random Sampling (Probability Sampling), Systematic Sampling (Probability Sampling), Stratified Sampling (Probability Sampling), Multistage Sampling (Probability Sampling) CU IDOL SELF LEARNING MATERIAL (SLM)
292 Business Mathematics and Statistics The other methods of Sampling are: Cluster sampling (Probability Sampling), Quota Sampling (Non-Probability Sampling), Convenience Sampling (Non-Probability Sampling), Panel Sampling (Non-Probability Sampling). Basic laws which provide the background techniques are: ‘The law of Statistical Regularity’, ‘The law of Inertia of large Numbers.’ For the calculation of Sample size the following formula can be used: Sample Size n = N/1 + Ne2 (Given by Taro Yemane) 13.7 Key Words/Abbreviations Random, Systematic, Stratified, Cluster, Multi-stage, Convenience Judgement, Statistical regularity, Inertia, Probability, Non-probability 13.8 Learning Activity 1. Give some examples of situations and circumstances when we can appropriately use probability sampling and non-sampling methods. ......................................................................................................................................... ......................................................................................................................................... 13.9 Unit End Questions (MCQ and Descriptive) A. Descriptive Type: Short Answer Type Questions 1. Discuss the methods generally used in sampling. Explain in brief the law of statistical regularity. 2. Distinguish between population and sample. What do you understand by a random sample and how is it drawn.? 3. What is sampling? What are the requisites of a good sample? Describe any two methods of sampling. 4. What is a random sample? Discuss the merits and demerits of census and sample surveys. CU IDOL SELF LEARNING MATERIAL (SLM)
Types of Probability 293 5. What is meant by sample method of enquiry? When is it adopted? What are its advantages? What are the essential requisites of a good sample? 6. Explain the terms ‘sample’ and ‘population’. Distinguish between census and sample survey. 7. Describe briefly the stratified and systematic sampling methods. 8. Explain the methods of systematic and stratified sampling with suitable illustrations. 9. State the merits claimed by the method of random sampling in statistical surveys. 10. Compare and contrast the merits and demerits of the sample survey and the census methods: 11. Write short notes on: (i) Random sampling (ii) Systematic sampling (iii) Purposive sampling (iv) Stratified sampling (v) Multistage sampling (vi) Law of statistical regularity (vii) Law of inertia of large numbers 12. Describe any two sampling methods with suitable illustrations. 13. Write notes on: (i) Statistical populations and samples (ii) Stratified sampling 14. Describe briefly the following (i) Statistical random sampling (ii) Stratified sampling (iii) Systematic sampling CU IDOL SELF LEARNING MATERIAL (SLM)
294 Business Mathematics and Statistics 17. Describe the term ‘random sampling’ and explain why randomness is important in sample design? 18. Describe briefly probability and Non Probability methods of sampling? B. Multiple Choice/Objective Type Questions 1. If N = 12000, e = 0.05, then n = _______________. (a) 450 (b) 350 (c) 300 (d) None of these 2. If p = 0.20, q = 0.80, e = 0.05, z = 1.96, then N = 246 (a) correct (b) incorrect (c) non-representative (d) none of these 3. Purposive sampling is also called as _______________. (a) Systematic sampling (b) Judgement sampling (c) Random sampling (d) None of these 4. Probability and non-probability are the two categories of ___________________. (a) Sample (b) Probability (c) Sampling method (d) All the above Answers: (1) (d); (2) (a); (3) (b); (4) (c) 13.10 References References of this unit have been given at the end of the book. CU IDOL SELF LEARNING MATERIAL (SLM)
Search
Read the Text Version
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
- 15
- 16
- 17
- 18
- 19
- 20
- 21
- 22
- 23
- 24
- 25
- 26
- 27
- 28
- 29
- 30
- 31
- 32
- 33
- 34
- 35
- 36
- 37
- 38
- 39
- 40
- 41
- 42
- 43
- 44
- 45
- 46
- 47
- 48
- 49
- 50
- 51
- 52
- 53
- 54
- 55
- 56
- 57
- 58
- 59
- 60
- 61
- 62
- 63
- 64
- 65
- 66
- 67
- 68
- 69
- 70
- 71
- 72
- 73
- 74
- 75
- 76
- 77
- 78
- 79
- 80
- 81
- 82
- 83
- 84
- 85
- 86
- 87
- 88
- 89
- 90
- 91
- 92
- 93
- 94
- 95
- 96
- 97
- 98
- 99
- 100
- 101
- 102
- 103
- 104
- 105
- 106
- 107
- 108
- 109
- 110
- 111
- 112
- 113
- 114
- 115
- 116
- 117
- 118
- 119
- 120
- 121
- 122
- 123
- 124
- 125
- 126
- 127
- 128
- 129
- 130
- 131
- 132
- 133
- 134
- 135
- 136
- 137
- 138
- 139
- 140
- 141
- 142
- 143
- 144
- 145
- 146
- 147
- 148
- 149
- 150
- 151
- 152
- 153
- 154
- 155
- 156
- 157
- 158
- 159
- 160
- 161
- 162
- 163
- 164
- 165
- 166
- 167
- 168
- 169
- 170
- 171
- 172
- 173
- 174
- 175
- 176
- 177
- 178
- 179
- 180
- 181
- 182
- 183
- 184
- 185
- 186
- 187
- 188
- 189
- 190
- 191
- 192
- 193
- 194
- 195
- 196
- 197
- 198
- 199
- 200
- 201
- 202
- 203
- 204
- 205
- 206
- 207
- 208
- 209
- 210
- 211
- 212
- 213
- 214
- 215
- 216
- 217
- 218
- 219
- 220
- 221
- 222
- 223
- 224
- 225
- 226
- 227
- 228
- 229
- 230
- 231
- 232
- 233
- 234
- 235
- 236
- 237
- 238
- 239
- 240
- 241
- 242
- 243
- 244
- 245
- 246
- 247
- 248
- 249
- 250
- 251
- 252
- 253
- 254
- 255
- 256
- 257
- 258
- 259
- 260
- 261
- 262
- 263
- 264
- 265
- 266
- 267
- 268
- 269
- 270
- 271
- 272
- 273
- 274
- 275
- 276
- 277
- 278
- 279
- 280
- 281
- 282
- 283
- 284
- 285
- 286
- 287
- 288
- 289
- 290
- 291
- 292
- 293
- 294
- 295
- 296
- 297
- 298
- 299
- 300
- 301
- 302
- 303
- 304
- 305
- 306
- 307
- 308
- 309
- 310
- 311
- 312
- 313
- 314
- 315
- 316
- 317
- 318
- 319
- 320
- 321
- 322
- 323
- 324
- 325
- 326
- 327
- 328
- 329
- 330
- 331
- 332
- 333
- 334
- 335
- 336
- 337
- 338
- 339
- 340
- 341
- 342
- 343
- 344
- 345
- 346
- 347
- 348
- 349
- 350
- 351
- 352
- 353
- 354
- 355
- 356
- 357
- 358
- 359
- 360
- 361
- 362
- 363
- 364
- 365
- 366
- 367
- 368
- 369
- 370
- 371
- 372
- 373
- 374
- 375
- 376
- 377
- 378
- 379
- 380
- 381
- 382
- 383
- 384
- 385
- 386
- 387
- 388
- 389
- 390
- 391