An average or central value of a distribution is the variable's value representing the entire distribution. These representative values are called the measures of central tendency. Generally, there are the following five measures of central tendency:
(a) Mathematical average
(i) Arithmetic mean
(ii) Geometric mean
(iii) Harmonic mean
(b) Positional average
(i) Median
(ii) Mode
2. Arithmetic Mean:
(i) For ungrouped distribution: If x1,x2,…,xn are n values of variate xi then their mean x is defined as x=nx1+x2+…+xn=n∑i=1nxi⟹∑i=1nxi=nx
(ii) For ungrouped and grouped frequency distribution: If x1,x2,…,xn are values of variate with corresponding frequencies f1,f2,…,fn then their mean is given by x=f1+f2+…+fnf1x1+f2x2+…+fnxn=∑i=1nfi∑i=1nfixi=N∑i=1nfixi
where N=∑i=1nfi
(iii) By shortcut method:
Let di=xi−ax=a+N∑i=1nfidi
where a is the assumed mean.
3. Median:
The median of a series is the value of the middle term of the series when the values are written in ascending order. Therefore, the median divides an arranged series into two equal parts.
Formulae of Median:(i) For ungrouped distribution: Let n be the number of variates in a series then
Median={Middle term,Mean of (2n)th and (2n+1)th terms,if n is oddif n is even
(ii) For ungrouped frequency distribution: First, we prepare the cumulative frequency (c.f.) column and find the value of N. Then
Median={(2N+1)th term,Mean of (2N)th and (2N+1)th terms,if N is oddif N is even
(iii) For grouped frequency distribution: Prepare the cumulative frequency column and find the value of 2N. Then, find the class that contains the value of c.f. equal to or just greater than 2N, this is the median class.
Median=l+(f2N−F)×h
where:
l — lower limit of median class
f — frequency of median class
F — cumulative frequency of the class preceding the median class
h — class interval of median class
4. Mode:
In a frequency distribution, the mode is the value of that variate that has the maximum frequency.
Method for Determining Mode:
(i) For ungrouped distribution: The value of that variate is repeated the maximum number of times.
(ii) For ungrouped frequency distribution: The value of that variates with the maximum frequency.
(iii) For grouped frequency distribution: First, we find the class with the maximum frequency, the modal class.
Mode=l+(2f0−f1−f2f0−f1)×h
where:
l — lower limit of modal class
f0 — frequency of the modal class
f1 — frequency of the class preceding the modal class
f2 — frequency of the class succeeding the modal class
h — class interval of modal class
5. Relation Between Mean, Median, and Mode:
In a moderately asymmetric distribution, the following is the relation between a distribution's mean, median, and mode. It is known as the empirical formula.
Mode=3×Median−2×Mean
Note:
(i) The median always lies between the mean and mode.
(ii) The mean, median, and mode coincide for a symmetric distribution.
6. Measures of Dispersion:
The dispersion of a statistical distribution is the measure of the deviation of its values about their average (central) value. Generally, the following measures of dispersion are commonly used:
(i) Range
(ii) Mean deviation
(iii) Variance and standard deviation
Range:
The difference between the greatest and least values of a variate of a distribution is called the range of that distribution. If the distribution is a grouped distribution, then its range is the difference between the upper limit of the maximum class and the lower limit of the minimum class.
Coefficient of range=Sum of extreme valuesDifference of extreme values
Mean Deviation (M.D.):
The mean deviation of a distribution is the mean of the absolute value of deviations of variate from their statistical average (Mean, Median, Mode).
If A is any statistical average of a distribution, then the mean deviation about A is defined as
Mean deviation=n∑i=1n∣xi−A∣(for ungrouped distribution)
Mean deviation=N∑i=1nfi∣xi−A∣(for frequency distribution)
Note: It is minimum when taken about the median.
Coefficient of Mean deviation=AMean deviation
(where A is the central tendency about which Mean deviation is taken)
Variance and Standard Deviation:
The variance of a distribution is the mean of squares of deviations of variate from their mean. It is denoted by σ2 or var(x).
The positive square root of the variance is called the standard deviation. It is denoted by σ or S.D.
Standard deviation=variance
Formulae for Variance:
(i) For ungrouped distribution:
σx2=n∑(xi−x)2
σx2=n∑xi2−x2=n∑xi2−(n∑xi)2
σx2=n∑di2−(n∑di)2where di=xi−a
(ii) For frequency distribution:
σx2=N∑fi(xi−x)2
σx2=N∑fixi2−x2=N∑fixi2−(N∑fixi)2
σd2=N∑fidi2−(N∑fidi)2
σu2=h2(N∑fiui2−(N∑fiui)2)where ui=hxi−a
Some Results on Standard Deviation:1.σx=0 when all the variate values are equal.
2.σx is independent of change of origin but is dependent on the change of scale.
σa+bx=∣b∣σx
Combined Standard Deviation:
If x1, σ1, N1 and x2, σ2, N2 are the means, standard deviations, and number of observations of two distributions respectively, then the standard deviation σ of the combined distribution is given by