Chapter 14. Statistics Notes in English

 


Revision Notes on Statistics

Statistics

Statistics is one of the parts of mathematics in which we study about the collecting, organizing, analyzing, interpreting and presenting data.

Statistics is very helpful in real life situations as it is easy to understand if we represent a data in a particular number which represents all numbers. This number is called the measure of central tendency. Some of the central tendencies commonly in use are -

Mean

It is the average of “n” numbers, which is calculated by dividing the sum of all the numbers by n.

The meanof n values x1, x2, x3, ...... xis given by

Mean

Median

If we arrange the numbers in an ascending or descending order then the middle number of the series will be median. If the number of series is even then the median will be the average of two middle numbers.

If n is odd then the median isOddobservation.

If the n is even then the median is the average ofEvenobservation.

Mode

The number which appears most frequently in the series then it is said to be the mode of n numbers.

Mean of Grouped Data (Without Class Interval)

If the data is organized in such a way that there is no class interval then we can calculate the mean by

Mean of Grouped Data

where, x1, x2, x3,...... xn are the observations

f1, f2, f3, ...... fn are the respective frequencies of the given observations.

Example

Grouped Population Mean
xffx
2040800
40602400
60301800
80504000
100202000
 200∑fx = 11000 

Here, x1, x2, x3, x4, x5 are 20, 40, 60, 80, 100 respectively and f1, f2, f3 , f4, f5 are 40, 60, 30, 50, 20 respectively.


means

Mean of Grouped Data (With Class-Interval)

When the data is grouped in the form of class interval then the mean can be calculated by three methods.

1. Direct Method

In this method, we use a midpoint which represents the whole class. It is called the class mark. It is the average of the upper limit and the lower limit.

Formula of class mark

Direct Method

Example

A teacher marks the test result of the class of 55 students for mathematics. Find the mean for the given group. 

Marks of Students0 – 1010 – 2020 – 3030 – 4040 – 5050 – 60
Frequency27107542

To find the mean we need to find the mid-point or class mark for each class interval which will be the x and then by multiplying frequency and midpoint we get fx.

Marks of studentsFrequency(f)Midpoint(x)fx
0 – 10275135
10 – 201015150
20 – 30725175
30 – 40535175
40 – 50445180
50 – 60255110
  ∑f = 55∑fx = 925

frequency

2. Deviation or Assumed Mean Method

If we have to calculate the large numbers then we can use this method to make our calculations easy. In this method, we choose one of the x’s as assumed mean and let it as “a”. Then we find the deviation which is the difference of assumed mean and each of the x. The rest of the method is the same as the direct method.

Deviation or Assumed Mean Method

Example

If we have the table of the expenditure of the company's workers in the household, then what will be the mean of their expenses?

Expense(Rs.)100 - 150150 - 200200 - 250250 - 300300 - 350350 - 400
Frequency244033283022

Solution

As we can see that there are big values of x to calculate so we will use the assumed mean method.

Here we take 275 as the assumed mean.

Expenses(Rs.)Frequency(f)Mid value(x)d = x – 275fd
100 – 15024125- 150- 3600
150 – 20040175- 100- 4000
200 – 25036225- 50-1650
250 – 3002827500
300 – 35030325501500
350 – 400223751002200
 ∑f = 180  ∑fd = - 5550

method

3. Step Deviation Method

In this method, we divide the values of d with a number "h" to make our calculations easier.

Step Deviation Method

Example

The wages of the workers are given in the table. Find the mean by step deviation method.

Wages 20 - 3020 - 3030 - 4040 - 5050 - 60
No. of workers8912116

Solution

WagesNo. of workers (f) Mid-point(x)Assume mean (a) = 35, d = x - ah = 10, u = (x – a)/hfu
10 – 20815-20-2-16
20 – 30925-10-1-9
30 – 401235000
40 – 50114510111
50 – 6065520212
 ∑f = 46   ∑fu = -2

deviation

Mode of Grouped Data

In the ungrouped data the most frequently occurring no. is the mode of the sequence, but in the grouped data we can find the class interval only which has the maximum frequency number i.e. the modal class.

The value of mode in that modal class is calculated by

Formula of mode

l = lower class limit of the modal class

h = class interval size

f1 =frequency of the modal class

f0 =frequency of the preceding class

f2 = frequency of the succeeding class

Example

The table of the marks of the students of a class is given. Find the modal class and the mode.

Marks0 – 2020 – 4040 – 6060 – 8080 – 100
No. of students48675

Solution

Here we can see that the class interval with the highest frequency 8 is 20 – 40.

So this is our modal class.

Modal class = 20 - 40

Lower limit of modal class (l) = 20

Class interval size (h) = 20

Frequency of the modal class(f1) = 8

Frequency of the preceding class(f0) = 4

Frequency of the succeeding class (f2) = 6

Mode

Median of Grouped Data

To find the median of a grouped data, we need to find the cumulative frequency and n/2

Then we have to find the median class, which is the class of the cumulative frequency near or greater than the value of n/2.

Cumulative Frequency is calculated by adding the frequencies of all the classes preceding the given class.

Then substitute the values in the formula

Formula of median

where l = lower limit of median class

n = no. of observations

cf = cumulative frequency of the class preceding to the median class

f = frequency of the median class

h = size of class

Example

Find the median of the given table.

Class IntervalFrequencyCumulative Frequency (fc) 
1 – 5444
6 – 10374 + 3 = 7
11 – 156137 + 6 = 13
16 – 2051813 + 5 = 18
21 – 2522018 + 2 = 20 
 N = 20  

Solution

Let’s find the n/2.

n = 20, so n/2 = 20/2 = 10

The median class is 11 - 15 as its cumulative frequency is 13 which is greater than 10.

Median

13.5

Remark: The empirical relation between the three measures of central tendency is

3 Median = Mode + 2 Mean

Graphical Representation of Cumulative Frequency Distribution

The graph makes the data easy to understand. So to make the graph of the cumulative frequency distribution, we need to find the cumulative frequency of the given table. Then we can plot the points on the graph.

The cumulative frequency distribution can be of two types -

1. Less than ogive

To draw the graph of less than ogive we take the lower limits of the class interval and mark the respective less than frequency. Then join the dots by a smooth curve.

2. More than ogive

To draw the graph of more than ogive we take the upper limits of the class interval on the x-axis and mark the respective more than frequency. Then join the dots.

Example

Draw the cumulative frequency distribution curve for the following table.

Marks of students0 – 1010 – 2020 – 3030 – 4040 – 5050 – 60
No. of students710142063

Solution

To draw the less than and more than ogive, we need to find the less than cumulative frequency and more than cumulative frequency.

MarksNo. of studentsLess than cumulative frequencyMore than cumulative frequency
0 – 107Less than 107More than 060
10 – 2010Less than 2017More than 1053
20 – 3014Less than 3031More than 2043
30 – 4020Less than 4051More than 3029
40 – 506Less than 5057More than 409
50 – 603Less than 6060More than 503
    More than 600

Now we plot all the points on the graph and we get two curves.

The less than and more than ogive graph

Remark

  • The class interval should be continuous to make the ogive curve.

  • The x-coordinate at the intersection of the less than and more than ogive is the median of the given data.

0 Comments