Sponsor Area
Example : Marks obtained by 10 students in Class XI are : 38 45 42 36 30 40 60 65 70 11
Sponsor Area
(a) Series of invidividual observations.
(b) Discrete series.
(c) Continuous series.
Data may be arranged by time, or space or both. For example, we have time series data on aggregate income, aggregate consumption, size of the population, etc.
Continuous Variable : It can take all values in a given range. For example, heights and weights of individuals, prices of commodities, income of individuals may be treated as continuous variables.
Discrete Variables : If the variables can take only some particular value like whole numbers, it is called a discrete variable. For example, number of students in different classes or different schools or the size of the households.
Bivariate Frequency Distribution : A bivariate frequency distribution is the frequency distribution of two variables. For example the table shows the sales and advertisement expenditure of the firm. They are classed in different rows.
1. It helps in the comparison of data.
2. It helps us to understand the relationship among variables.
3. It highlights significant features of the data at a glance.
4. It makes the statistical analysis of the data easier.
5. It arranges and presents huge volume of raw data in meaningful and condensed form.
However, in Statistics, the data on any single variable, or a set of variables, for all individual units in a region, constitute the population of that variable, or variables.
Univariate : If the data are on a single variable, the set of measurements constitutes a univariate population of that variable. We have a bivariate or multivariate population of the set variables. For example, we may have a univariate population of prices, or a population of incomes. We may have a bivariate population of height and weights of all individuals in a region.
Multivariate : A multivariate population of expenditures on various items of consumption of all households.
2. in case of exclusive series, value of the upper limit of a class is included in the lower limit of the next class interval. In inclusive series value of the upper limit of a class is included in that very class interval.
3. Exclusive series is useful whether the value is complete number or in decimals, but inclusive series is useful only when value is in complete number.
Exclusive Method : This method is useful whether the value is complete number or in decimals. In case of exclusive series, value of the upper limit of a class is included in the lower limit of the next class interval. Value of the upper limit of the class is included in that very class interval.
Inclusive Series : In inclusive series value of the upper limit of a class is included in that very class interval. Inclusive series is useful when value is incomplete number.
In order to get a better idea about the distribution of values within the range, we should subdivide the total range into a number of class intervals and find out the number of values in different classes.
The main points underlying the construction of a frequency distribution are as follows :
(a) Construction of Discrete Frequency Distribution :
1. Prepare a table with three columns-first for variable under study, second for 'Tally bars'
and the third for the total, representing corresponding frequency to each value or size of the variable.
2. Place all the values of the variables in the first column in ascending order-beginning with the lowest end giving to the highest. The gap between one magnitude to another may preferably be the same.
3. Put pars (vertical lines) in front of the values according in the second column keeping in view the number of items a particular value repeats itself. This column is for facility in counting. Blocks of five bars orare prepared and some space is left between each block of bars.
4. Count the number of bars in respect of each value in the variable and place it in the third column made for total of frequency.
(b) Construction of Continuous Frequency Distribution :
There are two methods of classifying the data according to class intervals :
Exclusive Method : Under this method upper limits are excluded. The upper limits of class intervals is the lower limit of the next class. For example, if the students obtained marks are grouped as 5 – 10,10 – 15,15 – 20,20 – 25, 25 – 30 etc., we include in first group of students whose marks are 5 or more but under 10. If the marks of a students are 10 he is not included in the first group but in the second, i.e., 10 to 15.
Inclusive Method : Under this method upper class limits of classes are included in respective classes. For example, if the students obtained marks are grouped as 5 – 9, 10 – 14, 15 – 19,20 – 24,25 – 29 etc., in the group 5 – 9, we include in first group students whose marks is between 5 and 9. If the marks of students are 10 he is included in the next class, i.e., 10 to 14. If there are no whole numbers, the classes can be made 5 – 9.9, 10 – 14.9, 15 – 19.9 and so on.
Sponsor Area
1. The classical data become comparable.
2. Homogeneous data are kept or classified in one group.
(i) Homogeneity, (ii) Clarity, (iii) Stability, (iv) Flexibility, (v) Diversification.
A.
The average of the upper class limit and the lower class limit.A.
the actual values of observationsA.
the upper class limit of a class is excluded in the class intervalA.
difference between the largest and the smallest observations.A.
The average of the upper class limit and the lower class limit.B.
The upper class limit of a class is excluded in the class-interval1. The class midpoint is equal to :
The average of the upper class limit and the lower class limit.
(b) The product of upper class limit and the lower class limit.
The ratio of the upper class limit and the lower class limit
None of the above.
A.
The average of the upper class limit and the lower class limit.
Statistical calculations in classified data are based on:
the actual values of observations
the upper class limits
the lower class limits
the class midpoints
A.
the actual values of observations
Under Exclusive method:
The upper class limit of a class is excluded in the class interval
the upper class limit of a class is included in the class interval
The lower class limit of a class is excluded in the class interval
The lower class limit of a class is included in the class interval.
A.
The upper class limit of a class is excluded in the class interval
Range is the:
difference between the largest and the smallest observations.
difference between the smallest and the largest observations
average of the largest and the smallest observations
ratio of the largest to the smallest observation
A.
difference between the largest and the smallest observations.
Can there be any advantage in classifying things? Explain with an example from your daily life.
Advantage of Classification:
1. It helps in the comparison of data.
2. It helps us to understand the relationship among variables.
3. It highlights significant features of the data at a glance.
4. It makes the statistical analysis of the data easier.
5. It arranges and presents huge volume of raw data in meaningful and condensed form.
What is a variable? Distinguish between a discrete and a continuous variable.
n statistics the term variable is used, only if the changing characteristics can be numerically measured. The heights and weights of individuals are variables, as they can be measured in numerical terms. Prices of commodities vary time and space, and they can be numerically measured. So the price is variable.
Continuous Variable : It can take all values in a given range. For example, heights and weights of individuals, prices of commodities, income of individuals may be treated as continuous variables.
Discrete VariablesIf the variables can take only some particular value like whole numbers, it is called a discrete variable. For example, number of students in different classes or different schools or the size of the households.
Explain the ‘exclusive’ and ‘inclusive’ methods used in classification of data.
There are two methods of classifying the data according to class-intervals, namely Exclusive Method and Inclusive Method.
Exclusive Method : This method is useful whether the value is complete number or in decimals. In case of exclusive series, value of the upper limit of a class is included in the lower limit of the next class interval. Value of the upper limit of the class is included in that very class interval.
Inclusive Series : In inclusive series value of the upper limit of a class is included in that very class interval. Inclusive series is useful when value is incomplete number.
Use the data in Table (given below) that relate to monthly household expenditure (in Rs.) on food of 50 households and
(i) Obtain the range of monthly household expenditure on food.
(ii) Divide the range into appropriate number of class intervals and obtain the frequency distribution of expenditure.
(iii) Find the number of households whose monthly expenditure on food is :
(a) less than Rs. 2000
(b) more than Rs. 3000
(c) between Rs. 1500 and Rs. 2500
(i) Range = Largest value-Smallest value = 5500 - 1007 = 4083.
(ii).
Class Interval |
Tally Bars |
Frequency |
1000 – 1500 |
|
19 |
1500 – 2000 |
|
14 |
2000 – 2500 |
|
6 |
2500 – 3000 |
|
5 |
3000 – 3500 |
|
2 |
3500 – 4000 |
|
1 |
4000 – 4500 |
0 |
0 |
4500 – 5000 |
|
2 |
5000 – 5500 |
|
1 |
(iii) (a) Number of household whose expenditure on food is less than Rs. 2000 = 19 + 14 = 33
(b) More than Rs. 3000 = 2+1 + 2+1 = 6
(c) Between Rs. 1500 and Rs. 2500 = 14 + 6 = 20.
In a city 45 families were surveyed for the number of domestic appliances they used. Prepare a frequency array based on their replies as recorded below :
1 |
3 |
2 |
2 |
2 |
2 |
1 |
2 |
1 |
2 |
2 |
3 |
3 |
3 |
3 |
3 |
3 |
2 |
3 |
2 |
2 |
6 |
1 |
6 |
2 |
1 |
5 |
1 |
5 |
3 |
2 |
4 |
2 |
7 |
4 |
2 |
4 |
3 |
4 |
2 |
0 |
3 |
1 |
4 |
3 |
Variables |
Tally Sheet |
Frequency |
0 |
|
1 |
1 |
|
7 |
2 |
|
15 |
3 |
|
12 |
4 |
|
5 |
5 |
|
2 |
6 |
|
2 |
7 |
|
1 |
Total |
45 |
What is ‘loss of information’ in classified data?
The classification of data as a frequency distribution has an inherent shortcoming. While it summarises the raw data making it concise and comprehensible, it does not show the details that are found in raw data. There is a loss of information in classifying raw data though much is gained by summarising it as a classified data. Once the data are grouped into classes, an individual observation has no significance in further statistical calculations. For example : the class 20–30 contains 6 obervations : 25, 25, 20, 22, 25 and 28. So when these data are grouped as a class 20–30 in the frequency distribution,the latter provides only the number of records in that class (i.e. frequency = 6) but not there actual values. All values in this class are assumed to be equal to the middle value of teh class interval or class mark (i.e. 25). Further statistical calculations are based only on the values of class mark and not on the values of teh observations in that class. This is true for other classes as well. Thus the use of class mark instead of the actual values of the obervations in statistical methods involves considerable loss of information.
Do you agree that classified data is better than raw data?
The data collected from primary and secondary sources are raw data or unclassified data. Once the data is collected, the next step is to classify them for further statistical analysis. Classification brings order in the data. It is a tedious task to get information from large unclassified data. The raw data are summarised and made comprehensible by classification. When facts of similar characteristics are placed in the same class, it enables one to locate them easily, make comparison and draw inferences without any difficulty.
Distinguish between univariate and bivariate frequency distribution.
Univariate Frequency Distribution : The frequency distribution of a single variable is called a univariate distribution. For example marks of a student.
Bivariate Frequency Distribution : A bivariate frequency distribution is the frequency distribution of two variables. For example the table shows the sales and advertisement expenditure of the firm. They are classed in different rows.
What is variable?
Variable is a characteristic which is capable of being measured and changes its value over time. A variable may be either discrete or continuous.
Prepare a frequency distribution by inclusive method taking class interval of 7 from the following data:
28 |
17 |
15 |
22 |
29 |
21 |
23 |
27 |
18 |
12 |
7 |
2 |
9 |
4 |
6 |
1 |
8 |
3 |
10 |
5 |
20 |
16 |
12 |
8 |
4 |
33 |
27 |
21 |
15 |
9 |
3 |
36 |
27 |
18 |
9 |
2 |
4 |
6 |
32 |
31 |
29 |
18 |
14 |
13 |
15 |
11 |
9 |
7 |
1 |
5 |
37 |
32 |
28 |
26 |
24 |
20 |
19 |
25 |
19 |
20 |
Frequency Distribution
Class Interval |
Tally Bars |
Frequency |
1–7 |
|
15 |
8–14 |
|
12 |
15–21 |
|
15 |
22–28 |
|
9 |
29–35 |
|
7 |
36–42 |
|
2 |
Total |
60 |
What is meant by classification?
Classification is the grouping of related facts into different classes.
What is quantitative classification?
Statistical Data classified into classes or groups on the basis of their numerical values is called quantitative classification.
What is meant by statistical series?
Arranging of data in different classes according to a given order is called series.
Sponsor Area
What is meant by exclusive series?
Exclusive series is that series in which every class interval excludes items corresponding to its upper limit.
What is meant by inclusive series?
An inclusive series is that series which include all items upto its upper limit.
What is meant by class interval or class magnitude?
Magnitude of a class interval is the difference between the upper limit and the lower limit of a class.
What is meant by ‘frequency’?
Frequency is the number of times an item occurs (or repeat itself) in the series.
What is meant by individual series?
Individual series are those series in which the items are listed singly.
What is discrete series?
A discrete series or freqnency array is that series in which data are presented in a way that exact measurement of items are clearly shown.
What is meant by a countinuous series or frequency distribution?
It is that series in which items cannot be exactly measured. The items assume a range of values and are placed within the range or limits.
What is an open ended series?
An open ended series is that series in which lower limit of the first class interval and the upper limit of last class interval is missing.
What is meant by cumulative frequency series?
Cumulative frequency series is that series in which the frequencies are continuously added corresponding to each class interval in the series.
Daily expenses of five persons are 15,20, 35, 40, 55. Calculate the magnitude.
Magnitude = 55 – 15 = 40.
What do you understand by raw data?
The data in their original form are called raw data.
Example : Marks obtained by 10 students in Class XI are :
38 45 42 36 30 40 60 65 70 11
What is frequency?
The term frequency signifies the number of items or a value repeated in a given set of information.
What do you understand by the term ‘array’?
When individual data are arranged in an increasing or decreasing order, it is known as an array.
What is a series?
A series is a systematic arrangement of items in a particular order or sequence.
What are the three types of series?
These are three types of series as under:
(a) Series of invidividual observations.
(b) Discrete series.
(c) Continuous series.
What is discrete series?
A series is called discrete when the frequency of various items are given separately
What is frequency distribution?
The presentation of observations of classes horizontally and individual items which fall in class are written according to ascending order of magnitude.
What is cumulative frequency series?
The type of presentation where the frequencies are represented by adding the frequency of each previous class.
What is quantitative classification?
Classification done according to quantitative varieties like marks, wages, etc. is termed as quantitative classification.
What is qualitative classification?
Classification according to attributes like honesty, beauty is known as qualitative classification.
What do you understand by ‘classification of data’?
‘Classification’ means arranging things in an appropriate order and putting them into homogeneous groups. For example, in library, the books and periodicals are classified and arranged according to subjects; students are grouped according to division they secure in a certain examination; plants and animals may be assigned to groups distinguished by structure, origin, etc.
Data may be arranged by time, or space or both. For example, we have time series data on aggregate income, aggregate consumption, size of the population, etc.
Arrange the countries in ascending order and descending order of yield.
Country |
Yield of Wheat (kg. /acre) |
America |
1925 |
Brazil |
127 |
China |
893 |
Denmark |
225 |
France |
439 |
India862 |
In Ascending Order
Country |
Yield of Wheat (kg. /acre) |
Brazil |
127 |
Denmark |
225 |
France |
439 |
India |
862 |
China |
893 |
America |
1925 |
In Descending Order
Country |
Yield of Wheat (Kg. /acre) |
America |
1925 |
China |
893 |
India |
862 |
France |
439 |
Denmark |
225 |
Brazil |
127 |
Distinguish between (a) variable, and (b) attribute. Explain with examples.
Variable : In Statistics, the term ‘variable’ is used, only if, the changing characteristics can be numerically measured. Thus, heights and weights of individuals are variables, as they can be measured in numerical terms, price of commodities vary over time and space and they can be numerically measured. Therefore prices is a variable. Similarly, incomes of individuals, household expenditures on various items of consumption, size of households, input and outputs of firms are all variables.
Attribute : The looks of people, their intelligence and aptitude for art and music change from one individual to the other, they cannot be measured numerically in the same way as heights and weights, or prices and incomes. Therefore, they are not called variable in the statistical sense. They are called ‘attributes’. We may rank individual according to the quality of attributes. The ranks are sometimes used as their numerical values for purposes of analysis.
Distinguish between (a) Continuous and (b) Discrete variables.
Continuous Variables : It can take all conceivable values in the given range. For example, heights and weights of individuals, prices of commodities, income of individuals may be treated as continuous variables. Although, in practice, the measurements are taken only approximately, upto one or two places of decimal, the true values may be anything in a certain range.
Discrete Variables : If the variables can taken only some particular value (like whole numbers), it is called a ‘discrete’ or ‘discontinuous’ variable. For example, number of students in different classes, different schools, or the size of households are discrete variables, as they can take interval values only.
Explain the term ‘Population’ as it is used in Statistics. Define (a) Univariate, (b) Bivariate and (c) Multivariate population.
In common language the word ‘population’ means the number of persons living in a certain region. We may count the number of persons and obtain the size of population of that region. Similarly, we may find the population of certain animals in forests in a country or the population of certain plants in a garden and so on. The term population implies ‘head count’.
However, in Statistics, the data on any single variable, or a set of variables, for all individual units in a region, constitute the population of that variable, or variables.
Univariate : If the data are on a single variable, the set of measurements constitutes a univariate population of that variable. We have a bivariate or multivariate population of the set variables. For example, we may have a univariate population of prices, or a population of incomes. We may have a bivariate population of height and weights of all individuals in a region.
MultivariateA multivariate population of expenditures on various items of consumption of all households.
Define organisation of data.
After the data has been collected either with the help of the sampling method or census method the work of its organisation starts. The organisation of data means a systematic arrangements of collected figures so that the data becomes easy to understand and more convenient for further statistical treatment. This systematic arrangement changes into a statistical series.
What are the differences between exclusive and inclusive series?
1. In case of exclusive series, the upper limit of one class interval is the lower limit of the next class interval.
2. in case of exclusive series, value of the upper limit of a class is included in the lower limit of the next class interval. In inclusive series value of the upper limit of a class is included in that very class interval.
3. Exclusive series is useful whether the value is complete number or in decimals, but inclusive series is useful only when value is in complete number.
In the following time series in which year do you find the population of India to be the minimum. Find the year when it is the maximum.
Population of India (in Crores)
Year |
Population |
1951 |
35.7 |
1961 |
43.8 |
1971 |
54.6 |
1981 |
68.4 |
1991 |
81.8 |
2001 |
102.7 |
India’s population in the year 1951 was the minimum. It was 35.7 crores. Maximum population was in the year 2001. It was 102.7 crores.
In the given table frequency distribution of marks in mathematics of 100 students are given. Express the values of frequency as pecentage of total frequency (Relative frequency).
Which class has the maximum and minimum concentration of data ?
Frequency Distribution of Marks in Mathematics of 100 Students
Marks |
Frequency |
0–10 |
1 |
10–20 |
8 |
20–30 |
6 |
30–40 |
7 |
40–50 |
21 |
50–60 |
23 |
60–70 |
19 |
70–80 |
6 |
80–90 |
5 |
90–100 |
4 |
Total |
100 |
(i) Maximum concentration of data is in class (50 – 60).
(ii) Minimum concentration of data is in class (0 – 96; 10).
Find the country whose yield of wheat is slightly more than that of India. How much would that be in terms of percentage?
Yield of wheat for different countries
Country |
Yield of Wheat (kg I acre) |
America |
1925 |
Brazil |
127 |
China |
893 |
Denmark |
225 |
France |
439 |
India |
862 |
Country |
Yield of Wheat (kg. / acre) |
% increase |
India |
862 |
10.36% |
China |
893 |
more than |
India |
What is a frequency distribution? What are the main points underlying the construction of a frequency distribution?
he largest value of x is B and smaller value is A. Then x = B – A is the total range of x. A large range indicates that the values of x are spread over a large interval or the variation of values of x is large. A small range indicates smaller variation in the values of x. Thus, the range is measures of variation (or dispersion) of x.
In order to get a better idea about the distribution of values within the range, we should subdivide the total range into a number of class intervals and find out the number of values in different classes.
The main points underlying the construction of a frequency distribution are as follows :
(a) Construction of Discrete Frequency Distribution :
1. Prepare a table with three columns-first for variable under study, second for ‘Tally bars’ and the third for the total, representing corresponding frequency to each value or size of the variable.
2. Place all the values of the variables in the first column in ascending order-beginning with the lowest end giving to the highest. The gap between one magnitude to another may preferably be the same.
3. Put bars (vertical lines) in front of the values according in the second column keeping in view the number of items a particular value repeats itself. This column is for facility in counting. Blocks of five bars or are prepared and some space is left between each block of bars.
4. Count the number of bars in respect of each value in the variable and place it in the third column made for total of frequency.
(b) Construction of Continuous Frequency Distribution :
There are two methods of classifying the data according to class intervals :
Exclusive Method : Under this method upper limits are excluded. The upper limits of class intervals is the lower limit of the next class. For example, if the students obtained marks are grouped as 5 – 10,10 – 15,15 – 20,20 – 25, 25 – 30 etc., we include in first group of students whose marks are 5 or more but under 10. If the marks of a students are 10 he is not included in the first group but in the second, i.e., 10 to 15.
Inclusive Method : Under this method upper class limits of classes are included in respective classes. For example, if the students obtained marks are grouped as 5 – 9, 10 – 14, 15 – 19,20 – 24,25 – 29 etc., in the group 5 – 9, we include in first group students whose marks is between 5 and 9. If the marks of students are 10 he is included in the next class, i.e., 10 to 14. If there are no whole numbers, the classes can be made 5 – 9.9, 10 – 14.9, 15 – 19.9 and so on.
How many classes we choose? How does one decide about the size of class intervals?
There is no hard and fast rule about how many class we choose; but as a working rule the number of classes should lie between 5 and 15. It should be noted that the number of classes will be large if we choose small size class intervals and it will be small if the size of class intervals is large.
As an illustration, suppose the range is 70, and we choose classes of width 2 each. We would require 70 ÷ 2 = 35 classes. However, the number of classes would be 14 if the width of each class was 5.
Size of Class Intervals : We may choose all classes of the same width or of different width. In the case of equal class intervals the size of the class interval is determined as soon as we have decided about the number of classes.
Suppose n is the number of classes and all classes are of width h, then n × h = R.
Knowing the range R and number of classes
n we can abtain h = R/n as the width of class interval. If the range is 70 and we choose 10 classes, the width is 7.
Choice of Class Limits : Suppose x is a continuous variable, such that it can take any value in a given range. In that case, it is possible to choose class limits which are not equal to any of the observed values, For example, height of individuals is a continuous variable, even though, in practice, one can measure height to the nearest of the unit value (in centimetres) as 165, 170, 169, 171 .........; or to the nearest of tenth place of decimals as 165.3, 170.4, 168.9, 170.8, ........ We may specify class intervals as 160.55 165.55,165.55 ....... so that none of the observed values of x is equal to any of the class limits.
Distinguish between a frequency array and a frequency distribution.
Frequency Array : We obtain a frequency array if the variable x is discrete and we have frequencies corresponding to each value (there are no class intervals). Let us illustrate with the following example.
Example : A survey of 100 households was carried out to obtain information on their size, i.e., the number of members of households. The results of the survey are classified as a frequency array in table below :
Frequency Array of Size of Households
Size of the households |
Number of households |
X |
f |
(1) |
(2) |
1 |
5 |
2 |
15 |
3 |
25 |
4 |
35 |
5 |
10 |
6 |
5 |
7 |
3 |
8 |
2 |
Total |
100 |
The column (1) of the table gives the values which the variable x (size of the households) takes; and column (2) gives the corresponding frequencies (number of households). Thus, there are 5 households whose size is 1, there are 15 households of size 2, and so on.
Frequency Distribution : The largest value of X is B and smallest value is A. Then X = B – A is the total range of X. A large range indicates that the values of X are spread over a large interval or the variation of value of X is large. A small range indicates smaller variation in the values of X. Thus, the range is measure of variation (or dispersion) of X.
For example : Suppose we have data on monthly income of 10,000 individuals, the maximum of which is Rs.50,000 and minimum is Rs. 1,000. Thus, the range is Rs.49,000. We observe that majority of individuals say, 70% have small incomes close to Rs. 5,000 and minority, say 2% have income close to Rs.30,000.
In order to get a better idea about the distribution of values within the range, we should subdivide the total range into a number of class intervals and find out the number of values in different classes.
What is meant by individual series? Give one example.
Individual series : Under this method, the value of all the units are shown separately The following example will illustrate this:
Example : The marks obtained by 10 students in statistics are following :
S. No. |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 |
Marks |
16 |
18 |
22 |
19 |
14 |
20 |
25 |
30 |
32 |
50 |
The individual series may be arranged in following two orders :
(a) Ascending Order : When data are arranged in ascending order i.e., a small value to a big value it is known as arranging them in ascending order. The figures of above example may be arranged in ascending order as follows :
S. No. |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 |
Marks |
14 |
16 |
18 |
19 |
20 |
22 |
25 |
30 |
32 |
50 |
(b) Descending Order : When data are arranged serially starting from a big value to small value it is known as arrangement of data in descending order.
S. No. |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 |
Marks |
14 |
16 |
18 |
19 |
20 |
22 |
25 |
30 |
32 |
50 |
What is meant by discrete series? Give the examples.
In these series all the items are divided in certain groups, but these groups are not continuous, therefore these series are known as discrete series. The numbered item that fall in every group are shown in each group which are known as frequencies. The following examples will illustrate this :
Examples :
S. Wo. |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 |
Income in Rs. |
15 |
20 |
25 |
5 |
35 |
45 |
40 |
5 |
15 |
20 |
S. No |
11 |
12 |
13 |
14 |
15 |
16 |
17 |
18 |
19 |
20 |
Income in Rs. |
10 |
5 |
15 |
20 |
30 |
25 |
40 |
35 |
10 |
50 |
S. No. |
21 |
22 |
23 |
24 |
25 |
26 |
V |
28 |
29 |
30 |
Income in Rs |
15 |
20 |
5 |
25 |
10 |
5 |
45 |
20 |
50 |
25 |
S. No |
31 |
32 |
33 |
34 |
35 |
36 |
37 |
38 |
39 |
40 |
Income in Rs. |
20 |
15 |
30 |
10 |
30 |
25 |
5 |
5 |
35 |
15 |
(i) Discrete Series in Ascending Order :
Income in Rs. |
Tally Sheet |
No. of Persons (Frequencies) |
5 |
|
7 |
10 |
|
4 |
15 |
|
6 |
20 |
|
6 |
25 |
|
5 |
30 |
|
3 |
35 |
|
3 |
40 |
|
2 |
45 |
|
2 |
50 |
|
2 |
Total |
40 |
(ii) Discrete Series in Descending Order :
Income in Rs. |
Tally Sheet |
No. of Persons (Frequencies) |
50 |
|
2 |
45 |
|
2 |
40 |
|
2 |
35 |
|
3 |
30 |
|
3 |
25 |
|
5 |
20 |
|
6 |
15 |
|
6 |
10 |
|
4 |
5 |
|
7 |
Total |
40. |
What is meant by continuous series? Give example.
Under such series all the variables are divided in certain continuous groups and their respective frequencies will be written with them. The following example will clear the form of such series:
Example:
Marks : |
0–5 |
5–10 |
10–15 |
15–20 |
20–25 |
25–30 |
30–35 |
35–40 |
No. of Students : |
11 |
22 |
25 |
30 |
19 |
17 |
11 |
10 |
Following are the elements of a continuous series:
(i) Class Intervals : These are the measurements in which some problems is measured and written in continuous group. In the above example, (0 – 5), (5 – 10) etc. are the class intervals of the series.
(ii) Limits of Class Intervals : Each class interval figure is known as limits of class interval. Small figures class intervals are known as lower limit class interval. In class interval (0 – 5) 0 is lower limit and 5 is a upper limit of this class interval.
(iii) Magnitude of Class Intervals The difference between upper limit and lower limit of a class interval is known as its magnitude. In class interval (0 – 5) 5 is the magnitude.
(iv) Mid Value The average of two limits of the class interval is known as mid value e.g., the mid value of class interval
(v) Frequencies : Number of repetition of items of various class intervals in the universe is known as frequencies which will be written with them.
Exclusive and Inclusive Continuous Series:
(a) Exclusive Series : Where the value of upper limit is not included in the same group, but will be included in next group, it is known as exclusive series e.g.
Class interval |
0–10 |
10–20 |
20–30 |
30–40 |
40–50 |
50–60 |
Frequency |
8 |
12 |
15 |
16 |
9 |
3 |
In the above series, 10, 20, 30,40, 50, and 60 will not be included in first, second third, fourth, fifth and sixth group respectively.
(b) Inclusive Series : Where value of upper limit is included in the same group, it is known as inclusive series e.g.,
Class interval |
10–19 |
20–29 |
30–39 |
40–49 |
50–59 |
Frequency |
5 |
9 |
12 |
18 |
14 |
Differentiate between absolute errors and relative errors.
bsolute errors : Absolute error is the difference between the true and estimated value.
Absolute error = actual value – estimated value
Relative errors : It is the ratio of absolute error to the estimated value. It is found out by dividing the absolute error by the estimated error.
Relative error
= Actual value - Estimated value
Estimated error
Example : If the actual value is 400 and estimated value is 350. Find absolute and relative errors.
Solution : Here the actual value is 400 and estimated value is 350.
Absolute error = actual error – estimated value.
= 400 – 350 = 50
Relative error = 50 / 350 = 0.14
The relative error can also be expressed as percentage of 100.
What is open - end series?
In some series the lower class limit of the first class interval and the upper limit of the last class interval are missing. 'Less than' or below is specified in place of the lower class limit of the first class interval and 'More than' or above is specified in place of the upper class limit of the last class interval.
Example:
Marks |
Frequencies |
Below 5 |
2 |
5 – 10 |
6 |
10 – 15 |
8 |
15 and abova |
12 |
Sponsor Area
What is cumulative frequency series? Give an example.
Cumulative frequency series is that series in which the frequencies are continuously added corresponding to each class interval in the series. The frequencies than become cumulative frequencies. The cumulative frequency for the first class interval is the same as frequency itself. But for the second class interval the cumulative frequency would be the both the second as well as first class intervals e.g.
Marks |
Frequency |
Cumulative Frequencies |
5 – 10 |
2 |
2 |
10 – 15 |
5 |
2 + 5 = 7 |
15 – 20 |
7 |
7 + 7 = 14 |
20 – 25 |
8 |
14 + 8 = 22 |
25 – 30 |
18 |
22 + 8 = 30 |
How will you convert the cumulative frequency series into simple frequency series? Give an example.
Cumulative frequency series may be converted into simple frequency series in the following manner:
Marks |
No. of student |
Less than 10 |
8 |
20 |
40 |
30 |
80 |
40 |
96 |
50 |
100 |
Conversion into simple frequency table from
Cumulative Frequency Series Simple Frequency Series
Marks less than |
No. of Students |
10 |
8 |
20 |
40 |
30 |
80 |
40 |
96 |
50 |
100 |
In an entrance examination, for admission to the engineering course of a certain university, the top 50 students, who were selected, got the following percentage of marks in the aggregate:
95 92 |
91 |
90 |
88 |
82 |
86 87 |
87 88 |
70 |
72 |
76 |
77 |
77 79 |
79 79 |
78 |
78 |
68 |
67 |
67 66 |
65 63 |
63 |
63 |
63 |
64 |
58 59 |
60 60 |
60 |
60 |
61 |
62 |
62 62 |
S = smallest) = 95 % - 50% = 45%
Marks |
No. of Students |
0 – 10 |
8 |
10 – 20 |
40 – 8 = 32 |
20 – 30 |
80 – 40 = 40 |
30 – 40 |
96 – 80 = 16 |
40 – 50 |
100 – 96 = 4/p |
55 |
55 |
56 |
57 |
54 |
53 |
52 |
|
55 |
50 |
(i) Obtain the range of the distribution of percentage of marks.
(ii) Classify the data in class intervals as(a) 45 – 55,55 – 65,65 – 75, ...........
(b) 50 – 55, 55 – 60, 60 – 65, ...........
(iii) Examine the error of grouping in each case.
(ii)(a)
C. I. |
Tally Bars |
fx |
45 – 55 |
|
5 |
55 – 65 |
|
20 |
65 – 75 |
|
7 |
75 – 85 |
|
9 |
85 – 95 |
|
1 8 |
95 – 105 |
|
1 |
50 |
(b)
C.I. |
Tally Bars |
fx |
50 – 55 |
|
5 |
55 – 60 |
|
7 |
60 – 65 |
|
13 |
65 – 70 |
|
5 |
70 – 75 |
|
2 |
75 – 80 |
|
8 |
85 – 85 |
|
1 |
85 – 90 |
|
5 |
90 – 95 |
|
3 |
95 – 100 |
|
1 |
50 |
(iii) Error of grouping as per (ii) (a)
C. I. |
A. Value |
M |
(A.V. – M) Σ of Group |
45 – 55 |
54, 53, 52, 50 |
|
4, 3, 2, 2, 0 |
55 – 65 |
63, 58 |
||
65 – 75 |
70, 68, 72, 67, 67, 66, 65 |
|
0, –2, 2, –3, –3, –4, –5 |
75 – 85 |
79, 72, 79, 76, 77, 78, 77, 78 |
|
–1, –8, –1, –4, –1, –3 |
85 – 95 |
88, 86, 92, 87, 91, 90, 87, 88 |
|
–2, –4, 2, –3, 1, 0, –3, –2 |
95 – 105 |
95 |
|
–5 |
Distinguish between error and mistake.
Statistical Error:Term error is used in statistics in a technical sense. It is the difference between the estimated or approximated value and the true value.
Mistake:The mistake arises because of miscalculations, use of wrong methods of calculations and wrong interpretation of the result. These may arise due to the unrepresentative character of the data.
Difference between mistake and error
Basis |
Error |
Mistake |
1. Origin |
1. It arises from the inadequacy of data in statistical techniques and ignorance about the nature and scope of enquiry. |
1. It arises due to carelessness, wrong method of calculation and mistake in computation procedure. |
2. Nature |
2. It arises involuntarily. |
2. It is deliberate. |
3. Stage Estimation |
3. State of interpreting statistical data possible. |
3. State of statistical investigation impossible. |
Distinguish between (a) continuous and (b) discrete variables.
Continuous Variables : If a variable can taken any numerical value within a certain range, it is called continuous variable. It passes one value to the next.
Discrete Variables : If a variable can taken only some particular value, it is called discrete variable. There are gaps between one value and the next possible one.
Example : Continuous Series :
Marks |
40–50 |
50–60 |
60–70 |
70–80 |
80–90 |
No. of Students |
3 |
8 |
14 |
19 |
8 |
Example : Discrete Series :
X |
10 |
15 |
20 |
30 |
35 |
40 |
Explain the following terms:
Frequency distribution, Class frequency, Class limits, Class interval, Class mark, Frequency curve.
Frequency distribution : It is a comprehensive way to classify raw data of a quantitative variable. It shows how the different values of a variable are distributed in different classes alongwith their class frequencies.
Class frequency : It means the number of values in a particular class.
Class limit : Each unit in a frequency distribution table is bounded by class limits. Class limits are the two ends of a class. The lowest value is called the lower class limit and the highest value, the upper class limit.
Class interval: Class interval or class width is the difference between the upper class limit and the lower class limit.
Class mark : The class mid-point or class mark is the middle value of a class. It lies half way between the lower class limit and the upper class limit of a class.
Clas mark or class mid point = Upper Class limit + Lower Class Limit
2
Frequency Curve : It is a graphic representation of a frequency distribution. To obtain the frequency curve we plot the class marks on the X-axis and frequency on the Y-axis.
What is the principal objective of classification of data?
The principal objective of classification of data is to capture and dinstinctively present the diverse characteristics of data.
What is the main difference between geographical classification and chronological organisation?
In geographical classification, the collected data are grouped with reference to location or geographical division such as countries, states, cities, districts or colonies, but in chronological classification the collected data are grouped with reference to time, such as years, months, weeks etc.
What is attribute?
Attribute is a characteristic which is not capable of being measured quantitatively. Blindness deafness sickness are examples of attribute.
What is relative frequency?
Relative frequency is the frequency value expressed as the percentage of the total frequency.
Write any two characteristics of classification.
Two characteristics of classification:
1. The classical data become comparable.
2. Homogeneous data are kept or classified in one group.
Write down the essentials of classification.
Essentials of classification:
(i) Homogeneity, (ii) Clarity, (iii) Stability, (iv) Flexibility, (v) Diversification.
Below is given the classification of population. Name the type of classification– simple or manifold.
What are discrete variables?
Discrete variables are those variables which are exact or finit and are not expressed in fraction. For example number of students in a class number of members in a family are discrete series.
What are continuous variables?
Continuous variables are those variables which assume values in fraction like 2.4, 2.6.
Name the types of statistical series.
Statistical series are of three series namely, (i) Individual series, (iii) Disrete series and (iii) continuous series.
What is difference between simple frequency series and cumulative frequency series?
In simple frequency series, the frequency corresponding to each class interval in shown separately and individually but in cumulative frequency series. The frequencies are progressively totaled and aggregates are shown.
Write down the difference between individual, discrete and continuous series.
These are differences between individual, discrete and continuous series
Individual Series |
Discrete Series |
Continuous Series |
1. There is always one frequency for each item. |
1. There is more than one frequency for each item. |
1. There is more than one frequency for each class-interval. |
2. In this series there is no column for frequency. |
2. There is column for frequency. |
2. This series has also column for frequency. |
3. In this series values are given in definite break. |
3. This series has also values in definite break. |
3. In this series values are given in the form of group. |
What is the basic difference between exclusive series and inclusive series?
In case of exclusive series, upper limit of one class-interval repeats itself as a lower limit of the next class-interval but in case of inclusive series, it does not.
Convert the following frequency series into a simple frequency series:
6 students obtained less than 10 marks
22 students obtained less than 20 marks
30 students obtained less than 30 marks
40 students obtained less than 40 marks
50 students obtained less than 50 marks
Cumulative Frequency Series
Marks less than |
No. of Students |
10 |
6 |
20 |
22 |
30 |
30 |
40 |
40 |
30 |
50 |
Simple Frequency Distribution
Marks |
No. of Students |
0–10 |
6 |
10–20 |
22 – 6 = 16 |
20–30 |
30 – 22 = 8 |
30–40 |
40 – 30 = 10 |
40–50 |
50 – 40 = 10 |
Sponsor Area
Sponsor Area