Unit Sampling: Structure
Unit Sampling: Structure
Unit Sampling: Structure
Structure
12.1 12.2 12.3 12.4 Introduction Objectives Meaning of Population and Sample Methodsl'esigns of Sampling
12.4.1 12.4.2 12.4.3 Probability Sampling Non-probability Sampling Criteria for Selecting Sampling
12.5
Probability Sampling
125:l 12.5.2 12.5 3 12.5.4 1255 Simple Random Sampling Systematic Sampling Stratified Sampling Cluster Sampling Multi-stage and Multi-phase Sampling
12.6
Non-probability Sampling
12.6.1 12.6.2 12.6.3 Incidental Sampling Purposive Sampling y t a Sampling
12.1 INTRODUCTION
In Unit 11, we have discussed the meaning and type of hypothesis. We also know by this time that testing the hypothesis is central to the research. In order to accomplish this it is imperative to collect requisite, recent and most relevant data. This helps us to decide from whom and how to collect these data. The primary purpose of research is to discover principles that have universal application. But to study a whole population in order to amve at generalizations would be impracticable if not impossible. Some populations are so that their characteristics cannot be measured, because before the measurement could be completed, the populations would have changed. Imagine the difficulty of conducting an experiment with all fifth-gradeIndian children [as subjects] on numerical ability. The study of this population would require the services of thousands of researchers, the expenditure of millions of rupees and thousands of class hours. In view of this it becomes imperative to collect the data from a smaller group of the population instead of collecting from the whole population.
30
There are various ways to achieve this. We will discuss these in this unit.
12.2
OBJECTnTES
Sampling
After completing the study of this unit, you will be to : define the terms 'population and sample', justify the need of selecting a sample, explain the meaning of probability sampling, describe various probability sampling methods, explain the meaning of non-probability sampling, describe various non-probability sampling methods, state the characteristics of a good sample.
12.3
A "population" is any group of individuals /units that have one or more characteristics in common which are of interest to the researcher, for a particular research. A population may include all the individuals of a particular type or a more restricted part of that group, e.g. a group of all the university teachers, or a group of male / female university teachers, or distance learners enrolled with IGNOU. For assessing the study habits of adolescent girls in city, all the adolescent girls of that city who are studying in schools and colleges, make up the population for this study. A "sample" is a small proportion of a population selected for the study. By observing the characteristics of the sample, one can make certain inferences about the characteristics of the population from which it is drawn. The term sampling refers to the strategies which enable us to pick a subgroup from a larger group and then use the subgroup as a basis for making judgement about the larger group. In order to use such a subgroup to make decisions about the larger group, the subgroup has to resemble the larger group as closely as possible.
31
As per the second concept, sampling distribution approaches normal distribution provided - more the irregular distribution in the population, larger is the sample and sample is selected to avoid biases.
ii)
Simple, straightforward and workable methods, adapted to available facilities and personnel, should be used. Achieving optimum balance between expenditure incurred and maximum of reliable information should be the guiding principle.
The decision whether a probability sampling or a non -probability sampling is to be applied rests on the constraints which are not very different from those stated earlier. These are - objectives of the study, type of study and availability of the resources for the study. If the objective of the research is to apply the results of the study to a small local group then sampling may not be given as much consideration as in a study the results of which are to be applied to a larger group. In experimental research internal validity is of more concern than the external validity.
Action research generally does not require sampling from a larger group. Most of the times sampling is not very essential in historical research also. Whereas survey studies generally have a more rigorous sampling. The availability of time, funds, manpower and equipment required is another importantconsiderationin deciding about the size and technique of sampling.
iii) If one is interested in obtaining an estimate of the sampling error, one may resort to probability sampling rather than to a non-probability one.
Notes: a) Space is given below for writing your answer. b) Compare your answer with that given a the end of the unit. t
1.
State the meaning of the following terms: Population, Sample, ProbabilitySampling,Non-pmbability Sampling.
12.5
PROBABILITY SAMPLING
Sampling
We know the meaning of and requirement for probability sampling. Now we will take a brief account of different methods of probability sampling.
i) Lottery Method After naming or numbering every unit in the population, they are well mixed. The required numbers of units are then drawn from all these well-mixed chits. The individualslobjects with these identification namednumbers are then picked up for inclusion in the sample.
However this technique has some objections. When the population is very large and includes such individuals/objects, which are of such nature that could not be mixed and further if 'well mixing' is not attained despite all efforts,the principle of randomness in the population may be violated.
ii) Random Table Method
Research Design
will be too cumbersome to recommend in case of large population. In such situation, computer generated random selection should be resorted to, in order to save time and labour. Tables of random numbers have been generated by computers producing a random sequence of digits e.g. random digit table by Rand Corporation and prepared by Kendall & Smith, by Fisher & Yates and by Tippett are frequently used. The required number of units are selected from such a table in any convenient and systematic way. Now suppose we have select to 20 distance learners for interview from 80-distance learners registered at a study centre. We may start with any column and any row. Because we want 20 numbers i.e. two digit numbers, we have to select only the first two digits from each number. If we select the first column and start from first row then we will get following twenty two digit numbers - 23,05, 14,38,97, 11, 43, ................ 61. You will notice that numbers greater than 80 will have to be deleted from this list and for the remaining numbers selecting any other column and the row the procedure will have to be repeated, till we get required number i.e. 20. If any number is repeated in this list, it is to be substituted by selecting the next number. Until a sample of desired size is obtained, the selection procedure is to be continued. Advantages 1. This method calls for no special expertise and training or even insight. It can be used mechanically by anybody.
However, best results are achieved by adopting simple random sampling method. Still, it is not free from criticism. Limitations Practically listing of all the units in the population may not be possible.
1.
In case of population with infinite numbers, listing is out of the question. It is difficult though not impossible, but it involves high cost. In case of heterogeneous population the selected random sample may not truly represent the characteristics of the population.
2. 3.
4.
The systematic sample is spread more evenly over the population which makes this method more precise than stratified random sampling.
Sampling
Limitations
I. Selection of every element other than the first selected randomly is linked with the first element. This makes the process different from the simple random method where selection of every element is independent of other one. When the list of elements has a periodic arrangement, there is a risk that the sample interval may coincide with the periodic interval in the list. Suppose, A, B, C, D and E are the 5 schools selected and from each school 100 students are selected. The students from school A are placed starting from 1, from school B starting from 2, from school C starting from 3, from school D starting from 4 and from school E starting from 5 with an interval of 5. Thus the school A students will hold the numbers 1,6,11,16,21,........496. The school B students will hold the numbers 2,7,12,17,22,. .........497. Now in systematic samplingprocedure suppose we decide to select 5 % of the total and randomly choose any number from 1 to 5 say '3' then starting from 3 we will have to select every 5" number. These numbers will be 3,8,13,18.. ......498. Have you noticed that all these numbers belong to school C? Why has it happened so?
The answer is Because every school is repeated in the list with an interval of '5' and elements are selected with an interval of ' 5 ' .
3.
Another limitation of the systematic sampling method is the trend of the listed population. This is explained below -
Suppose 100 students are listed in the decreasing order of academic merit. We want to draw a sample of 20 students from this using systematic sampling method. 20 out of 100 means the size of interval is ' 5 ' . We can draw many samples from this listed population. If we randomly pick up a number from amongst 1 to 5, say 3 then the ,sample will comprise the elements 3"', 81h, 13", 18Ih..... .......981h instead , if we . randomly pick '5' then the sample will comprise the 5"', loth,151h-,............,100" elements. Is it not obvious that the two samples will not be comparable in terms of merit ? The mean average of these two samples would be significantly different with respect to merit and other associated variables. Calculations made from such samples cannot pinpoint the sources of variability.
dividing the population under consideration into strata on the basis of stratification characteristics/criteria. listing the units in each stratum separately.
F
35
ResearchDesign
selecting requisite number of elements from each stratum using appropriate random selection technique.
Thus all the elements selected from all the strata compose the required sample. Important points to be noted i) ii) iii) The criteria for dividing the population into strata should be correlated with the variable being studied. The criteria should be practical. It should not yield a unwieldy number of strata.
A good measure of the stratification criteria should be available; e.g. if reliable and valid tool of determiningsocioeconomic status is not available ,stratification on this basis would lead to confounding of the results.
iv) Selection of the elements at random from each stratum in the same proportion as that of the actual size of the stratum in the population improves the representativeness of the sample and helps in achieving higher efficiency at a reduced cost. v) In some studies (like census) stratification is not possible before the data have been collected. After collecting the data stratification as per sex, age, educational level is effected. Or a simple random sample of the required size is selected and the classification into strata is observed.
Advantages
I.
2.
Stratified random sampling is very useful when a list of the elements in the population is not available. It is the most applicable method of sampling when the population is heterogeneous.
for the purpose of data collection. For example a school complex (a group of schools) is a cluster. Some such clusters are selected to make a sample. Here a sampling elementtunit is a group/cluster. In social survey, the cluster sampling is described as 'area sampling'. This method involves following steps -
deciding the nature of the cluster required. indentifyingnocating such clusters to make the population. selecting the clusters in required number at random.
Advantages
This method of sampling is economic, especially when the cost of measuring a unit is relatively small.
Limitations When the sampling unit is to be an individual elementtunitor number in the population, this method is not applicable.
Advantages
1. 2. 3. 4. In both the methods burden on respondents is reduced. Relative cost also gets reduced. Two-phase sampling is useful in studying rare cases. In two-phase sampling resulting gain in precision is more due to possibility of getting more information in details.
Check Your Progress Notes: a) Space is given below for writing your answers. b) Compare your answers with those given at the end of the unit.
iv) Stratified random 11) Sampling Method i) Stratified random ii) Simple random iii) Systematic iv) Cluster
Limitations a) Samplingunit is not an individual element b) Not applicable to heterogeneous population c) Listing the elements in sub-population necessary d) Periodic arrangement of elements e) Periodic arrangement of elements Special Feature a) Every unit in the population has equal chance of being selected b) Spread more evenly over the population C) Useful in case of heterogeneous population d) Applicable in case of infinite nopulation
e) Same type of sampling unit at each phase 3. Distinguish between multi-stage and multi-phase sampling methods.
Research Design
Advantages
The administrative convenience of obtaining sample for the study, the ease of testing, saving in time, completeness of the data collected are some of the merits of this method.
Limitations
Since there is no well-defined population and no random sampling method is applied to select the sample, the standard error formulae apply with a high degree of approximation. Hence no valid generalization can be drawn. Any attempt at generalization based on such data and conclusion thereof will be misleading.
Advantages
1.
18
This method of sampling is useful where a small sample is required. It is focused on solving problems of particular groups.
2.
Limitations This method is applicable only for the selection of samples including typicallspecial cases such as 'best teacher award winners' from the population of teachers or 'meritorious past students of the school' from the population of the past students.
Sampling
It is free from error due to bias or due to deliberate selection of the units of the sample. There is no substitution of originally selected unit by some other more convenient way. It does not suffer from incomplete coverage of the units selected for study. It includes such units, which are as far as possible independent. It represents the population in the strict sense that it is a miniature or replica in all respects of the population from which it has been drawn. This should at least apply to the characteristicsdirectly under the investigation or those likely to affect these characteristics indirectly. It is adequate or sufficient in size to allow confidence in the stability of its characteristics. An adequate sample is one that contains enough cases to ensure reliable results.
5.
6.
..............................................................................................................
Research Design
6.
4
f
I
C
t
I
- Purposive
f
I
- Quota
- Cluster
Get five Theses/Dissertations based on different research methods e.g. case study, survey, research etc. Study the sampling methods adopted in these studies and prepare a chart including the following columns -Research method, Sampling method, Sample size, rationale for adopting the sampling method.
Dalen, Van. Deobold, & Meyer, William. J. (1962): UnderstandingEducational ~ e s e a r c hAn Introduction. New York: McGraw Hill Book Company Inc. : Kerlinger, Fred N. (1993): Foundation of Behavioural Research :Educational and Psychological Enquiry. New York: Holt, Rinehart and Winston. Koul, Lokesh. (1997): Methodology of Educational Research. New Delhi: Vikas Publishing House Pvt. Ltd. Upasani, K.N. (1987): Conducting Educatioml Research. Examination Reforms Unit, S.N.D.T. Women's Univerisity, Pune: Kalpana Mudranalaya, Vockell, Edward L. (1983): Educational Research. New York: MacMillan Publishing Co. Inc.
Population - A group of individuals/ units having one or more characteristics in common which are of interest to the researcher for a particular research. Sample - A small representative proportion of a population selected for a particular research. Probability Sampling - Sampling based on some statistical concepts such as the 'Law of Large Numbers', 'Central Limit Theorem', and the 'Normal Distribution' is known as probability sampling. Non-probability Sampling - Sampling based on the judgements of the researcher as the most important element of contml is known as non-probability sampling.
2.
I)
3.
4.
(c) (a) iii) m ) (b) iv) iv) (d) The main distinction between multi-stage and multi-phase sampling is the use of unit of sampling at different levels. In multi-stage, sampling is done at various levels such as national, state, district level. In multi-phase, sampling units are of ~ the same type at each phase only a few of them are asked for m o information than others. When a readily or easily available group is selected as per the conveniemx of the researcher, it is termed as the 'incidental sample'.
-
i)
11)
i)
5.
i)
ii)
Similarity - Purposive and quota sampling both include stratification. Difference - In purposive sampling actual selection of the units from a stratum, to be included in the sample is done purposively rather than by random methods. In quota sampling quota is usually determined by the pmporcion of the strata and quota within the strata is selected as per the availability and convenience and not randomly. Free from error due to bias or deliberate selection of some units. Originally selected units are not i 8' :tmted and incom:&rs coverage of units is not involved. As far as possible independent units are included.
6.
Ti)
iv) v)
Research Design
12.13 GLOSSARY
Parameter Sampling Sampling Bias
: : :
It is a population value representing any trait or characteristic of the population as a whole. It is the process of selecting a sample from the population. When the mean of the sampling distribution does not coincide with the parameter, it is said to be biased. It is the relative frequency distribution of an infinity of determinations of the value of this statistic. It is the difference between the value of parameter and that of corresponding statistic or the difference between population value and sample value of a characteristic. It is a complete, accurate, and up-to-date list of all the units in apopulation. It is the sample value representing any trait or characteristic of the members of the sampling.
:.
:
: