Unit 4 Research Design
Unit 4 Research Design
Unit 4 Research Design
Unit 4
Research Design
A “research design” is defined as” the arrangement of conditions for collection and analysis of
data in a manner that aims to combine relevance to the research purpose with economy in
procedure.”
The research design is the conceptual structural within which research is conducted.
It constitutes the blueprint for the collection, measurement and analysis of data
The design incorporates a framework of what the researcher is going to do
Research Design is a Road Map or a Plan of Action
Research design can be thought of as the structure of research
it is the "glue" that holds all of the elements in a research project together
It is also a framework or blueprint for conducting the business research project
1. The sampling design: deals with the method of selecting items to be observed for the
given study
2. The Observational design: relates the conditions under which the observations are to be
made;
3. The statistical design: concerns with the question of how many items are to be
observed, and how the information and data gathered are to be analyzed.
4. The Operational Design: deals with the techniques by which the procedures specified
in the sampling, statistical and observational designs can be carried out.
(i) A plan: - specifies the sources & types of information relevant to the research problem
2
(ii) A strategy: - Which approach will be used for gathering and analyzing the data.
(iii) The time and cost budgets: - most studies are done under these two constraints
A variable can be anything that has more than one value (e.g. sex, age, weight, income,
religion, ESLCE scores).
The variable that is antecedent to the dependent variable is termed as an independent variable.
E.g. researcher could examine the impact of heavy drinking of alcohol on liver.
Heavy drinking of alcohol would be an independent variable and the effect e.g. liver disease
could be considered as dependent variable
2. Control
The technical term ’control’ is used when we design the study in a way that minimizes the
effects of extraneous variables.
Independent variables that are not related to the purpose of the study, but may affect the
dependent variable are termed as extraneous variables.
E.g. hypothesis: there is a relationship between children’s gains in social studies achievement
and their self-concepts.
When the group is exposed to some novel or special condition, it is termed an ‘experimental
group
4
4. Treatments.
The different conditions under which experimental and control groups are put are usually
referred to as ‘treatments’
5. Experiment.
The process of examining the truth of a statistical hypothesis, relating to some research problem,
is known as an experiment.
Experiments can be of two type’s viz. absolute experiment and comparative experiment.
But if we want to determine the impact of one fertilizer as compared to the impact of some
other fertilizer, our experiment will be termed as a comparative experiment.
6. Experimental Unit(s)
The predetermined plots or the block, where different treatments are used, are known as
experimental units. Such experimental units must be selected (defined) very carefully.
The main purpose: formulating a problem for more precise investigation/ developing
the working hypotheses from an operational point of view.
The major emphasis in such studies is on the discovery of ideas and insights.
The research design appropriate for such studies must be flexible enough to provide
opportunity for considering different aspects of a problem under study.
5
In this method, hypotheses formulated by earlier working may be reviewed and their usefulness
evaluated as a basis for further research.
The researcher should review and build upon the work already done by others,
If hypotheses have not yet been formulated, his task is to review the available material for
deriving the relevant hypotheses from it.
B. Experience Survey.
It means the survey of people who had practical experience with the problem to be studied.
The main purpose: obtain insight into the relationships between variables and new ideas
relating to the research problem.
People who are competent and can contribute new ideas may be carefully selected as respondents
The researcher must prepare an interview schedule for the systematic questioning of
informants who are carefully selected.
The interview must ensure flexibility in the sense that the respondents may be allowed to raise
issues and questions which the investigator has not previously considered.
Descriptive research concerned with specific predictions, with narration of facts and
characteristics concerning individual, group or situations
Experimental studies (or hypothesis –testing research studies), the researcher tests the hypotheses
of causal relationships between variability.
It requires procedures that will not only reduce bias and increase reliability, but will permit
drawing inferences about causality.
This principle refers to the repetition of the experiment more than once.
Divide the field into two parts and grow one variety in one part and the other variety in
the other part.
compare the yield of the two parts and draw conclusion on that basis
Under this principle, the extraneous factor, the known source of variability, is made to
vary deliberately over as wide a range as necessary.
The variability it causes can be measured and hence eliminated from the experimental
error.
This means that we should plan the experiment in a manner that we can perform a two-
way analysis of variance
8
--all items are covered, no element of chance is left and highest accuracy is obtained.
Even the slightest element of bias in such an inquiry will get larger and larger as the
number of observation increases.
There is no way of checking the element of bias or its extent except through a resurvey or use
of sample checks. Besides, this type of inquiry involves a great deal of time, money and
Energy.
When the field of inquiry is large, this method becomes difficult to adopt because of the
resources involved.
Government is the only institution which can get the complete enumeration carried out.
Sometimes it is possible to obtain sufficiently accurate results by studying only a part of total
population. In such cases there is no utility of census surveys
When field studies are undertaken in practical life:- considerations of time and cost almost
invariably lead to a selection of respondents i.e., selection of only a few items.
The respondents selected should be as representative of the total population as possible in order
to produce a miniature cross-section.
The selected respondents constitute what is technically called a ‘sample’ and the selection
process is called ‘sampling technique.’ The survey so conducted is known as ‘sample survey’.
Algebraically, let the population size be N and if a part of size n (which is < N) of this population
is selected according to some rule for studying some characteristic of the population, the group
consisting of these n units is known as ‘sample’.
9
Researcher must prepare a sample design for his study i.e., he must plan how a sample should
be selected and of what size such a sample would be.
A sample design is a definite plan for obtaining a sample from a given population.
It refers to the technique or the procedure the researcher would adopt in selecting items for the
sample.
There are many sample designs from which a researcher can choose.
Some designs are relatively more precise and easier to apply than others.
Researcher must select/prepare a sample design which should be reliable and appropriate
for his research study.
While developing a sampling design, the researcher must pay attention to the following points:
The first step in developing any sample design is to clearly define the set of objects, technically
called the Universe, to be studied.
A decision has to be taken concerning a sampling unit before selecting sample. Sampling unit
may be:-
An individual.
It is also known as ‘sampling frame’ from which sample is to be drawn. It contains the names of
all items of a universe (in case of finite universe only).
It is extremely important for the source list to be as representative of the population as possible.
Refers to the number of items to be selected from the universe to constitute a sample.
The size of sample should neither be excessively large, nor too small.
It should be optimum.
An optimum sample is one which fulfills the requirements of: efficiency, representativeness,
Reliability and flexibility.
While deciding the size of sample, researcher must determine the desired precision as also an
acceptable confidence level for the estimate.
In determining the sample design:-One must consider the question of the specific population
parameters which are of interest.
For instance, we may be interested in estimating the proportion of persons with some
characteristic in the population, or we may be interested in knowing some average or the other
measure concerning the population.
There may also be important sub-groups in the population about whom we would like to
make estimates.
11
Cost considerations, from practical point of view, have a major impact upon decisions relating to
size and type of sample. This fact can even lead to the use of a non-probability sample.
The researcher must decide: - the type of sample he will use, The technique to be used in
selecting the items for the sample and Technique or procedure stands for the sample
design itself.
Researcher must keep in view the two causes of incorrect inferences viz., systematic bias and
sampling error.
A systematic bias (non-sampling error) results from errors in the sampling procedures, and
it cannot be reduced or eliminated by increasing the sample size. Usually a systematic bias is
the result of one or more of the following factors:
If the sampling frame is inappropriate i.e., a biased representation of the universe, it will
result in a systematic bias.
In survey work, systematic bias can result if the questionnaire or the interviewer is biased.
12
Similarly, if the physical measuring device is defective there will be systematic bias in the data
collected through such a measuring device.
Questionnaire
Interview
respondents
3. Non-respondents:
If we are unable to sample all the individuals initially included in the sample, there may arise a
systematic bias.
The reason is that in such a situation the likelihood of establishing contact or receiving a
response from an individual is often correlated with the measure of what is to be estimated.
4. Indeterminacy principle:
Sometimes we find that individuals act differently when kept under observation than what they
do when kept in non-observed situations. For instance, if workers are aware that somebody is
observing them in course of a work study on the basis of which the average length of time to
complete a task
Natural bias of respondents in the reporting of data is often the cause of a systematic bias in
many inquiries.
There is usually a downward bias in the income data collected by government taxation
department, whereas we find an upward bias in the income data collected by some social
organization.
People in general understate their incomes if asked about it for tax purposes, but they
overstate the same if asked for social status or their affluence.
Generally in psychological surveys, people tend to give what they think is the ‘correct’ answer
rather than revealing their true feelings.
Sampling errors are the random variations in the sample estimates around the true population
parameters.
Since they occur randomly and are equally likely to be in either direction, their nature happens
to be of compensatory type and the expected value of such errors happens to be equal to zero.
13
Sampling error decreases with the increase in the size of the sample, and it happens to be of a
smaller magnitude in case of homogeneous population.
Sampling error can be measured for a given sample design and size.
The measurement of sampling error is usually called the ‘precision of the sampling plan’. If
we increase the sample size, the precision can be improved.
But increasing the size of the sample has its own limitations viz.,
Thus the effective way to increase precision is usually to select a better sampling design which
has a smaller sampling error for a given sample size at a given cost.
The procedure causes a relatively small sampling error and helps to control the systematic bias
in a better way.
probability sampling
Non-probability sampling.
unrestricted or
Restricted.
When each sample element is drawn individually from the population at large, then the sample
so drawn is known as ‘unrestricted sample’, whereas all other forms of sampling are covered
under the term ‘restricted sampling’.
Classification of Sampling
1. Non-probability samples
– Convenience
– Judgment
– Quota
– snowball
2. Probability samples
Convenience Sampling:
Judgment Sampling
An experienced individual selects the sample based on his or her judgment about some
appropriate characteristics required of the sample member
Snowball Sampling
A variety of procedures
Additional respondents are obtained from information provided by the initial respondents
Quota Sampling
Ensures that the various subgroups in a population are represented on pertinent sample
characteristics to the exact extent that the investigators desire
Quota sampling
Probability sampling
Cluster sampling
In simple random sampling, every individual in the target population has an equal chance of
being part of the sample.
Suppose there are N=800 students in a school from which a sample of n=8 students is to be
taken.
The students are numbered from 1 to 800. Since our data runs into three digits we use random
numbers that contain three digits
All numbers exceeding 800 are ignored because they do not correspond to any serial numbers in
the data
2. Systematic Sampling
Suppose the sample size is n, then we have to calculate the sampling interval(i) by dividing N by
n and generate a number between 1 and N/n and select that data item to be in the sample.
Other items in the sample are obtained by adding the sampling interval N/n successively to the
random number.
Advantage of this method is that the sample is evenly distributed over the entire data.
The town of Fairfax is divided up into N = 200 blocks which are numbered consecutively
Good Example:
17
i= 200/10 = 20
In stratified random sampling, the researcher first divides the population into groups based on a
relevant characteristic and then selects participants within those groups.
Stratified random sampling will be used to ensure adequate representation of both males and
females.
Stratified Random Sampling is applied whenever the researcher has interest on issues like:
Demographics,
Age,
Gender,
Race,
Region,
Urban/Rural etc
3. Divide the entire population into H strata. Based on the classification variable, each element
of the population is assigned to one of the H strata
4. In each stratum, number the elements from 1 to Nh (the pop. size of stratum h)
5. Determine the sample size of each stratum, nh, based on proportionate or disproportionate
stratified sampling,
4. Multi-Stage Sampling
In multi-stage sampling, other sampling techniques may be used at the different stages.
For example, the first stage may use random sampling, the second stage may use purposive
sampling, and the third stage may use stratified sampling.
1. Organize the sampling process into stages where the unit of analysis is systematically grouped.
2. Select a sampling technique for each stage.
3. Systematically apply the sampling technique to each stage until the unit of analysis has been
selected.
5. Cluster sampling
Cluster: a group of sampling units close to each other i.e. crowding together in the same area or
neighborhood
19
In studies where the plan is to estimate the mean of a continuous outcome variable in a single
population, the formula for determining sample size is given below:
Where
Z is the value from the standard normal distribution reflecting the confidence level that will be
used (e.g., Z = 1.96 for 95%),
σ is the standard deviation of the outcome variable and E is the desired margin of error.
Example 1:
In studies where the plan is to estimate the proportion of successes in a dichotomous outcome
variable (yes/no) in a single population, the formula for determining sample size is:
Where
Example 2:
An investigator wants to estimate the proportion of freshmen at his University who currently
smoke cigarettes (i.e., the prevalence of smoking). How many freshmen should be involved in
the study to ensure that a 95% confidence interval estimate of the proportion of freshmen who
smoke is within 5% of the true proportion?
Because we have no information on the proportion of freshmen who smoke, we use 0.5 to
estimate the sample size as follows: