Icte Lesson
Icte Lesson
Icte Lesson
Statistical Data
- Facts about, or summaries of data (e.g. sales statistics, employment statistics, etc.)
Statistical Method
- Statistics is a science that deals with the method of collection, organization, presentation, analysis and
interpretation of quantitative data.
Classification of Statistics
- Hypothesis testing – (ex) Test the claim that the population mean weight is 120 pounds
Important Terms
Population
- The collection of all responses, measurements, or counts that are of interest.
- The population must be defined explicitly before the study begins and the research hypothesis/questions specify
the population being studied
- Defined by certain characteristics: - Inclusion criteria - Exclusion criteria
Applications of Statistics
Types of Data
Data Sources
Sampling Techniques
- The process of selecting a portion (or subset) of the larger population and study that portion (sample) to gain
information about the population is called sampling
- When gathering data, you will not always have the luxury of collecting all available data. for example, economists
cannot measure the entire unemployment of the population, so they must take a random sample instead.
Likewise, in a manufacturing facility, quality control managers do not have the resources to test every product that
comes off the line; it is simply not feasible. instead, they take samples at various points during the production
process to test the quality of the products the firm produces.
Sampling
There are a number of methods employed in sampling data. It is important that the sampling method fits the application.
Random Sampling- each member of a population initially has an equal chance of being selected for the
sample
(1) Simple Random Sampling
- Every possible sample of a given size has an equal chance of being selected
e.g. A guidance counselor uses a computer to generate 50 random numbers and then picks students whose names
correspond to the numbers.
e.g. We could stratify (group) the college population by department and then choose a proportionate simple random
sample from each stratum (each department) to get a stratified random sample. Those samples picked from the SABH
department, picked from the SEAITE department and so on represent the members who make up the stratified sample
e.g. Suppose you have to do a phone survey. Your phone book contains 20,000 residence listings. You must choose 400
names for the sample. Number the population 1 - 20,000 and then use a simple random sample to pick a number that
represents the first name of the sample. Then choose every 50th name thereafter until you have a total of 400 names (you
might have to go back to your phone list)
Non-Random Sampling
Dichotomous Variable
- A variable that can have only two values
Qualities of Variables
Exhaustive – Should include all possible answerable responses
Mutually exclusive – No respondent should be able to have two attributes simultaneously
e.g. Employed vs. Unemployed - it is possible to be both if looking for a second job while employed
Types of Variables
(a)Discrete Variable
- Quantitative variables whose observations can assume only a countable number of values
- Values are obtained by counting
- e.g. No. of children in the family, No. of COVID-19 cases in the Philippines, No. of dates in the past month
(b)Continuous Variable
- Quantitative variables whose observations can assume any one of the countless number of values.
- Values are obtained by measuring
- e.g. Height, Weight, Time
Intervening Variable
- Sometimes referred to as test or control variables.
- Used to test whether the observed relations between the independent and dependent variables are spurious
- Serve either to increase or decrease the effect the independent variable has on the dependent variable
Scales of Measurement
Measurement - refers to the procedure of attributing qualities or quantities to specific characteristics of objects, persons
or events. Measurement is a key process in quantitative research and evaluation. If the measurement procedures are
inadequate its usefulness will be limited (Polgar & Thomas, 2008)
Levels of Measurement
❑ Nominal ❑ Ordinal ❑ Interval ❑ Ratio
❑ Interval and ratio variables-can be changed to become ordinal or nominal variables but not vice versa
WEEK 2
DATA COLLECTION
Data collection is the first stage in any statistical investigation. It is the process of obtaining (gathering) a set of related
measurements or counts to meet predetermined objectives.
Types of Data
The statistical data may be classified into two categories depending upon the sources utilized. These categories are
primary and secondary data.
(1) Primary Data – data which is collected by the investigator himself for the purpose of a specific inquiry or study.
These data are those collected for the first time (original/first-hand data) either through surveys or direct observation.
Examples:
Data on banking and finance collected by the Central Bank
Data on opinions and sentiments of people on current issues collected by the Pulse Asia
(2) Secondary Data – data which has been collected by others and used by an investigator for his own purposes. Such
data can be obtained from journals, official reports, government publications, publications of professional and research
organizations and so on.
Examples:
Documented data used by a medical researcher for his research which was originally collected or published by
the Department of Health.
Data published from Business journals
Remember!
Secondary data should be used with utmost care. So before using this data, the following three points should be
considered.
1. Whether the data are suitable for the purpose of investigation. This can be judged in the light of the nature and
scope of investigation.
2. If the data obtained is suitable for our purpose it should be assessed whether the data are adequate for the
purpose of investigation. This can be determined based on the time period and geographical area covered by the
data.
3. Whether the data are reliable. The data obtained should be checked for its accuracy. If the data are based on a
sample, one should see whether the sample is a representative of the population.
The most commonly used methods of data collection are the following:
(a) Survey Method – a data collection technique in which information is collected directly or indirectly from certain
individuals.
The most common methods of data collection for survey are self-administered questionnaire and personal interview.
(b) Questionnaire Method – a data collection method in which respondents are given prepared questionnaires with a
series of questions to answer and is used to record the responses.
(c) Interview Method – this method involves a face-to-face interaction between the interviewer and the interviewee.
Apart from face-to-face interviews, it can also be conducted over the phone or the computer terminal via video
conferencing technology.
(d) Observation Method - this method involves collecting information without asking questions. It is used when the
study relates to behavioral science This method is more subjective, as it requires the researcher (or observer) to add their
judgement to the data.
(f) Experiment Method – this method is used if the researcher would like to determine the cause and effect
relationship of certain phenomena under investigation. It is also used in making scientific inquiry
(g) Document Review Method - a method of utilizing the existing data or fact or information for a study.
WEEK 3
STATISTICAL ANALYSIS
DATA PRESENTATION
Body - main part of the table that contains the information or figures
Stubs or Classes - classification or categories describing the data and usually found at the leftmost side of the table
Caption - designations or identifications of the information contained in a column, usually found at the top most of the
column
THREE Types of Frequency Distribution Table (FDT)
(1) The Qualitative or Categorical frequency distribution table
- is used for data that can be placed in specific categories, such as nominal, or ordinal level data.
EXAMPLE:
EXAMPLE:
Frequency
RF=
N
Frequency
% RF= X 100 %
N
STEP 2
Determine the number of classes, k.
k =√ N
where N is the total number of observations/ scores in a given set of data
STEP 3
Determine the class size, c.
R
c=
k
The class width should be an odd number. This ensures that the midpoint of each class has the same place
value as the data.
STEP 4
Determine the lower limit (LL) of the first interval.
LL = Lowest score
Example 1
When 40 people were surveyed in Tuguegarao city, they reported the distance they drove to the mall, and the results (in
kilometers) are given below. Construct a frequency distribution table.
Solution
1. Determine the range, R.
R = highest score (HS) – lowest score (LS)
R = 40 – 1
R = 39
c=6.5 ≅ 7
The class width should be an odd number. This ensures that the midpoint of each class has the same place value as the
data.
NOTE
To determine the entries in the LESS THAN Cumulative Frequency ( < CF), add successively the frequencies of all the
previous classes/intervals including the class against which it is written. The cumulate is started from the frequency of
lowest class/interval to the frequency of the highest class/interval.
Example 3
Determine the greater than CF (>CF) of the given table:
NOTE
To determine the entries in the GREATER THAN Cumulative Frequency ( > CF), add successively the frequencies of all
the previous classes/intervals including the class against which it is written. The cumulate is started from the frequency of
highest class/interval to the frequency of the lowest class/interval.
Example 4
Construct a grouped frequency table for the given data below.
Solution
Following the steps discussed earlier, the frequency distribution of the given data is shown below. The classes are
arranged in increasing order.
GRAPHICAL PRESENTATION OF DATA
- A graph or a chart is a device for showing numerical values or relationships in pictorial form.
Advantages
1. Main features and implication of a body of data can be seen at once
2. Can attract attention and hold the reader’s interest
3. Simplifies concepts that would otherwise have been expressed in so many words
4. Can readily clarify data, frequently bring out hidden facts and relationships
It is accurate.
A good graph should not be deceptive, distorted or misleading, or in any way susceptible to wrong interpretations as a
result of inaccurate or careless construction.
It is clear.
An effective graph can be easily read and understood. The graph should focus on the message it is trying to
communicate.
It is simple.
The basic design of statistical graph should be simple, straightforward, not loaded with irrelevant, or trivial symbols. There
should be no distracting elements in a chart that inhibit effective visual communication.
Pie Chart
- A circular graph that is useful in showing
how a total quantity is distributed
among a group of categories. The
pieces of pie represent the proportions
of the total that fall into each category.
(4) Ogives
Example 1. The Statistics exam scores for a sample of 20 students are as follows: 50 53 59 59 63 63 72 72 72 72 72 76
78 81 83 84 84 87 90 93. Calculate the mean score of the students.
Solution: The sum of the scores is
To find the mean score, divide the sum of the scores by the number of scores in the sample data
1463/20 = 73.15
The mean score of the students in the Statistics exam is 73.15
ANSWERS: