[1] stat introduction
[1] stat introduction
Introduction of Statistics
Definition of Statistics:
From the first meaning of Statistics it may be mentioned that Statistics is the mirror by which
one can look at all aspects of a nation. It also indicates that simple or isolated items cannot be
termed as Statistics unless these are a part of aggregate of facts relating to any particular field of
inquiry. The aggregate of facts is a part of applied mathematics. According to R.A. “Fisher
Statistics is the branch of applied mathematics, where the mathematics is used to aggregate the
collected data.”
In other word Statistics is the science of collecting, presenting, analyzing and interpreting data to
assist in making more effecting decision.
1. Descriptive Statistics.
2. Inferential Statistics.
Descriptive Statistics:
A data set in its original form is usually very large. Consequently, such a data set is not very
helpful in drawing conclusions or making decision. It is easier to draw conclusions from
summary tables and diagrams than from the original version of a data set. So, we reduce data to a
manageable size by constructing tables, drawing graphs, or calculating summary measures such
as averages. Descriptive Statistics involve graphical and numerical procedures that are used to
summarize and process data and to transform data into information.
Example: A study found that 49% of the people in a survey knew the name of the first book of
the Bible. The Statistic 49 describes the number out of every 100 persons who knew the answer.
Inferential Statistics:
Inferential Statistics are used to make inferences about a population from a sample in order to
generalize (make assumptions about this wider population) and/or make prediction about the
future. Inferential Statistics provided the bases for prediction, forecasts and estimates that are
used to transform information into knowledge.
Examples: we may make some decisions about the political views of all college and university
students based on the political views of 1000 students selected from a few colleges and
universities.
1
Introduction of Statistics
Importance of Statistics:
Uses of Statistics:
It presents facts in a definite form.
It facilitates comparison.
It helps in formulating and testing hypothesis.
It helps in prediction.
It helps in the formulation of suitable policies.
Statistics is important for all branches of science, because it is incredibly helpful in decision
making and testing the accuracy of the decisions at the end.
In computer science and machine learning, the efficiency of algorithms can be significantly
increased by using statistics. You can decrease the cost of the processing significantly by using
statistics.
Without understanding statistics, it is not possible to understand the logic behind algorithms and
to develop them.
Human brain makes decisions based on data that it gets from its surroundings via sensors and it
uses statistics and data to come a conclusion for an action. Your brain uses statistics even if you
are not aware of this process at every instant of your life.
Limitations of Statistics:
2
Introduction of Statistics
Some definition:
Population: A Population is a collection of all possible individuals, objects or measurements of
interest.
Variable:
Any characteristics of a population which may vary from person to person or object to object
either in magnitude or in quality are called variables.
Examples:
Variable
A variable that cannot assume a numerical value but can be classified into different categories
are called qualitative or categorical variables.
3
Introduction of Statistics
Examples:
Brand of PC
Marital Status
Hair color.
Eye color.
Quantitative Variables:
The characteristics of an unit or item that are expressed in numerical form or in numbers are
called quantitative variables.
Examples:
Height of a person.
Weight of a person.
Family size.
No. of accident in one day etc.
Discrete Variable: A quantitative variable which possesses isolated or integral value is called
discrete variable.
Examples:
Continuous Variable: A quantitative variable which takes values within a range or limit is
called continuous variable.
Examples:
Height of a person.
Weight of a person.
The qualitative variable measures the The quantitative variable measures the
qualitative characteristics. quantitative characteristics.
For qualitative variable only the median and We can calculate all the measures of central
mode can be calculated. tendency for quantitative variable.
4
Introduction of Statistics
The qualitative variable classified according to The quantitative variable classified according
quality or attribute. to numerical value
Level of Measurement:
Level of measurement or scale of measurement is a classification that describes the nature of
information within the numbers assigned to variables. There are actually four different levels of
measurement.
The first level of measurement is nominal level of measurement. The variable of interest is
divided into categories or outcomes. And there is no natural order to the outcomes. Even when
the labels are numerically coded, the data categories have no logical order.
Examples:
Gender
Religion
Eye color
Marital status
Examples:
The interval level of measurement not only classifies and orders the measurements, but it also
specifies that the distances between each interval on the scale are equivalent along the scale from
low interval to high interval.
5
Introduction of Statistics
Examples:
Temperature.
IQ test score.
Examples:
Data
Statistics always deals with information and information should be expressed in number if we
like to analyze it. Certain type of information that is expressed in numerical form can be termed
as data. Data is the collections of any number of related observations.
1. Primary data.
2. Secondary data.
Primary data:
Primary data is the one, which is collected by the investigator himself for the purpose of a
specific inquiry or study. Such data is original in character and is generated by survey conducted
by individuals or research institution or any organization. The primary data can be collected by
the following methods:
Secondary data:
Secondary data are those data which have been already collected and analyzed by some earlier
agency for its own use and later the same data are used by a different agency.
6
Introduction of Statistics