Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
38 views

Engineering Data Analysis

1) The document introduces statistical thinking and methods used in engineering data analysis and statistics. It discusses collecting and presenting data through frequency distributions, graphs, and descriptive statistics. 2) Key concepts covered include qualitative and quantitative data, levels of measurement, sampling methods, and graphical methods for describing data distribution including histograms, stem-and-leaf plots, and time series graphs. 3) Examples are provided to demonstrate how to construct frequency distributions, histograms, stem-and-leaf plots, line graphs, and time series graphs from sample engineering and statistical data sets.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views

Engineering Data Analysis

1) The document introduces statistical thinking and methods used in engineering data analysis and statistics. It discusses collecting and presenting data through frequency distributions, graphs, and descriptive statistics. 2) Key concepts covered include qualitative and quantitative data, levels of measurement, sampling methods, and graphical methods for describing data distribution including histograms, stem-and-leaf plots, and time series graphs. 3) Examples are provided to demonstrate how to construct frequency distributions, histograms, stem-and-leaf plots, line graphs, and time series graphs from sample engineering and statistical data sets.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 10

ENGINEERING DATA ANALYSIS

Introduction to Statistics and Data Analytics


Chapter 1

1.1 The Engineering Method and Statistical Thinking

Statistical Thinking

The steps in the engineering method are as follows:

• An engineer is someone who solves problems of interest to society by the efficient


application of scientific principles.
• The engineering or scientific, method is the approach to formulating and solving these
problems

Statistical Thinking

o Statistics is the science of data.


o Statistics deals with the collection, presentation, analysis, and use of data to make decisions,
solve problems, and design products and processes.
o Statistical methods are used to help us describe and understand variability.
o Statistical methods are used to analyze data from a process in order to gain more sense of
where in the process changes may be made to improve the quality of the process.

Statistical Thinking
o Inferential statistics uses techniques that allow us to go beyond merely reporting
data to drawing conclusions (or inferences) about the scientific system. ➢
Statistical inference is one type of reasoning
Statistical Thinking
o Reasoning based on measurements from some objects to measurements on all
objects can result in errors (called sampling errors).
o However, if the sample is selected properly, these risks can be quantified, and an
appropriate sample size can be determined.

1.2 Collecting Engineering Data

Sampling Methods:

• Stratified Random Sampling


• Cluster Random Sampling
• Systematic Random Sampling

Three (3) basic methods of collecting data:

• Retrospective Study
• Observational Study
• Designed Experiment
o Statistic - sample
o Parameter – population

Two Categories of Data:

• Qualitative Data
• Quantitative Data
• Quantitative Discrete Data
• Quantitative Continuous Data

o A sample should have the same characteristics as the population it is


representing.
o Two or more samples from the same population, taken randomly, and having
close to the same characteristics of the population will likely be different from
each other, or has variation.

Four levels of measurement:

• Nominal Scale Level


• Ordinal Scale Level
• Interval Scale Level
• Ratio Scale Level

o A frequency is the number of times a value of the data occurs.


o A relative frequency is the ratio (fraction or proportion) of the number of times a
value of the data occurs in the set of all outcomes to the total number of outcomes.
o Cumulative relative frequency is the accumulation of the previous relative
frequencies.
1.3 Descriptive Statistics

Tabular Methods in Descriptive Statistics

Frequency Distribution (Frequency Table)

• Ungrouped Data (n<30)


• Grouped Data

Ungrouped Data Frequency Table

• Raw Data
• Array
• Frequency
• Relative Frequency

Sample Problems
Example 1.1

A certain machine is to dispense 1.6 kilos of alum. To determine whether it is properly adjusted to
dispense 1.6 kilos of quality control engineer weighed thirty bags of 1.6 kilo alum after the machine was
adjusted. The data given below refer to the net weight (in kilos) of each bag. Arrange the data in
frequency table.

Grouped Data Frequency Distribution


1. Find the range (R). The range is the difference the largest and smallest value.
2. Decide on a suitable number of classes depending upon what information the table is supposed to
present. Sturge suggested the number of classes (m) as m=1+3.3 log n where n=number of cases.
3. Determine the class size (c). C=R/m
4. Find the number of observations in each class. This is the class frequency (f).

Sample Problems
Example 1.2
The following are the observed gasoline consumption in miles per gallon of 40 cars. Arrange the data in a
frequency distribution. Also provide the class boundaries and class marks for each corresponding classes.
Lastly, provide a table for the cumulative frequency distribution of the gasoline consumption.
Graphical Methods in Descriptive Statistics

• Stem-and-leaf plots (Stemplot)

• Line graphs

• Bar graphs

• Histogram

• Frequency polygons

• Time series graphs

Sample Problems
Example 1.3
For Prof. Dumbledore’s calculus class, scores for the first exam were as follows (lowest to highest): 32, 42,
49, 49, 53, 55, 55, 61, 63, 67, 68, 68, 69, 72, 73, 74, 78, 80, 83, 88, 88, 88, 90, 92, 94, 94, 94, 94, 96, 100.
Construct a stemplot and describe the data.
Sample Problems
Example 1.4
In a survey, 40 mothers were asked how many times per week a teenager must be reminded to do his or
her chores. The results are: 2 mothers said 0 times, 5 said once, 8 said twice, 14 said 3 times, 7 said 4
times, 4 said 5 times. Construct a line graph and interpret the survey.

Sample Problems
Example 1.5
By the end of 2020, TikTok had over 146 million users in the Southeast Asia. The table below shows three
age groups, the number of users in each age group. Construct a bar graph using this data and describe
the graph.
Sample Problems
Example 1.6
The following data are the heights (in inches to the nearest half inch) of 100 male professional soccer
players. The heights are continuous data, since height is measured.
60; 60.5; 61; 61; 61.5; 63.5; 63.5; 63.5; 64; 64; 64; 64; 64; 64; 64; 64.5; 64.5; 64.5; 64.5;
64.5; 64.5; 64.5; 64.5; 66; 66; 66; 66; 66; 66; 66; 66; 66; 66; 66.5; 66.5; 66.5; 66.5; 66.5;
66.5; 66.5; 66.5; 66.5; 66.5; 66.5; 67; 67; 67; 67; 67; 67; 67; 67; 67; 67; 67; 67; 67.5;
67.5; 67.5; 67.5; 67.5; 67.5; 67.5; 68; 68; 69; 69; 69; 69; 69; 69; 69; 69; 69; 69; 69.5;
69.5; 69.5; 69.5; 69.5; 70; 70; 70; 70; 70; 70; 70.5; 70.5; 70.5; 71; 71; 71; 72; 72; 72;
72.5; 72.5; 73; 73.5; 74
Calculate the width of each bar or class interval, relative frequency of each bar and construct a
histogram.
Sample Problems
Example 1.7
Construct an overlay frequency polygon comparing the students’ calculus final test scores with the
students’ final numeric grade.
Sample Problems
Example 1.8
The following data shows the Annual Consumer Price Index, each month for ten years. Construct
a time series graph for the Annual Consumer Price Index data only.

You might also like