Data Visualization, Volume II
Data Visualization, Volume II
Volume II
Data Visualization,
Volume II
Uncovering the Hidden Pattern in
Data Using Basic and New Quality
Tools
Amar Sahay
Data Visualization, Volume II: Uncovering the Hidden Pattern in Data Using
Basic and New Quality Tools
Copyright Business Expert Press, LLC, 2017.
10987654321
Keywords
big data, business intelligence, charts and graphs, data, data visualization,
information visualization, quality tools, seven basic tools of quality, seven
new tools of quality, visual representation
Contents
Preface...................................................................................................ix
Acknowledgments..................................................................................xiii
Graphical and Visual Tools for Improving Product
and Service Quality........................................................xv
Chapter 1 Overview and Data Visualization.......................................1
Chapter 2 Data and Data Analysis Concepts....................................11
Chapter 3 Systems Processes and Variation.......................................19
Chapter 4 Current Trends in Data Visualization...............................25
Chapter 5 Data Visualization Concepts and Applications.................33
Chapter 6 Seven Basic Quality Tools: Graphical Tools to Solve
Quality Problems.............................................................41
Chapter 7 The Seven New Tools for Quality Improvement...............87
Chapter 8 Other Visual Information Tools in Quality
Improvement.................................................................123
Bibliography .......................................................................................147
Index..................................................................................................151
Preface
The purpose of this book is to introduce the graphical tools and informa-
tion visualization tools widely used in data analysis, visualization, and
quality improvement to analyze, enhance, and improve the quality of
products and services. Visual tools are an easy way to gain a first look at
your data and they have been used to gain an insight into the data before
applying more complex analysis. The book provides a collection of visuals
and graphical tools. The visual tools are commonly referred to as graphi-
cal tools. A number of charts and graphs are commonly used to create
visuals that provide a quick summary, trends, and patterns in the data
which are not usually apparent from the data in raw form.
The first part of the book presents background information and the
fundamental concepts relating to data visualization. The following con-
cepts are covered in the first part:
The second part of the book is devoted to quality tools. These are a set of
graphical and information visualization tools that have been developed and
used over the years in quality improvement and Lean Six Sigma programs.
The use of these data visualization and quality tools is not limited to qual-
ity programs. The key areas where these tools are applied include business
process improvement, business data analysis, health care, finance, manu-
facturing, engineering process improvement, and product and process
design, to name a few. These visual tools are powerful decision-making
tools.
x PREFACE
The quality tools in this text represent data visually that enable the
analyst to immediately see the important features and characteristics of
data. The graphs and charts provide the current state of the process and
can also show the opportunities for improvement.
Some of the visual displays, for example, flow diagrams and value
stream mapping, have been successfully used in studying, developing, and
improving business and engineering processes. They also help to rede-
sign more efficient processes. Besides improving the process design, many
specially designed graphs and charts are used in product and process de-
sign and improvement. In many cases, these visual tools provide an idea
about the variation in the process that allows the opportunity for reduc-
ing variation. Variation reduction is one of the major goals of process
improvement and quality improvement. In many cases, the visual tools
also help reveal the waste in the process. These graphical tools are critical
in identifying waste and variation in any process. All processesservice
or manufacturinghave two things in commonwaste and variation.
Minimizing and eliminating wastes and defects leads to a lean and defect-
free process with enhanced quality. Waste and variation reduction also
can significantly reduce the cost of poor quality. The quality tools in this
book are problem-solving and decision-making tools that can be applied
to improve the product and service quality. The data and information vi-
sualization tools discussed in this book have been successfully applied to:
Introduction
This book is about visual representation of data commonly known as data
visualization. The visualization tools or the graphical displays can be di-
vided into the following two categories:
1. Data Visualization
2. Information Visualization
Data Visualization
Data visualization usually represents graphs and charts that are visual rep-
resentations of data. These graphical displays provide a powerful way of
summarizing and presenting data in a way that most people find easier
to comprehend. Charts and graphs enable us to see the main features or
characteristics of the data. The graphs not only enable us to present the
numerical findings of a study, but also provide the shape and pattern of
the data which are critical in data analysis and decision making.
Some examples where visual displays (in the form of graphs) are used
to summarize data are presented below. These graphs summarize the sales
and revenue of the top computer companiesAmazon and Apple Inc.
It is said that a picture is worth a thousand words; this is particu-
larly true when a large set of data is effectively presented using charts and
graphs that quickly reveal important features. Visual displays of data are
easily recognizable and are found ubiquitously in business periodicals,
financial magazines, on the Internet, and televisions.
2
Top Five Revenue - Internet Apple Inc. Revenue by Category-
Companies ($Billion) Fiscal Q1 2012
iPad CPU
Dell 51.9 Sales Sales
20% iPod 14%
Amazon 24.5 Other Sales
3% 5%
Google 23.7
8.7 ITunes
eBay
Store
Yahoo 4% iPhone
6.5
Sales
0 20 40 60 54%
DATA VISUALIZATION, VOLUME II
20 18.7
30
26.3 25.4
24.6
15 22.5
12.8
10.2 20 18.0 18.1
10
Sales ($Billion)
8.1
0 0
2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011
Year Year
The above examples show how a number of charts and graphs are used
to describe the key features of data. A solid understanding of these graphs
will enable you to describe the key concept of the data visually, and will
aid in both your personal and professional life. With the advancement in
technology, high-quality and complex charts and graphs can be produced
easily. A number of charts and graphs can be found in reports of finan-
cial periodicals like The Economist, Business Week, Fortune, and many other
business and engineering periodicals. Almost every issue of USA Today and
The Wall Street Journal contains a number of visual displays in their articles.
Information Visualization
Software Applications
Most of the graphs in this text can be produced using statistical and data
visualization software. We will illustrate several examples where the com-
puter software including EXCEL and MINITAB are used to construct
the charts and graphs. Some other graphical displays, for example, flow
diagrams, process maps, and value stream maps, widely used in studying
and improving process are created using specialized software. MINITABs
Quality Companion, Microsoft Visio, and Smart Draw are some of
the widely used programs for this purpose. Another widely used software
for Data Visualization and Visual Analytics is Tableau Software. This
software is capable of handling big data and creates high-level graphs and
charts to visually display data. An added feature of Tableau is the analyt-
ics feature built into it that can answer many queries not apparent from
the graphs and charts alone.
Overview and Data Visualization
Chapters at a Glance
The first part of the text provides the basic concepts and fundamentals of
data and data analysis including the types of data and types of data visu-
alization. It also presents the concepts of systems and processes followed
by the current trend in data visualization and big data.
As outlined, visualizing data graphically helps to detect potential prob-
lems and identify the areas of improvement opportunities. The chapters in
the text are divided into sections with different data visualization concepts
and tools. A brief outline of the chapters in this book is provided below.
Chapter 1
Chapter 2
Chapter 2 discusses the basic concepts related to data and data analy-
sis. Types of dataqualitative or categorical data, quantitative data, and
other classifications of dataare presented. This chapter also presents the
concept of variablesboth qualitative and quantitative. Almost all data
show variation, and visual tools are an excellent way to study variation in
the data. We discuss the sources of data and how data are collected for
research and analysis. The types of data based on measurement scales and
recent trends in data visualization are introduced.
8 DATA VISUALIZATION, VOLUME II
Chapter 3
Chapter 4
This chapter introduces big data, current trends, and applications that in-
volve massive amounts of data. The chapter outlines the concepts of Data
Visualization and Visual Analytics using big data. Recent applications in data
visualization involving massive amounts of data are discussed. This chapter
introduces data visualization as visual communication that presents the data
in graphical form. The visual tools in the form of charts, graphs, and other
visuals including flowcharts to communicate the information in the data ef-
fectively are discussed. The software applications to create the visuals from
the big data and applications of big data in various fields are introduced.
Chapter 5
the same data. A discussion follows on how visualization with big data
is becoming a requirement because of the increase in the volume of data
being collected and stored along with the challenges in storing, analyzing,
processing, and communicating the huge amount of data.
Chapter 6
This chapter discusses the graphical techniques that are widely used in
quality improvement, lean six sigma, and also in analyzing business-related
data and processes. These are commonly referred to as the Seven basic
tools or the basic tools of quality. The graphical and visual tools in this
category include:
Chapter 7
This chapter deals with another set of graphical tools commonly known
as the Seven new tools of quality. More appropriately, these tools are re-
ferred to as graphical and information visualization tools. They have wide
applications in decision-making and quality improvement programs. The
following visualization tools are discussed in this chapter:
Chapter 8
Summary
This chapter provided an overview and importance of data visualization. It
lays a foundation for the rest of the book by outlining the chapter contents.
The subsequent chapters present concepts and explain the data and infor-
mation visualization tools that can be applied in areas ranging from simple
to advanced analysis. The charts and graphs find wide applications in data
analysis and also in quality improvement projects to detect and solve a
number of problems. These graphs and charts are critical in understanding
the process from which the data are collected. They range from commonly
used graphical tools to data and information visualization tools known as
basic and new tools of quality.
Index
D F
D3, 38 Facebook, 32
Data Federal Reserve Economic Data,
characteristics of, 1314 (FRED), 16
classification of, 1415, 20 Flow diagrams, 5, 39
collection of, 1617 for online order process, 8
continuous, 15 for recruitment process, 7
cross-sectional, 14 Flow process charts, 5, 39, 4852
discrete, 15 business process mapping, 52
levels of measurement, 1820 of hiring process, 49
qualitative, 14 of medical services, 50
quantitative, 14 of recruitment process, 51
time series, 14
Data analytics, 29 G
Data collection Google, 16, 31, 32
experimental design, 16 Graphical tools, of quality, 4344
Google, 16 cause-and-effect/fishbone diagrams,
government agencies, 16 8082
internet sites and available data, 17 check sheets, 5859
processes, 17 control charts, 7580
telephone/mail surveys, 1617 histograms, 5963
Data mining, 27 pareto charts, 8285
Data visualization process maps, 4557
big data, 2832 run charts, 6775
different forms of, 36 scatter diagrams/plots, 6366
effective graphical displays, 36 Graphs, 39
fundamental concepts in, 3536 data visualization using, 4
information displays, types of, 3839
information visualization, 56, 7, 8
introduction to, 35
H
Histograms, 37, 5963
quantitative messages by, 3738
process capability, evaluating, 6063
recent trends, 27
shift and variation in process,
software applications, 6, 3942
detecting, 60
software tools for, 38
uses of, 5960
terminology for, 38
House of Quality (HOQ), 125
using charts and graphs, 4
competitive assessments, 134135
Dell, 31
construction and implementation
Discrete data, 15
of, 129138
Dundas BI, 31
customer requirements (WHATs),
129130
E interrelationship matrix between
eBay.com, 32 HOWs, developing, 132134
Effective graphical displays, 36 prioritized customer requirements,
EMC, 31 developing, 135, 137138
Event, 112, 113 relationship matrix between
EXCEL, 6, 38 WHATs and HOWs,
Exploratory data analysis, 38 131132, 133
INDEX
153
S T
SAP, 31 Table, 39
SAS, 38 Tableau Software, 6, 31
Scales, 39 Time series data, 14
Scatter diagrams/plots, 6366. Tree diagrams, 97100
See also Scatter plot decision tree
with box plots of x and for loan application process, 100
y variables, 66 in manufacturing process, 100
with dot plots of x and for project development, 9899
y variables, 66 steps to construct, 98
with histograms of x and using, 9798
y variables, 65
nonlinear relationship between U
x and y, 63 U.S. Census Bureau, 16
of temperature vs.month, 64 US Library of Congress, 32
Scatter plot, 37
SIPOC (supplier, input, process,
output, and customer) V
diagrams, 5 Value stream mapping (VSM), 5, 39,
SIPOC process map, 4652 5357
flow process charts, 4852 creating, 56
of online order processing, 47 major steps of, 5455
symbols and their meaning in, 48 production and distribution
Smart Draw, 6 system, 57
SOFA, 38 symbols used in, 56
Software AG, 31 value-adding and nonvalue-adding
Software applications, 6 activities, 5354
EXCEL, 6 Variable, 1516
Microsoft Visio, 6 Variable control chart, 78
MINITAB, 6 for shaft diameter, 80
Quality Companion, 6 using MINITAB, 7980
Smart Draw, 6 Variation
Tableau Software, 6 in products and processes, 2526
Statistical graphics, 36 in quality characteristic, 25
Statistical thinking, 21 Visual analytics, 2829
fundamental principles, 21 Visual objects, 39
processes, 2325
systems, 2223 W
variation, 2426 Walmart, 32
OTHER TITLES IN QUANTITATIVE APPROACHES
TO DECISION MAKING COLLECTION
Donald N. Stengel, California State University, Fresno, Editor
Regression Analysis: Understanding and Building Business and Economic Models Using
Excel, Second Edition by J. Holton Wilson, Barry P. Keating, and Mary Beal-Hodges
Operations Methods: Managing Waiting Line Applications, Second Edition
by Kenneth A. Shaw
Using Statistics for Better Business Decisions by Justin Bateh and Bert G. Wachsmuth
Applied Regression and Modeling: A Computer Integrated Approach by Amar Sahay
The Art of Computer Modeling for Business Analytics: Paradigms and Case Studies
by Gerald Feigin
Data Visualization, Volume I: Recent Trends and Applications Using Conventional and
Big Data by Amar Sahay
a one-time purchase,
that is owned forever,
allows for simultaneous readers,
has no restrictions on printing, and
can be downloaded as PDFs from within the library community.
Our digital library collections are a great solution to beat the rising cost of textbooks. E-books
can be loaded into their course management systems or onto students e-book readers.
The Business Expert Press digital libraries are very affordable, with no obligation to buy in
future years. For more information, please visit www.businessexpertpress.com/librarians.
To set up a trial in the United States, please email sales@businessexpertpress.com