0% found this document useful (0 votes)

2 views

Data Visualization and Communication Introduction

Uploaded by

bsf23000703

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Data Visualization and Communication Introduction

Uploaded by

bsf23000703

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Data visualization and communication

Data visualization is the process of showing data using different visuals like graphs and charts. Organizing
the data in a pattern helps you analyze, evaluate, and generate reports.

In this lesson, you will learn

• Analyze the concept of data visualization and its role in data analysis and report generation

• Evaluate different methods of data visualization and determine the appropriate method to use
based on the type of data

• Apply knowledge of data visualization methods to create and interpret tables, graphs, and charts

• Synthesize insights from data visualizations to make data-driven decisions and communicate
findings effectively

• Evaluate the accuracy, clarity, and effectiveness of data visualizations

Skills Covered In Lesson:

• Skill 4.1: Report data

• Skills 4.2a and 4.3a: Create and derive conclusions from visualizations that compare one or more
categories of data

• Skills 4.2b and 4.3b: Create and derive conclusions from visualizations that show how individual
parts make up the whole

• Skills 4.2c and 4.3c: Create and derive conclusions from visualizations that analyze trends

• Skills 4.2d and 4.3d: Create and derive conclusions from visualizations that determine the
distribution of data

• Skills 4.2e and 4.3e: Create and derive conclusions from visualizations that analyze the
relationship between sets of values

Report data introduction

Data reporting is the process of collecting and organizing raw data and representing it with a suitable
visualization to analyze the data. There are different visualization methods to organize and represent data
for analysis. This section helps you understand how data can be organized by using tables and charts.

This skill covers how to:

• Use tables and charts to display information

• Disaggregate data

Using tables and charts to display information

Data reports can be prepared and represented using different methods. Some of the basic methods to
visualize data use tables and charts. By using tables, qualitative and numerical data can be arranged into
different categories or groups by separating them into rows and columns.

For example, tables can be typed directly into Excel. Table 4-1 illustrates that student data can be
organized using tables:

Table 4-1

ID Name Score
1182 James 80
3701 Matthew 66
3853 Robert 69
4461 Joseph 87
4641 Thomas 75
6001 Mike 85
6637 Anee 76
6701 Alen 88
8159 John 82
9225 Daniel 63
In Table 4-1, students’ id numbers, names, and scores are organized in rows and columns. It is easier to
interpret and compare data in tables than it is in regular text.

R can also be used to organize data into a table. First, you load each column into a list, and then use
a data.frame command to organize the lists into a table. One way to do this is shown in the following code
and Figure 4-1

ID <- c(1182, 3701, 3853, 4461, 4641, 6001, 6637, 6701, 8159, 9225)

Name <- c("James", "Matthew", "Robert", "Joseph", "Thomas", "Mike", "Anee", "Alen",
"John", "Daniel")

Score <- c(80, 66, 69, 87, 75, 85, 76, 88, 82, 63)

ScoreTable <- data.frame(ID, Name, Score)

ScoreTable
Figure 4-1

Charts are another way of representing data. Using charts, data can be represented with different color
codes and patterns, which makes it easier to analyze the data. There are different types of graphical
representations used to visualize data, such as column charts, bar charts, pie charts, line charts, etc. The
students’ scores can be represented in the form of an Excel chart, using the following steps.

After entering the data in Table 4-1, next highlight data and headers (here in A1: C:11), click on Insert and
choose the 2-D column chart. Next, click on Select Data, and change the chart data range to
“=Sheetx!$B1:$C$11” where x is the sheet containing the data, then click OK like in Figure 4-2.

Figure 4-2

Data labels can be added to this chart by clicking Add Chart Element, then Data Labels, and finally Center.
The chart that is generated is shown in Figure 4-3.
Figure 4-3

In R, a basic chart can be made after loading the data into a data frame by using the barplot() command,
as below.

barplot(height=ScoreTable$Score, names=ScoreTable$Name, ylim=c(0,100), xlab="",

ylab="Score", space=0.05, las=2)

mtext("Name", side=1, line=4)

In these commands, height is the response variable (the y-axis values), names is the predictor variable
(the x-axis variable), ylim scales the y-axis, ylab gives the axis labels, the space command positions the
bars next to one another, and the las command rotates the x-axis labels to vertical. Because of the long
names, the xlab label command is left blank, and a text line is added below the chart for axis labeling
with mtext. The result is shown in Figure 4-4:
Figure 4-4

Both tables and charts can be used to visualize data. Depending on the purpose, different visualization
methods (tables or charts) can be used to display and analyze the data. If the purpose of the data analysis
is to sort or search, tables can be used. However, charts can be better suited to interpreting the data
visually.

Example: The following data report of a retail shop is represented using both a table and a chart.

First, detailed information about the items, unit price, units sold, purchase date, revenue, total cost, and
total price can be organized and formatted in a table.

In Excel, this is accomplished by entering the data in rows and columns and highlighting the dataset. On
the Home tab highlight Format as table and choose the desired style. Then check the data range and click
the “My data has headers” box, as shown in Figure 4-5:

Figure 4-5
This produces an easy-to-read table of retail data, as shown in Table 4-2 and provided as an Excel file
named “Table 4_2_Retail Data.xlsx” in the course downloads.

Table 4-2

Items Unit Price ($) Units Purchase Revenue Total Cost ($) Profit
Sold Date ($) ($)
Vegetables 150 2550 2/2/2022 450000 382500 67500
Fruits 200 3000 2/3/2022 750000 600000 150000
Grains 125 2250 4/2/2023 400250 281250 119000
Dairy 350 5000 1/15/2022 2550000 1750000 800000
Cosmetics 500 4000 3/22/2023 2750000 2000000 750000
Toys 120 3500 12/1/2022 550000 420000 130000
Stationery 50 3250 6/2/2022 245000 162500 82500
Items
Using this table, the profit for each item can be easily compared by using the Profit column to sort the
dataset from highest to lowest (or vice versa). However, the relationship between cost, profit and
revenue cannot be easily seen without a graphical representation.

Using a chart, the relationship between these key economic measurements is visualized. For instance, we
can assess

• how profit levels relate to revenue levels for this retail shop.

Furthermore, we can start to ask questions such as:

• do low profit levels match low revenue?

• Or do high profit levels closely match low cost levels?

By asking these questions and using a graphical representation to answer these questions, we can begin
to start to make conclusions about how these economic measurements are related to each other, if they
are related at all.

In Excel, highlight the dataset and on the Insert tab choose 2-D Column. Right click on the chart that
appears and choose Select Data to see the window in Figure 4-6. Then under Horizontal (Category) Axis
Labels, choose the data for the labels (here A2:A8).
Figure 4-6

The resulting chart makes it much easier to visually inspect trends and comparisons in data, as shown
in Figure 4-7:

Figure 4-7

In R, a chart like this can be created by rearranging the data slightly and using the graphics library ggplot2.
First, the data is entered by using one column for Revenue, Cost, and Profit amounts and a second column
to indicate the category. This code can be run in your browser at rdrr.io or with RGui installed on your
machine. The generated table is shown in Figure 4-8.
Items <- c("Vegetables", "Fruits", "Grains", "Dairy", "Cosmetics", "Toys", "Stationery
Items", "Vegetables", "Fruits", "Grains", "Dairy", "Cosmetics", "Toys", "Stationery
Items", "Vegetables", "Fruits", "Grains", "Dairy", "Cosmetics", "Toys", "Stationery
Items")

UnitPrice <- c(150, 200, 125, 350, 500, 120, 50, 150, 200, 125, 350, 500, 120, 50, 150,
200, 125, 350, 500, 120, 50)

UnitsSold <- c(2500, 3000, 2250, 5000, 4000, 3500, 3250, 2500, 3000, 2250, 5000,
4000, 3500, 3250, 2500, 3000, 2250, 5000, 4000, 3500, 3250)

PurchaseDatex <- c("2/2/22", "2/3/22", "4/2/23", "1/15/22", "3/22/23", "12/1/22",

"6/2/22", "2/2/22", "2/3/22", "4/2/23", "1/15/22", "3/22/23", "12/1/22", "6/2/22",
"2/2/22", "2/3/22", "4/2/23", "1/15/22", "3/22/23", "12/1/22", "6/2/22")

PurchaseDate <- as.Date(PurchaseDatex, format = "%m/%d/%y")

Dollars <- c(450000, 750000, 400250, 2550000, 2750000, 550000, 245000, 328500,
600000, 281250, 1750000, 2000000, 420000, 162500, 67500, 150000, 119000, 800000,
750000, 130000, 82500)

Attribute <- c("Revenue", "Revenue", "Revenue", "Revenue", "Revenue", "Revenue",

"Revenue", "Total Cost", "Total Cost", "Total Cost", "Total Cost", "Total Cost", "Total
Cost", "Total Cost", "Profit", "Profit", "Profit", "Profit", "Profit", "Profit", "Profit")

StoreTable <- data.frame(Items, UnitPrice, UnitsSold, PurchaseDate, Dollars, Attribute)

StoreTable
Figure 4-8

Immediately after creating the table, you can build the table with the ggplot2 package with the code
below.

library (ggplot2)

ggplot(StoreTable, aes(fill=Attribute, y=Dollars, x=Items)) + geom_bar(position='dodge',

stat='identity')

If you are using RGui and you get an error, you might need to install the ggplot2 package using the
following code. Fortunately, this line of code only needs to be run once.

install.packages("ggplot2")

In the ggplot command, you call up the data frame, then choose the fill (categories for separate bars), y
values (response variable), and x values (predictor variable). Then you use the command ‘dodge’, to plot
the bars next to one another for each value of x, as shown in Figure 4-9:
Figure 4-9

From these charts (R or Excel), you can notice that the profit for the two highest cost categories is a fair
bit higher than those for the lower cost categories. This can help direct efforts in the retail operations. A
column chart is an appropriate visualization in this scenario because it makes it easier for the audience to
compare profit, cost, and revenue (sales) across multiple categories by comparing the heights of the bars.
For example, in this scenario, the business can see that, although cosmetics generate more sales than
dairy, the higher cost makes dairy a slightly more profitable product.

Disaggregate data

Disaggregate data is aggregate data (sums, totals, averages, rates, etc.) that retains some of its original
information about different subgroups (gender, age, economic status, etc.) linked to these aggregated
measures. Analyzing disaggregated data allows you to retain the simplicity of summarized (aggregate)
data metrics but still makes available the ability to compare these measurements between and within
these subgroups.
A large data set can have a number of factors or attributes for each data point. For example, in Table 4-3a,
the aggregated data provides information on the average annual income of individuals based solely on
their country. However, by disaggregating this data, you can retain information about additional factors
such as skill level and gender. This disaggregated view enables you to gain insights into the variations and
differences based on these specific factors, offering a more detailed analysis of the income patterns and
disparities.

Table 4-3a

Country Annual Income in USD ($)

USA 36832.37
MEX 5141.96
Table 4-3b shows a sample of aggregated data for annual average income for workers in the USA and MEX,
disaggregated to retain information about gender, skill level, and economic sector.

Table 4-3b

Annual Income in USD ($)

Female Male
Country Sector Unskilled Skilled Unskilled Skilled
USA AFS 17716.66 24545.44 23060.08 32230.86
USA ATP 30944.21 36078.03 37742.14 56375.02
USA B_T 27683.43 33189.47 29067.60 51581.51
USA BPH 27315.35 43688.90 44920.22 70441.86
USA C_B 27762.58 44459.74 29150.71 48693.49
MEX AFS 2964.39 5826.95 4303.06 5210.09
MEX ATP 5197.749 8463.39 5457.64 8005.26
MEX B_T 2405.46 5794.55 5385.37 9504.20
MEX BPH 2405.46 5794.55 5385.37 9504.20
MEX C_B 1457.71 2762.03 2708.13 4303.46
Table 4-3b Part 2

Code Sector
afs Accommodation, Food, and Service Activities
atp Air Transport
b_t Beverages and Tobacco Products
bph Basic Pharmaceutical Products
c_b Sugar Cane, Sugar Beet

Plotting and analyzing the data in the complete table is complicated because of the volume of data that
is available. In a disaggregated dataset, the data is broken down into smaller, more manageable subsets
based on specific criteria. It allows you to extract detailed insights by diving deeper into specific groups or
categories.

In Excel, you would highlight the entire data set and from the Insert tab choose the Pivot Table button.
When the table pops up, drag the Country and Sector fields to the Rows box, Gender and Skill Level to
the Columns box, and the Annual Income field to the Values box. Then click on the Annual Income field,
chose Value Field Settings, and the Average option, per Figure 4-10:

Figure 4-10:

Figure 4-10

You can divide into subgroups by using the filtering that you just set up with the headers. Using the down
arrow found in the Sector heading, choose the AFS (Accommodation, Food and Service Activities) as
in Figure 4-11, with the resulting table as in Figure 4-12:
Figure 4-11

Figure
4-12
Figure 4-12

This data can be displayed graphically, as in Figure 4-13. The filtered dataset is provided below. We can see
from this dataset that US workers earn a vastly larger wage than workers in Mexico, in all subgroups.
Furthermore, there is minimal difference between Female and Male Skilled Workers in Mexico, but a
significant difference between these two subgroups in the US.

Figure 4-13

Data Visualization Exploring and Explaining With Data J.camm Bibis - Ir
100% (1)
Data Visualization Exploring and Explaining With Data J.camm Bibis - Ir
418 pages
How To Choose The Right Data Visualization
100% (1)
How To Choose The Right Data Visualization
26 pages
A Quick and Easy Guide in Using SPSS for Linear Regression Analysis
From Everand
A Quick and Easy Guide in Using SPSS for Linear Regression Analysis
Jurex Gallo
No ratings yet
Callister Solutions of Ch08
100% (1)
Callister Solutions of Ch08
42 pages
Dataspeed Quick Guide To Running Carla Simulator
No ratings yet
Dataspeed Quick Guide To Running Carla Simulator
26 pages
Lesson 4 part 1-3
No ratings yet
Lesson 4 part 1-3
61 pages
Microsoft Excel Data Visualisation
No ratings yet
Microsoft Excel Data Visualisation
16 pages
Lesson 3-Bus. Math
No ratings yet
Lesson 3-Bus. Math
23 pages
Module 4
No ratings yet
Module 4
69 pages
DV Lab Manual (Ex - No.1-10)
No ratings yet
DV Lab Manual (Ex - No.1-10)
23 pages
Unit 3 DATA VISUAIZATION
No ratings yet
Unit 3 DATA VISUAIZATION
25 pages
DAV Lab Sample
No ratings yet
DAV Lab Sample
21 pages
DV Unit-I
No ratings yet
DV Unit-I
25 pages
Unit 4
No ratings yet
Unit 4
21 pages
Analysis Process: Data Visualization
No ratings yet
Analysis Process: Data Visualization
33 pages
DV Co1 All PDF
No ratings yet
DV Co1 All PDF
196 pages
From Data To Charts
No ratings yet
From Data To Charts
27 pages
DATA VISUALIZATION INTRO
No ratings yet
DATA VISUALIZATION INTRO
25 pages
_
No ratings yet
_
26 pages
DV-Viva-Voice-Data Visualization
No ratings yet
DV-Viva-Voice-Data Visualization
12 pages
LAB Manual Updated
No ratings yet
LAB Manual Updated
97 pages
Document (9)
No ratings yet
Document (9)
8 pages
Written Report - Chapter 3 - Visualizing Data
No ratings yet
Written Report - Chapter 3 - Visualizing Data
5 pages
Lecture 4. Visualization(1)
No ratings yet
Lecture 4. Visualization(1)
38 pages
2/ Organizing and Visualizing Variables: Dcova
No ratings yet
2/ Organizing and Visualizing Variables: Dcova
4 pages
UNIT4
No ratings yet
UNIT4
8 pages
(English) Charts Are Like Pasta - Data Visualization Part 1 - Crash Course Statistics #5 (DownSub - Com)
No ratings yet
(English) Charts Are Like Pasta - Data Visualization Part 1 - Crash Course Statistics #5 (DownSub - Com)
8 pages
Chap 03 Data Visualization
100% (1)
Chap 03 Data Visualization
61 pages
Data Visualization Techniques
No ratings yet
Data Visualization Techniques
20 pages
MCA_S3_Data Visualisation_U2
No ratings yet
MCA_S3_Data Visualisation_U2
17 pages
DVDG - Sessions 1 and 2
No ratings yet
DVDG - Sessions 1 and 2
42 pages
Data Visualization
No ratings yet
Data Visualization
23 pages
Lesson 4
No ratings yet
Lesson 4
64 pages
Week 2 Lesson 2
No ratings yet
Week 2 Lesson 2
15 pages
DA Unit 1
No ratings yet
DA Unit 1
43 pages
03 GraphicalPart2 Numerical
No ratings yet
03 GraphicalPart2 Numerical
43 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
52 pages
Diagrammatic and Graphical Presentation of Data
No ratings yet
Diagrammatic and Graphical Presentation of Data
17 pages
15 Questions DV 3rd Year a Sec
No ratings yet
15 Questions DV 3rd Year a Sec
51 pages
Introduction to Data Science Module 1 (1)
No ratings yet
Introduction to Data Science Module 1 (1)
32 pages
RM Lesson 10
No ratings yet
RM Lesson 10
28 pages
BUS 4055 Week 5
No ratings yet
BUS 4055 Week 5
16 pages
DV UNIT 2
No ratings yet
DV UNIT 2
5 pages
Runit 2
No ratings yet
Runit 2
50 pages
Chapter 3 - Visualizing Data
No ratings yet
Chapter 3 - Visualizing Data
70 pages
Evans Analytics3e PPT 03 Accessible v2
No ratings yet
Evans Analytics3e PPT 03 Accessible v2
36 pages
kundan jiiiiii
No ratings yet
kundan jiiiiii
26 pages
Data Visualization Techniques 1
No ratings yet
Data Visualization Techniques 1
27 pages
Richardson_DAA_3e_PPT_Ch04 (2) (3) (1)
No ratings yet
Richardson_DAA_3e_PPT_Ch04 (2) (3) (1)
37 pages
Data Visualisation
No ratings yet
Data Visualisation
11 pages
M2 - Visualization of Categorical and Numerical Data
No ratings yet
M2 - Visualization of Categorical and Numerical Data
20 pages
Data Visualization Notes
No ratings yet
Data Visualization Notes
4 pages
DATA VISUALIZATION - R PROGRAMMING POWER BI
No ratings yet
DATA VISUALIZATION - R PROGRAMMING POWER BI
51 pages
Chapter 3 - Data Visualization
No ratings yet
Chapter 3 - Data Visualization
36 pages
U1T3 - White Paper - Data Visualization Techniques From Basics To Big Data With SAS Visual Analytics
No ratings yet
U1T3 - White Paper - Data Visualization Techniques From Basics To Big Data With SAS Visual Analytics
19 pages
Data Visualization Techniques: Dr. D. Koteswara Rao
No ratings yet
Data Visualization Techniques: Dr. D. Koteswara Rao
41 pages
Data Visualization Exploring and Explaining With Data (Jeffrey D. Camm, James J. Cochran Etc.) (Z-Library)-1-173
No ratings yet
Data Visualization Exploring and Explaining With Data (Jeffrey D. Camm, James J. Cochran Etc.) (Z-Library)-1-173
173 pages
Pertemuan 2 Pengantar Bistat
No ratings yet
Pertemuan 2 Pengantar Bistat
26 pages
Group 3
No ratings yet
Group 3
25 pages
Ameer Data Visualization and Techniques
No ratings yet
Ameer Data Visualization and Techniques
4 pages
Tableau 8.2 Training Manual: From Clutter to Clarity
From Everand
Tableau 8.2 Training Manual: From Clutter to Clarity
Larry Keller
No ratings yet
Excel Statistics: Step by Step
From Everand
Excel Statistics: Step by Step
Stephanie Glen
4/5 (8)
Lesson 3 Notes
No ratings yet
Lesson 3 Notes
53 pages
Google Cluster Data Preprocessing - Updated
No ratings yet
Google Cluster Data Preprocessing - Updated
4 pages
Histogram, box and whisker plots
No ratings yet
Histogram, box and whisker plots
7 pages
Simple Linear Regression Using a Real Dataset in R and Excel
No ratings yet
Simple Linear Regression Using a Real Dataset in R and Excel
4 pages
Information Security Lecture 6
No ratings yet
Information Security Lecture 6
12 pages
Information_Security_Lecture_2
No ratings yet
Information_Security_Lecture_2
15 pages
Information Security Lecture 5
No ratings yet
Information Security Lecture 5
12 pages
Chapter 11 Introduction To Urban Hydrology
No ratings yet
Chapter 11 Introduction To Urban Hydrology
7 pages
LISTENING 271024
No ratings yet
LISTENING 271024
2 pages
Deep Multimodal Representation Learning A Survey
No ratings yet
Deep Multimodal Representation Learning A Survey
22 pages
LEADERSHIP THEORIES AND STYLES Handout
No ratings yet
LEADERSHIP THEORIES AND STYLES Handout
7 pages
S-44 For Dummies
No ratings yet
S-44 For Dummies
6 pages
Role For Human Reliability Analysis (HRA)
No ratings yet
Role For Human Reliability Analysis (HRA)
28 pages
When Can Transformers Reason With Abstract Symbols?
No ratings yet
When Can Transformers Reason With Abstract Symbols?
55 pages
Saes A 114
No ratings yet
Saes A 114
2 pages
Preliminary Program ICHQP 2014 PDF
No ratings yet
Preliminary Program ICHQP 2014 PDF
22 pages
Strategic Human Resource Management: Mba Iv Sem
No ratings yet
Strategic Human Resource Management: Mba Iv Sem
6 pages
Unit Test 2.1
No ratings yet
Unit Test 2.1
2 pages
Curriculum Vitae of Vanessa Boikhutso Relebogile Shole
No ratings yet
Curriculum Vitae of Vanessa Boikhutso Relebogile Shole
4 pages
Presentation 1
No ratings yet
Presentation 1
10 pages
2007 EM16-16R Babu Et Al - Mod & Inv of Mag and VLF-EM Data
No ratings yet
2007 EM16-16R Babu Et Al - Mod & Inv of Mag and VLF-EM Data
8 pages
BLS 102
No ratings yet
BLS 102
9 pages
2d. Planter
No ratings yet
2d. Planter
51 pages
Alphabrain: How A Group of Iconoclasts Are Using Cognitive Science To Advance The Business of Alpha Generation Stephen Duneier
No ratings yet
Alphabrain: How A Group of Iconoclasts Are Using Cognitive Science To Advance The Business of Alpha Generation Stephen Duneier
62 pages
IGCSE Biology Section 4 Lesson 1
No ratings yet
IGCSE Biology Section 4 Lesson 1
49 pages
5 0 Applications First - Order - Odes
No ratings yet
5 0 Applications First - Order - Odes
16 pages
HKDSE 2013 Physics (English Paper) - PaperPapa
No ratings yet
HKDSE 2013 Physics (English Paper) - PaperPapa
1 page
Lesson 12 EAPP
No ratings yet
Lesson 12 EAPP
2 pages
Applications of Statistics in Daily Life
No ratings yet
Applications of Statistics in Daily Life
11 pages
Quotation Marks Lesson Plan
No ratings yet
Quotation Marks Lesson Plan
6 pages
FINAL 01 RPMS 2022-2023 PURPLE TEMPLATE - Results-Based-Performance-Management-System
No ratings yet
FINAL 01 RPMS 2022-2023 PURPLE TEMPLATE - Results-Based-Performance-Management-System
43 pages
IMP Questions For MP-I
No ratings yet
IMP Questions For MP-I
5 pages
Theory and Applications of Monte Carlo Simulations 2013
No ratings yet
Theory and Applications of Monte Carlo Simulations 2013
284 pages
OIV World Wine Production Outlook 2023
No ratings yet
OIV World Wine Production Outlook 2023
9 pages
Semillas Ecologia de La Regeneracion en Plantas PDF
100% (1)
Semillas Ecologia de La Regeneracion en Plantas PDF
423 pages