Basic Charts and Multidimensional Visualization
Basic Charts and Multidimensional Visualization
BASIC CHARTS
MULTIDIMENSIONAL VISUALIZATION
PRESENTED BY:
G.IYSWARYA
20PMB057
II-MBA- B SECTION
DATA VISUALIZATION
FEATURES
● Identify areas that need attention or improvement.
● Clarify which factors influence customer behavior.
● Decision-making Ability.
● Integration Capability.
● Predict sales volumes.
BASIC CHARTS
A chart is a graphical representation for data visualization, in which "the data is represented by symbols,
such as bars in a bar chart, lines in a line chart, or slices in a pie chart".
❏ Bar charts
❏ Line graphs
❏ Scatter plots
These plots are easy to create in R and are the plots most commonly used in the current business world, in
both data exploration and presentation.
Basic charts support data exploration by displaying one or two columns of data (variables) at a time.
This is useful in the early stages of getting familiar with the data structure, the amount and types of
variables, the volume and type of missing values, etc.
Bar chart
A bar chart or bar graph is a chart or graph that presents categorical data with rectangular bars with
heights or lengths proportional to the values that they represent. The bars can be plotted vertically or
horizontally. A vertical bar chart is sometimes called a column chart.
WHEN TO USE:
To show a distribution of data points or perform a comparison of metric values across different subgroups of
your data.
APPLICATIONS :
● Analysts use bar charts in technical analysis to help monitor price trends, volatility, security movement
indicators, and more.
● Bar charts in technical analysis are also referred to as open, high, low, and closing (OHLC).
They are helpful in spotting trends, monitoring stock prices, and helping trading analysts make decisions.
EXAMPLE:
APPLICATIONS:
The Treemap visualization is used to display hierarchical data as a set of nested rectangles. Rectangles of each level
are of different sizes and colors.
Each characteristic of the building tiles (rectangles) has a role in the data analysis:
● Color - shows the categories by which the treemap visualization is divided. Fields data, used to define this
characteristic, can be numerical (123), string (ABC), or date.
● Size - shows the value for each category. Fields data, used to determine size, can only be numerical (123).
SHAPES AND MULTIPLE PANEL
When the number of categories is large, a better alternative is to use multiple panels.
Creating multiple panels (also called “trellising”) is done by splitting the observations according to a
categorical variable, and creating a separate plot (of the same type) for each category.
Animation can bring data to life during both the visual exploration and storytelling phases. By animating data visualization,
one can engage viewers in many ways. Static data visualization, especially the unique ones, can be persuasive and aid
decision-making. Animating them will boost these advantages even further.
Adding a temporal dimension to a plot to show how the information changes over time can be achieved via animation.
Manipulations: Rescaling, Aggregation
and Hierarchies, Zooming, Filtering
RESCALING
The process of changing the scale or proportions of something.This operation applies
to quantitative graphs in particular.
All graphs have at least one quantitative scale along an axis. Ordinarily, the quantitative
scale places equal space between equal intervals of value. This common type of scale is
sometimes called a linear scale
The graph below displays two sets of sales When we wish to compare rates of change
and avoid this optical suggestion that rate
values: one for hardware sales and one for of change for higher-priced items is greater
software sales. Which is increasing at a than for lower-priced items, log scales are a
faster rate: hardware or software sales? convenient solution. Look at what happens
when I do nothing to the graph above but
change it from a linear to a log scale.
AGGREGATIONS AND HIERARCHIES
Another useful manipulation of scaling is changing the level of aggregation.
Examining different aggregations, or hierarchies supports both supervised and unsupervised tasks in that
it can reveal patterns and relationships at various levels, and can suggest new sets of variables with
which to work.
For a temporal scale, we can aggregate by different granularity (e.g., monthly, daily, hourly) or even by a
“seasonal” factor of interest such as month-of-year or day-of-week. A popular aggregation for time series
is a moving average, where the average of neighboring values within a given window size is plotted.
Moving average plots enhance visualizing a global trend.
Non-temporal variables can be aggregated if some meaningful hierarchy exists: geographical (tracts
within a zip code in the Boston Housing example), organizational (people within departments within
units), etc
AGGREGATION EXAMPLE
Trend lines and using in-plot labels also help to detect patterns and outliers. Trend lines serve as a reference, and
allow us to more easily assess the shape of a pattern. Although linearity is easy to visually perceive, more elaborate
relationships such as exponential and polynomial trends are harder to assess by eye. Trend lines are useful in line
graphs as well as in scatter plots.
The use of in-plot labels can be useful for better exploration of outliers and clusters. Labels make it easier for
users to understand data visualizations by using text to reinforce visual concepts.
Labels are traditionally used to label axes and legends, however, they can also be used inside of data visualizations
to communicate categories, values, or annotations.
EXAMPLE: DATA LABELS
Scaling up to Large Datasets
SCALING UP-TO LARGE DATABASE
When the number of observations (rows) is large, plots that display each individual
observation (e.g., scatter plots) can become ineffective.
Aside from using aggregated charts such as boxplots, some alternatives are:
2.Multiple concurrent charts and tables can be easily combined and displayed on a single screen.
3.A set of visualizations can be linked, so that operations in one display are reflected in the other
displays.