Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
6 views

Data Visualization

Uploaded by

Omkar Shinde
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views

Data Visualization

Uploaded by

Omkar Shinde
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

 What is EDA?

i. Exploratory Data Analysis is an important step which helps us to look at data before making any assumptions.
ii. EDA can help identify obvious errors, as well as better understand patterns within the data, detect outliers or
anomalous events, find interesting relations among the variables.
iii. Exploratory data analysis is an initial step in data analysis. It's where the analyst takes a bird's
eye view of the data and tries to make some sense of it.

 Define Data Visualization. Why the data visualization important for data analysis? Explain.

Data visualization presents the data in a graphical format as it is said that the human brain processes visual content
better than plain textual information.
Data visualization gives us a clear idea of what the information means by giving it visual context through maps or
graphs.
Data visualization can useful to :-

Identify outliers in data, Improve response time, Greater simplicity, Easier visualization of patterns, Business analysis
made easy, and Enhanced collaboration.

 What is Visual coding? Types of Visual Coding.


Visual encoding in data basically means translating the data into a visual element on a chart/map/graphics object.
The visual encoding is broadly classified into Planar and Retinal.
 Explain the concept of Visualization graph.

Role of
Possible Illustrative Data Visualization Graph
Data Visualization
Distribution Scatter chart, 3D Area chart, Histogram

Relationship Bubble chart, Scatter chart

Comparison Bar chart, Line chart, Column chart, Area chart

Composition Pie chart, Waterfall chart, Stacked column chart, Stacked area chart

Location Bubble map, Choropleth map, Connection map

Connection Matrix chart, Node-link diagram, Word cloud, Alluvial diagram, Tube map

 Explain in brief any two software's used for Data Visualization.

Can generate focused reports and collaborative


SAP Analytics Cloud tools for online discussion. Provides import and Easy forecasting, set up important
export features for spreadsheets and visuals. events, cloud based protection.

Provides a vast variety of colorful designs for data 2D and 3D chart options, open
Plotly visualization. Can use the chart studio to create source coding, interactivity, plotly
web-based reporting templates has hover tool capabilities.
 Explain Data visualization libraries in Python.

matplotlib library :-
The matplotlib is the most common standard Python library used for plotting 2D data visualizations.
It is the first data visualization library to be developed in Python, and later many other libraries were built on top of
it.
This library is used to create a variety of visualization graphs such as line plots, pie charts, scatter plots, bar charts,
histograms, stem plots, and spectrograms. It allows easy use of labels, axes titles, grids, legends, and other graphic
requirements with customizable values and text.
seaborn library :-
The seaborn library couples the power of the matplotlib library to create artistic charts with very few lines of code.
This library follows creative styles and rich color palettes, which allows the creation of visualization plots to be more
attractive and modern.
As seaborn is considered to be a higher-level library, there are certain special visualization tools such as violin plots,
heat maps and time series plots that
plotly library :-
The plotly library is an online platform for data visualization
it can be used in making interactive plots that are not possible using other Python libraries.
Few such plots include dendrograms, contour plots, and 3D charts.
Other than these graphics, some basic visualization graphs such as area charts, bar charts, box plots, histograms,
polar charts, and bubble charts can also be created using the plotly library.
 Basic Data Visualization tools.

The common basic visualization tools that are often used by analysts for data analysis Histograms, Bar charts/graphs,
Scatter plots, Line charts, Area plots, Pie charts Donut charts
i. A histogram is a graphical display of data using bars of different heights.
A histogram displays the shape and spread of continuous sample data.
ii. A bar chart has rectangular bars in which the lengths are proportional to the values which are represented.
iii. A scatter plot that is a two-dimensional plot to observe and display relationships between two variables.
iv. Line charts is a graph that is used for the representation of continuous data points on a number line.
v. An area plot or area chart is similar to a line chart, except that the area between the x axis and the line is filled in
with color or shading.
vi. A pie chart, as the name suggests, looks similar to a pie. It is a circular graphic that is divided into slices.
vii. A donut chart is similar to a pie chart with the main difference in that an area of the center is cut out to give .
. the look of a doughnut.

 Explain the concept of Geospatial data, its libraries and tools.


Geospatial data, also referred to as spatial data, GIS data or geo data, consists of numeric data that denotes a
geographic coordinate system (latitude, longitude, and elevation) of a geographical location of a physical object.
The physical object can be as small as a building or a street, or as big as a city, a state, or a country. This spatial data
gives us information about the location, size, area, and shape of a physical object.
Libraries Used For Geospatial Data :- 1. shapely, 2. fiona, 3. geopandas , 4. rasterio , 5. gdal .
 Specialized Data Visualization tools.

A Venn diagram is a visualization tool used to display all possible logical relations among a finite group of few sets.
The Tree map visualization tool is mainly used for displaying hierarchical data that can be structured in the form of a
tree.
3D scatter plot is one of the most frequently used three-dimensional graphs for comparing the three characteristics
of a given dataset.
A wordcloud is a visualization tool for understanding and determining patterns and evolving trends in text data.

 Write a short note on - Advanced Data Visualization Tool.


Wordclouds :-
all the visualization tools were applied on numeric data like integers and floats.
In many applications, we deal with large amounts of text data.
For example, social media posts,comments, user feedback, messaging apps, twitter etc.
are in the form of text.
A wordcloud is a visualization tool for understanding and determining patterns and evolving trends in text data.
Each word in this cloud has a variable font size and color tone. Thus, this representation helps to determine words
of prominence.
A bigger font size of a word portrays its prominence more relative to other words in the cluster.

 What is Venn diagram? How to create it? Explain with example.


A Venn Diagram (also called primary diagram, set diagram or logic diagram) is a diagram that shows all possible
logical relationships between a finite collection of different sets.
In a Venn diagram, a set is represented by a circle.
The size of the circle determines the importance of a group.
The groups in the diagram may overlap due to the intersection of values in each set.
The intersection represented with different colors depicts the similarity while the area with different colors depicts
the differences.
Venn diagram is usually plotted for two or three sets. If more than three sets are involved, the diagram becomes
difficult to interpret.

 What is Histogram and Bar Chart? How to create them? What is difference between them?
Difference between Histogram and Bar chart
A bar graph looks similar to a histogram consisting of a set of bars based on the data but there are some major
differences between a bar chart and a histogram.
that there are gaps between bars in a bar chart but in a histogram, the bars are placed adjacent to each other.
While the histogram displays the frequency of numerical data, a bar chart uses bars to compare different categories
of data.
A histogram is a graphical display of data using bars of different heights. A histogram displays the shape and spread
of continuous sample data.
A bar chart has rectangular bars in which the lengths are proportional to the values which are represented.

You might also like