Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
22 views

Understanding Data

Uploaded by

lathak ECP1004
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views

Understanding Data

Uploaded by

lathak ECP1004
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 8

UNDERSTANDING

DATA

BY:
MANO RAJ
 introduction to data
• Data refers to any information that can be processed or
analyzed to generate meaningful insights. It is a fundamental
concept in computing, research, statistics, and everyday
decision-making.

Data Sources
• Data can come from various sources, such as:
• Primary Data: Data collected directly by the researcher
through methods like surveys, experiments, or interviews.
• Secondary Data: Data that has been collected by someone
else and is available for use, such as government reports,
 Data Representation
• Data can be represented in multiple formats, including:
• Textual Data: Plain text, such as written descriptions.
• Numerical Data: Numbers representing quantities or measurements.
• Graphical Data: Visual representations like charts, graphs, or plots.
• Multimedia Data: Images, videos, and audio files.

 Importance of Data
• Data is crucial because it enables decision-making based on evidence rather than
assumptions. It is used in various fields:
• Business: For market analysis, customer behavior, and strategic planning.
• Healthcare: To track patient information, conduct clinical trials, and improve
treatments.
• Science and Research: For hypothesis testing, analyzing trends, and drawing
conclusions.
• Technology: In machine learning, data is the foundation for training models and
 TYPES OF DATA
• Structured data
• Structured data refers to any data that is organized in a
defined format, making it easily searchable and analyzable
by computers. It follows a specific schema or structure, such
as tables with rows and columns in a relational database.
This type of data is highly organized and can be easily stored,
processed, and retrieved.
• Examples
Spreadsheets:of Structured
Data Data
arranged in rows and columns in tools like Microsoft Excel or
Google Sheets.

• Data in XML or JSON Format: While primarily used for data interchange, they have
structured formats
Unstructured Data
Definition
• Unstructured data lacks a specific format or structure and is
often qualitative rather than quantitative. It does not fit
neatly into rows and columns like structured data. Instead, it
may come in various forms and require advanced techniques
for analysis and processing.
Examples of Unstructured Data
• Textual Data: Emails, social media posts, articles, reports,
and customer reviews.
• Multimedia Data: Images, videos, audio recordings, and
graphical content.
• Web Content: Web pages, blogs, and forums.
MEASURES OF CENTRAL TENDENCY
• Mean
• Definition: The sum of all values in a dataset divided by the
total number of values.
Formula:

Example: In the dataset [3,5,7,9,11][3, 5, 7, 9, 11][3,5,7,9,11],


the mean is:
3+5+7+9+11/5=7
• Median
• Definition: The middle value in an ordered dataset. If there is an even
number of values, the median is the average of the two middle numbers.
• Example: For the dataset [3,5,7,9,11], the median is 7.
For the dataset [3,5,7,9], the median is:
5+7/2=6
Mode
• Mode is a measure of central tendency that represents the value or
values that appear most frequently in a dataset. Examples of Mode:
1.Single Mode (Unimodal)
1.Dataset: [3,5,7,7,9]
2.Mode: 7 (because 7 appears most frequently)
2.Multiple Modes (Bimodal)
1.Dataset: [1,2,2,3,3,4]
2.Modes: 2 and 3 (both appear twice, which is the highest frequency)
Thank you

You might also like