Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
45 views

Introduction To Data Science: Maths Business Technology Programmipython, Sas, R

The document introduces data science including the roles of data scientists, common tools and technologies used, and an overview of the data science process from data collection and cleaning to modeling, visualization, and deployment. Sample data on monthly sales for different cities is shown to illustrate concepts.
Copyright
© © All Rights Reserved
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
45 views

Introduction To Data Science: Maths Business Technology Programmipython, Sas, R

The document introduces data science including the roles of data scientists, common tools and technologies used, and an overview of the data science process from data collection and cleaning to modeling, visualization, and deployment. Sample data on monthly sales for different cities is shown to illustrate concepts.
Copyright
© © All Rights Reserved
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Introduction to Data Science

Maths Statistics,Linear Algebra,Probability


Business Domain Knowledge of the System
Technology ProgrammiPython,SAS,R

Business Domain +scientific Methods+Technogy Data Science

Tables Columns Rows


Datasets Fields NamObservation
Variables
icecream shop name
abc
Roles of Data Scientists def

Data Collect
Explore
analyze
Visual

Data Science is an art of making Decisions by using Historical data/live

Data Science -> Extracting knoewledge+Ingights=Decisions

Excel txt files


New York San Francisco New York
Jan 35000 42000 May 35670
Feb 33000 42560 June 25300
Mar 31000 39789 July 26500
April 32450 41000 Aug 23400

Master data
New York San Francisco
Jan 35000 42000 45000
Feb 33000 42560 40000
Mar 31000 39789 35000
April 32450 41000 30000
May 35670 36500 25000
June 25300 36660 20000
July 26500 35000 15000
10000
5000
0
Jan Feb M
30000
25000
20000
15000
Aug 23400 37800 10000
Sep 22100 39800 5000
Oct 42000 41000
0
Nov 32000 41500 Jan Feb M
Dec 28567 39765

Apache Hadoop and Apache Spake: for Datastorage


Python SAS and R : data science programming
Tableau ,power BI,MatLab,Juypter Notebook : data Visualization
Sklearn,Tensorflow ,pytorch: for deep learning and machine Learning

Data Science Process:

Understand Business Problem

Data Collecting/Acquistion
Flat files,Databases,API,logs from webs servers

Data Preparation/Data Cleaning and exploring


variable
Missing Value
Duplicate

Exploratory Data Analysis/EDA

Data Modeling
Deep Learning Algorithms
Machine Learning Algorithms

Visual
Qlikview,Tableau,powerBI

Deploy

Development Testing Environment UAT


whatsapp Write Script Test the data 1000
sale per day Months
3000 90000
1000 30000

Oracle
San Francisco New York San Francisco
36500 Sep 22100 39800
36660 Oct 42000 41000
35000 Nov 32000 41500
37800 Dec 28567 39765

Monthly Revenue
45000
40000
35000
30000
25000
20000
15000
10000
5000
0
Jan Feb Mar April May June July Aug Sep Oct Nov Dec
30000
25000
20000
15000
10000
5000
0
Jan Feb Mar April May June July Aug Sep Oct Nov Dec

New York San Francisco

Find Insights-business Strategy

Production/Live Data
Auto Sales
Customer ISex Age Car Type Quantity Price Satisfied
1 Male 33 Alto 1 500000 5
2 Female 26 Brezza 1 100000 4
3 Male 56 Alto 1 500000 1
4 Male 34 Alto 2 500000 5
5 Female 28 Ertiga 1 1400000 1
6 Male 32 Brezza 1 100000 1
7 Female 36 Ertiga 2 1400000 4
8 Male 42 Ertiga 1 1400000 3
9 Male 29 Alto 3 500000 4
10 Male 45 Brezza 1 100000 4
11 Female 39 Ertiga 2 1400000 5
12 Male 58 Brezza 1 100000 4

You might also like