Advanced Analytics Using SAS

The document summarizes various SAS procedures for fundamental and advanced analytics including PROC MEANS, PROC UNIVARIATE, PROC FREQ, PROC CORR, PROC REG, and PROC SQL. These procedures allow for descriptive statistics, frequency tables, correlation analysis, linear regression modeling, and querying SAS data using SQL syntax.

Uploaded by

Arjun Khosla

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views

Advanced Analytics Using SAS

Uploaded by

Arjun Khosla

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Fundamental & Advanced

Analytics
Using SAS
SAS Procedures for Fundamental & Advanced
Analytics
› PROC MEANS
› PROC UNIVARIATE
› PROC FREQUENCY
› PROC CORR
› PROG REG
› PROC SQL
› Prints Descriptive Statistics
› Without any options, prints for all numeric
PROC MEANS variables in the data set
– No of Non-missing observations,
– mean,
SYNTAX – standard deviation,
– minimum and
PROC MEANS data=XYZ;
– maximum.
RUN; › With Options presents the opted measures
› Computing Statistics for Each Value- BY
Variable
– Pre-requisite the dataset must be sorted on the
BY variable
› CLASS –
– Substitute for BY,
– Sorted dataset not needed
PROC MEANS- Options
Option Description

N No of Non-missing Observations used to compute the Statistics

NMISS No of Missing Observations

MEAN The Mean

STD The Standard Deviation

CV The Coefficient of Variation

CLM The 95% confidence interval for the mean

STDERR The Standard Error

MIN The Minimum Value

MAX The Maximum Value

MEDIAN The Median

MAXDEC=n The Maximum no of Decimal Places in all table values

Example Code
PROC Sort DATA= SASHELP.SHOES PROC MEANS DATA= SASHELP.SHOES n
OUT=Sorted_Shoes; nmiss mean std;
BY REGION; CLASS REGION;
RUN; VAR STORES SALES INVENTORY;
PROC MEANS DATA= Sorted_Shoes n RUN;
nmiss mean std;
Including Multiple CLASS Variables
BY STORES;
PROC MEANS DATA= SASHELP.SHOES n
VAR STORES SALES INVENTORY; nmiss mean std;
RUN; CLASS REGION PRODUCT;
VAR STORES SALES INVENTORY;
RUN;
PRINTALLTYPES
› Used with Multiple CLASS Variables.
› Outputs Statistics broken down by every combination of CLASS
Variables

Sample Code:

PROC MEANS DATA= SASHELP.SHOES n nmiss mean std PRINTALLTYPES;

CLASS REGION PRODUCT;
VAR STORES SALES INVENTORY;
RUN;
› Similar to PROC MEANS
PROC
UNIVARIATE › Also produces Histograms & Probability
Plots.
› Options:
Syntax: › Histogram: Generates Histogram of all
variables on the VAR statement
Proc UNIVARIATE DATA= XYZ; › Qqplot: Produces quantile-quantile plot to
ID Var;
determine deviations from normality.
– Option NORMAL draws a straight line representing
Var v1 v2 v3; what a normal distribution would look like on the
Histogram; plot.
– mu(Mean) sigma(standard deviation) for theoretical
Qqplot /normal (mu=est normal plot.
sigma=est); – Option est helps get data to request these.
RUN;
One way Frequency Tables
PROC FREQ data= SASHELP.SHOES;
Tables Region product;
PROC FREQ Run;
Option: NOCUM: Eliminates cumulative frequencies
Generates Frequency Tables
PROC FREQ data= SASHELP.SHOES;
• One-way, Tables Region product /nocum;
Run;
• Two-way And
• Three-way
Two way & Three Way Frequency Tables
PROC FREQ data= SASHELP.SHOES;
Tables REGION * product; Two Way
Tables REGION * product * Sales;
Three Way
Run;
Region as rows and Product as columns

Option: Chisq: Chi square tables added in output

PROC FREQ data= SASHELP.SHOES;

Tables REGION * product / chisq; Run;
› Correlation Analysis of all SAS Variables
PROC CORR with each other.

Syntax › If variables are specified then their

correlation with each other will be
presented.
PROC CORR Data=XYZ;
RUN;

PROC CORR Data=XYZ;

Var v1 v2 v3…;
RUN;
› Models relationship between scalar dependent
PROC REG variable and one or more explanatory variables.
› Syntax 1 for Simple Linear Regression
Syntax 1: › Syntax 2 for Multiple Linear Regression
PROC REG Data=XYZ; Options: OUT, RESIDUAL, P
MODEL Var1=Var2 OUTPUT OUT=res RESIDUAL=resid P=pred
RUN; › OUT: For sending output to New dataset instead of
screen
› RESIDUAL: Residual Value
Syntax 2:
› P: Predicted Value
PROC REG Data=XYZ;
MODEL Var1=Var2 Var2 Var3
…; clm prints 95% confidence intervals for mean of each obs

RUN; cli prints 95% prediction intervals

PROC SQL › SAS offers extensive support to SQL by
using SQL queries inside SAS programs.

Syntax › Most of the ANSI SQL syntax is

supported.
› PROC SQL is used to process the SQL
PROC SQL;
statements.
SELECT Columns
› This procedure can
FROM TABLE – gives back the result of an SQL query,
WHERE Columns – can create SAS tables & variables.
GROUP BY Columns
;
QUIT;
Running SQL Commands
CREATING TABLES READING DATA

PROC SQL; PROC SQL;

CREATE TABLE EMPLOYEES AS SELECT make, model, type, invoice, horsepower

SELECT * FROM TEMP; FROM SASHELP.CARS;

QUIT; QUIT;

PROC PRINT data = EMPLOYEES; UPDATING DATA

RUN; PROC SQL;
UPDATE EMPLOYEES2 SET SALARY=SALARY*1.25;
WHERE CLAUSE QUIT;
PROC SQL; PROC PRINT data = EMPLOYEES2; RUN;
SELECT make, model, type, invoice,
horsepower DELETING DATA
FROM SASHELP.CARS PROC SQL;
Where make = 'Audi‘ and Type = 'Sports'; DELETE FROM EMPLOYEES2 WHERE SALARY >
900; QUIT;
QUIT;
PROC PRINT data = EMPLOYEES2; RUN;
INTCK Function
› Counts number of Intervals between two dates or times.
› SYNTAX:
INTCK(‘Interval’, From, To)

› INTERVAL may be
– ‘YEAR’, ‘SEMIYEAR’,
– ‘MONTH’, ‘SEMIMONTH’, ‘QTR’,
– ‘DAY’, ‘WEEKDAY’, ‘TENDAY’.
› Selects a Sample from a population.
PROC
SURVEYSELECT › Options:
› OUT=
– output data set that contains the sample.
PROC SURVEYSELECT
Data=XYZ <options>; › METHOD=
– Sample selection method.
STRATA variables;
– Default is simple random sampling (METHOD=SRS) with
CONTRAL variables; no SIZE statement.
– With SIZE statement, default is probability proportional to
SIZE variable; size without replacement (METHOD=PPS)
ID variables; › SAMPSIZE= number for sample size
› STRATA partitions input data set into nonoverlapping
groups
› ID lists variables from the input data set to be included
in the output data set else all variables inlcuded

PHC 6052 SAS Skills
No ratings yet
PHC 6052 SAS Skills
52 pages
SAS Interview Questions
100% (1)
SAS Interview Questions
40 pages
Unit Iii Sas Procedures
No ratings yet
Unit Iii Sas Procedures
27 pages
SAS Info 2
No ratings yet
SAS Info 2
4 pages
Chapter 6 - Evaluating Quantitative Data
No ratings yet
Chapter 6 - Evaluating Quantitative Data
21 pages
Proe Summary: A Powerful Exploratory Data Analysis Tool: Systems Seminar Consultants, Kalamazoo, MI
No ratings yet
Proe Summary: A Powerful Exploratory Data Analysis Tool: Systems Seminar Consultants, Kalamazoo, MI
10 pages
W3 Syntax Review
No ratings yet
W3 Syntax Review
4 pages
Proc Univariate HTML
No ratings yet
Proc Univariate HTML
1 page
Summary Syntax SAS
No ratings yet
Summary Syntax SAS
6 pages
Using PROC SGPLOT For Quick High-Quality Graphs
No ratings yet
Using PROC SGPLOT For Quick High-Quality Graphs
17 pages
Quick Reference: SAS Programming 1: Essentials
No ratings yet
Quick Reference: SAS Programming 1: Essentials
10 pages
DP01_06
No ratings yet
DP01_06
12 pages
base-programming-ref-sheet
No ratings yet
base-programming-ref-sheet
4 pages
PROC_Statements
No ratings yet
PROC_Statements
2 pages
EPG1V2_Summary of Lesson 3_ Exploring and Validating Data
No ratings yet
EPG1V2_Summary of Lesson 3_ Exploring and Validating Data
3 pages
SRME2 D
No ratings yet
SRME2 D
10 pages
SAS Procedures
No ratings yet
SAS Procedures
8 pages
HSPICEQuick Ref
No ratings yet
HSPICEQuick Ref
86 pages
SAS Introduction To Time Series Forecasting-Libre
No ratings yet
SAS Introduction To Time Series Forecasting-Libre
34 pages
Grid Search CV
No ratings yet
Grid Search CV
5 pages
6_Workflow
No ratings yet
6_Workflow
11 pages
it6312_dbms_manual
No ratings yet
it6312_dbms_manual
80 pages
Histograms Pre-12C and Now: Anju Garg
No ratings yet
Histograms Pre-12C and Now: Anju Garg
34 pages
Introduction To Sas: Reading Assignment: Selected Sas Documentation For Bios111 Part 1: Introduction To SAS Software
No ratings yet
Introduction To Sas: Reading Assignment: Selected Sas Documentation For Bios111 Part 1: Introduction To SAS Software
22 pages
SAS Programming For Data Mining: AUC Calculation Using Wilcoxon Rank Sum Test
No ratings yet
SAS Programming For Data Mining: AUC Calculation Using Wilcoxon Rank Sum Test
8 pages
Control Flow - Looping
No ratings yet
Control Flow - Looping
18 pages
CHAPTER 7 (SAS Session) 2023
No ratings yet
CHAPTER 7 (SAS Session) 2023
137 pages
CRM Cheat Sheet
No ratings yet
CRM Cheat Sheet
7 pages
Journal
No ratings yet
Journal
35 pages
Hspice Quick Ref
No ratings yet
Hspice Quick Ref
86 pages
Topic: Generating Reports
No ratings yet
Topic: Generating Reports
15 pages
SAS Slides 5: Functions
No ratings yet
SAS Slides 5: Functions
6 pages
Using DSR in ns2: Rishi Sinha
No ratings yet
Using DSR in ns2: Rishi Sinha
15 pages
5GRAPHS
No ratings yet
5GRAPHS
14 pages
Chapter2 35
No ratings yet
Chapter2 35
41 pages
OLAP Functions Part 1
No ratings yet
OLAP Functions Part 1
41 pages
Overview of Validating and Cleaning Data
No ratings yet
Overview of Validating and Cleaning Data
5 pages
Proc Means
No ratings yet
Proc Means
22 pages
SAS Presentation
No ratings yet
SAS Presentation
35 pages
Top Down Modeling and Test Bench Development: Verification Case Study: Pipeline ADC
No ratings yet
Top Down Modeling and Test Bench Development: Verification Case Study: Pipeline ADC
54 pages
SQR
100% (1)
SQR
81 pages
SAS Manipulate Datasets
No ratings yet
SAS Manipulate Datasets
32 pages
IBM DB2 To PostgreSQL Migration - SQLines Tools
No ratings yet
IBM DB2 To PostgreSQL Migration - SQLines Tools
5 pages
Chapter2 1
No ratings yet
Chapter2 1
41 pages
Lecture 8: SQL Programming and Transactions: Friday, January 24, 2003
No ratings yet
Lecture 8: SQL Programming and Transactions: Friday, January 24, 2003
28 pages
Dbmslabmanual
No ratings yet
Dbmslabmanual
81 pages
TEC BAS 10 - ABAP Performance Tips & Tricks - v2003
No ratings yet
TEC BAS 10 - ABAP Performance Tips & Tricks - v2003
26 pages
Dbms Unit II Plsql Pnr 2 of 2
No ratings yet
Dbms Unit II Plsql Pnr 2 of 2
81 pages
3 Singlerowfun
No ratings yet
3 Singlerowfun
30 pages
Proc
No ratings yet
Proc
85 pages
Descriptive Statistics Using SAS
No ratings yet
Descriptive Statistics Using SAS
10 pages
System Verilog Classes
No ratings yet
System Verilog Classes
105 pages
23bce0140 VL2024250105412 Ast05
No ratings yet
23bce0140 VL2024250105412 Ast05
9 pages
Unit4 - PL - SQL Oracle 2022
No ratings yet
Unit4 - PL - SQL Oracle 2022
49 pages
DB2 Application Programming
No ratings yet
DB2 Application Programming
45 pages
MSC - ProCOR 2006 User's Guide
No ratings yet
MSC - ProCOR 2006 User's Guide
224 pages
SAS Chapter 10
No ratings yet
SAS Chapter 10
5 pages
PSLP Lab File
No ratings yet
PSLP Lab File
30 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
From Everand
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
César Pérez López
No ratings yet
Pom Ndim
No ratings yet
Pom Ndim
30 pages
Sampling Fundamentals Modified
No ratings yet
Sampling Fundamentals Modified
45 pages
Operations Management: Dr. Mashkur Zafar
No ratings yet
Operations Management: Dr. Mashkur Zafar
30 pages
Operations Management Week 2 New1
No ratings yet
Operations Management Week 2 New1
112 pages
Working With SAS Dates
No ratings yet
Working With SAS Dates
6 pages
Operations Management Week 1
No ratings yet
Operations Management Week 1
49 pages
In Chapter 4 - Managing Product and Service Innovation Identifies The Following Key Questions
No ratings yet
In Chapter 4 - Managing Product and Service Innovation Identifies The Following Key Questions
18 pages
Chapter-4: Causal/Experimental Research Designs
No ratings yet
Chapter-4: Causal/Experimental Research Designs
34 pages
Data Collection Techniques Modified
No ratings yet
Data Collection Techniques Modified
32 pages
Study Material
No ratings yet
Study Material
88 pages
Study Mat
No ratings yet
Study Mat
34 pages
DPR
No ratings yet
DPR
5 pages
Domestic Investment in India
No ratings yet
Domestic Investment in India
42 pages
Ipo 1
No ratings yet
Ipo 1
4 pages
Advanced - Linear Regression
No ratings yet
Advanced - Linear Regression
57 pages
Ratan Goyal
No ratings yet
Ratan Goyal
6 pages
3 Statement & DCF Model
No ratings yet
3 Statement & DCF Model
17 pages
Cars
No ratings yet
Cars
31 pages
Credit Risk Modeling in R
100% (2)
Credit Risk Modeling in R
66 pages
MELT205 - LessonGuide - InterculturalCompetence - Banda
No ratings yet
MELT205 - LessonGuide - InterculturalCompetence - Banda
9 pages
Reading Response 5
No ratings yet
Reading Response 5
3 pages
Lesson 3 Con - Phil
No ratings yet
Lesson 3 Con - Phil
57 pages
Herpderp1909 Dragons Reworked Part IV - Dragon Hall of Fame
No ratings yet
Herpderp1909 Dragons Reworked Part IV - Dragon Hall of Fame
55 pages
Elijah Budd Resume
No ratings yet
Elijah Budd Resume
1 page
Automated Weighing Software
No ratings yet
Automated Weighing Software
18 pages
How To Use Phrasal Verb Get in English
No ratings yet
How To Use Phrasal Verb Get in English
4 pages
Unit 6 Half_Closed_Eyes_of_the_Buddha_Presentation
No ratings yet
Unit 6 Half_Closed_Eyes_of_the_Buddha_Presentation
13 pages
722875-june-2025-zone-2-timetable
No ratings yet
722875-june-2025-zone-2-timetable
13 pages
Ca Syllabus 2013 2014
No ratings yet
Ca Syllabus 2013 2014
4 pages
50 Hadith
No ratings yet
50 Hadith
10 pages
Leave Management System Web Application
No ratings yet
Leave Management System Web Application
6 pages
Blumea Balsamifera
No ratings yet
Blumea Balsamifera
4 pages
Ict Non Cs Sylb MQP QB
No ratings yet
Ict Non Cs Sylb MQP QB
4 pages
Segment Tree
No ratings yet
Segment Tree
36 pages
Speech Writing
No ratings yet
Speech Writing
2 pages
MBSE - Practical Use and Applications
No ratings yet
MBSE - Practical Use and Applications
37 pages
Discuss The File Documentation
No ratings yet
Discuss The File Documentation
3 pages
Palestine Test
No ratings yet
Palestine Test
2 pages
Literary Analysis Checklist
No ratings yet
Literary Analysis Checklist
1 page
Memory Management
No ratings yet
Memory Management
21 pages
The Linguistic Development of Genie, Susan Curtiss
No ratings yet
The Linguistic Development of Genie, Susan Curtiss
28 pages
Wilfred Owen
No ratings yet
Wilfred Owen
8 pages
White Paper c11 556985
No ratings yet
White Paper c11 556985
58 pages
CHAPTER 08 (Practice Questions Q.P)
No ratings yet
CHAPTER 08 (Practice Questions Q.P)
14 pages
1.3 Possessive Adjectives BBL
No ratings yet
1.3 Possessive Adjectives BBL
9 pages
Ctevt Exam 2079
No ratings yet
Ctevt Exam 2079
2 pages
logcat_1742457701861
No ratings yet
logcat_1742457701861
34 pages
Instructions for Matsutec AIS HA-102 Config
No ratings yet
Instructions for Matsutec AIS HA-102 Config
9 pages
A Survey of Sassanian Silver Coins Found in China
No ratings yet
A Survey of Sassanian Silver Coins Found in China
2 pages