0% found this document useful (0 votes)

38 views

MGMT 469 Helpful Stata Commands

This document contains helpful Stata commands organized into the following categories: file manipulation commands, data merging commands, data description commands, basic statistics commands, commands for creating and changing variables, regression commands, and advanced commands for working with grouped data. Some key commands include set mem to allocate memory, use and save to load and save datasets, merge to combine datasets, label to add variable descriptions, and egen to generate group-level variables while retaining all observations.

Uploaded by

He H

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

MGMT 469 Helpful Stata Commands

Uploaded by

He H

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Mgmt 469

Helpful Stata Commands

This contains virtually all the Stata commands you will ever need. You may find it helpful to
experiment with them just to move more rapidly down the learning curve.

File Manipulation Commands

set mem xxm Allocates xx RAM for use by Stata. Must be done before you load up your data.

set mem xxm, permanently Always assigns XX RAM when you start Stata. You never need
to perform this command again.

set matsize yyy “Reshapes” the available RAM to permit more variables into your model. The
default for Intercooled Stata is 40 variables. You will probably need more than this for the ISP
project.

use filename Reads in the Stata formatted file called filename.dta. Alternatively, you can use
the “open file” button in the usual way.

insheet using textfile.txt Use this if you have created a tab delimited text file in Excel.
Converts the file called textfile.txt into a Stata data file.

infile var1 var2 var3 using textfile.dat Reads in a plain text file named textfile.dat. In this
example, the plain text file has three variables, named var1 var2 and var3.

save filename Saves a stata data set called filename.dta

save filename, replace Replaces an existing stata data set with a new version. Use this to save
a file that you have already been working on. Tip: Save an original version of your data.

clear Clears the workspace to permit you to enter new data

drop var1 var2 var3 Drops these three variables from the data

keep var4 var5 var6 Keeps these three variables (and drops the rest!)

sort varname Sorts the data set in ascending order by the variable varname. (E.g., the data set
yogurtsmall.dta has been sorted by week. The first week of data appears in the first row, etc.)
Merging Data Sets

merge varname using newfile This merges the active file with another file called the using file
(The active file is the one you are currently working with. In this example, the using file is
called newfile). The two data sets are matched to each other using the variable called varname.
Before you merge, you need to sort both data sets by the matching variable:

After merging, you will see a new variable called _merge. Stata created this variable to help you
take stock of the merge. It will take on one of three values:

_merge = 1 if the observation was in the active file data but not in the using data
_merge = 2 if the observation was in the using data but not in the active data
_merge = 3 if the observation was in both data sets.

Prior to merging again, you will need to drop _merge

Data Description Commands

label var varname "description of variable that you supply" I strongly recommend that you
label all variables that you create. This will keep you from forgetting what each variable means.

de Describes all the variables. Stata will list all the variables in your data set and give tyeir
properties (such as whether they are alphanumeric). Displays the labels for each variable.

de varname Describes varname, rather than all the variables

de var* Describes all variables that start with the letters var. (The * is a “wildcard”. You can
use * in almost any command to avoid typing in a long list of variables. E.g., su z* will generate
summary statistics for all variables whose names begin with the letter z.)

list varname1 varname2 … Displays the values of the selected variables for all observations
in your data.
Basic Statistics

su varname1 varname2 varname3 generates basic summary statistics such as mean, variance,
and sample size for a list of variables.

su varname, de generates detailed summary statistics, including median and percentiles of the
distribution, for one variable at a time.

corr varname1 varname2 generates a simple correlation table You can list as many variables
as you like.

pwcorr varname1 varname2, sig generates a correlation table with significance levels

pwcorr varname1 varname2, star(.05) Puts a * next to correlations that are significant at
p=.05. (You can choose some other significance level, of course.)

tab varname tabulates varname (that is, it lists all the values in ascending order, as well as their
frequency of occurring in the data.)

table varname1 varname2 generates a two-way table. If varname1 takes on M possible values
and varname2 takes on N possible values, you will get an M x N table.

table varname1, c(mean varname2) reports the mean of varname2, broken out by categorical
varname1. For example, suppose you are working with the yogurtall data. If you type
table store, c(mean price1), you will get the mean value of price1 by store. Other statistics
such as median, max, min, and sd (standard deviation) may be substituted for mean.
Creating and changing variables

ge newvar = varname1+varname2 Generate a new variable. Almost any mathematical

expression is possible

replace oldvar=oldvar-2 Change the value of an existing variable

The egen command computes a summary statistic for all observations that belong to a group.
See the last section of this document for more information about egen.

Conditional commands

Use the if statement to execute a command for a subset of the data.

replace oldvar =oldvar*othervar if oldvar <5 Change the value of oldvar only if the initial
value of oldvar is less than 5.

The relational operators for the if statement are = =, >=, <+, and ~=.

You may combine conditions in the if statement. Note that & is “and”, | is “or”, and remember
that “.” represents missing values.

replace oldvar =oldvar*othervar if oldvar <5&othervar~=. (& is "and".)

replace oldvar =oldvar*othervar if oldvar <5|othervar>10 (| is "or")

The if statement can be used to modify virtually any command. For example:

su varname if varname >100

This will generate summary statistics for varname, but only for those values where varname>100

Note: Stata considers missing values to equal ∞ for the purposes of evaluating if statements.
Regression and related commands

regress depvar predvar1 predvar2 Run a regression. The dependent variable comes first.

regress depvar predvar1 predvar2 if predvar2 >=20 Run the regression on the observations
for which predvar is greater than or equal to 20.

You can recover both the predicted values and residuals from your regression. After a
regression, type

predict varname to generate a new variable, varname, which equals the predicted values from
the regression. You can, of course, select any name you wish for this variable.

The predict command also enables you to generate residuals after a regression:

predict resname, residuals

The new variable resname will contain the values of the residuals. (You can, of course choose
any name for this variable that you wish.)

If you wish to generate a set of dummy variables in your model, use the xi: prefix

xi: regress depvar predvar1 predvar2 i.categoricalvar

Stata will generate a set of categorical dummy variables. If categoricalvar takes on N values,
Stata will generate N-1 dummies. Be sure to interpret all the coefficients relative to the omitted
category!

Note that the logic for other regression commands we will learn in class, including logit,
poisson, nbreg, and oprobit, is similar to that for regress.
You may want to see if a group of variables belong in a regression. After the regression, type

test varlist where varlist is the list of variables you are testing

A useful interpretation of this test is as follows: “Did the variables collectively add more
predictive power to the model than would have been expected from the same number of random
variables?”

You can also compare the coefficients of different predictors to see if you can reject the null
hypothesis that the coefficients are equal. Try:

test var1 = var2

This tests whether the coefficient on var1 is different from the coefficient on var2.

You can test whether the coefficient on var2 is twice that of var1:

test 2*var1 = var2

or if one coefficient equals the negative of the other:

test var1 = -var2

In fact, you can test any algebraic relationship among coefficients!

Again, the logic is the same for logit and other commands.
Advanced commands for working with grouped data

1) To create a group level variable but retain the same level of observation

egen newvar = fxn(oldvar), by(groupname)

The egen command allows you to compute a function such as the mean or the minimum, at the
group level. It is best explained by means of an example. Suppose you have wage data on two
firms in each of three markets.

firm market wage

1 Chicago 10
2 Chicago 11
3 Chicago 12
4 Boston 10
5 Boston 12
6 Boston 14

Suppose you want to create a variable that equals the mean wage, by market. If you have
variables named wage and market, then you would type:

egen avgwage=mean(wage), by(market)

Your new data set will look like this:

firm market wage avgwage

1 Chicago 10 11
2 Chicago 11 11
3 Chicago 12 11
4 Boston 10 12
5 Boston 12 12
6 Boston 14 12

In addition to computing the mean, egen allows you to use the following functions: min, max,
median, sum, sd (standard deviation within the group), sum, count (the number of observations
in the group), and many others described in the manual.

When you use the egen command, the number of observations remains unchanged.
2) To perform a similar group level calculation, but collapse the data so that the unit of
observation becomes the group, you need to perform the collapse command. The syntax is a bit
goofy, so watch carefully!

collapse (mean) avgwage =wage, by(market)

Your new data set will be

market avgwage
Chicago 11
Boston 12

As with egen, you can use many other functions besides mean. Unlike egen, you can compute
several different functions of several different variables, all at the same time. For example,

collapse (mean) avgwage =wage (min) minwage=wage, by(market)

will generate the following data:

market Avgwage minwage

Chicago 11 10
Boston 12 10

Stata Cheat Sheets
100% (1)
Stata Cheat Sheets
6 pages
Stata Guide
No ratings yet
Stata Guide
11 pages
Collins, D., Maydew, E. & Weiss, I. (1997) - Changes in The Value-Relevance of Earnings and Book Values Over The Past Forty Years
No ratings yet
Collins, D., Maydew, E. & Weiss, I. (1997) - Changes in The Value-Relevance of Earnings and Book Values Over The Past Forty Years
29 pages
Report On Cost Estimation
No ratings yet
Report On Cost Estimation
41 pages
STATA Commands
100% (2)
STATA Commands
35 pages
Computing Stata Notes
No ratings yet
Computing Stata Notes
5 pages
Wooldridge 2002 Rudiments of Stata
No ratings yet
Wooldridge 2002 Rudiments of Stata
11 pages
stata notes
No ratings yet
stata notes
7 pages
Basics of Sas
No ratings yet
Basics of Sas
14 pages
stata_tutorial MATERIAL
No ratings yet
stata_tutorial MATERIAL
3 pages
A Short Introduction To STATA
No ratings yet
A Short Introduction To STATA
8 pages
Stata
No ratings yet
Stata
6 pages
Stata Programming Tools
No ratings yet
Stata Programming Tools
9 pages
Stata Excel Spreadsheet
No ratings yet
Stata Excel Spreadsheet
43 pages
Stata Logistic
No ratings yet
Stata Logistic
4 pages
Introduction To Stata: 1 Data Manipulation
No ratings yet
Introduction To Stata: 1 Data Manipulation
6 pages
Topic 3-SPSS and STATA
100% (1)
Topic 3-SPSS and STATA
73 pages
Stata Application Part I
No ratings yet
Stata Application Part I
27 pages
Command Window: Stata Results Window: Variables Window: Review Window
No ratings yet
Command Window: Stata Results Window: Variables Window: Review Window
3 pages
Append Merge Collapse
No ratings yet
Append Merge Collapse
16 pages
Variable. A Variable May Also Point To An Array of Numbers or Strings. in Lab 5
No ratings yet
Variable. A Variable May Also Point To An Array of Numbers or Strings. in Lab 5
3 pages
Stata Demo 3 Econ 396A F2016
No ratings yet
Stata Demo 3 Econ 396A F2016
12 pages
Matlab - Tutor2 - Variables and Arrays
No ratings yet
Matlab - Tutor2 - Variables and Arrays
16 pages
STATA Notes 2022
No ratings yet
STATA Notes 2022
25 pages
BES - R Lab 1
No ratings yet
BES - R Lab 1
4 pages
Stat A Guide
No ratings yet
Stat A Guide
16 pages
Descriptive and Inferential Statistics With R
No ratings yet
Descriptive and Inferential Statistics With R
6 pages
Best programming language
No ratings yet
Best programming language
23 pages
grr
No ratings yet
grr
11 pages
R Lectures Chapter 4
No ratings yet
R Lectures Chapter 4
3 pages
9780199297818
No ratings yet
9780199297818
3 pages
Stata
No ratings yet
Stata
26 pages
Introduction To Stata 2012 - Econ4150
No ratings yet
Introduction To Stata 2012 - Econ4150
17 pages
Stata Commands PDF
No ratings yet
Stata Commands PDF
5 pages
Types of C Constants: C Constants Can Be Divided Into Two Major Categories: Primary Constants Secondary Constants
No ratings yet
Types of C Constants: C Constants Can Be Divided Into Two Major Categories: Primary Constants Secondary Constants
21 pages
Econometrics Computer Exercise Week 1: Introduction Stata + Simple Regression Model
No ratings yet
Econometrics Computer Exercise Week 1: Introduction Stata + Simple Regression Model
4 pages
Stata Review
No ratings yet
Stata Review
9 pages
Stata Session 1 KA (Class)
No ratings yet
Stata Session 1 KA (Class)
6 pages
Stata Excel
No ratings yet
Stata Excel
44 pages
Varlist Exp: Alphabetical List of Common Stata Commands
No ratings yet
Varlist Exp: Alphabetical List of Common Stata Commands
3 pages
Applied Econometrics Using Stata
100% (2)
Applied Econometrics Using Stata
100 pages
Stata Excel
No ratings yet
Stata Excel
25 pages
Stata Introduction and Worksheet
No ratings yet
Stata Introduction and Worksheet
2 pages
SAS: What You Need To Know To Write A SAS Program: Data Definition and Options Data Step Procedure(s)
No ratings yet
SAS: What You Need To Know To Write A SAS Program: Data Definition and Options Data Step Procedure(s)
8 pages
Stat A Tutorial
No ratings yet
Stat A Tutorial
40 pages
STATA
No ratings yet
STATA
26 pages
Introduction To STATA: Introduction To STATA About STATA Basic Operations Regression Analysis Panel Data Analysis
No ratings yet
Introduction To STATA: Introduction To STATA About STATA Basic Operations Regression Analysis Panel Data Analysis
27 pages
Tutorial of Stata
No ratings yet
Tutorial of Stata
11 pages
Stata Working With Ado Files
No ratings yet
Stata Working With Ado Files
4 pages
Practical Meta Analysis
No ratings yet
Practical Meta Analysis
6 pages
2015 SPSS Exercise
No ratings yet
2015 SPSS Exercise
69 pages
software material
No ratings yet
software material
13 pages
Arrays From Atoz: Phil Spector
No ratings yet
Arrays From Atoz: Phil Spector
10 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
BE EXPERT IN JAVA Part- 2: Learn Java programming and become expert
From Everand
BE EXPERT IN JAVA Part- 2: Learn Java programming and become expert
Ummed Singh
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
From Everand
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
Kanto
No ratings yet
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet
Coding Interview Questions and Answers
From Everand
Coding Interview Questions and Answers
Chinmoy Mukherjee
No ratings yet
SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis
No ratings yet
SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis
12 pages
PublicStataDatasetReadme PDF
No ratings yet
PublicStataDatasetReadme PDF
1 page
8
No ratings yet
8
23 pages
Longitudinal Data Analysis Instructor: Natasha Sarkisian
No ratings yet
Longitudinal Data Analysis Instructor: Natasha Sarkisian
31 pages
Dwedw
No ratings yet
Dwedw
217 pages
Panel Time-Series
No ratings yet
Panel Time-Series
113 pages
Chapter Four Research Methodology
No ratings yet
Chapter Four Research Methodology
39 pages
Introductory Dynamic Macroeconomics: Ragnar Nymoen University of Oslo 10 August 2008
No ratings yet
Introductory Dynamic Macroeconomics: Ragnar Nymoen University of Oslo 10 August 2008
149 pages
Advanced Stata Skills
No ratings yet
Advanced Stata Skills
10 pages
Getting Started in Frequencies, Crosstab, Factor and Regression Analysis
No ratings yet
Getting Started in Frequencies, Crosstab, Factor and Regression Analysis
34 pages
Using Stata For Data Management and Reproducible Research: Christopher F Baum
No ratings yet
Using Stata For Data Management and Reproducible Research: Christopher F Baum
96 pages
Alternatives To Logistic Regression (Brief Overview)
No ratings yet
Alternatives To Logistic Regression (Brief Overview)
5 pages
Lectures
No ratings yet
Lectures
766 pages
List Developing Country
No ratings yet
List Developing Country
1 page
Causal Inference With Interference and Noncompliance in Two-Stage Randomized Controlled Trials
No ratings yet
Causal Inference With Interference and Noncompliance in Two-Stage Randomized Controlled Trials
22 pages
Welfare Capability Happy
No ratings yet
Welfare Capability Happy
42 pages
Gross National Happiness and Macroeconomic Indicators in The Kingdom of Bhutan
No ratings yet
Gross National Happiness and Macroeconomic Indicators in The Kingdom of Bhutan
26 pages
2018CanLIIDocs28 1
No ratings yet
2018CanLIIDocs28 1
1,194 pages
Implementation of 5S in Manufacturing Industry A C
No ratings yet
Implementation of 5S in Manufacturing Industry A C
5 pages
Case Chapter 17: The Research Report: The Jupiter Consumer Electronics Chain
No ratings yet
Case Chapter 17: The Research Report: The Jupiter Consumer Electronics Chain
11 pages
Accuracy and Confidence On The Interpersonal Perception Task
No ratings yet
Accuracy and Confidence On The Interpersonal Perception Task
16 pages
Sciencedirect: Categorical Principal Component Logistic Regression: A Case Study For Housing Loan Approval
No ratings yet
Sciencedirect: Categorical Principal Component Logistic Regression: A Case Study For Housing Loan Approval
7 pages
Cs2032 Data Warehousing and Data Mining Notes (Unit III) .PDF - Www.chennaiuniversity - Net.notes
No ratings yet
Cs2032 Data Warehousing and Data Mining Notes (Unit III) .PDF - Www.chennaiuniversity - Net.notes
54 pages
The Summary of Diabetes Self Care Activities Measure Results From 7 Studies and A Revised Scale PDF
100% (1)
The Summary of Diabetes Self Care Activities Measure Results From 7 Studies and A Revised Scale PDF
8 pages
Factors Affecting Gold Prices: A Case Study of India: March 2013
No ratings yet
Factors Affecting Gold Prices: A Case Study of India: March 2013
26 pages
The Effect of Teamwork On Employee Performance: A Study of Medium Scale Industries in Anambra State
No ratings yet
The Effect of Teamwork On Employee Performance: A Study of Medium Scale Industries in Anambra State
21 pages
The Kano Model - A Review of Its Application in Marketing Research From 1984 To 2006
No ratings yet
The Kano Model - A Review of Its Application in Marketing Research From 1984 To 2006
10 pages
Total Consolidated Vol 2 IJBEMR
No ratings yet
Total Consolidated Vol 2 IJBEMR
478 pages
Time-Series Panel Analysis (TSPA) Online Material
No ratings yet
Time-Series Panel Analysis (TSPA) Online Material
8 pages
Survey Method in Educational Psychology
No ratings yet
Survey Method in Educational Psychology
7 pages
Internship Report at Universiti Malaysia
No ratings yet
Internship Report at Universiti Malaysia
42 pages
Bullying in Social Media: An Effect Study of Cyber Bullying On The Youth
100% (1)
Bullying in Social Media: An Effect Study of Cyber Bullying On The Youth
21 pages
MMW Chapter 4
No ratings yet
MMW Chapter 4
11 pages
Jurnal Psikologi Manajemen
No ratings yet
Jurnal Psikologi Manajemen
24 pages
Customer Satisfaction of Berger Paint
50% (2)
Customer Satisfaction of Berger Paint
50 pages
Unit - I
No ratings yet
Unit - I
44 pages
PACIS Ecommerce
No ratings yet
PACIS Ecommerce
15 pages
Influence of Feed Size On Ag/Sag Mill Performance
No ratings yet
Influence of Feed Size On Ag/Sag Mill Performance
6 pages
Impact of Artificial Intelligence On Performance of Banking Industry in Middle East
No ratings yet
Impact of Artificial Intelligence On Performance of Banking Industry in Middle East
9 pages
ASP Complete Guide PDF
No ratings yet
ASP Complete Guide PDF
32 pages
Covariance and Some Conditional Expectation Exercises: Scott She Eld
No ratings yet
Covariance and Some Conditional Expectation Exercises: Scott She Eld
17 pages
New Developments in Categorical Data Analysis For The Social Behavioral Science.9780805847284.18577 PDF
No ratings yet
New Developments in Categorical Data Analysis For The Social Behavioral Science.9780805847284.18577 PDF
274 pages
Lec4 PDF
No ratings yet
Lec4 PDF
13 pages
Custumer - Perceived Value in Industrial Contexts
No ratings yet
Custumer - Perceived Value in Industrial Contexts
26 pages
SWATPlot and SWATGraph
No ratings yet
SWATPlot and SWATGraph
9 pages
Parametric Study of Charging Inlet Part2
100% (1)
Parametric Study of Charging Inlet Part2
18 pages