Variance Guide Appendix C Sas
Variance Guide Appendix C Sas
EXAMPLES IN SAS
In SAS, the NCVS sample design must be appropriately specified for all survey
procedures using the varmethod, strata, and cluster statements. The varmethod=taylor statement
is included within the proc statement of each analysis. The following statements must be
included to provide the stratification and PSU variables.
The domain statement is used to limit the analysis to the appropriate set of cases 1. While
all combinations of variables included in the domain statement will be included in the results,
only the results meeting all subpopulation conditions are relevant. For this reason, prior to each
analysis a “sub” variable is created based on all exclusions (e.g. victimizations occurring outside
of the US and outside the year(s) of interest). Including this single recoded variable on the
domain statement simplifies examination of the results.
1
In Section 3, the term analysis domain is used to define subsets of the population based on victims or
victimizations. In SAS, the domain statement is used to identify records to include in analyses. To avoid
confusion between the two uses, the term “analysis class” will replace the term “analysis domain” in
Appendix C.
1
household files, and Example 7 demonstrates the calculation of victimization rates for
victimization characteristics not on the modified files. Examples are included for both single-
and pooled-year estimates.
Code:
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and year=2011);
run;
2
Code Comment(s):
a) Specifies input dataset and sample design
b) Subpopulation indicator
c) Specifies stratification
d) Specifies primary sampling units
e) Specifies analysis weight
f) Outcome of interest
Output:
Data Summary
Statistics
3
Example 1B - Total number of violent victimizations by sex, age category, and race/ethnicity,
2011
Year(s): 2011
Crime Type: violent victimizations (vcrime)
Analysis Classes: sex (sex), age category (agecat), and race/ethnicity (race_eth)
Weight: newwgt
Subpopulation: within the United States (exclude_outUS=0); non-dummy records (dummy=0);
2011 (year=2011)
Code:
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and year=2011);
run;
Code Comment(s):
a) Analysis class variables (victim characteristics), crossed with
subpopulation indicator
Output:
NOTE: Output is provided overall and by subpopulation. Only the output
under “Domain Analysis: sub” with a “sub” value of 1 is relevant
(highlighted rows).
Data Summary
Statistics
4
Domain Analysis: sub*sex
5
Example 1C - Total number of property victimizations, 2011
Year(s): 2011
Crime Type: property victimizations (pcrime)
Analysis Classes: n/a
Weight: newwgt
Subpopulation: within the United States (exclude_outUS=0); non-dummy records (dummy=0);
2011 (year=2011)
Code:
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and year=2011);
run;
Output:
Data Summary
Statistics
6
0 pcrime 68063730 1324122
1 pcrime 17063148 543579
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
7
Example 2: Number of crimes, aggregated years
Examples 2A and 2B are estimates of personal crimes, while Example 2C is an estimate
of a property crime. Examples 2A and 2C provide overall crime estimates and Example 2B
provides estimated totals for a victimization-level characteristic of interest. The code provided
calculates the estimated number of victimization across the pooled years. To calculate the
average number of victimizations per year, estimated totals and standard errors must be divided
by the number of pooled years, as shown in the following examples.
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and (2009 <= year) and
(year <=2011)); *a;
run;
Code Comment(s):
a) Subpopulation indicator incorporates the range of years to be
analyzed
Output:
8
The SURVEYMEANS Procedure
Data Summary
Statistics
NOTE: The estimate above represents the total number of aggravated assaults from 2009-
2011. To obtain the average number of aggravated assaults, both the estimate and the standard
error must be divided by the number of pooled years (3), as follows:
2940416
=
3
= 980139
160814
=
3
= 53605
9
Example 2B - Total and average number of aggravated assaults involving firearm, 2009–2011
Year(s): 2009-2011
Crime Type: aggravated assaults (aast)
Analysis Classes: weapon category (weapcat)
Weight: newwgt
Subpopulation: within the United States (exclude_outUS=0); non-dummy records (dummy=0);
2009-2011 ((2009 <= year) and (year <=2011))
Code:
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and (2009 <= year) and
(year <=2011));
run;
Output:
Data Summary
Statistics
10
Domain Analysis: sub*weapcat
NOTE: The estimate above represents the total number of aggravated assaults involving a
firearm from 2009-2011. To obtain the average number of aggravated assaults involving a
firearm, both the estimate and the standard error must be divided by the number of pooled years
(3), as follows:
778706
=
3
= 259569
81552
=
3
= 27184
11
Example 2C - Total and average number of household burglaries, 2009-2011
Year(s): 2009-2011
Crime Type: household burglary (hburg)
Analysis Classes: n/a
Weight: newwgt
Subpopulation: Within the United States (exclude_outUS=0); non-dummy records (dummy=0);
2009-2011 ((2009 <= year) and (year <=2011))
Code:
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and (2009 <= year) and
(year <=2011));
run;
proc surveymeans data=anset varmethod=taylor sum;
domain sub;
strata yr_grp v2117;
cluster v2118;
weight newwgt;
var hburg;
run;
Output:
Data Summary
Statistics
12
1 HBURG 10201317 336711
Ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
NOTE: The estimate above represents the total number of household burglaries from
2009-2011. To obtain the average number of household burglaries, both the estimate and the
standard error must be divided by the number of pooled years (3), as follows:
10201317
=
3
= 3400439
336711
=
3
= 112237
13
C.2 Victimization Proportions
The SAS surveyfreq procedure is used to estimate the percent distribution of
victimizations in a specific analysis class across covariates from the modified incident-level file.
Subpopulation and covariates of interest are specified on the table statement.
Example 3A - Distribution of female violent crime victims by age category, race/ethnicity, and
victim-offender relationship, 2011
Year(s): 2011
Analysis Classes: female (sex=2); violent victimizations (vcrime=1)
Covariate(s): age category (agecat); race/ethnicity (race_eth); victim-offender relationship
(direl)
Weight: newwgt
Subpopulation: within the United States (exclude_outUS=0); non-dummy records (dummy=0);
2011 (year=2011); female (sex=2); violent victimizations (vcrime=1)
Code:
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and year=2011 and sex=2 and
vcrime=1);
run;
Code Comment(s):
a) Specifies input dataset and sample design
b) Specifies stratification
c) Specifies primary sampling units
d) Specifies analysis weight
e) Subpopulation indicator
f) Covariates of interest
14
Output:
Data Summary
Total 100.000
----------------------------------------------
1 12 to 14 7.6428 1.8588
15 to 17 6.2620 1.3309
18 to 20 15.5020 5.3677
21 to 24 9.3116 1.6367
25 to 34 19.3023 2.3616
35 to 49 24.5841 3.0542
50 to 64 12.9129 1.9929
65 or older 4.4822 1.7220
Total 100.000
----------------------------------------------
Total 12 to 14
15 to 17
18 to 20
21 to 24
25 to 34
35 to 49
50 to 64
65 or older
Total
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Frequency Missing = 1600
15
Total 100.000
------------------------------------------------------------------
1 Non-Hispanic White 59.8656 4.4568
Non-Hispanic Black 19.4289 5.1812
Hispanic 11.6743 1.9826
Non-Hispanic Other 4.4212 1.5083
Non-Hispanic More than One Race 4.6101 1.2872
Total 100.000
------------------------------------------------------------------
Total Non-Hispanic White
Non-Hispanic Black
Hispanic
Non-Hispanic Other
Non-Hispanic More than One Race
Total
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Frequency Missing = 1600
Total 100.000
------------------------------------------------------------------
1 intimates 23.2306 3.5018
other relatives 9.9822 1.6418
well known/casual acquaintances 35.2831 4.5594
strangers 25.3356 3.1567
do not know relationship 2.7056 0.6099
do not know number of offenders 3.4630 0.9380
Total 100.000
16
------------------------------------------------------------------
Total intimates
other relatives
well known/casual acquaintances
strangers
do not know relationship
do not know number of offenders
Total
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Frequency Missing = 26335
17
Example 3B - Distribution of property crime victims by household income, MSA status, and
region, 2011
Year(s): 2011
Analysis Classes: property victimizations (pcrime=1)
Covariate(s): household income (hincome); MSA status (msa); region (region)
Weight: newwgt
Subpopulation: within the United States (exclude_outUS=0); non-dummy records (dummy=0);
2011 (year=2011); property victimizations (pcrime=1)
Code:
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and year=2011 and pcrime=1);
run;
Output:
Data Summary
18
-----------------------------------------------------
0 Less than $7,500 5.8068 0.3420
$7,500 to $14,999 7.9607 0.3342
$15,000 to $24,999 9.7039 0.3290
$25,000 to $34,999 9.2893 0.3713
$35,000 to $49,999 11.4065 0.3506
$50,000 to $74,999 11.6421 0.3815
$75,000 or more 17.9292 0.4836
Unknown 26.2615 0.7121
Total 100.000
-----------------------------------------------------
1 Less than $7,500 6.4502 0.6755
$7,500 to $14,999 8.6763 0.6591
$15,000 to $24,999 10.2487 0.6577
$25,000 to $34,999 8.8855 0.6006
$35,000 to $49,999 11.7194 0.6256
$50,000 to $74,999 11.4977 0.6959
$75,000 or more 16.4669 0.7300
Unknown 26.0553 1.1284
Total 100.000
-----------------------------------------------------
Total
-----------------------------------------------------
Frequency Missing = 1600
Total 100.000
-------------------------------------------
1 Urban 40.5967 1.5111
Suburban 46.6167 1.6160
Rural 12.7866 1.6281
Total 100.000
19
-------------------------------------------
Total Urban
Suburban
Rural
Total
--------------------------------------------
Frequency Missing = 1600
Total 100.000
--------------------------------------------
Total 100.000
--------------------------------------------
Total Northeast
Midwest
South
West
Total
--------------------------------------------
Frequency Missing = 1600
20
Example 4: Percent distribution of victims across characteristics, aggregated years
Example 4A is an estimate of personal crimes, while Example 4B is an estimate of
property crimes. Both examples specify multiple covariates based on incident-level
characteristics.
Example 4A - Percent of violent victimizations reported and not reported to police by type of
crime, 2009-2011
Year(s): 2009-2011
Analysis Classes: violent victimizations (vcrime=1)
Covariate(s): report status (notify); type of crime (newoff)
Weight: newwgt
Subpopulation: within the United States (exclude_outUS=0); non-dummy records (dummy=0);
2009-2011 ((2009 <= year) and (year <=2011)); violent victimizations (vcrime=1)
Code:
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and (2009 <= year) and (year
<=2011) and vcrime=1);
run;
Code Comment(s):
a) The goal is to obtain the distribution of victimizations by
report status for each type of violent crime. Therefore,
newoff*notify is specified on the table statement and row
percents are requested. If the percent distribution of crimes
among those reported or not reported to police were desired,
column percents would be requested.
Output:
NOTE: Output is provided overall and by subpopulation. Only the output
under “Domain Analysis: sub” with a “sub” value of 1 is relevant
(highlighted row).
21
The SURVEYFREQ Procedure
Data Summary
Total 100.000
-------------------------------------------------------------------------
Robbery reported to police 64.8907 3.5035
not reported to police 35.1093 3.5035
do not know . .
Total 100.000
-------------------------------------------------------------------------
Aggravated Assault reported to police 54.4203 5.0909
not reported to police 43.9368 5.0355
do not know 1.6429 0.6685
Total 100.000
-------------------------------------------------------------------------
22
Example 4B - Percent of property victimizations reported and not reported to police by type of
crime, 2009-2011
Year(s): 2009-2011
Analysis Classes: property victimizations (pcrime=1)
Covariate(s): report status (notify); type of crime (newoff)
Weight: newwgt
Subpopulation: within the United States (exclude_outUS=0); non-dummy records (dummy=0);
2009-2011 ((2009 <= year) and (year <=2011)); property victimizations (pcrime=1)
Code:
data anset;
set adjincident;
sub=(exclude_outUS=0 and dummy=0 and (2009 <= year) and (year
<=2011) and pcrime=1);
run;
Output:
NOTE: Output is provided overall and by subpopulation. Only the output
under “Domain Analysis: sub” with a “sub” value of 1 is relevant
(highlighted row).
Data Summary
23
-------------------------------------------------------------------------
Rape & Sexual Assault reported to police 36.1639 4.6125
not reported to police 63.8361 4.6125
do not know . .
Total 100.000
-------------------------------------------------------------------------
Robbery reported to police 63.1051 2.4092
not reported to police 36.4855 2.4511
do not know 0.4093 0.2529
Total 100.000
-------------------------------------------------------------------------
Aggravated Assault reported to police 57.0243 2.6465
not reported to police 41.4185 2.6741
do not know 1.5572 0.6167
Total 100.000
-------------------------------------------------------------------------
Simple Assault reported to police 42.5540 1.3444
not reported to police 56.1714 1.3307
do not know 1.2746 0.2690
Total 100.000
-------------------------------------------------------------------------
Personal Theft reported to police 53.6418 4.1041
not reported to police 44.8037 4.0528
do not know 1.5545 0.9273
Total 100.000
-------------------------------------------------------------------------
Household Burglary reported to police 52.8492 1.3038
not reported to police 45.6522 1.2619
do not know 1.4986 0.5373
Total 100.000
-------------------------------------------------------------------------
Motor Vehicle Theft reported to police 82.4474 1.8638
not reported to police 17.1571 1.8509
do not know 0.3955 0.2299
Total 100.000
-------------------------------------------------------------------------
Theft reported to police 31.8270 0.7299
not reported to police 66.9627 0.7458
do not know 1.2103 0.1756
Total 100.000
-------------------------------------------------------------------------
Total reported to police
not reported to police
do not know
Total
-------------------------------------------------------------------------
24
Table of newoff by notify
Total .
-------------------------------------------------------------------------
Aggravated Assault reported to police . .
not reported to police . .
do not know . .
Total .
-------------------------------------------------------------------------
Simple Assault reported to police . .
not reported to police . .
do not know . .
Total .
-------------------------------------------------------------------------
Personal Theft reported to police . .
not reported to police . .
do not know . .
Total .
-------------------------------------------------------------------------
Household Burglary reported to police 54.5535 1.5082
not reported to police 44.6500 1.4979
do not know 0.7964 0.4070
Total 100.000
-------------------------------------------------------------------------
Motor Vehicle Theft reported to police 83.7377 1.4000
not reported to police 15.9900 1.3898
do not know 0.2723 0.1922
Total 100.000
-------------------------------------------------------------------------
Theft reported to police 31.1678 0.4962
not reported to police 68.1654 0.4969
do not know 0.6668 0.0674
Total 100.000
-------------------------------------------------------------------------
Total reported to police
not reported to police
do not know
Total
-------------------------------------------------------------------------
25
C.3 Victimization Rates
Victimization rates are calculated from the modified person-level file (for personal
crimes) or the modified household-level file (for property crimes) using the SAS surveymeans
procedure. Since the victimization rate is a function of a constant and two variables, the analysis
variable is created as a recode in a data step preceding the procedure. The victimization count for
the victimization type of interest is multiplied by the victimization adjustment factor
(ADJINC_WT), and this product is multiplied by 1,000. This new variable is used as the analysis
variable in surveymeans’ var statement, as demonstrated in the examples below.
The modified person and household files contain the victimization counts needed to
calculate victimization rates for the most common victimization characteristics analyzed using
NCVS data. Examples 5A-5C demonstrate the calculation of personal and property
victimizations rates that can be calculated directly from the modified person and household level
files provided for a single year and Examples 6A-6B demonstrate these calculations for pooled
year estimates. If an analyst wants to calculate a victimization rate for an incident-level
characteristic that is not included on the modified files, preprocessing steps are needed to
calculate victimization summaries from the incident-level file and move these summaries to the
person file (for personal crimes) or the household file (for property crimes). Section 3.2
documents these steps. Examples 7A-7B demonstrate the calculation of personal and property
victimization rates for incident characteristics not included on the modified person and
household files.
26
Example 5: Rate of crime, single year
Example 5A is an estimate of an overall personal victimization rate. Example 5B is an
estimate of a property victimization rate with the inclusion of household-level characteristics.
Example 5C is an estimate of a personal victimization rate, where the characteristics of interest
are incident-level characteristics. All three estimates are computed with variables available on
the modified person and household files.
Code:
data anset;
set adjper;
sub=(year=2011);
vrsast=adjinc_wt*sast*1000; *a;
run;
Code Comment(s):
a) Specifies victimization recode
b) Specifies input dataset and sample design
c) Subpopulation indicator
d) Specifies stratification
e) Specifies primary sampling units
f) Specifies analysis weight
Output:
27
The SURVEYMEANS Procedure
Data Summary
Statistics
Std Error
Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
vrsast 15.638820 0.610515
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 vrsast 15.707917 0.669826
1 vrsast 15.367133 1.176853
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
28
Example 5B - Rate of household burglary by MSA status, household income, and family
structure, 2011
Year(s): 2011
Crime Type(s): household burglary (hburg)
Analysis Classes: MSA status (msa); household income (hincome); family structure
(fam_structure2)
Weight: wgthhcy
Subpopulation: 2011 (year=2011)
Calculated Directly from Adjusted Files?: yes
Code:
data anset;
set adjhh;
sub=(year=2011);
vrburg=adjinc_wt*hburg*1000;
run;
Output:
Data Summary
Statistics
Std Error
Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
vrburg 28.319646 0.771345
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Domain Analysis: sub*HOUSEHOLD LOCALE
29
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 Urban vrburg 33.388469 1.179397
Suburban vrburg 22.284276 1.161647
Rural vrburg 35.072801 2.724182
1 Urban vrburg 33.487009 2.515425
Suburban vrburg 25.445345 2.008611
Rural vrburg 33.019059 4.534446
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub hincome Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 Less than $7,500 vrburg 59.651675 5.784041
$7,500 to $14,999 vrburg 56.887617 4.760163
$15,000 to $24,999 vrburg 36.988937 3.625684
$25,000 to $34,999 vrburg 32.461893 2.317476
$35,000 to $49,999 vrburg 28.371919 1.752191
$50,000 to $74,999 vrburg 21.164290 1.171311
$75,000 or more vrburg 16.811149 0.868502
Unknown vrburg 25.476500 1.281015
1 Less than $7,500 vrburg 67.164972 14.006749
$7,500 to $14,999 vrburg 58.845099 9.944788
$15,000 to $24,999 vrburg 42.776246 5.556782
$25,000 to $34,999 vrburg 33.340142 5.830933
$35,000 to $49,999 vrburg 26.301166 2.682444
$50,000 to $74,999 vrburg 21.506342 3.006216
$75,000 or more vrburg 12.288703 1.235878
Unknown vrburg 29.549576 2.702723
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub Family Structure Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 Two or more adults - W/O Children vrburg 21.290559 0.930596
Two or more adults - With Children vrburg 28.441133 1.059043
One Male Adult - W/O Children vrburg 36.163618 3.603260
One Male Adult - With Children vrburg 52.526591 9.167820
One Female Adult - W/O Children vrburg 25.750941 1.588754
One Female Adult - With Children vrburg 64.961684 4.899564
1 Two or more adults - W/O Children vrburg 24.208885 2.304288
Two or more adults - With Children vrburg 29.610024 2.381918
One Male Adult - W/O Children vrburg 30.997012 2.922347
One Male Adult - With Children vrburg 46.669715 14.319366
One Female Adult - W/O Children vrburg 30.105671 4.797870
One Female Adult - With Children vrburg 61.962753 8.315094
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
30
Example 5C - Rate of violent crime by weapon involvement, injury, and victim-offender
relationship, 2011
Year(s): 2011
Crime Type(s): violent victimizations (rsa+rob+aast+sast)
Analysis Classes: weapon involvement (wpnuse); injury (inj), victim-offender relationship (rel)
Weight: wgtpercy
Subpopulation: 2011 (year=2011)
Calculated Directly from Adjusted Files?: yes
Code:
data anset;
set adjper;
sub=(year=2011);
*aggregate violent crimes into a single measure for each outcome of interest.
*a;
viol_weap1=(rsa_wpnuse1+rob_wpnuse1+aast_wpnuse1+sast_wpnuse1)*adjinc_wt*1000;
*b;
viol_weap2=(rsa_wpnuse2+rob_wpnuse2+aast_wpnuse2+sast_wpnuse2)*adjinc_wt*1000;
*c;
viol_weap3=(rsa_wpnuse3+rob_wpnuse3+aast_wpnuse3+sast_wpnuse3)*adjinc_wt*1000;
*d;
viol_inj1=(rsa_inj1+rob_inj1+aast_inj1+sast_inj1)*adjinc_wt*1000;
*e;
viol_inj2=(rsa_inj2+rob_inj2+aast_inj2+sast_inj2)*adjinc_wt*1000;
*f;
viol_inj3=(rsa_inj3+rob_inj3+aast_inj3+sast_inj3)*adjinc_wt*1000;
*g;
viol_inj4=(rsa_inj4+rob_inj4+aast_inj4+sast_inj4)*adjinc_wt*1000;
*h;
viol_rel1=(rsa_rel1+rob_rel1+aast_rel1+sast_rel1)*adjinc_wt*1000;
*i;
viol_rel2=(rsa_rel2+rob_rel2+aast_rel2+sast_rel2)*adjinc_wt*1000;
*j;
viol_rel3=(rsa_rel3+rob_rel3+aast_rel3+sast_rel3)*adjinc_wt*1000;
*k;
viol_rel4=(rsa_rel4+rob_rel4+aast_rel4+sast_rel4)*adjinc_wt*1000;
*l;
viol_rel5=(rsa_rel5+rob_rel5+aast_rel5+sast_rel5)*adjinc_wt*1000;
*m;
viol_rel6=(rsa_rel6+rob_rel6+aast_rel6+sast_rel6)*adjinc_wt*1000;
run;
31
Code Comment(s):
a) Yes, offender had a weapon
b) No, offender did not have a weapon
c) Do not know if offender had a weapon
d) No injury
e) Serious injury
f) Minor injury
g) Rape without other injuries
h) Intimates
i) Other relatives
j) well known/casual acquaintances
k) Strangers
l) Do not know relationship
m) Do not know number of offenders
Output:
Data Summary
Statistics
Std Error
Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
viol_weap1 4.831854 0.239978
viol_weap2 16.905851 0.657311
viol_weap3 1.585757 0.126202
viol_inj1 17.472264 0.660110
viol_inj2 1.019748 0.101134
viol_inj3 4.522141 0.241631
viol_inj4 0.294720 0.053443
viol_rel1 3.679001 0.235064
viol_rel2 1.700361 0.168869
viol_rel3 6.996892 0.373455
viol_rel4 8.648548 0.364130
viol_rel5 1.324181 0.088132
viol_rel6 0.968963 0.106546
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
32
Domain Analysis: sub
Std Error
sub Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 viol_weap1 4.881214 0.279946
viol_weap2 17.021172 0.732904
viol_weap3 1.612903 0.135616
viol_inj1 17.608513 0.731000
viol_inj2 0.991732 0.110364
viol_inj3 4.633226 0.274660
viol_inj4 0.263517 0.065760
viol_rel1 3.774519 0.255819
viol_rel2 1.634968 0.188501
viol_rel3 7.018989 0.395294
viol_rel4 8.728594 0.410245
viol_rel5 1.423006 0.109876
viol_rel6 0.928292 0.124775
1 viol_weap1 4.637771 0.348181
viol_weap2 16.452410 1.176199
viol_weap3 1.479021 0.300596
viol_inj1 16.936533 1.225512
viol_inj2 1.129908 0.191335
viol_inj3 4.085352 0.384823
viol_inj4 0.417409 0.097062
viol_rel1 3.303427 0.499653
viol_rel2 1.957484 0.354815
viol_rel3 6.910004 0.777927
viol_rel4 8.333804 0.686701
viol_rel5 0.935598 0.125684
viol_rel6 1.128885 0.215224
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
33
Example 6: Rate of crime, aggregated years
Example 6A is an estimate of a personal victimization rate with person-level
characteristics of interest. Example 6B is an estimate of a property victimization rate with the
inclusion of a household-level characteristic. Both estimates are computed with variables
available on the modified person and household files.
Example 6A - Rate of violent crimes reported to police by sex, age category, race/ethnicity,
and marital status, 2009–2011
Year(s): 2009-2011
Crime Type(s): violent victimizations reported to police
(rsa_rpt1+rob_rpt1+aast_rpt1+sast_rpt1)
Analysis Classes: sex (sex); age category (agecat); race/ethnicity (race_eth); marital status
(marital)
Weight: wgtpercy
Subpopulation: 2009-2011 ((2009 <= year) and (year <=2011))
Calculated Directly from Adjusted Files?: yes
Code:
data anset;
set adjper;
sub=((2009 <= year) and (year <=2011));
*aggregate violent crimes into a single measure for outcome of
interest;
viol_rpt1=(rsa_rpt1+rob_rpt1+aast_rpt1+sast_rpt1)*adjinc_wt*1000;
run;
Output:
34
The SURVEYMEANS Procedure
Data Summary
Statistics
Std Error
Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
viol_rpt1 10.942564 0.388276
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub sex Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 Male viol_rpt1 12.263099 0.847527
Female viol_rpt1 11.735712 0.792736
1 Male viol_rpt1 9.682951 0.535355
Female viol_rpt1 10.801333 0.750092
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub agecat Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 12 to 14 viol_rpt1 10.706604 1.514160
15 to 17 viol_rpt1 20.325239 2.983613
18 to 20 viol_rpt1 18.508326 2.103361
21 to 24 viol_rpt1 19.158965 2.595491
25 to 34 viol_rpt1 17.998077 2.201344
35 to 49 viol_rpt1 13.197065 1.221089
50 to 64 viol_rpt1 7.657778 0.944521
65 or older viol_rpt1 1.662752 0.359343
1 12 to 14 viol_rpt1 10.142548 1.528288
15 to 17 viol_rpt1 13.041282 1.607437
18 to 20 viol_rpt1 22.576998 4.639424
21 to 24 viol_rpt1 16.933566 1.895897
25 to 34 viol_rpt1 14.075923 1.173494
35 to 49 viol_rpt1 11.413987 0.893979
50 to 64 viol_rpt1 6.593241 0.582309
65 or older viol_rpt1 1.902745 0.371955
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub race_eth Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 Non-Hispanic White viol_rpt1 11.779361 0.689952
Non-Hispanic Black viol_rpt1 17.080468 2.129403
Hispanic viol_rpt1 9.606242 1.012843
Non-Hispanic Other viol_rpt1 8.094436 2.965192
Non-Hispanic More than One Race viol_rpt1 20.653312 4.002124
1 Non-Hispanic White viol_rpt1 9.591058 0.556192
Non-Hispanic Black viol_rpt1 16.324494 2.150383
35
Hispanic viol_rpt1 9.110811 0.905751
Non-Hispanic Other viol_rpt1 6.215577 1.431772
Non-Hispanic More than One Race viol_rpt1 20.785537 3.675282
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub marital Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 Never Married viol_rpt1 18.209104 1.219181
Married viol_rpt1 6.064960 0.536853
Widowed viol_rpt1 5.630742 2.102283
Divorced viol_rpt1 21.873231 2.587565
Separated viol_rpt1 37.178780 7.171082
1 Never Married viol_rpt1 15.248824 1.034491
Married viol_rpt1 5.035802 0.383114
Widowed viol_rpt1 3.738650 0.916177
Divorced viol_rpt1 18.372702 1.868214
Separated viol_rpt1 38.603385 5.885568
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
36
Example 6B - Rate of property crimes reported to police by household income, 2009–2011
Year(s): 2009-2011
Crime Type(s): property crimes reported to police (hburg_rpt1+tft_rpt1+mvtft_rpt1)
Analysis Classes: household income (hincome)
Weight: wgthhcy
Subpopulation: 2009-2011 ((2009 <= year) and (year <=2011))
Calculated Directly from Adjusted Files?: yes
Code:
data anset;
set adjhh;
sub=((2009 <= year) and (year <=2011));
vrproprpt=(hburg_rpt1+tft_rpt1+mvtft_rpt1)*adjinc_wt*1000;
run;
Output:
Data Summary
Statistics
Std Error
Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
vrproprpt 53.182550 0.936036
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
37
$7,500 to $14,999 vrproprpt 74.357555 6.668412
$15,000 to $24,999 vrproprpt 60.660916 3.555991
$25,000 to $34,999 vrproprpt 58.303682 4.122739
$35,000 to $49,999 vrproprpt 60.220186 3.250153
$50,000 to $74,999 vrproprpt 56.186640 3.568633
$75,000 or more vrproprpt 58.226671 2.115554
Unknown vrproprpt 49.863632 1.997445
1 Less than $7,500 vrproprpt 64.052739 4.366416
$7,500 to $14,999 vrproprpt 60.840821 3.678508
$15,000 to $24,999 vrproprpt 54.831987 2.794859
$25,000 to $34,999 vrproprpt 54.832067 3.222495
$35,000 to $49,999 vrproprpt 55.921312 3.030289
$50,000 to $74,999 vrproprpt 49.011134 2.029558
$75,000 or more vrproprpt 49.946119 1.490552
Unknown vrproprpt 43.616987 1.544342
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
38
Example 7: Computing victimization rates based on incident characteristics not included on
the provided file
The two examples below estimate personal and property victimization rates for incident-
level characteristics not contained on the modified person and household files, and thus require
the pre-processing steps outlined in Section 3.2. SAS is used to calculate victimization
summaries from the incident-level file and merge incident summaries onto the household and
person files. Victimization rates are then calculated from the modified person-level file (for
personal crimes) or the modified household-level file (for property crimes). Example 7A is an
estimate of a personal victimization rate and Example 7B is an estimate of a property
victimization rate. Example 7A is for a single year, and Example 7B is for aggregated years.
Code:
*step1: identify records with victimization characteristic(s) of
interest on the modified incident-level file ;
data example7a_inc;
set adjincident;
vcrime=(newoff in (1 2 3 4)); *a;
39
*create binary classifications for each location, excluding
dummy records and crimes that occurred outside of the united
states;
Code Comment(s):
a) Identify violent crimes
b) Respondent’s home
c) Near respondents home
d) Friend’s home
e) Commercial
f) Parking lot-garage
g) School
h) Open areas, on street or public transportation
i) Other
j) Missing
k) Respondent’s home/near respondent’s home
l) Other location
data example7a_per;
merge adjper(in=a)
violent_place_sums(in=b);
by yearq idper;
if a;
if violent_home=. then violent_home=0; *a;
if violent_other=. then violent_other=0;
run;
40
Code Comment(s):
a) The two incident count variables are missing for persons not
included on the victimization summary file, so they are set to ‘0’ (no
victimizations of this type)
data anset;
set example7a_per;
sub=(year=2011);
*a;
vrviolrhome=violent_home*adjinc_wt*1000;
vrviolother=violent_other*adjinc_wt*1000;
run;
Code Comment(s):
a) The final step outlined in section 3.2 was to calculate the
victimization adjustment factor, but it is already contained on
the modified person file.
Output:
Data Summary
Statistics
Std Error
Variable Mean of Mean
41
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
vrviolrhome 8.895617 0.371408
vrviolother 14.427845 0.557310
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 vrviolrhome 9.038327 0.425064
vrviolother 14.476962 0.608688
1 vrviolrhome 8.334482 0.631375
vrviolother 14.234720 1.153994
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
42
Example 7B - Rate of property crimes by time of day, 2009-2011
Year(s): 2009-2011
Crime Type(s): property crimes by time of day (property_day; property_night;
property_missing)
Analysis Classes: n/a
Weight: wgthhcy
Subpopulation: 2009-2011 ((2009 <= year) and (year <=2011))
Calculated Directly from Adjusted Files?: no
run;
Code Comment(s):
a) identify property crimes
b) day
c) night
d) missing
43
*calculate the number of victimizations of each type by household and
reporting period ;
proc means data=example7b_inc noprint;
by yearq idhh;
weight serieswgt;
var property_day property_night property_missing;
output out=property_time_sums(drop=_type_ _freq_)
sum=;
run;
Code Comment(s):
a) The three incident count variables are missing for households not
included on the victimization summary file, so they are set to ‘0’ (no
victimizations of this type)
data anset;
set example7b_hh;
sub=((2009 <= year) and (year <=2011));
*a;
vrpropday=property_day*adjinc_wt*1000;
vrpropnight=property_night*adjinc_wt*1000;
vrpropmiss=property_missing*adjinc_wt*1000;
run;
44
Code Comment(s):
a) The final step outlined in section 3.2 was to calculate the
victimization adjustment factor, but it is already contained on
the modified person file.
Output:
Data Summary
Statistics
Std Error
Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
vrpropday 54.546013 1.097167
vrpropnight 59.332574 1.222059
vrpropmiss 24.883746 0.735171
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 vrpropday 59.022848 1.518622
vrpropnight 63.701678 1.688110
vrpropmiss 26.018035 1.020754
1 vrpropday 51.620495 1.227080
vrpropnight 56.477455 1.331527
vrpropmiss 24.142513 0.806898
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
45
C.4 Identifying Low Quality Estimates
This section demonstrates how to implement the recommendations for identifying low
quality estimates and the rounding rules discussed in Section 4 of the user’s guide. Three types
of data are needed to identify and flag low quality estimates, regardless of the estimate type: the
estimate, the standard error of the estimate, and the unweighted sample size of the estimate.
Because Examples 1-7 provided details for obtaining estimates and standard errors, the examples
in this section will focus primarily on calculating the percent relative standard error (RSE),
calculating unweighted sample sizes, identifying estimates that should be flagged as unreliable,
and verifying that estimates rounding to zero are not presented.
Unweighted sample sizes should take into account the series adjustment. For example, a
series victimization with a series count of seven would count as seven victimizations in the
unweighted sample size, while a non-series victimization would only count as one. This series
adjustment is already incorporated in the victimization counts on the modified person and
household-level files, so sample sizes for victimization rates are obtained by taking unweighted
sums of victimization counts with the specified characteristic(s) of interest. However, for
victimization totals and proportions, the series count must be included in the calculation of the
sample size, as demonstrated in the examples below. Example 8A demonstrates implementation
of flagging and rounding rules for victimization totals, Example 8B for victimization proportions,
and Example 8C for victimization rates.
46
proc surveymeans data=anset varmethod=taylor sum;
domain sub*sex sub*race_eth;
strata yr_grp v2117;
cluster v2118;
weight newwgt;
var ptft;
run;
Code Comment(s):
a) Weight by the series weight to obtain the series-adjusted count
of victimizations
Output:
Data Summary
Statistics
47
Non-Hispanic Black PTFT 435124 52924
Hispanic PTFT 371336 44117
Non-Hispanic Other PTFT 172088 39824
Non-Hispanic More than One Race PTFT 27608 9540.767800
1 Non-Hispanic White PTFT 83956 21479
Non-Hispanic Black PTFT 29507 10989
Hispanic PTFT 30287 10489
Non-Hispanic Other PTFT 13519 5943.552122
Non-Hispanic More than One Race PTFT 8501.872300 5281.778471
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
𝑆𝑆𝑆𝑆(𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇)
% 𝑅𝑅𝑅𝑅𝑅𝑅(𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇) = ∗ 100
𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇𝑇
Estimated totals, standard errors, and unweighted sample sizes from the output above are
included in Table C-1 (in the “Total,” “SE(Total),” and “n” columns, respectively). Percent
RSEs are calculated based on the formula above. Based on the recommendations, five of the
eight estimates are flagged as unreliable. The estimate for males is flagged because the percent
RSE is greater than 30 percent, while the other four estimates are flagged both because their
48
RSEs exceed 30 percent and because their unweighted sample sizes are less than or equal to 10.
All estimates can be displayed because no estimates round to zero.
49
Example 8B – Distribution of motor vehicle theft in the Northeast Census Region by report
status, 2011
Year(s): 2011
Domain(s): Northeast Census Region (region=1); motor vehicle theft (mvtft=1)
Covariate(s): report status (notify)
Weight: newwgt
Subpopulation: within the United States (exclude_outUS=0); non-dummy records (dummy=0);
2011 (year=2011); Northeast Census Region (region=1); motor vehicle theft (mvtft=1)
Code:
*calculate estimates and standard errors;
data anset;
set adjincident;
sub=(exclude_outus=0 and dummy=0 and year=2011 and region=1 and
mvtft=1);
run;
Code Comment(s):
a) Weight by the series weight to obtain the series-adjusted count
of victimizations. Because flagging rules are based on the
denominator sample size, only the overall sample size is needed
(not estimates by report status).
Output:
50
The SURVEYFREQ Procedure
Data Summary
Total 100.000
---------------------------------------------------------
1 reported to police 92.6011 5.4032
not reported to police 7.3989 5.4032
do not know . .
Total 100.000
---------------------------------------------------------
Total reported to police
not reported to police
do not know
Total
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Frequency Missing = 4079
notify N Obs N
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
reported to police 17 17
51
RSE > 30%, where
The percent relative standard errors of the log of the percentages (P) and (1-P) are
calculated as:
𝑆𝑆𝑆𝑆(𝑃𝑃)
% 𝑅𝑅𝑅𝑅𝑅𝑅(log(𝑃𝑃)) = ∗ 100
𝑃𝑃
𝑃𝑃 ∗ 𝑎𝑎𝑎𝑎𝑎𝑎(log �100�)
𝑆𝑆𝑆𝑆(𝑃𝑃)
% 𝑅𝑅𝑅𝑅𝑅𝑅(log(100 − 𝑃𝑃)) = ∗ 100
𝑃𝑃
(100 − 𝑃𝑃) ∗ 𝑎𝑎𝑎𝑎𝑎𝑎(log �1 − 100�)
Estimated percentages, standard errors, and the unweighted denominator sample size
from the output above are included in Table C-2 below (in the “P,” “SE(P),” and “n (denom)”
columns, respectively). Percent RSEs are calculated based on the formulas above. Based on these
recommendations, none of the estimates are flagged as unreliable. The denominator sample size
is 19, which is greater than the recommended threshold of 10. The percent RSEs for the reported
and not reported estimates are 28.03, which are below the 30 percent threshold. However,
because no respondents reported that they did not know whether or not the motor vehicle theft
was reported to the police, the estimated percent is 0. This estimate should not be presented
because it rounds to zero.
52
Example 8C – Rate of rape/sexual assault by sex, 2011
Year(s): 2011
Crime Type(s): rape/sexual assault (rsa)
Domain(s): sex (sex)
Weight: wgtpercy
Subpopulation: 2011 (year=2011)
Calculated Directly from Adjusted Files?: yes
Code:
Code Comment(s):
b) Unweighted because the victimization count RSA already includes
the series adjustment
Output:
53
The SURVEYMEANS Procedure
Data Summary
Statistics
Std Error
Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
vrrsa 1.115027 0.141041
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 vrrsa 1.157469 0.166403
1 vrrsa 0.948146 0.162529
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
Std Error
sub sex Variable Mean of Mean
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
0 Male vrrsa 0.173447 0.043650
Female vrrsa 2.095539 0.323622
1 Male vrrsa 0.276019 0.089702
Female vrrsa 1.592869 0.332343
ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
54
RSE > 30%
The percent relative standard errors of the victimization rates (VR) are calculated as:
𝑆𝑆𝑆𝑆(𝑉𝑉𝑉𝑉)
% 𝑅𝑅𝑅𝑅𝑅𝑅(𝑉𝑉𝑉𝑉) = ∗ 100
𝑉𝑉𝑉𝑉
Estimates, standard errors, and unweighted sample sizes from the output above are
included in Table C-3 below (in the “VR,” “SE(VR),” and “n (num)” columns, respectively).
Percent RSEs are calculated based on the formula above. Based on these recommendations, the
estimated victimization rate for males is flagged as unreliable because the percent RSE is greater
than 30 percent and the sample size of the numerator is less than or equal to 10. Because none of
the estimates round to zero, all three estimates can be presented.
55