Lecture 4. Point Pattern Analysis
Lecture 4. Point Pattern Analysis
ANALYSIS
Lecture 4
Introduction to Point Pattern Analysis
Simplest Spatial Data
• A set of point locations within a defined region.
• A point pattern is a set of events in a study region.
• Each event is symbolized by a point object.
• Data are the locations of a set of point objects.
Applications
-Hot-spot analysis (crime, disease)
-Vegetation, archaeological studies
Examples
• Position of trees on a landscape.
• Location of crime incidents
• Single type of crime, interaction between types of crime,
• Banks, Distribution of grazing animals in a field
• Study the evolution of the point-pattern over time
• Interaction between two species
Introduction to Point Pattern Analysis
Requirements for a set of events to constitute a point pattern
-Pattern should be mapped on a plane (flat).
-Study area determined objectively
-Pattern is a census of the entities of interest
-One-to-one correspondence between objects and events
-Event locations are proper
Why Analyze Pattern Analysis?
Point Pattern
Analysis
Centrography
A very basic form of point
pattern analysis involves
summary statistics such as
the mean center, standard
distance and standard
deviational ellipse.
How Point Pattern Analysis is Different?
• From Centrographic Statistics :
• Centrographic Statistics calculates single summary measures,
• PPA analyzes the complete set of points
• From Spatial Autocorrelation :
• with PPA, the points have location only; there is no “magnitude” value.
• With Spatial Autocorrelation points have different magnitudes; there is an
attribute variable.
Use of Point Pattern Analysis
• These point pattern analysis techniques were popular before
computers were ubiquitous since hand calculations are not too
involved, but these summary statistics are too concise and hide far
more valuable information about the observed pattern.
• More powerful analysis methods can be used to explore point
patterns
Introduction to Point Pattern Analysis
we’ll make a distinction between the intensity of a spatial process and the
observed density of a pattern under study
Global Density
Local Density
3/17.08 = 0.17
Relationship of Elevation and Density
• We can plot the relationship between point density and elevation regions to help assess any
dependence between the variables.
•
Though there is a steep increase in density at the highest elevation range, this increase is not
monotonic across all ranges of increasing elevation suggesting that density may not be explained by
elevation alone.
• It’s important to note that how one chooses to tessellate a surface can have an
influence on the resulting density distribution. For example, dividing the elevation
into equal area sub-regions produces the following density values.
While the high density in the western part of the study area remains, the density values to the east are
no longer consistent across the other three regions.
Density-based Point Pattern Measures
Kernel-density estimation
• -Pattern has a density at any location in the study region
• -Good for hot-spot analysis, checking first-order stationary process,
and linking point objects together geographic data.
Kernel Density Function
• Kernel density estimation (KDE) is a non-parametric way to
estimate the probability density function of a random variable.
• Kernel density estimation is a fundamental data smoothing
problem where inferences about the population are made, based
on a finite data sample.
• The Kernel Density tool calculates the density of features in a
neighborhood around those features. It can be calculated for
both point and line features.
Kernel Density
• The kernel density approach is an extension of the quadrat method: Like the
quadrat density, the kernel approach computes a localized density for subsets
of the study area, but unlike its quadrat density counterpart, the sub-regions
overlap one another providing a moving sub-region window.
• This moving window is defined by a kernel. The kernel density approach
generates a grid of density values whose cell size is smaller than that of the
kernel window.
• Each cell is assigned the density value computed for the kernel
window centered on that cell.
Kernel Density
• A kernel not only defines the
shape and size of the window,
but it can also weight the points
following a well defined kernel
function. The simplest function is
a basic kernel where each point
in the kernel window is assigned
equal weight.
Example:
X= 1.5, Y = 8.5
1/9 = 0.11
Kernel Density
• Some of the most popular kernel
functions assign weights to points
that are inversely proportional to
their distances to the kernel
window center. A few such kernel
functions follow
a gaussian or quartic like
distribution function. These
functions tend to produce a
smoother density map.
A red one-to-one diagonal line is added to the plot. While an increase in predicted intensity is
accompanied with increasing observed intensity, the relationship is not linear. This can be
explained by the small area covered by these high elevation locations which result in fewer
observation opportunities and thus higher uncertainty for that corner of the study extent. This
uncertainty is very apparent in the ρ vs. elevation plot where the 95% confidence interval envelope
widens at higher elevation values (indicating the greater uncertainty in our estimated ρ value at
those higher elevation values).
Distance Based Methods
• Distance based methods for pattern analysis whereby the interest
lies in how the points are distributed relative to one another (a
second-order property of the point pattern) as opposed to how the
points are distributed relative to the study extent.
Three distance based approaches are covered next: The average nearest
neighbor (ANN), the G, F and K function.
Average Nearest Neighbor