Data Mining-Spatial Data Mining
Data Mining-Spatial Data Mining
Spatial Database
Stores a large amount of space-related data
Maps
Remote Sensing
Medical Imaging
VLSI chip layout
Have Topological and distance information
Require spatial indexing, data access, reasoning
,geometric
computation
and
knowledge
representation techniques
Spatial Data Mining
Extraction of knowledge, spatial relationships from
spatial databases
Can be used for understanding spatial data and spatial
relationships
Applications:
GIS, Geomarketing, Remote Sensing, Image
database exploration, medical imaging, Navigation
Challenges
Complexity of spatial data types and access
methods
Large amounts of data
Non-spatial Information
Same as data in traditional data mining
Numerical, categorical, ordinal, boolean, etc
e.g., city name, city population
Spatial Information
Spatial attribute: geographically referenced
Output
A map that reveals patterns: merged (similar)
regions
Goals
Interactive analysis (drill-down, slice, dice, pivot,
roll-up)
Fast response time
Minimizing storage space used
Challenge
A merged region may contain hundreds of
primitive regions (polygons)
Probably not.
It requires multi-megabytes of storage.
On-line computation is slow and expensive.
Progressive Refinement
Progressive Refinement:
spatial association mining needs to evaluate
multiple spatial relationships among a large no. of
spatial object expensive.
Hierarchy of spatial relationship:
First search for rough relationship and then
refine it
Superset coverage property all the potential
answers should be perserved (i.e.false-positive
test).
Two-step mining of spatial association:
Step 1: Rough spatial computation (as a filter)
Using MBR for rough estimation
Step2: Detailed spatial algorithm (as refinement)
Apply only to those objects which have passed
the rough spatial association test (no less than
min_support)
Spatial co-locations
Just what one really wants to explore.
Based on the property of spatial autocorrelation,
interesting features likely coexist in closely located
regions.
Efficient methods - Apriori , progressive
refinement,etc.
Spatial Cluster Analysis & Spatial Classification
Analyze spatial objects to derive classification
schemes, such as decision trees, in relevance to certain
spatial properties (district, highway, river, etc.)
Classifying medium-size families according to