Geographical Data in the Computer-1
Geographical Data in the Computer-1
• Record: A collection of related fields (such as a name, address, and phone number) treated as a
single unit of data in a database.
• File: A collection of records stored together, such as a document, image, or database table.
• Directory: A folder in a computer’s file system that contains files and possibly other directories
(subdirectories), organizing data hierarchically.
Data Volume:
• Kilobyte (KB): Approximately 1,024 bytes.
• Megabyte (MB): Approximately 1,024 kilobytes or about 1 million bytes.
• Gigabyte (GB): Approximately 1,024 megabytes or about 1 billion bytes.
• Terabyte (TB): Approximately 1,024 gigabytes or about 1 trillion bytes.
Coding the Basic Data Models
- Data structure that uses regular tessellation units such as square cell (pixel) to describe
geographical phenomena.
- Locational data – Direct reference to grid network
- Attribute data
Database Structures
• Essential feature of any data storage system
- Data to be accessed and cross-referenced quickly
• No single best method
• File and Data Access
- Simple lists
- Ordered sequential files
GMR Institute of Technology
- Indexed files
- Direct file
- Inverted file
Data file and database
Data file Database
Data are collected for specific Data are generic for the business of
purpose an organization
Collection of records usually of the Collection of interrelated records
same data type and format organized in one or more data files
Data file processing – computer Database processing- Associated with
programming DBMS
Used in support of information needs Used in support of day to day
GMR Institute of Technology
Basic unit of file is data record or tuple- Contains all the information related to entity.
hand
-Simplicity and ease of access via keys
Disadv: Large index files- Data redundancy- Storage and access costs
GMR Institute of Technology
• Network structure
- Linkage between the objects
Adv: -Useful, when relations or linkages can be specified beforehand.
- Data redundancy avoided
Disadv: Database is enlarged by overhead of pointers
GMR Institute of Technology
Is situated
Contains
Type
District User#
service
in
Type Address
Amount
User# Name Address Consu-
Used
mer
GMR Institute of Technology
TS TB Owner
Type Type
Lives at
Used
by
Addres
Buildin Owner Consum User#
s
g er
User #
Relational database for
Utility consumer management
Name TC AU
Appropriate database structure
• Hierarchical
– Dividing the spatial data into manageable themes
• Network approach
- Ideal for topologically linked vector lines and polygons
GMR Institute of Technology
• Relational approach
- Good for retrieving objects on the basis of their attributes
• O- O approach
- Useful when entities share attributes or interact in special ways.
Data organization in raster data structures
GMR Institute of Technology
GMR Institute of Technology
Different ways of creating raster data structure
Next Overlay
Next point
(a) Each cell referenced directly (b) Each overlay referenced directly
➢ Data Hungry structure ➢Two dimensional matrix
➢ No data on cell size or display symbols ➢Contains list of redundant coordinates
➢ No compression techniques ➢No data on cell size or display symbols
MAP FILE MAP FILE
Overlay Overlay
Number of rows, N
Title Number of columns, M
scale Cell Size
mapping unit 1 Projection
label X Minimum
display symbol Y Minimum
attribute Pointer to legend
GMR Institute of Technology
set of points
(X,Y coordinate pairs) Sequence of NXM values
• Chain codes
- Boundary of the region will be encoded
- Region will be represented in terms of its origin and sequence of unit vectors in
cardinal directions (E=0, N=1, W=2, S=3).
- Compact way of storing data
GMR Institute of Technology
RASTER MAP
GMR Institute of Technology
Chain Code
- Data structure consists of the origin (Centre or bottom left) and radius of the square.
- Medial Axis Transformation
- Efficient for large uniform shape
- Union and intersection can be performed
GMR Institute of Technology
• Quadtrees
- This will provide an approach to addressing the successively finer levels of information
- Region will be divided successively into quadrant blocks
- The block structure may be described by tree of degree 4, known as quadtree
GMR Institute of Technology
8
7
6
5
4
3
1
2
1
2
3
A
4
5
6
7
8
Data Organization in Vector Data Structures
GMR Institute of Technology
GMR Institute of Technology
POINT
Unique Identifier
(if simple)
GMR Institute of Technology
Symbol scale
orientation
Pointer to database
(if text) characters scale
font
justification
type style
(if node) symbol pointer to lines
angles of the junctions
Non-graphic attributes
Vector
• Adv: • Dis adv:
- Good representation of entity data - Complex data structure
models - Difficult overlay processing
GMR Institute of Technology