5. data input device
5. data input device
7.7 Activity
7.8 Summary
7.9 Unit End Questions
7.10 References
7.11 Further/Suggested Reading
7.12 Answers
7.1 INTRODUCTION
You have been introduced to the concepts of GIS, data, data models and data
structures in Block 2, Fundamentals of Geographic Information System of
MGY-003. Now we will introduce you to the methods of data input. You have
already learnt that data is the basic requirement to perform any analysis in GIS
platform. In this unit, we will discuss the methods, processes and devices of data
input and reference frameworks alongwith a short introduction to linking external
database.
Objectives
After studying this unit, you should be able to:
• define data input;
• list sources of data;
• describe data input devices;
• illustrate digitisation process; 5
GIS Database Creation • discuss level of input;
• link external databases and metadata; and
• explain reference frameworks and georeferencing.
Data
Plan and Georeference conversion Construct Enter
organise → Data → Edit → projection → vector/raster → database → attributes
entry
topology
Fig. 7.1: Chain of data input process in GIS (source: modified after Davis, 2001)
(a)
(b) (c)
Fig. 7.2: Different types of scanners; (a) flat-bed, (b) rotating drum and (c) large-
format feed scanners (sources: www.techfuels.com/scanner/1579-flatbed-
scanners.html (a), http://postcardprinters.us/drum-scanner (b) and
www.chinasystem.com.hk/en/main.htm (c)
After scanning, the raster image needs to be first corrected for errors caused by
scanning. This requires some level of image processing about which we have
already read in Block 4, Processing and Classification of Remotely Sensed
Images of MGY-002.
• Thresholding: It segregates the image grey values into two distinct values
(that is 0 for black and 255 for white) by a threshold value.
• Mouse
• Keyboard
Digitising Table
A digitising table as shown in Fig. 7.3 is the most common device used for hard
copy digitisation. It has an in-built electronic mesh, which can sense the
position of a cursor. Cursor is a mouse like device (also called puck ) which
contains a cross hair encased in a glass or transparent plastic that allows the
operator to place the cursor accurately over the map elements. Cursor also has
buttons to indicate the start and end of line or polygon or define left and right
polygons. A map document is fixed to the center of digitising table with a sticky
tape. Digitising table uses a local rectilinear coordinate system. Map and the
digitiser must be registered so that vector data can be captured in real-world
coordinates. This is achieved by digitising a series of four or more control points Tics are ground-control
also called reference points or ‘tics’ along the four corners so that it is well points for a vector data,
spread out and then their real-world values are entered. Digitiser control software which represent known
geographic locations and
calculates the transformation matrix and then automatically applies this to any
are used to register maps
future coordinates that are captured. mounted on a digitiser
table.
Digitising tables can range from small paper sized formats to large formats. Large
sized tables also have adjustable stands to alter the elevation of the digitiser as
per convenience of the operator. Modern digitisers provide good resolution of
about 0.001" and an accuracy of about 0.003". A good digitiser should have the
properties of stability, repeatability, linearity, resolution and skew (DeMers,
2009). Let us discuss about these properties in detail.
• Stability: It deals with the tendency of the digitiser not to change as its
temperature rises.
(b)
(a)
(c) (d)
Fig. 7.4: Mouse and digitisation; (a) roller ball, (b) optical, (c) laser mouse, and (d) on-screen
10 digitisation with the help of mouse
Keyboard Data Input
Keyboard is the simplest device to input data into a computer (Fig. 7.5). The
process is also known as key coding. This technique is mostly used to input
attribute data into a GIS. These attributes can be linked to map features in a
spatial database using identification codes. The coordinates of special features
like points, lines and polygons can also be encoded with the help of keyboard.
(a) (b)
Fig. 7.6: Two types of digitisation; (a) point mode and (b) stream mode
There are a number of digitisation errors related to vector data. This requires
quality checking and cleaning of vector data. After cleaning the vector data,
topology is created. Topology is the relationship between entities (point, line and
polygon) in a vector dataset.
......................................................................................................................
......................................................................................................................
......................................................................................................................
......................................................................................................................
7.4.3 Metadata
Metadata in simple words is known as data about data. In other words,
metadata is a summary document providing content, type, creation, quality and
spatial information about a data set. In geoinformatics, it is the information about
geospatial data. Metadata provides some basic information as to when the data
was created, by whom it was created, why it was created, where it was
created, etc. Metadata summarises, indexes, abstracts and describes the quality 13
GIS Database Creation and material of the data. Thus, metadata is an integral part of a GIS project and
is usually prepared during the data production process. A metadata is usually
presented as an XML (Extensible Markup Language) document (Fig 7.7). XML
HTML (Hyper Text is a markup language similar to HTML. In a metadata XML document, a title
Markup Language) may be stored as follows: <title>My Document</title>.
is used for
displaying web
pages.
a) Metadata Standards
Now we will learn about metadata standards. A metadata standard is a
document that contains rules or guidelines for standardising the content of
metadata. These standards can be created specifically for use within a particular
organisation and also allow to develop tools that can be used to create and
modify metadata that meets the standard. In fact, it is the way of verifying the
content of metadata in compliance with the standard. Metadata standards contain
rules or guidelines for the content, format, creation and update policies, and
conformance rules.
The following are some of the suitable standards that are followed while
representing metadata:
l Federal Geographic Data Committee (FGDC) has developed the
content standards for metadata. It lays down specifications for both spatial
as well as non-spatial data.
l International Standards Organisation Technical Committee (ISO/TC
211) develops standards related to geographic information. Recently ISO
has developed an international metadata standard ISO 19115. It attempts to
satisfy the requirements of all well-known metadata standards.
l Dublin Core Metadata is specifically intended to support resource
discovery which involves searching, locating and retrieving of information
resources on computer-based networks in response to user queries.
b) Metadata Formats
Metadata can be interpreted or valued differently by different users inspite of
following a particular standard. Some common metadata formats are listed below:
14
• unstructured notes or log files Data Input
• summary formats
• formats used for the purpose of indexing or searching but not for viewing
• some new formats being used specific to GIS software which enables to
view, create and manage metadata and
• some customisable metadata formats that are becoming popular with XML,
etc.
Till 1999, the most common format of metadata used to be Outline Format.
Soon FAQ (Frequently Asked Question) format was popularised as it was easier
to read. This format could be easily understood even by non-technical or non-
GIS users.
• what was the original source of dataset and techniques of its collection?
• rotation
Translation: It sets the distance to move the objects contained in the track, in
the x, y and z directions. In translation, objects slide to the new position without
being rotated. This is done by adding or subtracting the coordinate values
necessary for X and Y coordinates of the object as shown in Fig. 7.8. The new
X-coordinate say X’ would be equal to the original X-coordinate plus some
value Tx. Similarly, the new Y coordinate Y’ for each graphic object would be
equal to Y coordinate plus some value Ty.
X’ = X+ Tx
Y’ = Y+ Ty
Y Direction (n units)
Y Y
X X
(a) (b)
Fig. 7.8: Schematics showing the process of translation; (a) before translation, and
(b) after translation. Note displacement of the same object in Y-direction after
16 translation
Scale Change: It sets the x, y and z scale factors to make objects in the track Data Input
larger or smaller as shown in Fig. 7.9. This is also useful in comparing differently
scaled maps. Even the output can be represented in different scales. This is done
by multiplying the overall X-coordinate extent by a scale factor sx, and each set
of Y-coordinates by a scale factor sy.
X’ = X sx
Y’ = Y sy
where, sx and sy represent the amount or percentage of scale change.
Y Y
X X
Scale: 1 cm = 200 m Scale: 1 cm = 400 m
(a) (b)
Fig. 7.9: Schematics showing the process of scale change; (a) original object and
(b) after scale change. Note that the same object has become smaller after scale
change
Rotation: It sets the x, y and z rotation angles around a fixed point (the
origin) for objects contained within the track as you can see in Fig. 7.10.
Rotation around the x-axis is the roll, or yaw angle. Rotation around the
y-axis is the inclination, or pitch angle. Rotation around the z-axis is
the azimuth, or heading angle. It is used frequently during the process
of projection and inverse projection and uses the basic trigonometry. For
X- coordinate locations, the new location X’ would be found by multiplying
the original X location by the cosine of the new angle (è) and then adding
that value to the original Y- coordinate multiplied by the sine of the theta
(sin è). The new Y- coordinate location Y’ is found by multiplying the
negative of the original X value by the sine of the angle and again adding
that to the product of Y coordinate and sin è.
X’ = X cos è + Y sin è,
and
Y’ = − X sin è + Y sin è
where, è is the angular displacement needed. 17
GIS Database Creation
Y
Y
X X
(a) (b)
Fig. 7.10: Schematics showing the process of rotation; (a) object before rotation and
(b) same object after rotation by angle è
These are the basic three types of graphical manipulations which help in all
necessary transformations.
Spend Check Your Progress II
5 mins
1) What is meant by scale change?
......................................................................................................................
......................................................................................................................
......................................................................................................................
......................................................................................................................
......................................................................................................................
......................................................................................................................
......................................................................................................................
......................................................................................................................
......................................................................................................................
......................................................................................................................
7.6 GEOREFERENCING
Georegistration is Georeferencing is the process of locating any entity in real-world coordinates.
another term which is
Raster data is commonly obtained by scanning maps, topographical maps or
used alternatively with
georeferencing. collecting aerial photographs and satellite images. Scanned maps normally do not
contain spatial reference information. However, sometimes they might have
coordinate information marked along it with the help of which it can be
georeferenced (Fig. 7.11a). If there is no coordinate information, we can
georeference it by adjusting the map or image to the geographic location of a
“known good” reference image or map. The image or map being used as a
reference is called reference component and the image or map being adjusted is
18 called target component (Fig. 7.11b & c).
Data Input
(a)
(b) (c)
Fig. 7.11: Georeferenced images; (a) georeferencing by adding control points, (b) and
(c) georeferencing of map/image by using a reference image (b) and target image (c)
(source: www.georeference.org/doc/georegistration.html)
Fig. 7.12: Geocoding with the help of coordinate information of places given in table 7.1
Reverse geocoding can also be done by finding the street address with the help
of associated geographical coordinates.
7.6.2 Geotagging
Geotagging enables the user to find location of a particular media. It is the process of
adding location specific information or geographical identifiers to photographs, films,
videos, websites, etc. This is also called geospatial metadata and usually consists
of coordinate information in the form of latitude-longitude along with information
20 related to altitude, bearing, distance, place names, etc. (Fig. 7.13).
Data Input
7.6.3 Rubbersheeting
Rubbersheeting is a non-uniform adjustment of a dataset based on the movement
of known control points to new locations. It is a process that corrects flaws in
source map or vector drawing through geometric adjustment of coordinates with
the help of a more accurate target layer (Fig. 7.14). Errors may be due to
imperfect image registration, scanning, inaccurate flight alignment or camera
inaccuracies in case of aerial photographs, etc. Thus, inaccurate data can be
stretched or rubbersheeted over the accurate data using control points and place
marks common to both data sets.
Reference point 2
Reference point 2
Target map
Input point 2 Target map
(accurate)
(accurate)
Input point 2
Y
Reference point 1
(b)
Input point 1
X
Fig. 7.14: Illustration of rubbersheeting; (a) vector drawing which is more
accurately adjusted to the target drawing (b)
7.7 ACTIVITY
1) Scan a paper map which has coordinate information at the corners.
2) Georeference the map using a suitable coordinate and projection system
with the help of GIS software. Then input the rectified or georeferenced
map into the GIS software and digitise it (on-screen digitisation). Digitise
different features like roads, rivers, lakes, forests, etc. and input them as
different layers in the GIS database. Now add attributes to your digitised
features so as to represent their names, location and other attributes. 21
GIS Database Creation 3) Now you have learnt the basics of data inputting. After finishing this you can
scan topographical maps of a place and continue doing the same process as
mentioned above. If the topographical maps are in continuity then digitised
features can also be edge-matched once the georeferenced toposheets open
up side-by-side.
Note: For scanning of topographical maps prior permission from the
concerned authorities is needed.
7.8 SUMMARY
In this unit, you have learnt that:
• Data input is a method of getting data into a computer.
• There are different types of devices like digitising table, mouse, scanner and
keyboard which are used for data input.
• Map digitisation is the process of converting features on a paper map or
simply tracing a paper map into digital format with the help of digitising table
or on-screen digitisation by displaying raster maps on a computer screen
with the help of a GIS software. Digitised features can be in the form of
points, lines or polygons.
• Metadata is the data about data and is prepared mostly in the form of
XML during the time of data production. It forms the most integral part of a
GIS and reveals basic information about data as to when it was created, by
whom it was created, why it was created, where it was created, etc.
• A metadata standard is a document that contains rules or guidelines for
standardising the content of metadata.
• Before digitisation, it is important to georeference map using a suitable
projection system. Georeferencing is the process of locating any entity in
real-world coordinates. Some other processes related to georeferencing are
geocoding (adding coordinates to street locations), geotagging (adding
geographical information to photographs, films, etc.) and rubbersheeting
(geometric adjustment of coordinates with the help of a more accurate
map). GIS software has to perform a number of graphical manipulations that
results into different types of projections. They are translation, scale change
and rotation.
Spend
30 mins 7.9 UNIT END QUESTIONS
1) What do you understand by map digitisation?
2) Briefly describe different methods of raster and vector data input.
3) Give a brief discussion of metadata in GIS.
4) Explain the concept of georeferencing.
7.10 REFERENCES
• Chang, K.-t. (2010), Introduction to Geographic Information Systems,
Tata McGraw-Hill, New Delhi.
22
• Davis, B.E. (2001), GIS: A Visual Approach, Onword Press, Canada. Data Input
7.12 ANSWERS
Check Your Progress I
1) The names of different devices used for inputting geographical data are,
mouse, digitising table and puck/cursor, scanners, keyboard, etc.
2) The different modes of digitisation are point mode and stream mode
digitisation.
Check Your Progress II
1) Scale change sets the x, y and z scale factors to make objects in the track
larger or smaller. This is done by multiplying the overall X- coordinate
extent by a scale factor sx, and each set of Y coordinates by a Y scale
factor sy.
Unit End Questions
1) Refer to section 7.3.
2) Refer to section 7.3.3
3) Refer to section 7.4.3
4) Refer to section 7.6.
23