Study Module 2
Study Module 2
Study Session 1:
Learning Outcome:
Diskette
Hard disks
This is a variation of the hard disk in which hard disks enclosed in plastic or metal
cartridges are easily removable like floppy disks. It combines the best features of
hard and floppy disks. They are used to provide large economic, high, fast, and
portable storage facilities for data processing.
Magnetic tape has historically been found more convenient means of large data
storage over disk where media portability or removability is required for backup.
Magnetic Tape uses the same read/write techniques as disks. Data is stored on
flexible mylar tape covered with magnetic oxide. Data is stored in parallel tracks of
9, 18, or 36. Data on tapes are accessed sequentially. Tapes provide slow, very cheap,
large capacity backup for data. The rapid advances in disk storage technologies
resulting in and the improvement in disk storage density, and reducedprice, coupled
with arguably declining innovation in tape storage technology, has reduced the
market share of tape storage devices.
Figure 3: Tape
Optical Storage
The optical storage devices such as CDs and DVDs are means in which data is
written and read with a laser for archival or backup purposes. The optical storage
devices are fast replacing both hard drives in computers and tape backup in mass
storage. This is because optical media are more durable than tape and less vulnerable
to environmental conditions lasting up to seven times as long as traditional storage
media. However, at present optical media are slower than typical hard drive speeds
and offer lower storage capacities. Optical disk capacity ranges up to 6 gigabytes
(6,000,000 bytes), which is far more compared to the 1.44 megabytes (MB), that is,
1,440,000 bytes offered by a floppy disk. A newer technology, the digital versatile
disc DVD, has about 4.7-gigabyte storage capacity on a single-sided, one-layered
disk compared with 65 gigabyte of storage for a CD-ROM disk. Invariably, they can
be used to hold large amount of data.
Bits of data (0 s and 1 s) are stored on circular magnetic platters called disks and
rotates rapidly (& never stops). A disk head reads and writes bits of data as under
the head. Often, several platters are organized into a disk pack (or disk drive). A disk
contains concentric tracks. Tracks are divided into sectors. A sector is the smallest
addressable unit in a disk.
When a program reads a byte from the disk, the operating system locates the surface,
track, and sector containing that byte, and reads the entire sector into a particular
area in main memory called buffer. The bottleneck of disk access is moving the
read/write arm. So it makes sense to store a file in tracks that are below/above each
other on different surfaces rather than in several tracks on the same surface.
Cylinder
A cylinder is the set of tracks at a given radius of a disk pack. i.e. a cylinder is the
set of tracks that can be accessed without moving the disk arm. All the information
on a cylinder can be accessed without moving the read/write arm.
Disk Performance
This is the time it takes to position the proper sector under the read/write head. In
general, it is used to refer to the period of time that one component in a system is
spinning its wheels waiting for another component. Latency, therefore, is wasted
time. It makes sense to separate read latency and write latency, and in case of
sequential access storage, minimum, maximum and average latency. Consider a hard
disk which rotates at about 5000 rpm i.e. one revolution per 12 msec. The average
latency can be calculated as follows:
Min latency = 0
= max / 2
(b) Throughput
This is the rate at which information can be read from or written to the storage. It is
expressed in terms of megabytes per second or MB/s. A media accessed sequentially,
as opposed to randomly, typically yield maximum throughput.
This is the amount of time between when the CPU requests a file and when the first
byte of the file is sent to the CPU. Seek times between 10 and 20 milliseconds are
common.
Figure 6: A Flash
A common type of memory card, SD cards are used in multiple electronic devices,
including digital cameras and mobile phones. Although there are different sizes,
classes, and capacities available, they all use a rectangular design with one side
"chipped off" to prevent the card from being inserted into the camera or other device
the wrong way.
A solid-state drive uses flash memory to store data and is sometimes used in devices
such as netbooks, laptop, and desktop computers instead of a traditional hard disk
drive. The advantages of an SSD over an HDD include a faster read/write speed,
noiseless operation, greater reliability, and lower power consumption. The biggest
downside is cost, with an SSD offering lower capacity than an equivalently priced
HDD.
Cloud Storage
With users increasingly operating multiple devices in multiple places, many are
adopting online and cloud computing solutions. Cloud computing basically involves
accessing services over a network via a collection of remote servers. Although the
idea of a "cloud of computers" may sound rather abstract to those unfamiliar with
this metaphorical concept, in practice it can provide powerful storage solutions for
devices that are connected to the internet.
Punch Card
Punch cards (or punched cards) were a common method of data storage used in the
early computers. Basically, they consisted of a paper card with punched or perforated
holes that have been created by hand or machine. The cards were entered into the
computer to enable the storage and accessing of information. This form of data
storage media pretty much disappeared as new and better technologies were
developed.
Study Session 2:
File Organization
Learning Outcomes:
a. Program File
These are files that store sets of instructions written in a programming language. A
source program file, for example, contains the instructions written in a high-level
language such as BASIC or FORTRAN programming language by a programmer.
In contrast, the object file is the translated form of the source file in machine code
after. The files that contain the machine code are called executable files (or binary
files).
b. ASCII File
ASCII stands for American Standard Code for Information Interchange. ASCII files
are text-based files. The characters are represented in ASCII code (without
formatting such as underline, italics, boldface, or graphics). Files stored in this
format are used to transfer documents between incompatible computer platforms,
such as IBM and Macintosh.
c. Image File
Documents containing digitized graphics or images are stored in this format. Image
files: Image files contain digitized graphics. Audio and video files: Audio files
contain digitized sound, while video files contain digitized video images and
animation.
d. Audio and Video File
This is a file that is used to store digitized sound or digitized video images and
animation.
e. Data File
This refers to document files, contain data, not programs. Their contents are using
application software.
File Organization
The file is created, arranged, and maintained in data processing systems to
retrieve quickly. Computer systems store files permanently on secondary storage
devices. Records or files are arranged in several ways on the storage media, and
the arrangement determines how individual records can be accessed or retrieved.
Four common ways of file organization and access are:
Self-Assessment Questions
1. Define the term a computer file.
2. Explain three main techniques of file organization.
3. List and explain the two types of files.
References/Further Readings
• Jeffery L. Whitten, Lonnie D. Bentley, Kevin C. Dittman, Systems
Analysis and Design Methods, McGraw Hill, New York, 2004.
• Introduction to Computers and Information Technology, 2nd edition, Pearson,
2015, ISBN-13: 9781323237120.
Study Session 3:
Learning Outcomes:
Some problems with the traditional file environment are Data Dependence,
Program-Data Dependence, and Difficulty of Data-Sharing. These are discussed as
follow:
(a) Data Redundancy: This means the presence of duplicate data in multiple data
files and often in different formats. This is often the result when different
departments are allowed to collect the same piece of information about an object.
For instance, within the university environment, the hostels and student registration
department might collect the same student’s information (Name, Mat No, Level, and
Address). Because it is collected and maintained in so many different places, the
same data items may be repeated in different departments. When data fields are
repeated in different files, storage spaces are wasted, and much time is spent trying
to update the records.
Self-Assessment Questions
References/Further Readings
Dominic Giampaolo, Practical File System Design with the Be File System,
Morgan Kaufmann; Kindle edition (August 29, 2013), ISBN-13: 9781558604971.