Unit 1
Unit 1
Unit 1
The database is a collection of inter-related data which is used to retrieve, insert and
delete the data efficiently. It is also used to organize the data in the form of a table,
schema, views, and reports, etc.
For example: The college Database organizes the data about the admin, staff,
students and faculty etc.
Using the database, you can easily retrieve, insert, and delete the information.
Characteristics of DBMS
o It uses a digital repository established on a server to store and manage the
information.
o It can provide a clear and logical view of the process that manipulates data.
o DBMS contains automatic backup and recovery procedures.
o It contains ACID properties which maintain data in a healthy state in case of failure.
o It can reduce the complex relationship between data.
o It is used to support manipulation and processing of data.
o It is used to provide security of data.
o It can view the database from different viewpoints according to the requirements of
the user.
Advantages of DBMS
o Controls database redundancy: It can control data redundancy because it stores all
the data in one single database file and that recorded data is placed in the database.
o Data sharing: In DBMS, the authorized users of an organization can share the data
among multiple users.
o Easily Maintenance: It can be easily maintainable due to the centralized nature of
the database system.
o Reduce time: It reduces development time and maintenance need.
o Backup: It provides backup and recovery subsystems which create automatic backup
of data from hardware and software failures and restores the data if required.
o multiple user interface: It provides different types of user interfaces like graphical
user interfaces, application program interfaces
Disadvantages of DBMS
o Cost of Hardware and Software: It requires a high speed of data processor and
large memory size to run DBMS software.
o Size: It occupies a large space of disks and large memory to run them efficiently.
o Complexity: Database system creates additional complexity and requirements.
o Higher impact of failure: Failure is highly impacted the database because in most of
the organization, all the data stored in a single database and if the database is
damaged due to electric failure or database corruption then the data may be lost
forever.
Database
What is Data?
Data is a collection of a distinct small unit of information. It can be used in a variety
of forms like text, numbers, media, bytes, etc. it can be stored in pieces of paper or
electronic memory, etc.
Word 'Data' is originated from the word 'datum' that means 'single piece of
information.' It is plural of the word datum.
In computing, Data is information that can be translated into a form for efficient
movement and processing. Data is interchangeable.
What is Database?
A database is an organized collection of data, so that it can be easily accessed and
managed.
You can organize data into tables, rows, columns, and index it to make it easier to
find relevant information.
Database handlers create a database in such a way that only one set of software
program provides access of data to all the users.
There are many dynamic websites on the World Wide Web nowadays which are
handled through databases. For example, a model that checks the availability of
rooms in a hotel. It is an example of a dynamic website that uses a database.
There are many databases available like MySQL, Sybase, Oracle, MongoDB,
Informix, PostgreSQL, SQL Server, etc.
The Evolution
File-Based
1968 was the year when File-Based database were introduced. In file-based
databases, data was maintained in a flat file. Though files have many advantages,
there are several limitations.
One of the major advantages is that the file system has various access methods, e.g.,
sequential, indexed, and random.
1968-1980 was the era of the Hierarchical Database. Prominent hierarchical database
model was IBM's first DBMS. It was called IMS (Information Management System).
Below diagram represents Hierarchical Data Model. Small circle represents objects.
Like file system, this model also had some limitations like complex implementation,
lack structural independence, can't easily handle a many-many relationship, etc.
In this model, files are related as owners and members, like to the common network
model.
Data Models
Data Model is the modeling of the data description, data semantics, and consistency
constraints of the data. It provides the conceptual tools for describing the design of a
database at each level of data abstraction. Therefore, there are following four data
models used for understanding the structure of the database:
1) Relational Data Model: This type of model designs the data in the form of rows
and columns within a table. Thus, a relational model uses tables for representing data
and in-between relationships. Tables are also called relations. This model was initially
described by Edgar F. Codd, in 1969. The relational data model is the widely used
model which is primarily used by commercial data processing applications.
4) Semistructured Data Model: This type of data model is different from the other three
data models (explained above). The semistructured data model allows the data specifications
at places where the individual data items of the same type may have different attributes sets.
The Extensible Markup Language, also known as XML, is widely used for representing the
semistructured data. Although XML was initially designed for including the markup
information to the text document, it gains importance because of its application in the
exchange of data.
A schema diagram can display only some aspects of a schema like the name of
record type, data type, and constraints. Other aspects can't be specified through the
schema diagram. For example, the given figure neither show the data type of each
data item nor the relationship among various files.
In the database, actual data changes quite frequently. For example, in the given
figure, the database changes whenever we add a new grade or add a student. The
data at a particular moment of time is called the instance of the database.
Data Independence
o Data independence can be explained using the three-schema architecture.
o Data independence refers characteristic of being able to modify the schema at one
level of the database system without altering the schema at the next higher level.
(But in Oracle database, the execution of data control language does not have
the feature of rolling back.)
There are the following operations which have the authorization of Revoke:
The majority of users switched from terminals to PCs and workstations as hardware
prices decreased. Initially, Database Systems operated on these computers in a
manner akin to how they had operated display terminals. As a result, the DBMS itself
continued to operate as a centralized DBMS, where all DBMS functionality,
application program execution, and UI processing were done on a single computer.
The physical elements of a centralized architecture Client/server DBMS designs
emerged as DBMS systems gradually began to take advantage of the user side's
computing capability.
Client-server Architecture of DBMS:
We first talk about client/server architecture in general, and then we look at how
DBMSs use it. In order to handle computing settings with a high number of PCs,
workstations, file servers, printers, database servers, etc., the client/server architecture
was designed.
A network connects various pieces of software and hardware, including email and
web server software. To define specialized servers with a particular functionality is the
aim. For instance, it is feasible to link a number of PCs or compact workstations to a
file server that manages the client machines' files as clients. By having connections to
numerous printers, different devices can be designated as a printer server; all print
requests from clients are then directed to this machine. The category of specialized
servers also includes web servers and email servers. Many client machines can utilize
the resources offered by specialized servers. The user is given the proper user
interfaces for these servers as well as local processing power to run local applications
on the client devices. This idea can be applied to various types of software, where
specialist applications, like a CAD (computer-aided design) package, are kept on
particular server computers and made available to a variety of clients. Some devices
(such as workstations or PCs with discs that only have client software installed) would
only be client sites.
Classification of Database
Management Systems
ADRIENNE WATT
Database management systems can be classified based on several criteria, such as
the data model, user numbers and database distribution, all described below.
In recent years, the newer object-oriented data models were introduced. This
model is a database management system in which information is represented in the
form of objects as used in object-oriented programming. Object-oriented databases
are different from relational databases, which are table-oriented. Object-oriented
database management systems (OODBMS) combine database capabilities with
object-oriented programming language capabilities.
Centralized systems
With a centralized database system, the DBMS and database are stored at a single
site that is used by several other systems too. This is illustrated in Figure 6.1.
In the early 1980s, many Canadian libraries used the GEAC 8000 to convert their
manual card catalogues to machine-readable centralized catalogue systems. Each
book catalogue had a barcode field similar to those on supermarket products.