Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
15 views

Lecture 01 Introduction To Database PART I BACKGROUND

This document provides an overview of a database systems course. It outlines the assessments including a final exam, midterm, assignment, and attendance. It then introduces the concepts of databases and database management systems. The objectives and history of databases are discussed. Finally, the limitations of traditional file-based systems and advantages of the database approach are summarized.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views

Lecture 01 Introduction To Database PART I BACKGROUND

This document provides an overview of a database systems course. It outlines the assessments including a final exam, midterm, assignment, and attendance. It then introduces the concepts of databases and database management systems. The objectives and history of databases are discussed. Finally, the limitations of traditional file-based systems and advantages of the database approach are summarized.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 39

DATABASE SYSTEM

DESIGN, IMPLEMENTATION, AND


MANAGEMENT.

Lecturer: Dr. Trieu Minh Nhut Le


1
Lecture 01:
INTRODUCTION TO DATABASE

2
Assessments
• Final Exam (50%) – Paper test
• Midterm Test (20%) – Paper test
• Assignment(20%) – Project report + Presentation
• Attendance (10%) – 8-out-of-10 exercises submission

3
Introduction
• This subject is an introduction about databases.
– Learn the concepts and techniques for the design and
programming of database applications with relational
database management systems
• Today, lecture is discussion about outline the syllabus of
the course.

4
TextBooks

5
TextBooks

6
TextBooks

7
TextBooks

8
TextBooks

9
TextBooks

10
Objectives
• Some common uses of database systems.
• The characteristics of file-based systems.
• The problems with the file-based approach.
• The meaning of the term ‘database’.
• The meaning of the term ‘Database Management
System’ (DBMS).
• The typical functions of a DBMS.
• The major components of the DBMS environment.
• The personnel involved in the DBMS environment.
• The history of the development of DBMSs.
• The advantages and disadvantages of DBMSs.
11
Why Study Databases?
• Databases are useful
– Many computing applications deal with large amounts of
information
– Database systems give a set of tools for storing, searching and
managing this information
• Databases in Computer Science
– Databases are a ‘core topic’ in computer science
– Basic concepts and skills with database systems are part of the
skill set you will be assumed to have as a computer science
graduate.

12
What is a Database?
• “A set of information held in a computer”
Oxford English Dictionary

• “One or more large structured sets of persistent data,


usually associated with software to update and query
the data”
Free On-Line Dictionary of Computing

• “A collection of data arranged for ease and speed of


search and retrieval”
Dictionary.com

13
The History of Database System
The history of database system research is one of exceptional
productivity and startling economic impact. Barely 20 years
old as a basic science research field, database research has
fueled an information services industry estimated at $10
billion per year in the U.S. alone. Achievements in database
research underpin fundamental advances in communications
systems, transportation and logistics, financial management,
knowledge-based systems, accessibility to scientific literature,
and a host of other civilian and defense applications. They
also serve as the foundation for considerable progress in the
basic science fields ranging from computing to biology.
(Silberschatz et al., 1996)

14
Introduction
• Purchases from the supermarket.
• Purchases using your credit card.
• Booking a holiday at the travel agents.
• Using the local library
• Taking out insurance
• Renting a video
• Using the Internet
• Studying at university
• Stock control
• Train timetables
• Airline bookings
• Student records …
15
Traditional File-Based Systems
Definition: A collection of application programs that
perform services for the end-users such as the production
of reports. Each program defines and manages its own
data.

16
Limitations of the File-Based Approach
1. Separation and isolation of data
– When data is isolated in separate files, it is more difficult to
access data that should be available
2. Duplication of data
– The decentralized approach taken by each department, the file-
based approach encouraged, if not necessitated, the
uncontrolled duplication of data
• Duplication is wasteful. It costs time and money to enter the data
more than once.
• It takes up additional storage space, again with associated costs.
• The duplication of data can be avoided by sharing data files.
• More importantly, duplication can lead to loss of data integrity,
other words, the data is no longer consistent

17
Limitations of the File-Based Approach
3. Data dependence
– The physical structure and storage of the data files and records
are defined in the application code. This means that changes to
an existing structure are difficult to make
4. Incompatible file formats
– The structure of files is embedded in the application programs,
the structures are dependent on the application programming
language
5. Fixed queries/proliferation of application programs
– File-based systems are very dependent upon the application
developer, who has to write any queries or reports. As a result,
the type of query or report that could be produced was fixed.
• No provision/providing for security or integrity
• Recovery, in the event of a hardware or software failure, was limited
or non-existent
• Access to the files was restricted to one user at a time – there was
no provision/providing for shared access.
18
Limitations of the File-Based Approach
6. Difficulty in representing data from the user's view:
– To create useful applications for the user, often data from various
files must be combined. In file processing it was difficult to
determine relationships between isolated data in order to meet
user requirements.
7. Data Security.
– The security of data is low in file based system because, the data
is maintained in the flat file(s) is easily accessible. For Example:
Consider the Banking System. The Customer Transaction file has
details about the total available balance of all customers. A
Customer wants information about his account balance. In a file
system it is difficult to give the Customer access to only his data in
the· file. Thus enforcing security constraints for the entire file or
for certain data items are difficult.
19
Limitations of the File-Based Approach
8. Transactional Problems.
– The File based system approach does not satisfy transaction
properties like Atomicity, Consistency, Isolation and Durability
properties commonly known as ACID properties.
9. Concurrency problems.
– When multiple users access the same piece of data at same
interval of time then it is called as concurrency of the system.
When two or more users read the data simultaneously there is ll(
problem, but when they like to update a file simultaneously, it
may result in a problem.
10. Poor data modeling of real world.
– The file based system is not able to represent the complex data
and interfile relationships, which results poor data modeling
properties.

20
Database Approach
All the above limitations of the file-based approach can be
attributed to two factors:
(1) The definition of the data is embedded in the application
programs, rather than being stored separately and independently
(2) There is no control over the access and manipulation of data
beyond that imposed by the application programs.

A new approach was required to cover these factors


• Database: A shared collection of logically related data, and a
description of this data, designed to meet the information
needs of an organization. In other words, the database
represents the entities, the attributes, and the logical
relationships between the entities.
21
History of Database
• First generation
– Hierarchical model
• Information Management System (IMS)
– Network model
• Conference on Data System Languages (CODASYL)
• Data Base Task Group (DBTG)
– Limitation
• Complex program for simple query
• Minimum data independence
• No theoretical foundation

22
History of Database
• Second generation
– Relational model
• E. R. Codd
• DB2, Oracle
– Limitation
• Limited data modeling
• Third generation
– Object-relational DBMS
– Object-oriented DBMS

23
The Database Management System (DBMS)
• DBMS: A software system that enables users to define,
create, maintain, and control access to the database.
– A database is a collection of information
– A database management system (DBMS) is the software
than controls that information
Examples:
– Oracle
– DB2 (IBM)
– MS SQL Server
– MS Access
– Ingres
– PostgreSQL
– MySQL 24
The Database Management System
The DBMS is the software that interacts with the users’
application programs and the database.
– It allows users to define the database, usually through a Data
Definition Language (DDL). The DDL allows users to specify the
data types and structures and the constraints on the data to be
stored in the database.
– It allows users to insert, update, delete, and retrieve data from
the database, usually through a Data Manipulation Language
(DML). Having a central repository for all data and data
descriptions allows the DML to provide a general inquiry facility to
this data, called a query language.
– It provides controlled access to the database.
• A security system prevents unauthorized users accessing the database
• An integrity system maintains the consistency of stored data
• A concurrency control system allows shared access of the database
25
Database Systems
• A database system consists of
– Data (the database)
– Software
– Hardware
– Users
• We focus mainly on the software.
• Database systems allow users to
– Store
– Update
– Retrieve
– Organise
– Protect
the data.
26
Database Users
• End users
– Use the database system to achieve some goal
• Application developers
– Write software to allow end users to interface with the
database system
• Database Administrator (DBA)
– Designs & manages the database system
• Database systems programmer
– Writes the database software itself

27
The Database Management System
DBMS (Database Management System): A software
system that enables users to define, create, maintain, and
control access to the database.

28
(Database) Application Programs
• Application program: A computer program that
interacts with the database by issuing an appropriate
request (typically an SQL statement) to the DBMS.

29
Components of the DBMS Environment

• Hardware: the hardware can range from a single personal computer, to a single
mainframe, to a network of computers.
• Software: Application programs are written in a third-generation programming
language (3GL), such as ‘C’, C++, Java, Visual Basic, COBOL, Fortran, Ada, or
Pascal or using a fourth-generation language (4GL), such as SQL, embedded in a
third generation language.
• Data: the data acts as a bridge between the machine components and the
human components.
• Procedures: the instructions and rules govern the design and use of the
database such as log on, start, and stop the DBMS, backup the database,
handle hardware or software failures, manage structure of a table, improve
performance and so on.
• People: End user, Application developers, Database Administrator (DBA),
Database systems programmer 30
Advantages and Disadvantages of DBMSs
• Advantages
– Control of data redundancy
– Data consistency
– More information from the same amount of data (with the
integration of the operational data)
– Sharing of data (the database belongs to the entire organization
and is shared by all authorized users)
– Improved data integrity Improved security (integrity
constraints)
– Enforcement of standards (integration allows the DBA to define
and enforce the necessary standards)
– Economy of scale (Combining all the organization’s operational
data into one database, and creating a set of applications that
work on this one source of data, can result in cost savings)
31
Advantages and Disadvantages of DBMSs
• Advantages
– Balance of conflicting requirements.
– Improved data accessibility and responsiveness increased
productivity (DBMSs provide a fourth-generation environment
consisting of tools to simplify the development of database
applications)
– Improved maintenance through data independence (The
provision of data independence simplifies database application
maintenance)
– Increased concurrency (DBMSs manage concurrent database
access and ensure such problems cannot occur)
– Improved backup and recovery services (DBMSs provide
facilities to minimize the amount of processing that is lost
following a failure)

32
Advantages and Disadvantages of DBMSs
• Disadvantages
– Complexity (DBMS an extremely complex piece of
software. Database designers and developers, the data
and database administrators, and end-users must
understand this functionality to take full advantage. Failure
to understand the system can lead to bad design decisions,
which can have serious consequences for an organization)
– Size (DBMS an extremely large piece of software,
occupying many megabytes of disk space and requiring
substantial amounts of memory to run efficiently)
– Cost of DBMSs (The cost of DBMSs varies significantly,
depending on the environment and functionality provided)

33
Advantages and Disadvantages of DBMSs
• Disadvantages
– Additional hardware costs (to achieve the required
performance, it is necessary to purchase a larger machine,
perhaps even a machine dedicated to running the DBMS)
– Cost of conversion (the cost of the DBMS and extra
hardware is insignificant compared with the cost of
converting existing applications to run on the new DBMS
and hardware such as training staff and DB administrator)
– Performance
– Higher impact of a failure (The centralization of resources
increases the vulnerability of the system)

34
Summary
• The Database Management System (DBMS) is now the
underlying framework of the information system and has
fundamentally changed the way that many organizations
operate. The database system remains a very active
research area and many significant problems have still to
be satisfactorily resolved.
• The DBMS was the file-based system performing services
for the end-users (reports). Each application defines and
manages its own data. The file-based system has
significant problems (data redundancy, data dependence).

35
Summary
• The database approach emerged to resolve the problems
with the file-based approach. A database is a shared
collection of logically related data, and a description of this
data, designed to meet the information needs of an
organization.
• A DBMS is a software system that enables users to define,
create, maintain, and control access to the database. An
application program is a computer program that interacts
with the database by issuing an appropriate request
(typically an SQL statement) to the DBMS. The more
inclusive term database system is used to define a collection
of application programs that interact with the database
along with the DBMS and database itself.
36
Summary
• All access to the database is through the DBMS. The DBMS
provides a Data Definition Language (DDL), which allows
users to define the database, and a Data Manipulation
Language (DML), which allows users to insert, update, delete,
and retrieve data from the database.
• The DBMS provides controlled access to the database. It
provides security, integrity, concurrency and recovery control,
& a user-accessible catalog. It also provides a view
mechanism to simplify the data that users have to deal with.
• The DBMS environment consists of hardware (the computer),
software (the DBMS, operating system, and applications
programs), data, procedures, and people. The people include
data and database administrators, database designers,
application developers, & end-users.
37
Summary
• The roots of the DBMS lie in file-based systems. The hierarchical
and CODASYL systems represent the first-generation of DBMSs.
The hierarchical model is typified by Information Management
System and the network or CODASYL model by Integrated Data
Store, both developed in the mid-1960s. The relational model,
proposed by E. F. Codd in 1970, represents the second-
generation of DBMSs. It has had a fundamental effect on the
DBMS community & there are now over one hundred relational
DBMS. The third-generation of DBMS are represented by the
Object-Relational DBMS & the Object-Oriented DBMS.
• Advantages of the database approach include control of data
redundancy, data consistency, sharing of data, and improved
security and integrity. Disadvantages include complexity, cost,
reduced performance, and higher impact of a failure.
38
Thank you for your listen
End of lecture 01

39

You might also like