DBMS
DBMS
A necessity for almost any enterprise to carry out its business. Consists of raw facts, and when organized may be transformed into information Information is data that has been processed for use.
What is Knowledge?
Knowledge is defined variously as Expertise, and skills acquired by a person through experience or education; the theoretical or practical understanding of a subject,
Knowledge Management
Data, Information & Knowledge Management The Management comprises a range of practices used by organizations to identify, create, represent, and distribute knowledge. The Management activities are: 1. Gathering 2. Organizing 3. Sharing 4. Analyzing & 5. Disseminating Information to improve systems structure.
Knowledge Management
There are two types of data, information &knowledge about the subject: What we know and what we do not know about the subject, Where to find the information about the subject. Data, information & Knowledge is accumulated and kept at/in 1. Peoples mind, 2. On paper notes, 3. Discussion transcripts, & 4. Places that are not readily available with the individuals.
Knowledge Management
The D.I.K. Management puts Procedures and Technologies for the following: Transfer individual data, information and knowledge into database - Gathering - Organizing takes place - Structure of DB is created if not there.
6
Knowledge Management
Recognize most relevant data, information and knowledge for decision making, depending upon the criteria. - Analyzing & Sharing of DB, Processing the Query Allow individuals to easily access the information as and where required. - Sharing of DB & Disseminating the report.
7
Other Concepts
Entity Anything about which information can be stored for ex. Person, concept, physical object.. Any real physical object about which information is stored. Group of similar objects identified by their properties is called Entity Set. Attribute Characteristic of interest about an entity. The values of the attributes describe a particular entity.
8
Other Concepts
Record Representation of Entity with its attributes. File Documentation and arrangement of Records according to a definite structure. Schema It is the structure of a DB system, described in a formal language supported by the DBMS. E.g.: In a RDBMS, the schema defines the tables, the fields, and the relationships between fields and tables.
Other Concepts
Database Collection of data designed to be used by different people . Data Dictionary System database that contain descriptions of the data min the database (metadata). It contains information about data, relationships, constraints and the entire schema that organize these features into a unified database. The data catalog can be queried to get information about the structure of the database.
10
The Traditional File Approach Separate files are created and stored for each
application program. Very common from the 50s through the 80s.
11
13
Reports
17
Components of DBMS
Structure Storage Manager - File Manager - Buffer Manager Data Manager Disk Manager Query Processor Data Dictionary Access Aids - Indexes Facilities Users Data Definition End Users Language - Nave Users (DDL) - Stand Alone Data - Sophisticated Manipulation - Casual Language Database (DML) Administrators Database Designers
20
Components of DBMS
DBMS Facilities Data Definition Language (DDL): Defines the basic structure of a database. Create Table structure Alter and delete Table. Data Manipulation Language (DML): Performs a query through the database. Insert, modify & delete the data. Defines procedures for working with the database.
24
Structure of a DBMS
The cylindrical component contains not only data, but also metadata, i.e. info about the structure of data. If DBMS is relational, metadata includes: names of relations, names of attributes of those relations, and data types for those attributes (e.g., integer or character string). A database also maintains indexes for the data. Indexes are part of the stored data. Description of which attributes have indexes is part of the metadata.
25
26
Storage Manager
The job of the Storage Manager is to obtain data from the data storage, and modify the data to the data storage when requested. Storage Manager has two components: File Manager handles files. Keeps track of the location of files Obtains block(s) of a file on request from the buffer manager. Buffer Manager handles main memory. Obtains and returns blocks of data from/to the file manager Stores blocks temporarily in main memory pages.
27
Query Processor
Query Processor handles: queries and modifications to the data. Finds the best way to carry out a requested operation and Issues commands to the storage manager which will carry them out. E.g. A bank has a DB with two relat.: Customers (name, SIN, address), Accounts (accountNo, balance, SIN) Query: Find the balances of all accounts of which Sally is the owner. SELECT Accounts.balance FROM Customers, Accounts WHERE Customers.SIN = Accounts.SIN AND Customers.name = 'Sally';
28
Transaction Manager
Transaction Manager is responsible for the integrity of the system. It must assure that: several queries running simultaneously do not interfere with each other and that, the system will not lose data even if there is a power failure. Transaction Manager interacts with: Query Manager, Because it may need to delay certain query operations to avoid conflicts. Storage Manager Because schemes for protecting data involve storing a log of changes to the data.
29
31
cost of purchasing and operating a DBMS in a mainframe operating environment. This applies to mainframes, and some minicomputers only. Increased cost of specialized staff. Increased vulnerability.
32
DATA WAREHOUSE
It is a collection of data designed to support management in the decision-making process. It is a Subject-Oriented, Integrated, Time-Variant, Non -Updatable collection of data used in support of management decision making processes and business intelligence.
33
DATA WAREHOUSE
Characteristics 1. Subject-Oriented Data are stored with a subject orientation that facilitates multiple views for data and decision making. E.g.: sales, products, sales by products, etc. 2. Integrated Provides a unified view of all data elements with a common definition and representation for all departments.
34
DATA WAREHOUSE
3. Time-Variant Data are historic in nature. A dimension is added to facilitate data analysis and time corporations. 4. Non-Updatable Data are changed, but, are only added periodically from operational systems. Once data are stored, no changes are allowed.
35
DATA MINING
Data Mining is the principle of sorting through large amounts of data and picking out relevant information. It is usually used by business intelligence organizations, and financial analysts, but it is increasingly used in the sciences to extract information from the enormous data sets generated by modern experimental and observational methods. It has been described as "the non-trivial extraction of implicit, previously unknown, and potentially useful information from data" and "the science of extracting useful information from large data sets or databases".
36
37
39