L01-Introduction - What Is Database
L01-Introduction - What Is Database
Dr Renata Borovica-Gajic
Lecture 01
What are Database Systems?
• Data
– known facts stored and recorded
– can include: text, numbers, dates, plus images, sound, video,
and other complex objects
• Information
– Data presented in context (can be summarised data)
– Data that has been processed increasing the users knowledge
• Data vs Information
– Data is known and available; Information is processed and more
useful
Baker, Kenneth D. 324917628
Doyle, Joan E. 476193248
Finkle, Clive R. 548429344
Lewis, John C. 551742186
McFerran, Debra R. 409723145
Sisneros, Michael 392416582
• Can include:
– structure, rules, constraints
• Why do we need Metadata?
– Consistency
– Meaning
• We generate a data dictionary as part of the analysis of
system requirements
• Example: a university
– Entities … such as courses, students, professors
– Relationships … such as enrollment, teaching
• Program-data dependence
– If the file structure changes, so does the program
– What if you change data structure for one program
• Duplication of data
– wasteful, inefficient, loss of data integrity
• Limited data sharing
– data tied to application, hard/slow to create adhoc reports
• Lengthy development times
– application has to do low level data management, figure out file
format each time
• Excessive program maintenance
– up to 80% of development time in traditional file based
organisations is for maintenance
Enrolled
sid cid grade Students
sid name login age gpa
53666 Carnatic101 5
53666 Jones jones@cs 18 5.4
53666 Reggae203 5.5
53688 Smith smith@eecs 18 4.2
53650 Topology112 6
53666 History105 5 53650 Smith smith@math 19 4.8
• Data independence
– separation of data and program, application logic
– central data repository, central management
• Minimal data redundancy
– redundancy can be controlled (normalization)
• Improved data consistency
– single store: no disagreements, update problems, less storage space
• Improved data sharing
– data is shared, a corporate resource, not a necessity for an application
– external users can be allowed access
– multiple views of data, arbitrary views of data
• Reduced program maintenance
– data structure can change without application data changing
• Novel ad hoc data access ‘without programming’
– SQL