0% found this document useful (0 votes)

22 views

Lecture 4 - CS50's Introduction To Databases With SQL

Uploaded by

curiouswitcher

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

Lecture 4 - CS50's Introduction To Databases With SQL

Uploaded by

curiouswitcher

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

CS50’s Introduction to Databases with SQL

OpenCourseWare

Donate  (https://cs50.harvard.edu/donate)

Carter Zenke (https://carterzenke.me)

carter@cs50.harvard.edu
 (https://github.com/carterzenke)  (https://www.linkedin.com/in/carterzenke/)

David J. Malan (https://cs.harvard.edu/malan/)

malan@harvard.edu
 (https://www.facebook.com/dmalan)  (https://github.com/dmalan)  (https://www.instagram.com/
davidjmalan/)  (https://www.linkedin.com/in/malan/)  (https://www.reddit.com/user/davidjmalan) 
(https://www.threads.net/@davidjmalan)  (https://twitter.com/davidjmalan)

Lecture 4
• Introduction
• Views
• Simplifying
◦ Questions
• Aggregating
◦ Questions
• Common Table Expression (CTE)
• Partitioning
◦ Questions
• Securing
• Soft Deletions
• Fin

Introduction
 Thus far, we have learned about concepts that allow us to design complex databases and write data
into them. Now, we will explore ways in which to obtain views from these databases.
 Let’s go back to the database containing books longlisted for the International Booker Prize. Here is a
snapshot of tables from this database.
 To �nd a book written by the author Han Kang, we would need to go each of through the three table
above — �rst �nding the author’s ID, then the corresponding book IDs and then the book titles.
Instead, is there a way to put together related information from the three tables in a single view?
 Yes, we can use the JOIN command in SQL to combine rows from two or more tables based on a
related column between them. Here is a visual representation of how these tables could be joined in
order to line up authors and their books.

This makes it simple to observe that Han Kang authored The White Book.
 One can also imagine removing the ID columns here, such that our view looks like the following.

Views
 A view is a virtual table de�ned by a query.
 Say we wrote a query to join three tables, as in the previous example, and then select the relevant
columns. The new table created by this query can be saved as a view, to be further queried later on.
 Views are useful for:
 simplifying: putting together data from different tables to be queried more simply,
 aggregating: running aggregate functions, like �nding the sum, and storing the results,
 partitioning: dividing data into logical pieces,
 securing: hiding columns that should be kept secure. While there are other ways in which
views can be useful, in this lecture we will focus on the above four.

Simplifying
 Let us open up longlist.db on SQLite and run the .schema command to verify that the three
tables we saw in the previous example are created: authors , authored and books .
 To select the books written by Fernanda Melchor, we would write this nested query.

SELECT "title" FROM "books"

WHERE "id" IN (
SELECT "book_id" FROM "authored"
WHERE "author_id" = (
SELECT "id" FROM "authors"
WHERE "name" = 'Fernanda Melchor'
)
);

 The above query is complex — there are three SELECT queries in the nested query. To simplify this,
let us �rst use JOIN to create a view containing authors and their books.
 In a new terminal, let us connect to longlist.db again, and run the following query.

SELECT "name", "title" FROM "authors"

JOIN "authored" ON "authors"."id" = "authored"."author_id"
JOIN "books" ON "books"."id" = "authored"."book_id";

 Observe that it is important to specify how two tables are joined, or the columns they are
joined on.
 Tip: The primary key column of one table is usually joined to the corresponding foreign key
column of the other table!
 Running this will pull up a table containing all the author names next to the titles of the books
they have authored.
 To save the virtual table created in the previous step as a view, we need to change the query.

CREATE VIEW "longlist" AS

SELECT "name", "title" FROM "authors"
JOIN "authored" ON "authors"."id" = "authored"."author_id"
JOIN "books" ON "books"."id" = "authored"."book_id";

The view created here is called longlist . This view can now be used exactly as we would use a
table in SQL.
 Let us write a query to see all the data within this view.

SELECT * FROM "longlist";

 Using this view, we can considerably simplify the query needed to �nd the books written by
Fernanda Melchor.

SELECT "title" FROM "longlist" WHERE "name" = 'Fernanda Melchor';

 A view, being a virtual table, does not consume much more disk space to create. The data within a
view is still stored in the underlying tables, but still accessible through this simpl�ed view.
Questions
Can we manipulate views to be ordered, or displayed differently?

 Yes, we can order books in a view in much the same way as we can in a table.
 As an example, let us display the data within the longlist view, ordered by the book titles.

SELECT "name", "title"

FROM "longlist"
ORDER BY "title";

 We could also have the view itself be ordered. We can do this by including an ORDER BY clause
in the query used to create the view.

Aggregating
 In longlist.db we have a table containing individual ratings given to each book. In previous weeks,
we saw how to �nd the average rating of every book, rounded to 2 decimal places.

SELECT "book_id", ROUND(AVG("rating"), 2) AS "rating"

FROM "ratings"
GROUP BY "book_id";

 The results of the above query can be made more useful by displaying the title of every book, and
perhaps the year in which each book was longlisted. This information is present in the books table.

SELECT "book_id", "title", "year", ROUND(AVG("rating"), 2) AS "rating"

FROM "ratings"
JOIN "books" ON "ratings"."book_id" = "books"."id"
GROUP BY "book_id";

 Here, we use a JOIN to combine information from the ratings and books tables, joining on
the book ID column.
 Notice the order of operations in this query — in particular, the placement of the GROUP BY
operation at the end of the query after the two tables are joined.
 This aggregated data can be stored in a view.

CREATE VIEW "average_book_ratings" AS

SELECT "book_id" AS "id", "title", "year", ROUND(AVG("rating"), 2) AS "rating"
FROM "ratings"
JOIN "books" ON "ratings"."book_id" = "books"."id"
GROUP BY "book_id";

 Now, let us see the data in this view.

SELECT * FROM "average_book_ratings";

 On adding more data to the ratings table, to obtain an up-to-date aggregate, we need to simply
requery the view using a SELECT command like the above!
 Each time a view is created, it gets added to the schema. We can verify this by running .schema to
observe that longlist and average_book_ratings are now part of this database’s schema.
 To create temporary views that are not stored in the database schema, we can use CREATE
TEMPORARY VIEW . This command creates a view that exists only for the duration of our connection
with the database.
 To �nd the average rating of books per year, we can use the view we already created.

SELECT "year", ROUND(AVG("rating"), 2) AS "rating"

FROM "average_book_ratings"
GROUP BY "year";

Notice that we select the rating column from average_book_ratings , which already contains the
average ratings per book. Next, we group these by year and calculate the average ratings again,
which gives us the average rating per year!
 We can store the results in a temporary view.

CREATE TEMPORARY VIEW "average_ratings_by_year" AS

SELECT "year", ROUND(AVG("rating"), 2) AS "rating" FROM "average_book_ratings"
GROUP BY "year";

Questions
Can temporary views be used to test whether a query works or not?

 Yes, this is a great use case for temporary views! To generalize a little, temporary views are used
when we want to organize data in some way without actually storing that organization long-term.

Common Table Expression (CTE)

 A regular view exists forever in our database schema. A temporary view exists for the duration of our
connection with the database. A CTE is a view that exists for a single query alone.
 Let us recreate the view containing average book ratings per year using a CTE instead of a temporary
view. First, we need to drop the existing temporary view so that we can reuse the name
average_book_ratings .

DROP VIEW "average_book_ratings";

 Next, we create a CTE containing the average ratings per book. We then use the average ratings per
book to calculate the average ratings per year, in much the same way as we did before.

WITH "average_book_ratings" AS (
SELECT "book_id", "title", "year", ROUND(AVG("rating"), 2) AS "rating" FROM "ratings"
JOIN "books" ON "ratings"."book_id" = "books"."id"
GROUP BY "book_id"
)
SELECT "year" ROUND(AVG("rating"), 2) AS "rating" FROM "average_book_ratings"
GROUP BY "year";

Partitioning
 Views can be used to partition data, or to break it into smaller pieces that will be useful to us or an
application. For example, the website for the International Booker Prize has a page of longlisted
books for each year the prize was awarded. However, our database stores all the longlisted books in
a single table. For the sake of creating the website, or a different purpose, it might be useful to have
a different table (or view) of books for each year.
 Let us create a view to store books longlisted in 2022.

CREATE VIEW "2022" AS

SELECT "id", "title" FROM "books"
WHERE "year" = 2022;

 We can also see the data in this view.

SELECT * FROM "2022";

Questions
Can views be updated?

 No, because views do not have any data in the way that tables do. Views actually pull data from the
underlying tables each time they are queried. This means that when an underlying table is updated,
the next time the view is queried, it will display updated data from the table!

Securing
 Views can be used to enhance database security by limiting access to certain data.
 Consider a rideshare company’s database with a table rides that looks like the following.

 If we were to give this data to an analyst, whose job is to �nd the most popular ride routes, it would
be irrelevant and indeed, not secure to give them the names of individual riders. Rider names are
likely categorized as Personally Identi�able Information (PII) which companies are not allowed to
share indiscriminately.
 Views can be handy in this situation — we can share with the analyst a view containing the origin
and destination of rides, but not the rider names.
 To try this out, let us open rideshare.db in our terminal. Running .schema should reveal one table
called rides in this database.
 We can create a view with the relevant columns, while omitting the rider column altogether. But
we will go one step further here, and create a rider column to display an anonymous rider for each
row in the table. This will indicate to the analyst that while we have rider names in the database, the
names have been anonymized for security.
CREATE VIEW "analysis" AS
SELECT "id", "origin", "destination", 'Anonymous' AS "rider"
FROM "rides";

 We can query this view to ensure that it is secure.

SELECT * FROM "analysis";

 Although we can create a view that anonymizes data, SQLite does not allow access control. This
means that our analyst could simply query the original rides table and see all the rider names we
went to great lengths to omit in the analysis view.

Soft Deletions
 As we saw in previous weeks, a soft deletion involves marking a row as deleted instead of removing
it from the table.
 For example, a piece of art called “Farmers working at dawn” is marked as deleted from the
collections table by changing the value in the deleted column from 0 to 1.

 We can imagine creating a view to display only the art that is not deleted.
 To try this, let us open mfa.db in our terminal. The collections table does not have a deleted
column yet, so we need to add it. The default value here will be 0, to indicate that the row is not
deleted.

ALTER TABLE "collections"

ADD COLUMN "deleted" INTEGER DEFAULT 0;

 Now, let us perform a soft delete on the artwork “Farmers working at dawn”, by updating it to have 1
in the deleted column.

UPDATE "collections"
SET "deleted" = 1
WHERE "title" = 'Farmers working at dawn';

 We can create a view to display information about the rows that are not deleted.

CREATE VIEW "current_collections" AS

SELECT "id", "title", "accession_number", "acquired"
FROM "collections"
WHERE "deleted" = 0;

 We can display the data in this view to verify that “Farmers working at dawn” is not present.

SELECT * FROM "current_collections";

 On soft deletion of a row from the underlying table collections , it will be removed from the
current_collections view on any further querying.

 We already know that it is not possible to insert data into or delete data from a view. However, we
can set up a trigger that inserts into or deletes from the underlying table! The INSTEAD OF trigger
allows us to do this.

CREATE TRIGGER "delete"

INSTEAD OF DELETE ON "current_collections"
FOR EACH ROW
BEGIN
UPDATE "collections" SET "deleted" = 1
WHERE "id" = OLD."id";
END;

 Every time we try to delete rows from the view, this trigger will instead update the deleted
column of the row in the underlying table collections , thus completing the soft deletion.
 We use the keyword OLD within our update clause to indicate that the ID of the row updated
in collections should be the same as the ID of the row we are trying to delete from
current_collections .

 Now, we can delete a row from the current_collections view.

DELETE FROM "current_collections"

WHERE "title" = 'Imaginative landscape';

We can verify that this worked by querying the view.

SELECT * FROM "current_collections";

 Similarly, we can create a trigger that inserts data into the underlying table when we try to insert it
into a view.
 There are two situations to consider here. We could be trying to insert into a view a row that already
exists in the underlying table, but was soft deleted. We can write the following trigger to handle this
situation.

CREATE TRIGGER "insert_when_exists"

INSTEAD OF INSERT ON "current_collections"
FOR EACH ROW
WHEN NEW."accession_number" IN (
SELECT "accession_number" FROM "collections"
)
BEGIN
UPDATE "collections"
SET "deleted" = 0
WHERE "accession_number" = NEW."accession_number";
END;

 The WHEN keyword is used to check if the accession number of the artwork already exists in
the collections table. This works because an accession number, as we know from previous
weeks, uniquely identi�es every piece of art in this table.
 If the artwork does exist in the underlying table, we set its deleted value to 0, indicating a
reversal of the soft deletion.
 The second situation occurs when we are trying to insert a row that does not exist in the underlying
table. The following trigger handles this situation.
CREATE TRIGGER "insert_when_new"
INSTEAD OF INSERT ON "current_collections"
FOR EACH ROW
WHEN NEW."accession_number" NOT IN (
SELECT "accession_number" FROM "collections"
)
BEGIN
INSERT INTO "collections" ("title", "accession_number", "acquired")
VALUES (NEW."title", NEW."accession_number", NEW."acquired");
END;

 When the accession number of the inserted data is not already present within collections , it
inserts the row into the table.

Fin
 This brings us to the conclusion of Lecture 4 about Viewing in SQL!

VOID MY CASE Ebook
96% (48)
VOID MY CASE Ebook
11 pages
ch4 23 11 2023
100% (1)
ch4 23 11 2023
81 pages
Lecture 5 - CS50's Introduction To Databases With SQL
No ratings yet
Lecture 5 - CS50's Introduction To Databases With SQL
10 pages
Chap1 Anglais BDA
No ratings yet
Chap1 Anglais BDA
77 pages
15
No ratings yet
15
12 pages
SQL Intro 5.slides
No ratings yet
SQL Intro 5.slides
1 page
Intermediate SQL
No ratings yet
Intermediate SQL
46 pages
5.Note_5
No ratings yet
5.Note_5
13 pages
Lecture Slides Week2 014 Logical Data Independence
No ratings yet
Lecture Slides Week2 014 Logical Data Independence
5 pages
Experiment-11 AIM - Introduction To Views. VIEWS A View Is, in Essence, A Virtual Table. It Does Not Physically Exist. Rather, It Is Created
No ratings yet
Experiment-11 AIM - Introduction To Views. VIEWS A View Is, in Essence, A Virtual Table. It Does Not Physically Exist. Rather, It Is Created
7 pages
ch4
No ratings yet
ch4
65 pages
DBMS sql
No ratings yet
DBMS sql
43 pages
ADT Lab Manual
No ratings yet
ADT Lab Manual
58 pages
Data Engineer - Course
No ratings yet
Data Engineer - Course
57 pages
Intermediate SQL
No ratings yet
Intermediate SQL
52 pages
Flash Cards
No ratings yet
Flash Cards
24 pages
Lecture 3 - CS50's Introduction To Databases With SQL
No ratings yet
Lecture 3 - CS50's Introduction To Databases With SQL
10 pages
SQL - Notes
No ratings yet
SQL - Notes
7 pages
SQL Study Material -6 (Views, Indexes and CTEs) (4)
No ratings yet
SQL Study Material -6 (Views, Indexes and CTEs) (4)
6 pages
SQLIII
No ratings yet
SQLIII
64 pages
Database View
No ratings yet
Database View
7 pages
Lecture 6 - CS50's Introduction To Databases With SQL
No ratings yet
Lecture 6 - CS50's Introduction To Databases With SQL
14 pages
Intermediate SQL
No ratings yet
Intermediate SQL
52 pages
U2 P3 Intermediate SQL
No ratings yet
U2 P3 Intermediate SQL
60 pages
5 IntermediateSQL
No ratings yet
5 IntermediateSQL
18 pages
Views: Friday, January 17th, 2003
No ratings yet
Views: Friday, January 17th, 2003
27 pages
DBMS_Lecture_05 View
No ratings yet
DBMS_Lecture_05 View
33 pages
CSE - 301 - Lecture-4 SQLIntermediateSQL
No ratings yet
CSE - 301 - Lecture-4 SQLIntermediateSQL
64 pages
DBMS Experiment - Lab 7
No ratings yet
DBMS Experiment - Lab 7
24 pages
Chapter 4: Intermediate SQL
No ratings yet
Chapter 4: Intermediate SQL
44 pages
CSC271 Database Systems: Data Definition Language
No ratings yet
CSC271 Database Systems: Data Definition Language
21 pages
Wa0001
No ratings yet
Wa0001
1 page
Adbs 3
No ratings yet
Adbs 3
49 pages
Unit 4
No ratings yet
Unit 4
12 pages
Dbms Lab Manual 2015-16
No ratings yet
Dbms Lab Manual 2015-16
50 pages
DBMS QB
No ratings yet
DBMS QB
27 pages
BO Designer Related FAQs
No ratings yet
BO Designer Related FAQs
5 pages
U2-5.Views
No ratings yet
U2-5.Views
23 pages
DBMS 6 Days Traiining Plan
No ratings yet
DBMS 6 Days Traiining Plan
35 pages
SQL
No ratings yet
SQL
55 pages
Data Views Lecture Note PDF
No ratings yet
Data Views Lecture Note PDF
34 pages
SQL Views: Views Indexes Nested Queries PLSQL Triggers Cursors
No ratings yet
SQL Views: Views Indexes Nested Queries PLSQL Triggers Cursors
18 pages
An_Introduction_to_SQL_1731971471
No ratings yet
An_Introduction_to_SQL_1731971471
57 pages
Database Application Lab Manual V-Sem Cse: Thirthe Gowda MT
No ratings yet
Database Application Lab Manual V-Sem Cse: Thirthe Gowda MT
64 pages
Ex10-Using Views To Simplify Queries
No ratings yet
Ex10-Using Views To Simplify Queries
3 pages
CSC271 Database Systems: Data Definition Language
No ratings yet
CSC271 Database Systems: Data Definition Language
21 pages
CSC421 - Database Management II
No ratings yet
CSC421 - Database Management II
48 pages
Experiment No 11
No ratings yet
Experiment No 11
5 pages
DBMS3
No ratings yet
DBMS3
13 pages
DBMS (Lovish)
No ratings yet
DBMS (Lovish)
62 pages
SQL Join and Architecture
No ratings yet
SQL Join and Architecture
9 pages
06 Joins Views Integrity Constraints and Authorization
No ratings yet
06 Joins Views Integrity Constraints and Authorization
26 pages
Wa0001 PDF
No ratings yet
Wa0001 PDF
16 pages
Dbms Mod3
No ratings yet
Dbms Mod3
18 pages
Introduction To SQL: Intro To Querying: EECS 317, Spring 2014
No ratings yet
Introduction To SQL: Intro To Querying: EECS 317, Spring 2014
14 pages
Dbms Lab-1
No ratings yet
Dbms Lab-1
17 pages
RDBMS and DBMS Concepts
No ratings yet
RDBMS and DBMS Concepts
5 pages
Lecture No 04: By: Syed Aun Irtaza
No ratings yet
Lecture No 04: By: Syed Aun Irtaza
10 pages
Lecture 5 22-23
No ratings yet
Lecture 5 22-23
22 pages
Views in SQL: No Limitation On Querying A View
No ratings yet
Views in SQL: No Limitation On Querying A View
20 pages
SQL Query Basics
From Everand
SQL Query Basics
Isabella Ramirez
No ratings yet
Mancur Olson
No ratings yet
Mancur Olson
10 pages
Total Physical Response
No ratings yet
Total Physical Response
21 pages
Sprint
No ratings yet
Sprint
6 pages
1.3 Organizational Objectives Notes
No ratings yet
1.3 Organizational Objectives Notes
6 pages
GRADE 10 QUIZ Endocrine
No ratings yet
GRADE 10 QUIZ Endocrine
3 pages
Android MCQ - Javatpoint
No ratings yet
Android MCQ - Javatpoint
27 pages
Hamburger
No ratings yet
Hamburger
2 pages
Modeling and Controlling Self - Tuning Fuzzy PID Speed Control of A Fan For Vegetable Preservation
No ratings yet
Modeling and Controlling Self - Tuning Fuzzy PID Speed Control of A Fan For Vegetable Preservation
5 pages
A Thermodynamic Study of (2+1) - Dimensional Analytic Charged Hairy Black Holes With Born-Infeld Electrodynamics
No ratings yet
A Thermodynamic Study of (2+1) - Dimensional Analytic Charged Hairy Black Holes With Born-Infeld Electrodynamics
24 pages
Ethio Economy
No ratings yet
Ethio Economy
51 pages
Final RMTTS 09
No ratings yet
Final RMTTS 09
25 pages
Maudsley Deprescribing Guidelines - From Publication to Practice - Presenter Slides
No ratings yet
Maudsley Deprescribing Guidelines - From Publication to Practice - Presenter Slides
64 pages
Politicians and Rhetoric The Persuasive Power of Metaphor 1st Edition Jonathan Charteris-Black (Auth.) Download PDF
100% (4)
Politicians and Rhetoric The Persuasive Power of Metaphor 1st Edition Jonathan Charteris-Black (Auth.) Download PDF
84 pages
Trading Vs Gambling
100% (1)
Trading Vs Gambling
11 pages
Newtonian and NON - Newtonian Fluids
No ratings yet
Newtonian and NON - Newtonian Fluids
42 pages
A DETAILED LESSON PLAN IN LANGUAGE CHILDREN WITH AUTISM
No ratings yet
A DETAILED LESSON PLAN IN LANGUAGE CHILDREN WITH AUTISM
4 pages
41year - 115th Final Exam For Graduation (80 Questions) - EN
No ratings yet
41year - 115th Final Exam For Graduation (80 Questions) - EN
17 pages
Toyotacorollarepairmanual1990 2011 141114131731 Conversion Gate02
0% (1)
Toyotacorollarepairmanual1990 2011 141114131731 Conversion Gate02
2 pages
MA45DS1201
No ratings yet
MA45DS1201
6 pages
Lecture 3 - Introduction To Ages of Literature
No ratings yet
Lecture 3 - Introduction To Ages of Literature
19 pages
The Text Is For Number 1 To 2
No ratings yet
The Text Is For Number 1 To 2
13 pages
ATSEA SAP V8 4july2024
No ratings yet
ATSEA SAP V8 4july2024
90 pages
GMAT Reading Comprehension Tcm58 11071
No ratings yet
GMAT Reading Comprehension Tcm58 11071
5 pages
LESSONS
No ratings yet
LESSONS
10 pages
SSS Application Form
No ratings yet
SSS Application Form
3 pages
Multicasting and Multicast Protocols
No ratings yet
Multicasting and Multicast Protocols
50 pages
Form Three Business Studies Paper 2 Marking Scheme End of Term Three Examination 2021
No ratings yet
Form Three Business Studies Paper 2 Marking Scheme End of Term Three Examination 2021
4 pages
LME Approved Brands
No ratings yet
LME Approved Brands
19 pages
SQL Complete Notes.
No ratings yet
SQL Complete Notes.
63 pages

Lecture 4 - CS50's Introduction To Databases With SQL

Uploaded by

Lecture 4 - CS50's Introduction To Databases With SQL

Uploaded by

CS50’s Introduction to Databases with SQL

Carter Zenke (https://carterzenke.me)

David J. Malan (https://cs.harvard.edu/malan/)

SELECT "title" FROM "books"

SELECT "name", "title" FROM "authors"

CREATE VIEW "longlist" AS

SELECT * FROM "longlist";

SELECT "title" FROM "longlist" WHERE "name" = 'Fernanda Melchor';

SELECT "name", "title"

SELECT "book_id", ROUND(AVG("rating"), 2) AS "rating"

SELECT "book_id", "title", "year", ROUND(AVG("rating"), 2) AS "rating"

CREATE VIEW "average_book_ratings" AS

 Now, let us see the data in this view.

SELECT * FROM "average_book_ratings";

SELECT "year", ROUND(AVG("rating"), 2) AS "rating"

CREATE TEMPORARY VIEW "average_ratings_by_year" AS

Common Table Expression (CTE)

DROP VIEW "average_book_ratings";

CREATE VIEW "2022" AS

 We can also see the data in this view.

SELECT * FROM "2022";

 We can query this view to ensure that it is secure.

SELECT * FROM "analysis";

ALTER TABLE "collections"

CREATE VIEW "current_collections" AS

SELECT * FROM "current_collections";

CREATE TRIGGER "delete"

 Now, we can delete a row from the current_collections view.

DELETE FROM "current_collections"

We can verify that this worked by querying the view.

SELECT * FROM "current_collections";

CREATE TRIGGER "insert_when_exists"

You might also like