Unit 3-Notes
Unit 3-Notes
Unit 3-Notes
UNIT – III
SQL
Databases
A database is structured collection of data. Databases may be stored on a computer and examined
using a program. These programs are often called ‘databases’ and examined using a program.
These programs are often called ‘databases’, but more strictly are database management systems
(DMS). There are many ways that a printed catalogue can be organized, there are many ways, or
models, by which a computerized database may be organized. One of the most common and
powerful models is the relational model and programs which use this model are known as
relational database management systems (RDMS).
What is SQL?
Database Tables
A database most often contains one or more tables. Each table is identified by a name
(e.g. "Customers" or "Orders"). Tables contain records (rows) with data.
The table above contains three records (one for each person) and five columns (P_Id, LastName,
FirstName, Address, and City).
SQL Statements
Most of the actions you need to perform on a database are done with SQL statements.
With SQL, we can query a database and have a result set returned.
The following SQL statement will select all the records in the "Persons" table:
In this tutorial we will teach you all about the different SQL statements.
Some database systems require a semicolon at the end of each SQL statement.
SQL can be divided into two parts: The Data Manipulation Language (DML) and the Data
Definition Language (DDL).
The query and update commands form the DML part of SQL:
The DDL part of SQL permits database tables to be created or deleted. It also define indexes
(keys), specify links between tables, and impose constraints between tables. The most important
DDL statements in SQL are:
and
Now we want to select the content of the columns named "LastName" and "FirstName" from
the table above.
We use the following SELECT statement:
LastName FirstName
Kumari Mounitha
Kumar Pranav
Gubbi Sharan
SELECT * Example
Now we want to select all the columns from the "Persons" table.
The DISTINCT keyword can be used to return only distinct (different) values.
The WHERE clause is used to extract only those records that fulfill a specified criterion.
Now we want to select only the persons living in the city "Bangalore" from the table above.
SQL uses single quotes around text values (most database systems will also accept double
quotes).
Numeric values should not be enclosed in quotes.
For text values:
This is correct:
SELECT * FROM Persons WHERE FirstName= 'Pranav' ;
This is wrong:
SELECT * FROM Persons WHERE FirstName= Pranav ;
This is correct:
SELECT * FROM Persons WHERE Year = 1965 ;
This is wrong:
SELECT * FROM Persons WHERE Yea r= '1965' ;
The AND & OR operators are used to filter records based on more than one condition.
OR Operator Example
Now we want to select only the persons with the first name equal to "Pranav" OR the first name
equal to "Mounitha":
If you want to sort the records in a descending order, you can use the DESC keyword.
The first form doesn't specify the column names where the data will be inserted, only their values:
INSERTINTO table_name
VALUES (value1, value2, value3,...)
The second form specifies both the column names and the values to be inserted:
INSERTINTO Persons
VALUES (4,'Nilsen', 'Johan', 'Bakken 2', 'Tumkur')
The following SQL statement will add a new row, but only add data in the "P_Id", "LastName"
and the "FirstName" columns:
UPDATE table_name
SET column1=value, column2=value2
WHERE some_column=some_value
Note: Notice the WHERE clause in the UPDATE syntax. The WHERE clause specifies which
record or records that should be updated. If you omit the WHERE clause, all records will be
updated!
UPDATE Persons
SET Address='Nissestien 67', City='Bangalore'
WHERE LastName='Tjessem' AND FirstName='Jakob'
Be careful when updating records. If we had omitted the WHERE clause in the example above,
like this:
UPDATE Persons
SET Address='Nissestien 67', City='Bangalore'
Note: Notice the WHERE clause in the DELETE syntax. The WHERE clause specifies which
record or records that should be deleted. If you omit the WHERE clause, all records will be
deleted!
Now we want to delete the person "Tjessem, Jakob" in the "Persons" table.
It is possible to delete all rows in a table without deleting the table. This means that the
table structure, attributes, and indexes will be intact:
Note: Be very careful when deleting records. You cannot undo this statement!
The TOP clause can be very useful on large tables with thousands of records. Returning a
large number of records can impact on performance.
MySQL Syntax
SELECT column_name(s)
FROM table_name
LIMIT number
Oracle Syntax
SELECT column_name(s)
FROM table_name
WHERE ROWNUM <= number
Now we want to select only the two first records in the table above.
Now we want to select only 50% of the records in the table above.
The LIKE operator is used in a WHERE clause to search for a specified pattern in
a column.
Now we want to select the persons living in a city that starts with "B" from the table above.
The "%" sign can be used to define wildcards (missing letters in the pattern) both before and
after the pattern.
Next, we want to select the persons living in a city that ends with an "r" from the "Persons"
table.
Next, we want to select the persons living in a city that contains the pattern "mk" from
the "Persons" table.
It is also possible to select the persons living in a city that NOT contains the pattern "mk"
from the "Persons" table, by using the NOT keyword.
SQL Wildcards
SQL Wildcards
SQL wildcards can substitute for one or more characters when searching for data in a database.
Wildcard Description
% A substitute for zero or more characters
_ A substitute for exactly one character
[charlist] Any single character in charlist
[^charlist] Any single character not in charlist
or
[!charlist]
Now we want to select the persons living in a city that starts with "sa" from the "Persons" table.
Now we want to select the persons with a first name that starts with any character, followed
by "ri" from the "Persons" table. We use the following SELECT statement:
SELECT * FROM Persons WHERE FirstName LIKE '_ri'
Next, we want to select the persons with a last name that starts with "P", followed by any
character, followed by "an", followed by any character, followed by "v" from the "Persons"
table.
Now we want to select the persons with a first name that starts with "b" or "s" or "p"
from the "Persons" table.
Next, we want to select the persons with a last name that do not start with "b" or "s" or "p"
from the "Persons" table.
SQL IN Operator
The IN Operator
SQL IN Syntax
SELECT column_name(s)
FROM table_name
WHERE column_name IN (value1,value2,...)
IN Operator Example
Now we want to select the persons with a last name equal to "Kumari" or "Gubbi" from the
table above.
The BETWEEN operator is used in a WHERE clause to select a range of data between
two values. The values can be numbers, text, or dates.
Now we want to select the persons with a last name alphabetically between "Kumari" and
"Gubbi" from the table above.
In some databases a person with the LastName of "Kumari" or "Gubbi" will not be listed
(BETWEEN only selects fields that are between and excluding the test values).
In other databases a person with the last name of "Kumari" or "Gubbi" will be listed
(BETWEEN selects fields that are between and including the test values).
And in other databases a person with the last name of "Kumari" will be listed, but "Gubbi" will
not be listed (BETWEEN selects fields between the test values, including the first test value
and excluding the last test value).
Example 2
To display the persons outside the range in the previous example, use NOT BETWEEN:
SQL Alias
SQL Alias
You can give a table or a column another name by using an alias. This can be a good thing to
do if you have very long or complex table names or column names.
Alias Example
Assume we have a table called "Persons" and another table called "Product_Orders". We will
give the table aliases of "p" an "po" respectively.
Now we want to list all the orders that "Mounitha Kumari" is responsible for.
As you'll see from the two SELECT statements above; aliases can make queries easier to both
write and to read.
SQL Joins
SQL joins are used to query data from two or more tables, based on a relationship
between certain columns in these tables.
SQL JOIN
The JOIN keyword is used in an SQL statement to query data from two or more tables, based
on a relationship between certain columns in these tables.
A primary key is a column (or a combination of columns) with a unique value for each row.
Each primary key value must be unique within the table. The purpose is to bind data together,
across tables, without repeating all of the data in every table.
Note that the "P_Id" column is the primary key in the "Persons" table. This means that no two
rows can have the same P_Id. The P_Id distinguishes two persons even if they have the same
name.
Note that the "O_Id" column is the primary key in the "Orders" table and that the "P_Id"
column refers to the persons in the "Persons" table without using their names.
Notice that the relationship between the two tables above is the "P_Id" column.
• JOIN: Return rows when there is at least one match in both tables
• LEFT JOIN: Return all rows from the left table, even if there are no matches in the
right table
• RIGHT JOIN: Return all rows from the right table, even if there are no matches in the
left table
• FULL JOIN: Return rows when there is a match in one of the tables
The INNER JOIN keyword return rows when there is at least one match in both tables.
The INNER JOIN keyword return rows when there is at least one match in both tables. If there
are rows in "Persons" that do not have matches in "Orders", those rows will NOT be listed.
Now we want to list all the persons and their orders - if any, from the tables above
The LEFT JOIN keyword returns all the rows from the left table (Persons), even if there
are no matches in the right table (Orders).
2 44678 3
3 22456 1
4 24562 1
5 34764 15
Now we want to list all the orders with containing persons - if any, from the tables above.
The RIGHT JOIN keyword returns all the rows from the right table (Orders), even if there
are no matches in the left table (Persons
The FULL JOIN keyword return rows when there is a match in one of the tables.
Now we want to list all the persons and their orders, and all the orders with their persons.
The FULL JOIN keyword returns all the rows from the left table (Persons), and all the rows
from the right table (Orders). If there are rows in "Persons" that do not have matches in
"Orders", or if there are rows in "Orders" that do not have matches in "Persons", those rows will
be listed as well.
The UNION operator is used to combine the result-set of two or more SELECT statements.
Notice that each SELECT statement within the UNION must have the same number of columns.
The columns must also have similar data types. Also, the columns in each SELECT statement
must be in the same order.
Note: The UNION operator selects only distinct values by default. To allow duplicate values,
use UNION ALL.
PS: The column names in the result-set of a UNION are always equal to the column names in
the first SELECT statement in the UNION.
"Employees_India":
E_ID E_Name
01 Kumari, Mounitha
02 Kumar, Pranav
03 Kumar, Stephen
04 Gubbi, Sharan
"Employees_USA":
E_ID E_Name
01 Turner, Sally
02 Kent, Clark
03 Kumar, Stephen
04 Scott, Stephen
Now we want to list all the different employees in Norway and USA.
Note: This command cannot be used to list all employees in India and USA. In the example
above we have two employees with equal names, and only one of them will be listed. The
UNION command selects only distinct values.
Result
E_Name
Kumari, Mounitha
Kumar, Pranav
Kumar, Stephen
Gubbi, Sharan
Turner, Sally
Kent, Clark
Kumar, Stephen
Scott, Stephen
The SQL SELECT INTO statement can be used to create backup copies of tables.
The SELECT INTO statement selects data from one table and inserts it into a different table.
The SELECT INTO statement is most often used to create backup copies of tables.
SELECT *
INTO new_table_name [IN externaldatabase]
FROM old_tablename
Or we can select only the columns we want into the new table:
SELECT column_name(s)
INTO new_table_name [IN externaldatabase]
FROM old_tablename
SQL SELECT INTO Example
Make a Backup Copy - Now we want to make an exact copy of the data in our "Persons"
table.
SELECT *
INTO Persons_Backup
FROM Persons
We can also use the IN clause to copy the table into another database:
SELECT *
INTO Persons_Backup IN 'Backup.mdb'
FROM Persons
We can also copy only a few fields into the new table:
SELECT LastName,FirstName
INTO Persons_Backup
FROM Persons
The following SQL statement creates a "Persons_Backup" table with only the persons who
lives in the city "Bangalore":
SELECT LastName,Firstname
INTO Persons_Backup
FROM Persons
WHERE City='Bangalore'
The following example creates a "Persons_Order_Backup" table contains data from the two
tables "Persons" and "Orders":
SELECT Persons.LastName,Orders.OrderNo
INTO Persons_Order_Backup
FROM Persons
INNER JOIN Orders
ON Persons.P_Id=Orders.P_Id
The data type specifies what type of data the column can hold. For a complete reference of all the
data types available in MS Access, MySQL, and SQL Server, go to our complete Data Types
reference.
Now we want to create a table called "Persons" that contains five columns: P_Id, LastName,
FirstName, Address, and City.
CREATE TABLEPersons
(
P_Id int,
LastName varchar(255),
FirstName varchar(255),
Address varchar(255),
City varchar(255)
)
The P_Id column is of type int and will hold a number. The LastName, FirstName, Address, and
City columns are of type varchar with a maximum length of 255 characters.
The empty table can be filled with data with the INSERT INTO statement
SQL Constraints
Constraints are used to limit the type of data that can go into a table.
Constraints can be specified when a table is created (with the CREATE TABLE statement) or
after the table is created (with the ALTER TABLE statement).
• NOT NULL
• UNIQUE
• PRIMARY KEY
• FOREIGN KEY
• CHECK
• DEFAULT
The NOT NULL constraint enforces a column to NOT accept NULL values.
The NOT NULL constraint enforces a field to always contain a value. This means that you
cannot insert a new record, or update a record without adding a value to this field.
The following SQL enforces the "P_Id" column and the "LastName" column to not accept
NULL values:
CREATE TABLEPersons
(
P_Id int NOT NULL,
LastName varchar(255) NOT NULL,
FirstName varchar(255),
Address varchar(255),
City varchar(255)
)
SQL UNIQUE Constraint
The UNIQUE and PRIMARY KEY constraints both provide a guarantee for uniqueness for a
column or set of columns.
Note that you can have have many UNIQUE constraints per table, but only one PRIMARY
KEY constraint per table.
The following SQL creates a UNIQUE constraint on the "P_Id" column when the "Persons"
table is created:
The PRIMARY KEY constraint uniquely identifies each record in a database table.
Each table should have a primary key, and each table can have only one primary key.
The following SQL creates a PRIMARY KEY on the "P_Id" column when the "Persons" table is
created:
CREATE TABLEPersons
(
P_Id int NOT NULL,
LastName varchar(255) NOT NULL,
FirstName varchar(255),
Address varchar(255),
City varchar(255),
PRIMARY KEY (P_Id)
)
To allow naming of a PRIMARY KEY constraint, and for defining a PRIMARY KEY
constraint on multiple columns, use the following SQL syntax:
MySQL:
Let's illustrate the foreign key with an example. Look at the following two tables:
Note that the "P_Id" column in the "Orders" table points to the "P_Id" column in the
"Persons" table.
The "P_Id" column in the "Persons" table is the PRIMARY KEY in the "Persons" table.
The "P_Id" column in the "Orders" table is a FOREIGN KEY in the "Orders" table.
The FOREIGN KEY constraint is used to prevent actions that would destroy link between tables.
The FOREIGN KEY constraint also prevents that invalid data is inserted into the foreign key
column, because it has to be one of the values contained in the table it points to.
The following SQL creates a FOREIGN KEY on the "P_Id" column when the "Orders" table is
created:
MySQL:
CREATE TABLEOrders
(
O_Id int NOT
NULL, OrderNo
int NOT NULL,
P_Id int,
PRIMARY KEY (O_Id),
FOREIGN KEY (P_Id) REFERENCES Persons(P_Id)
)
To allow naming of a FOREIGN KEY constraint, and for defining a FOREIGN KEY constraint
on
What if we only want to delete the data inside the table, and not the table itself?
The ALTER TABLE statement is used to add, delete, or modify columns in an existing table.
To delete a column in a table, use the following syntax (notice that some database systems don't
allow deleting a column):
To change the data type of a column in a table, use the following syntax:
ALTERTABLEtable_name
ALTER COLUMN column_name datatype
Notice that the new column, "DateOfBirth", is of type date and is going to hold a date. The data
type specifies what type of data the column can hold. For a complete reference of all the data
types available in MS Access, MySQL, and SQL Server, go to our complete Data Types
reference.
Now we want to change the data type of the column named "DateOfBirth" in the "Persons" table.
Notice that the "DateOfBirth" column is now of type year and is going to hold a year in a two-
digit or four-digit format.
Next, we want to delete the column named "DateOfBirth" in the "Persons" table.
SQL Views
A view contains rows and columns, just like a real table. The fields in a view are fields from
one or more real tables in the database.
You can add SQL functions, WHERE, and JOIN statements to a view and present the data as if
the data were coming from one single table.
SQL CREATE VIEW Syntax
Note: A view always shows up-to-date data! The database engine recreates the data, using
the view's SQL statement, every time a user queries a view.
If you have the Northwind database you can see that it has several views installed by default.
The view "Current Product List" lists all active products (products that are not discontinued)
from the "Products" table. The view is created with the following SQL:
Another view in the Northwind sample database selects every product in the "Products" table
with a unit price higher than the average unit price:
Another view in the Northwind database calculates the total sale for each category in 1997.
Note that this view selects its data from another view called "Product Sales for 1997":
CREATE VIEW [Category Sales For 1997] AS
SELECT DISTINCT CategoryName,Sum(ProductSales) AS CategorySales
FROM [Product Sales for 1997]
GROUP BY CategoryName
We can also add a condition to the query. Now we want to see the total sale only for the
category "Beverages":
Now we want to add the "Category" column to the "Current Product List" view. We will
update the view with the following SQL:
The most difficult part when working with dates is to be sure that the format of the date you
are trying to insert, matches the format of the date column in the database.
As long as your data contains only the date portion, your queries will work as expected.
However, if a time portion is involved, it gets complicated.
Before talking about the complications of querying for dates, we will look at the most
important built-in functions for working with dates.
SQL aggregate functions return a single value, calculated from values in a column.
SQL scalar functions return a single value, based on the input value.
Tip: The aggregate functions and the scalar functions will be explained in details in the
next chapters.
Now we want to find the customers that have an OrderPrice value higher than the average OrderPrice
value.
We use the following SQL statement:
The COUNT() function returns the number of rows that matches a specified criteria.
The COUNT(column_name) function returns the number of values (NULL values will not be
counted) of the specified column:
The COUNT(DISTINCT column_name) function returns the number of distinct values of the specified
column:
Note: COUNT(DISTINCT) works with ORACLE and Microsoft SQL Server, but not with
Microsoft Access.
The result of the SQL statement above will be 2, because the customer Nilsen has made 2 orders in
total:
CustomerNilsen
2
SQL COUNT(*) Example
NumberOfOrders
6 which is the total number of rows in the table.
Now we want to count the number of unique customers in the "Orders" table.
The MAX() function returns the largest value of the selected column.
The MIN() function returns the smallest value of the selected column.
SmallestOrderPrice
100
The GROUP BY statement is used in conjunction with the aggregate functions to group the
result-set by one or more columns.
Now we want to find the total sum (total order) of each customer.
Customer SUM(OrderPrice)
Kumari 2000
Nilsen 1700
Jensen 2000
Customer SUM(OrderPrice)
Kumari 5700
Nilsen 5700
Kumari 5700
Kumari 5700
Jensen 5700
Nilsen 5700
Explanation of why the above SELECT statement cannot be used: The SELECT statement
above has two columns specified (Customer and SUM(OrderPrice). The "SUM(OrderPrice)"
returns a single value (that is the total sum of the "OrderPrice" column), while "Customer"
returns 6 values (one value for each row in the "Orders" table). This will therefore not give us the
correct result. However, you have seen that the GROUP BY statement solves this problem.
We can also use the GROUP BY statement on more than one column, like this:
The HAVING clause was added to SQL because the WHERE keyword could not be
used with aggregate functions.
Now we want to find if any of the customers have a total order of less than 2000.
Customer SUM(OrderPrice)
Nilsen 1700
Now we want to find if the customers "Kumari" or "Jensen" have a total order of more than 1500.
Now we want to select the content of the "LastName" and "FirstName" columns above, and
convert the "LastName" column to uppercase.
LastName FirstName
KUMARI Mounitha
KUMAR Pranav
GUBBI Sharan
Now we want to select the content of the "LastName" and "FirstName" columns above, and
convert the "LastName" column to lowercase.
LastName FirstName
Kumari Mounitha
Kumar Pranav
Gubbi Sharan
Parameter Description
column_name Required. The field to extract characters from.
start Required. Specifies the starting position (starts at 1).
length Optional. The number of characters to return. If omitted, the MID() function
returns the rest of the text.
Now we want to extract the first four characters of the "City" column above.
SmallCity
Bang
Bang
Tumk
The LEN() function returns the length of the value in a text field.
Now we want to select the length of the values in the "Address" column above.
LengthOfAddress
5
8
6
The ROUND() function is used to round a numeric field to the number of decimals specified.
Parameter Description
column_name Required. The field to round.
decimals Required. Specifies the number of decimals to be returned.
Now we want to display the product name and the price rounded to the nearest integer.
The NOW() function returns the current system date and time.
Now we want to display the products and prices per today's date.
Parameter Description
column_name Required. The field to be formatted.
format Required. Specifies the format.
Now we want to display the products and prices per today's date (with today's date displayed in
the following format "YYYY-MM-DD").
or
SELECT column_name AS
AS (alias) column_alias
FROM table_name
or
SELECT column_name
FROM table_name AS table_alias
or
CREATE UNIQUE INDEX
index_name
ON table_name (column_name)
or
DELETE FROM table_name
(Note: Deletes the entire table!!)
DROP DATABASE
IN SELECT column_name(s)
FROM table_name
WHERE column_name
IN (value1,value2,..)
or
ON table_name1.column_name=table_name2.column_name
SELECT * SELECT *
FROM table_name
or
SELECT column_name(s)
INTO new_table_name [IN externaldatabase]
FROM old_table_name