0% found this document useful (0 votes)

2 views

SQL

The document provides an overview of SQL and its application in managing structured data within relational databases, emphasizing the importance of queries, keys, and debugging practices. It also covers data analysis techniques using Google Sheets, including data cleaning, statistical analysis, and functions for manipulating and extracting insights from datasets. Key concepts include sorting, filtering, and conditional functions, as well as the use of VLOOKUP for searching information in tables.

Uploaded by

bemamdangiu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

SQL

Uploaded by

bemamdangiu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

MÁY TÍNH TRONG KINH DOANH

SQL
1. Data lives in databases. Just about every company and organization
relies on some form of database to store and organize information.
TRUE

Bài học rút ra:

 SQL allows you to work with data stored in a database

 The SELECT query is used to get data from a table

1. Data that can be stored in tables is called structured data.

2. SQL makes working with structured data very fast and effective. A
single data query can access thousands and thousands of records in
the blink of an eye.
3. Real databases can contain a very high amount of information.

A query is a way to access only the data you are interested in.

4. Unstructured data is information that is difficult to store in tables.

Ex: sales table: structured; audio file: unstructured.
5. SQL stands for Structured Query Language.
With SQL you'll be able to extract data from massive datasets with
thousands of fields and records.
SQL is used to work with structured data in the form of tables.

1. The relational database is the most common type of database. The

image shows a relational database.
A relational database can contain several tables
2. The different tables in a relational database connect to each other
using fields (columns) with values in common. These fields are
called keys.
Ex: Star wars: Lucasfilm; Frozen: Walt Disney
3. A relational database stores data in tables.
What data category can a relational database handle? Structured
data.
4. Key fields are used to connect the tables in a relational database.
5. You might have used spreadsheets like Excel to work with data before.
Databases are better for storing larger and more complex collections of
data.
Working with data in 1 single table: database.
Working with data in multiple, larger tables: spreadsheet.
⭐ relational databases store information in interconnected
tables

⭐ keys connect tables in a relational database

⭐ you can query data FROM different tables in the database

REVIEW
1. A database is an organized collection of data
2. SQL is used to work with the data in a database

Debugging

1. If your SQL query contains a mistake, you’ll not be able to access the
data.
2. A schema is a visual representation of how a database is organized,
showing its tables, fields and keys. Arrows are used to show how the
different tables are related.
3. The * symbol allows you to select all the fields in a table. This way
you can avoid typos when listing field names.

⭐ Bugs in queries cause error messages

⭐ The schema of a database helps you avoid errors

⭐ SQL queries are organized in different lines so they are easier to read by
humans

REVIEW

1. The term for a request of information from a database is query

2. Error in code: bug
Information request: query
Visual representation of a database: schema
3. A relational database stores information in table
4. What connects different tables in a relational database? Common key
fields

Standards & Best Practices

1. Comments in code are Explanations for humans

2. It's a good practice to use comments in your code
3. You can use comments to temporarily disable a query or part of a
query when testing your code. This way the computer will skip the
instruction.
4. If you need to make comments with multiple lines, you can use /* … */
block comments.
5. Lowercase for field and table names
⭐ You can add comments to your code with the double hyphens (--)
⭐ You can add a block comment with /* … */
⭐ SQL is a case-insensitive language.

1. Sorting consists in putting data in a meaningful order

2. The ORDER BY command is used to sort the extracted data in the
results table.
3. Data is sorted by fields.
4. Your extracted data will be sorted in ascending order by default. If you
need the data sorted in descending order (from largest to smallest) you
need to add the DESC keyword.
5. By default, data is sorted in ascending order. You can use the explicit
ASC keyword to clarify and make your queries more readable,
particularly when writing complex queries.
⭐ You can sort extracted data with the ORDER BY command
⭐ DESC sorts data in descending order (from largest to smallest, or Z
to A)
⭐ ASC sorts data in ascending order (from smallest to largest, or A to
Z)

Limiting Data
1. The LIMIT keyword extracts a limited number of records.

GG SHEETS
Bảng tính cho phép thực hiện nhiều nhiệm vụ chính trong qtrinh phân tích dữ
liệu:

 Dọn dẹp và thao tác dữ liệu (cleaning + manipulating data)

 Phân tích thống kê (statistical analysis)
 Ptich và tạo hình ảnh trực quan (Visualizations)

Để báo cáo – tất cả đều cần rất ít hoặc k cần mã.

Phân tích dữ liệu: là quá trình trích xuất những hiểu biết có nghĩa từ dl mặc
dù mỗi dự án sẽ có mtieu riêng nhưng hầu hết đều tuân theo 1 khuôn khổ
(Data analysis: process of extracting meaningful insights from data).

Các hàm tích hợp: Các phép tính được viết sẵn có trong các công thức
(Built-in functions: Pre-written calculations that are available in formulas).

 Hàm làm tròn đến chữ số thập phân ROUND(value, [place])

o Required arguments: value (Đối số bắt buộc)
o Optional argument: place  0 by default (Đối số tùy chọn:

place mặc định 0)

Exploring data

 Characterize the data

 Identify data quality issues

Summary statistics

1. Measures of frequency (tần suất): How often dóe a value occur?

- Count:
 COUNT() : cells containing numerical data (đếm số ô trong 1 phạm vi)
o Dates
o Currencies
 COUNTA() : cells containing any data type
o Empty strings (“”) chuỗi trống
o Errors (#DIV/0!)
 COUNTBLANK() : cells
o Empty cells
o Empty strings (“”) chuỗi trống
2. Measures of center: What does a typical value look like?
 Aim to describe a “typical” value
 Mean “average”: Sum of values/Count of values
 Median: The middle number in a sorted list of values; used when
there are outliers (gtri ngoại lai, làm sai lệch k cân xứng phép tính tb)
3. Measures of spread (độ phân tán): How do values vary across the
datasets?

Identifying data quality issue

Missing data Errouneous data

COUNTBLANK() MAX()  maximum value in a range
COUNT() MIN()  Minimum value in a range
COUNTA()

Finding uniqe values

 Categorical data: can only be one of a finite number of value (Dữ liệu
phân loại: chỉ có thể là một trong số hữu hạn các giá trị)
UNIQUE(range)  find the number of unique values

Filtering

 Extract subsets of the dataset for more detailed exploration (lọc dl hữu
ích cho việc khám phá dữ liệu)
FILTER(range, condition1, [condition2,…])
range: phạm vi
condition1: dữ liệu của phạm vi
condition2: điều kiện đề cho

 Identify largest and smallest values

SORT(range, sort_column, is_ascending)
Range: có thể 1/nhiều cột
Sort_column: sắp xếp phạm vi
Is_ ascending (TRUE/FALSE): chỉ định xem muốn sắp xếp theo tt
tăng/giảm dần

Cleaning and preparing data

 80/20 rule: 80% cleaning, 20% analyzing

 A clean dataset
o Can be easily processed during analysis
o will return valid conclusions
o save more time dủing analysis

Dates and times

 Collected for measurements over time

 Continuous (liên tục) data: can take any value
 Discrete data: can take one of a finite number of categories (DL rời
rạc: có thể chứa 1 số lượng hữu hạn các phạm trù)

Trích xuất tp năm từ 1 ngày, YEAR(date)

MONTH(date)

Chuyển month  “mmm” TEXT(number, format)

WEEKDAY(date, [type])

 type: the numbering system to use

o 1 (default): Start Sunday=1
o 2: Start Monday=1
o 3: Start Monday=0

Functions to extract time components:

HOUR(time)

MINUTE(time)
SECOND(time)

Tính toán khoảng thời gian ngày

TODAY()

NOW()

 Spreadsheet is refreshed (gtri hiển thi trong ô sẽ đc cập nhật bất cứ khi
nào bảng tính đc làm mới)

DATEDIF(start_date, end_date, unit)

 End_date > start_date

 Unit: “Y”, “M”, “D”,…

Result are chopped

chèn TODAY sau start_date

Làm sạch dữ liệu văn bản

PROPER(): viết hoa- thường

LOWER(): chữ thường

UPPER(): chữ hoa

Removing whitespace:

Extra whitespace:

Leading space before text

Trailing space after text

Repeated >1 space between characters

TRIM(“ text “): xóa khoảng trắng

Combining text data:

CONCATENATE(string1, [string2,…]) dán 2 ô lại k có khoảng trắng muốn thì, “

“,

Combining text data – email addresses:

Thao tác dữ liệu văn bản

LEN(): trả về độ dài chuỗi, bằng vs chỉ số cuối cùng

SEARCH(search_for, text_to_search, [starting_at])

search_for: chuỗi cần tìm kiếm

text_to_search: vb cần tìm kiếm

starting_at (default=1): chỉ mục bắt đầu tìm kiếm, theo mặc định là vtri bắt
đầu của văn bản

LEFT(string, [number_of_characters])

RIGHT (string, [number_of_characters])

SUBSTITUTE(text_to_search, search_for, replace_with, [occurrence_number])

text_to_search: the text to search through

search_for: the string to search for

replace_with: the replacement string

occurrence_number: which occurrence should be substituted

Conditional functions and logic

Conditional functions: return diffrent results depending on criteria

IF(logical_expression, value_if_true, value_if_false)

AND(logical_expression1, [logical_expression2, …])

Returns TRUE if all logical expressions return TRUE

Conditional aggregations

COUNT()

SUM()

AVERAGE()

COUNTIF(range, criterrion)
 Criterrion (tiêu chí)
o String (chuỗi) to match ex “United Kingdom”
o Number to match ex 150
o String containing a number and comparison operator ex “>9”

Nhiều tiêu chí COUNTIFS( criterion1, [criteria_range2, criterion2,…])

SUMIF(range, criterion, [sum_range])

SUMIFS(sum_range , criteria_range1, criterion1, [criteria_range2, criterion2,

…])

AVERAGEIF(criteria_range, criterion, [average_range])

AVERAGE(average_rang, criteria_range1, criterion1, [criteria_range2,

criterion2,...])

VLOOKUP

VerticalLOOKUP: search for information in a table based on search keys

Lookup table & main table

VLOOKUP(search_key, range, index, [is_sorted])

 search_key: the value in main table to search for in the lookup table
 range: the range containing the lookup table (usually absolute
references)
 index (chỉ mục): the column index in the lookup table to return
 is_sorted: TRUE/FALSE to indicate is the lookup table is sorted

How To Become A Virtual Assistant
No ratings yet
How To Become A Virtual Assistant
4 pages
Slide Lesson 1 SQL Server Lv1
No ratings yet
Slide Lesson 1 SQL Server Lv1
31 pages
Giôùi Thieäu Veà Microsoft Access: 1. Khaùi Nieäm
No ratings yet
Giôùi Thieäu Veà Microsoft Access: 1. Khaùi Nieäm
144 pages
03 SQL Select Command
No ratings yet
03 SQL Select Command
40 pages
Bai Tap Thuc Hanh Phan 1
No ratings yet
Bai Tap Thuc Hanh Phan 1
16 pages
Postgree SQL
No ratings yet
Postgree SQL
7 pages
Fresher Training Program Relational Database Management System
No ratings yet
Fresher Training Program Relational Database Management System
71 pages
Create Table Insert Into Select Update Alter Table Delete From
No ratings yet
Create Table Insert Into Select Update Alter Table Delete From
3 pages
database
No ratings yet
database
5 pages
SQL
No ratings yet
SQL
9 pages
Copy-of-DB10rrrr
No ratings yet
Copy-of-DB10rrrr
5 pages
Data Structures and Algorithm
From Everand
Data Structures and Algorithm
Knowledge Flow
No ratings yet
Chapter 3 - SQL Statement
No ratings yet
Chapter 3 - SQL Statement
20 pages
Chapter 7 8
No ratings yet
Chapter 7 8
50 pages
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
CSDL 1 1
No ratings yet
CSDL 1 1
39 pages
ISA
No ratings yet
ISA
4 pages
Week 1 Introduction To Database
No ratings yet
Week 1 Introduction To Database
30 pages
Copy-of-DB12rrrr
No ratings yet
Copy-of-DB12rrrr
5 pages
Order of Execution in SQL
No ratings yet
Order of Execution in SQL
12 pages
LUYỆN TẬP TRUY VẤN SQL
No ratings yet
LUYỆN TẬP TRUY VẤN SQL
8 pages
Summ Revision XII IP
No ratings yet
Summ Revision XII IP
4 pages
List of SQL Commands
100% (1)
List of SQL Commands
5 pages
sql keys
No ratings yet
sql keys
8 pages
Self-Notes: Data Manipulation Using SQL
No ratings yet
Self-Notes: Data Manipulation Using SQL
4 pages
01 SQL DDL Commands
No ratings yet
01 SQL DDL Commands
42 pages
Lecture 09 - Read SQL Tables - For Student
No ratings yet
Lecture 09 - Read SQL Tables - For Student
29 pages
SQL+Commands
No ratings yet
SQL+Commands
13 pages
Apii 4 DB
No ratings yet
Apii 4 DB
10 pages
SQL PLSQL
No ratings yet
SQL PLSQL
25 pages
sql notes
No ratings yet
sql notes
10 pages
st1
No ratings yet
st1
8 pages
đề gki by me
No ratings yet
đề gki by me
6 pages
1 RDBMS Concepts
No ratings yet
1 RDBMS Concepts
45 pages
Chapter 9
No ratings yet
Chapter 9
3 pages
SQL-Commands-revision - Sheet (Nisha - Jha)
No ratings yet
SQL-Commands-revision - Sheet (Nisha - Jha)
1 page
SQL 1
No ratings yet
SQL 1
58 pages
Copy-of-DB13rrrr
No ratings yet
Copy-of-DB13rrrr
5 pages
SQL Alias:: To Make Selected Columns More Readable.
No ratings yet
SQL Alias:: To Make Selected Columns More Readable.
17 pages
exit sign
No ratings yet
exit sign
11 pages
Csc10006 Chapter 4 SQL II.2425
No ratings yet
Csc10006 Chapter 4 SQL II.2425
155 pages
RDBMS
No ratings yet
RDBMS
49 pages
12 - Information Practices
No ratings yet
12 - Information Practices
14 pages
Lec4 - SQL ASR
No ratings yet
Lec4 - SQL ASR
55 pages
Structured Query Language (SQL)
No ratings yet
Structured Query Language (SQL)
29 pages
Database Systems: Nguyễn Văn Diêu
No ratings yet
Database Systems: Nguyễn Văn Diêu
121 pages
My SQL Notes
No ratings yet
My SQL Notes
13 pages
Steps: Database Dbms Rdbms SQL, Sqlplus, Mysql
No ratings yet
Steps: Database Dbms Rdbms SQL, Sqlplus, Mysql
4 pages
Untitled document
No ratings yet
Untitled document
41 pages
Database
No ratings yet
Database
24 pages
Keyword PDF
No ratings yet
Keyword PDF
9 pages
Basics of SQL 1
No ratings yet
Basics of SQL 1
21 pages
Hsslive-CS-chapt-9-Structured-Query-Language
No ratings yet
Hsslive-CS-chapt-9-Structured-Query-Language
3 pages
RDBMS and DBMS Concepts
No ratings yet
RDBMS and DBMS Concepts
5 pages
Example
No ratings yet
Example
48 pages
Basic Commands of SQL
No ratings yet
Basic Commands of SQL
63 pages
Oracle 11g SQL Handbook
100% (1)
Oracle 11g SQL Handbook
91 pages
Alter Table: Table - Name ADD Column - Name Datatype
No ratings yet
Alter Table: Table - Name ADD Column - Name Datatype
5 pages
DBI Assignment Form 2024
No ratings yet
DBI Assignment Form 2024
3 pages
Oracle
No ratings yet
Oracle
103 pages
DAWP Eng
No ratings yet
DAWP Eng
187 pages
البيانات الضخمة وأثرها في عملية اتخاذ القرار
No ratings yet
البيانات الضخمة وأثرها في عملية اتخاذ القرار
16 pages
UML Use Case Diagram For Online Shopping
No ratings yet
UML Use Case Diagram For Online Shopping
1 page
Online System Based On E-Commerce Platform
100% (1)
Online System Based On E-Commerce Platform
128 pages
Capacity Planning - by ByteByteGo and Diego Ballona
100% (1)
Capacity Planning - by ByteByteGo and Diego Ballona
12 pages
Vigor 3900 CLI Guide PDF
No ratings yet
Vigor 3900 CLI Guide PDF
97 pages
XG Firewall On Ms Azure Infrastructure
No ratings yet
XG Firewall On Ms Azure Infrastructure
1 page
SQL Practical
No ratings yet
SQL Practical
6 pages
4hana 2.0
No ratings yet
4hana 2.0
49 pages
MongoDB Data Models Guide
100% (1)
MongoDB Data Models Guide
39 pages
Ebook Cybersecurity Tips For Employees
No ratings yet
Ebook Cybersecurity Tips For Employees
12 pages
aspenONE InstCtfgV8 - 8 PDF
No ratings yet
aspenONE InstCtfgV8 - 8 PDF
67 pages
State of Maine - Partner VPN Form: Contact Information
No ratings yet
State of Maine - Partner VPN Form: Contact Information
5 pages
Real-Time Machine Learning: The Missing Pieces
No ratings yet
Real-Time Machine Learning: The Missing Pieces
6 pages
Enterprise Resource Planning: ERP Demystified (Second Edition) by Alexis Leon (2008)
No ratings yet
Enterprise Resource Planning: ERP Demystified (Second Edition) by Alexis Leon (2008)
38 pages
A Day in The Life of Your Data
No ratings yet
A Day in The Life of Your Data
11 pages
Commands
No ratings yet
Commands
10 pages
ADD A TITLE SLI-WPS Office
No ratings yet
ADD A TITLE SLI-WPS Office
30 pages
Lec 4 - Network Layer - II - Inside A Router
No ratings yet
Lec 4 - Network Layer - II - Inside A Router
14 pages
Amulya DataStag Resume
No ratings yet
Amulya DataStag Resume
4 pages
BABOK 3 KA Cheat Sheet - Solution Evaluation
100% (1)
BABOK 3 KA Cheat Sheet - Solution Evaluation
1 page
Add Users-Web Image Monitor
No ratings yet
Add Users-Web Image Monitor
4 pages
L1 - SS13.R43 Fintech in Investment Management Lesson 1
No ratings yet
L1 - SS13.R43 Fintech in Investment Management Lesson 1
6 pages
01 What Is SAP BASIS - Complete Tutorial
No ratings yet
01 What Is SAP BASIS - Complete Tutorial
3 pages
0133128903
No ratings yet
0133128903
10 pages
Computer Repair (Priya Saini-20671601573, Diksha Katal-20671601583)
No ratings yet
Computer Repair (Priya Saini-20671601573, Diksha Katal-20671601583)
79 pages
Stamford University Bangladesh: Submitted To
No ratings yet
Stamford University Bangladesh: Submitted To
26 pages
Cloud Computing Program Brochure
No ratings yet
Cloud Computing Program Brochure
19 pages
06-Schema Design and Normalization
No ratings yet
06-Schema Design and Normalization
13 pages
Unit 5 PHP Question Bank
No ratings yet
Unit 5 PHP Question Bank
14 pages

SQL

Uploaded by

SQL

Uploaded by

MÁY TÍNH TRONG KINH DOANH

Bài học rút ra:

 SQL allows you to work with data stored in a database

1. Data that can be stored in tables is called structured data.

4. Unstructured data is information that is difficult to store in tables.

1. The relational database is the most common type of database. The

⭐ keys connect tables in a relational database

⭐ Bugs in queries cause error messages

⭐ The schema of a database helps you avoid errors

1. The term for a request of information from a database is query

Standards & Best Practices

1. Comments in code are Explanations for humans

1. Sorting consists in putting data in a meaningful order

 Dọn dẹp và thao tác dữ liệu (cleaning + manipulating data)

Để báo cáo – tất cả đều cần rất ít hoặc k cần mã.

 Hàm làm tròn đến chữ số thập phân ROUND(value, [place])

place mặc định 0)

 Characterize the data

1. Measures of frequency (tần suất): How often dóe a value occur?

Identifying data quality issue

Missing data Errouneous data

Finding uniqe values

 Identify largest and smallest values

Cleaning and preparing data

 80/20 rule: 80% cleaning, 20% analyzing

Dates and times

 Collected for measurements over time

Trích xuất tp năm từ 1 ngày, YEAR(date)

Chuyển month  “mmm” TEXT(number, format)

 type: the numbering system to use

Functions to extract time components:

Tính toán khoảng thời gian ngày

DATEDIF(start_date, end_date, unit)

 End_date > start_date

Result are chopped

chèn TODAY sau start_date

Làm sạch dữ liệu văn bản

PROPER(): viết hoa- thường

LOWER(): chữ thường

UPPER(): chữ hoa

Leading space before text

Trailing space after text

Repeated >1 space between characters

TRIM(“ text “): xóa khoảng trắng

Combining text data:

CONCATENATE(string1, [string2,…]) dán 2 ô lại k có khoảng trắng muốn thì, “

Combining text data – email addresses:

LEN(): trả về độ dài chuỗi, bằng vs chỉ số cuối cùng

SEARCH(search_for, text_to_search, [starting_at])

search_for: chuỗi cần tìm kiếm

text_to_search: vb cần tìm kiếm

RIGHT (string, [number_of_characters])

SUBSTITUTE(text_to_search, search_for, replace_with, [occurrence_number])

text_to_search: the text to search through

search_for: the string to search for

replace_with: the replacement string

occurrence_number: which occurrence should be substituted

Conditional functions and logic

Conditional functions: return diffrent results depending on criteria

IF(logical_expression, value_if_true, value_if_false)

AND(logical_expression1, [logical_expression2, …])

Returns TRUE if all logical expressions return TRUE

Nhiều tiêu chí COUNTIFS( criterion1, [criteria_range2, criterion2,…])

SUMIF(range, criterion, [sum_range])

SUMIFS(sum_range , criteria_range1, criterion1, [criteria_range2, criterion2,

AVERAGEIF(criteria_range, criterion, [average_range])

AVERAGE(average_rang, criteria_range1, criterion1, [criteria_range2,

VerticalLOOKUP: search for information in a table based on search keys

Lookup table & main table

VLOOKUP(search_key, range, index, [is_sorted])

You might also like