Cloud AWS - Database On AWS
Cloud AWS - Database On AWS
Database
Week 5 – Module 5
Tuan Vo
Solution Architect
Microsoft
SQL Server
Easy to administer Available and durable Highly scalable Fast and secure
No need for infrastructure Automatic Multi-AZ Scale database compute SSD storage and
provisioning or installing data replication; and storage with a guaranteed provisioned
and maintaining database automated backup, few clicks with no I/O; data encryption at
software snapshots, and failover application downtime rest and in transit
Spend time innovating & building new apps, not managing infrastructure
Automatic fail-over
Backup & recovery
Isolation & security
Schema design Industry compliance
You AWS Push-button scaling
Query construction
Query optimization Automated patching &
upgrades
Advanced monitoring
Routine maintenance
• CPU/ Memory / IOPS / • Process / Thread list • SQL / State / User / Host
Network • Per second metric storage (“Database Load”)
• Per minute metric storage in Amazon CloudWatch • Per second metric storage in
in Amazon CloudWatch Logs Amazon RDS
• Network security
• Amazon Virtual Private Cloud (VPC) security groups act as a virtual firewall to
control inbound and outbound traffic
• Data encryption
• Encryption at rest using AWS KMS or Oracle/Microsoft TDE
• SSL protection for data in transit
Application Database
servers failure Standby
Fault tolerance across
multiple data centers
• Automatic failover
New standby
• Synchronous replication Availability Zone A
• Enabled with one click
Primary
Availability Zone B
Volume
• Always incremental
• Amazon S3 à
99.999999999% durability
• Supports encryption
Bucket Snapshot 1 Snapshot 2 Snapshot 3
• Copy across accounts,
across regions
Compute
Node
SQL
Caching
Attached
Storage
Compute
Compute and storage have different Node
lifetimes SQL
Transactions
• Instances fail and may be replaced Caching
• Instances are shut down Logging
• Instances are scaled up/down
• Instances are added to cluster to scale out
Purpose-built log-structured
distributed storage
AZ 1 AZ 2 AZ 3
Amazon S3
Scale-out, distributed, multi-tenant storage architecture
AZ 1 AZ 2 AZ 3
DB Cluster
Up to 16 DB instances/nodes in a
regional cluster, spanning Writer Reader Reader
SQL SQL SQL
multiple AZs
Transactions Transactions Transactions
Caching Caching Caching
Amazon S3
Database Migration
• Simple to use
• Minimal downtime
• Supports widely used
databases
• Low cost
• Fast and easy to setup
• Reliable
Sources* Targets**
Oracle Oracle
SQL Server SQL Server
On-premises
Azure SQL PostgreSQL
database
PostgreSQL MySQL
MySQL Amazon Redshift
SAP ASE SAP ASE
MongoDB Amazon S3
Amazon S3 Amazon DynamoDB
IBM DB2 Amazon Kinesis
Amazon ElasticSearch
*
https://docs.aws.amazon.com/dms/latest/userguide/CHAP_Source.html
** https://docs.aws.amazon.com/dms/latest/userguide/CHAP_Target.html
You can use different tasks with different settings for different environments
Features
Schema conversion between database engines
Database Migration Assessment report for choosing the best target engine
Code browser that highlights places where manual edits are required
Relational Key - Value Document Wide Collumn In-memory Graph Time Series Ledger
Relational Key - Value Document Wide Collumn In-memory Graph Time Series Ledger
1K WCU or 3K RCU
T1.p1 T1.pn
up to 10 GB
© 2021, Amazon Web Services, Inc. or its Affiliates.
DynamoDB Table
Table
A1 A2 A3 A4 A5
(partition key) (sort key)
A1 A2
(partition key) (sort key)
Items
A1 A2 A6 A4 A5
(partition key) (sort key)
Keyspace
Partition B
OrderId: 2
CountryCode : 1 Hash(2) = 48
ASIN: [B00OQVZDJM]
AA
OrderId: 3 Partition C
CountryCode : 1 Hash(3) = CD
ASIN: [B00U3FPN4U]
FF
Hash.MAX = FF
AVA I L A B I L I T Y RR RR RR RR RR
ZONE 1
RR RR RR RR RR
RR RR RR RR RR
AVA I L A B I L I T Y RR RR RR RR RR
ZONE 2
RR RR RR RR RR
Network
RR RR RR RR RR
AVA I L A B I L I T Y RR RR RR RR RR
ZONE 3
RR RR RR RR RR
Data Scientist
Amazon
Amazon
Amazon Amazon
Exploration, Integration,
S3 Amazon Amazon
Kinesis Neptune S3 QuickSight SageMaker Predictive Models
Systems of
Record
Amazon Data Experts
S3 Glacier Amazon Amazon Amazon
AWS
Glue Redshift DynamoDB Athena Ad-hoc Reports,
Raw Data Create KPIs
Systems of
Engagement
AWS Amazon
AWS
Database
Amazon
Amazon
Elasticsearch
Amazon
ElastiCache Lambda API Gateway Business Users
Service
Migration Service
S3 Dashboarding,
Sensor & Move Data Prepared Data Consumable Data Insights Consumption
Use KPIs, Slice & Dice
Log Data
…
Downstream Systems
Amazon AWS Amazon Amazon Amazon Amazon Amazon Data Feeds,
External Data Athena Glue EMR Transcribe Rekognition Comprehend SageMaker Information Hub
Data Processing, Metadata Management Machine Learning
Analytical Data …
Insights Applications
Actionable Insights at
AWS AWS AWS Amazon AWS AWS
the Point of Impact
Data Sources KMS IAM CloudTrail CloudWatch CloudFormation Config
Security, Identity and Compliance Management and Governance Data and Insights Applications
Amazon RDS
On-premises data
Streaming data
Glue
Data Catalog
• Automatically discovers data and stores schema
Discover data and • Catalog makes data searchable, and available for ETL
extract schema
• Catalog contains table and job definitions
Compliance
• Serverless
Giải thưởng:
• 5 áo thun AWS cho 5 anh/chị hoàn thành bài lab nhanh nhất.
• 5 áo thun AWS được trao ngẫu nhiên cho 5 anh/chị hoàn thành bài lab.
• Phần quà cho các anh chị tìm ra lỗi trong bài lab hoặc đưa ra ý tưởng hay để cải thiện bài lab.
Lưu ý:
- Chụp màn hình AWS Console sau khi hoàn thành các bài lab và đăng vào kênh lab-week-3 trên Slack.
- Hình chụp phải bao gồm Account ID (Ở góc trên bên phải của AWS Console).
- Tài khoản Lab có thể sử dụng đến 5 giờ chiều thứ 2 tuần sau.
Lưu ý:
- Bài Lab tuỳ chọn, không cần nộp bài.