In addition to running databases in Amazon EC2, AWS customers can choose among a variety of managed database services. These services save effort, save time, and unlock new capabilities and economies. In this session, we make it easy to understand how they differ, what they have in common, and how to choose one or more. We explain the fundamentals of Amazon DynamoDB, a fully managed NoSQL database service; Amazon RDS, a relational database service in the cloud; Amazon ElastiCache, a fast, in-memory caching service in the cloud; and Amazon Redshift, a fully managed, petabyte-scale data-warehouse solution that can be surprisingly economical. We will cover how each service might help support your application, how much each service costs, and how to get started.
4. If you host your databases on-premises
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
OS installation
you
App optimization
5. If you host your databases on-premises
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
OS installation
you
App optimization
6. If you host your databases in Amazon EC2
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
OS installation
you
App optimization
7. If you host your databases in Amazon EC2
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
you
App optimization
Power, HVAC, net
Rack and stack
Server maintenance
OS installation
8. If you choose a managed DB service
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
App optimization
High availability
DB software installs
OS installation
you
Scaling
9. Quick summary of the options
Self-managed EC2 instances Fully managed
Corporate data
center
Database DB on EC2
instance
RDS
11. A managed service for each major DB type
Amazon
Redshift
Data
warehouse
Amazon
DynamoDB
Document
and key-
value store
Amazon
RDS
Relational
database
engines
Amazon
ElastiCache
In-memory
key-value
store
13. NoSQL vs. relational DB for a new app: How to choose?
• Schemaless, easy reads
and writes, simple data
model
• Scaling is easy
• Focus on performance and
availability at any scale
• Strong schema, complex
relationships,
transactions and joins
• Scaling is difficult
• Focus on consistency
over scale and availability
NoSQL SQL
19. Amazon Aurora: Fast, available, and MySQL-compatible
SQL
Trans-
actions
AZ 1 AZ 2 AZ 3
Caching
Amazon
S3
5x faster than MySQL on
same hardware
SysBench: 100K writes/sec
and 500K reads/sec
Designed for 99.99%
availability
6-way replicated storage
across 3 AZs
Scale to 64 TB and 15 Read
Replicas
20. Amazon RDS is simple and fast to scale
Database instance types
offer a range of CPU and
memory selections
Scale up or down among
instance types on demand
Database storage is
scalable on demand
21. Amazon RDS offers fast, predictable storage
General Purpose
(SSD) for most
workloads
Provisioned IOPS
(SSD) for OLTP
workloads up to
30,000 IOPS
Magnetic for small
workloads with
infrequent access
22. High availability Multi-AZ deployments
Enterprise-grade fault tolerance solution for
production databases
23. Choose Read Replicas for scalability and enhanced
data locality
Relieve pressure on your master
node for supporting reads and
writes
Even faster recovery in the event
of disaster
Bring data close to your
customers
Promote to a master for easy
migration
24. Choose cross-region snapshot copy for even greater
durability, ease of migration
Copy a database snapshot
to a different AWS Region
Warm standby for disaster
recovery
Base for migration to a
different region
25. How do Amazon RDS backups work?
Automated backups
Restore your database to a point in
time
Enabled by default
Choose a retention period, up to 35
days
Manual snapshots
Build a new database instance from a
snapshot when needed
Initiated by you
Persist until you delete them
Stored in Amazon S3
26. Monthly
bill
= +
Further details at http://aws.amazon.com/rds/pricing/
You pay for the resources that you use
Storage
consumed
Duration for which DB
instances were used
(Price depends on
type of storage)
(Price depends on
type of DB instance)
Free tier (for first 12 months)
• 750 micro DB instance hours
• 20 GB of DB storage
• 20 GB for backups
• 10 million I/O operations
GBN ×
30. Amazon DynamoDB: a managed document and
key-value store
• Simple and fast to deploy, easy to scale
• Data is automatically replicated
• Fast, predictable performance
• Helps provide security and control
• Fine-grained access control
• No cost to get started
• Pay only for what you consume
31. Popular use cases
Ad tech IoT Gaming
Mobile
& web
Ad serving,
retargeting, ID
lookup, user
profile
management,
session-
tracking, RTB
Tracking state,
metadata and
readings from
millions of
devices, real-
time
notifications
Recording
game details,
leaderboards,
session
information,
usage history,
and logs
Storing user
profiles,
session details,
personalization
settings, entity
specific
metadata
32. Writes
Replicated continuously to 3 AZs
Persisted to disk (custom SSD)
Reads
Strongly or eventually consistent
No latency trade-off
Automatic replication for rock-solid durability and
availability
33. Amazon DynamoDB: A schemaless database
Attributes
Schemaless
Schema is defined per item
Items
Table
Item
key
34. DynamoDB: What are capacity units?
One write capacity unit One read capacity unit
One strongly consistent
read per second up to 4KB
or
Two eventually consistent
reads per second
One write per
second up to 1KB
35. Simple app architecture with Amazon DynamoDB
Elastic Load
Balancing Amazon EC2
app instances
Clients
DynamoDB
Business logic
36. You pay for the resources that you use
Monthly
bill = GB +
Pricing varies by region. Further details at http://aws.amazon.com/dynamodb/pricing/
Storage
consumed
Write
capacity
units
(WCUs)
+
Read
capacity
units
(RCUs)
Free tier:
• Generous free tier of 25 GB, 25 WCUs, and 25 RCUs
• That is, you get over 60M read requests and 60M write request for free in a month
• The free tier is indefinite—you benefit from this every month
39. Amazon
Redshift
a lot faster
a lot cheaper
a whole lot simpler
Relational data warehouse
Massively parallel; petabyte scale
Fully managed
HDD and SSD platforms
$1,000/TB/year; starts at $0.25/hour
40. Popular use cases
10x cheaper
Easy to provision
Higher DBA productivity
Traditional
enterprises
10x faster
No programming
Easily leverage BI tools,
Hadoop, machine
learning, streaming
Companies
with big data
Analysis in-line with
process flows
Pay as you go, grow as
you need
Managed availability and
disaster recovery
SaaS
companies
41. Amazon Redshift architecture
Leader node
• Simple SQL endpoint
• Stores metadata
• Optimizes query plan
• Coordinates query execution
Compute nodes
• Local columnar storage
• Parallel/distributed execution of all
queries, loads, backups, restores,
resizes
Start at just $0.25/hour, grow to 2 PB
(compressed)
• DC1: SSD; scale 160 GB–326 TB
• DS2: HDD; scale 2 TB–2 PB
10 GigE
(HPC)
Ingestion
Backup
Restore
JDBC/ODBC
42. Amazon Redshift is fast
Dramatically less I/O
Column storage
Data compression
Zone maps
Direct-attached storage
Large data block sizes
10 | 13 | 14 | 26 |…
… | 100 | 245 | 324
375 | 393 | 417…
… 512 | 549 | 623
637 | 712 | 809 …
… | 834 | 921 | 959
10
324
375
623
637
959
ID Age State Amount
123 20 CA 500
345 25 WA 250
678 40 FL 125
957 37 WA 375
43. Fully managed, continuous/incremental backups
Multiple copies within cluster
Continuous and incremental backups
to Amazon S3
Continuous and incremental backups
across regions
Streaming restore
Amazon S3
Amazon S3
Region 1
Region 2
44. Amazon Redshift offers rock-solid fault tolerance
Amazon S3
Amazon S3
Region 1
Region 2
Disk failures
Node failures
Network failure
Availability Zone– or region-level
disasters
45. Security is built-in
• Load encrypted from S3
• SSL to secure data in transit
• Amazon VPC for network isolation
• Encryption to secure data at rest
• On-premises HSM and AWS
CloudHSM support
• SOC 1, 2, and 3; PCI-DSS;
FedRAMP; BAA
10 GigE
(HPC)
Ingestion
Backup
Restore
Customer VPC
Internal
VPC
JDBC/ODBC
46. You pay for what you use
Further details at https://aws.amazon.com/redshift/pricing/
Monthly
bill = N ×
Duration for which the
nodes were used
Number of nodes
(Price depends on type
of node)
2 month free trial
Leader node is free
No upfront costs, pay as you go
47. Amazon Redshift has a large ecosystem
Data Integration Systems IntegratorsBusiness Intelligence
54. How ElastiCache billing works
Monthly
bill = N ×
Further details at http://aws.amazon.com/elasticache/pricing/
Duration for which the
nodes were used
Number of nodes
(Price depends on type
of node)
Free tier (for first 12 months)—750 micro cache node hours
57. Benefits of AWS managed database services
Pay only for what
you use
No upfront cost
Fully managed
services
AWS handles
installs, patching,
restarts
Easy to scale
Grow as you need
Designed for use
with other AWS
services
AWS
Data Pipeline
Amazon
EC2
Amazon
S3
Amazon
CloudWatch
Amazon
SNS
Amazon
VPC