5 Years Of Building SaaS On AWS

© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Christian Beedgen
October 2015
5 Years of Building SaaS on AWS
A Story by Sumo Logic

$ whoami
Co-Founder & CTO, Sumo Logic
Cloud-based Machine Data Analytics Service
Applications, Operations, Security
Chief Architect, ArcSight
Major SIEM player in the enterprise space
Log Management for security and compliance

From Data to Decisions
DEVOPS
Streamline continuous
delivery
Monitor KPI’s and
Metrics
Accelerate
Troubleshooting
IT INFRASTRUCTURE
AND OPERATIONS
Monitor all workloads
Troubleshoot and
increase uptime
Simplify, Modernize,
and save costs
COMPLIANCE AND
SECURITY
COMPLIANCE AND
SECURITY
Automate and
demonstrate compliance
Audit all systems
Think beyond rules
Cloud Analytics Platform
DEVOPS
IT INFRASTRUCTURE
AND OPERATIONS
COMPLIANCE AND
SECURITY

Cloud Analytics Platform
From Data to Decisions
DEVOPS
IT INFRASTRUCTURE
AND OPERATIONS
COMPLIANCE AND
SECURITY
Customer A Cloud
COLLECTOR COLLECTOR
Customer A Data Center Customer B Data Center
COLLECTOR
Customer B Cloud
COLLECTOR

Why SaaS?
Because enterprise software sucks™

Why SaaS?
Too much pain for the customer
Time spent running the system is not spent using the system
Expensive when done adding hardware and people

Why SaaS?
Too much pain for the customer
Time spent running the system, not spent using the system
Expensive when done adding hardware and people
Disastrous for the vendor
No control over the runtime, hard to diagnose problems
Kills innovation because each release lives forever

Why AWS?
We are developers, not data center people
AWS has turned the data center into an API
As developers, we understand reuse (libraries, OSs, …)
Today’s systems require reuse on a higher level
Do you really want to care for 4,000 machines? HA? DR?

Anti-monolithic
In previous gigs, we dealt with monolithic systems
With Sumo, we knew what we needed to build, no MVP required
Get data into the system, index it, provide query function
So we had a logical breakdown immediately
And we knew it had to scale…
…not just to the biggest customer, but to all customers!

Ingestion Path
Receiver Bus Index
Raw
CQ
S3

Analytics Path
Query
Service
CQ
S3

Scale Today
50 TB of new incoming data per day
Double-digit PB of data under management
>2,000,000 queries/day
Thousands of instances in 4 regions globally

Divide & Conquer
3 to 1000s of instances!

Divide & Conquer
Each box in the previous images
is an application

Divide & Conquer
Blast radius, bulk-heading,
concern separation

Divide & Conquer
Not everything will break all the
time – repair engines, not plane

Divide & Conquer
Not everybody will need to work
on everything all the time

What We Actually Did
Compose applications from layers of modules
Whole system is Scala on top of the JVM
One Maven POM per module, one main() per application
Initially one GitHub repository per module, today just one project
Right size AWS instance for each application cluster
Each application exposes a façade
Avro over HTTP, or Avro over HornetQ, or Avro over Kafka

How I Actually Visualize Microservices

Deployment wide services
Ingest
Search
Internal tools
receiver
hornetq-
forge
forge
cqsplitter
search
cloud
collector
service
api
con-
cierge
stream
katta
glass,
ganglia
bill
mix
meta
conﬁg
zoo-
keeper
appvault org
raw
hornetq-
inbound
cocoa
bloom
ﬁlter
analyticscsi
cqmerger
rework
view
autoview
depman
hornetq-
internal
hornetq-
metadata
nrt
2 to the power of 5 services
(“32”), 170+ modules
Don’t even ask about the #
of dependencies
At least 3 of each –
everything is a separately
scalable cluster

Service Discovery
Loose coupling in the large…
A deployment is made up of many things
Some of these things need to talk to each other
Some of these things come and go
Don’t pass in a huge list of static dependencies
Start each application with one parameter
$ bin/receiver prod.service-registry.sumologic.com

Anti-singletenant
Multi-dimensional scaling predicates multitenancy
This is a data processing platform – cost matters!
Autoscaling single tenants is too fine-grained for us
Also, efficiency… one code line “master” in deployment
Customers aren’t pets, they are cattle 

Anti-singletenant
Multi-dimensional scaling predicates multitenancy
This is a data processing platform – cost matters!
Autoscaling single tenants is too fine-grained for us
Also, efficiency… one code line “master” in deployment
Customers aren’t pets, they are cattle 
Yum yum yum…
FEATURE FLAGS!!!

Just one typical Sumo Logic customer - 8x Variance!

Money flushed down the toilet

Money flushed down the toilet
Load per tenant fluctuates wildly, but
aggregated system load just goes up slowly

Anti-manual
We use Jenkins, of course
We still build system versions as cross-cuts and QA them
We are busy moving toward true continuous delivery
Application Groups for things that evolve together…
…and that can be deployed together

dsh: Another AWS Deployment Tool
Model-driven, describe desired state, run to make it so
High performance due to parallelization
Covers all layers of the stack – AWS, OS, Sumo Logic
Easy to use and extend, scriptable CLI
Developer-friendly, Scala-based, high-level APIs

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Resource Management SaaS Application(s)
Business Services Core Platform Services
Interaction
Application
AdditionalApplications
Application Lifecycle
Management
EC2
EC2
Route53
S3 Glacier
CloudFront
DynamoDB RDSElastiCache
DynamoDB
DynamoDB RedShift
WorkSpaces
CloudWatch CloudTrail
IAM
CodeDeploy
BeanstalkCloudFormationOpsWorksSWF
SWF
EMR EMR Kinesis
SNS
Mobile
Analytics
Kinesis SNS
CognitoDirectory
Service
CloudSearch
AppStream
SES SQS
SWF XCode
Data
Pipeline

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
3 ELBs (service, api, receiver)

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
EC2, obviously
RIs, dabbling with Spot

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
SES for alert emails to our
customers

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
SQS for user registration from
corporate website

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
Petabytes of S3

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
ElastiCache Memcache for
client object caches

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
DynamoDB for feature flags
and configuration

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
RDS MySQL for configuration
and content objects

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
SimpleDB for deployment
location

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management Sumo Logic 

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management Sumo Logic 
CloudWatch, CloudTrail

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
Sumo Logic!

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
Zuora for billing

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management Jenkins, GitHub

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
Our own automation
framework – “dsh”

Data Access
Layer
Delivery
Authentication &
Authorization
MeteringMonitoring
Ordering
Provisioning
Billing
Analytics
Interaction
Application
Management
CloudFormation for Mesos
cluster setup

Integrations
Generic S3 Collection
Amazon S3 Audit
Elastic Load Balancing
Amazon CloudFront
AWS CloudTrail
Amazon VPC Flow Logs
AWS Config

What Does the Future Hold?
Super happy to see Amazon EFS introduced
Borderline unnaturally excited about AWS KMS
Planning on using AWS Lambda as a “plugin system”
Implementing Mesos for new services
Very excited about Docker to enable better utilization

5 Years Of Building SaaS On AWS

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (20)

Similaire à 5 Years Of Building SaaS On AWS

Similaire à 5 Years Of Building SaaS On AWS (20)

Dernier

Dernier (20)

5 Years Of Building SaaS On AWS

Notes de l'éditeur