SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
Start Counting -
How we unlocked platform
efficiency and reliability,
while saving over $730,000
David Wu, Ph.D - Senior Staff Solutions Architect - VMware Tanzu Labs
David Filippelli - Lead Site Reliability Engineer - Manulife
Alvin Kwame Coch - Senior Site Reliability Engineer - Manulife
2nd September 2021
$1 Million
Content
- PCF @ Manulife
- Efficiency vs Reliability
- Improving Efficiency & Reliability
- Tallying The Savings
- References
- Acknowledgements
- Q & A
PCF @ Manulife
- 7 PCF Foundations in Azure North America and Azure Asia
- Total: ~8000 AIs and 540 Diego Cells across all foundations
- Problem Statement:
- How can we improve the efficiency and reliability of our platforms while saving money?
Efficiency vs Reliability
- Efficiency - Make the platform run better, therefore platform engineers life easier
- Optimize resources used
- Reduce cost and time to do something
- Improve monitoring strategies
- Reliability - Deliver an exceptional customer first experience
- Increase application uptime
- Increase service availability
- Improve Recovery Point Objective (RPO)
Improving Efficiency & Reliability
- Diego Cell Scheduler
- Switching TAS Internal blobstore to External blobstore
- Changing and Tuning Diego Cell VM types
- Foundation Configuration Tuning
Improving Efficiency & Reliability – Diego Cell Scheduler
- Manulife Scheduler App
- Developers self service to subscribe what (apps) and when (day of week and time) to stop/start
their apps.
- Incentive to developers: Save costs on charge back
Improving Efficiency & Reliability - Diego Cell Scheduler
- Dev and Sandbox environments not fully utilized on weekends starting Friday evenings until
Monday morning.
- Costs are incurred to both app teams and platform team. How to save money?
- Concourse pipeline to query diego cell memory utilization after apps stopped, to determine how
many diego cells to scale down by. This is reverted to original diego cell count via pipeline on
Monday morning.
diego
cell
count
day of week
M T W T F S S M T W T F S S M T W T F S S M T W T F S S M T W T F
Improving Efficiency & Reliability - Internal blobstore to External blobstore
- Using internal blobstore == NFS running on VM
- Cost of VM+persistent disk > cost of Azure blob storage (GRS, us-east1, hot) [1,2,3]
- Decrease platform upgrade times while increasing availability
- NFS VM is not HA [4]
- Large NFS persistent disks leads to long upgrade times
- Long upgrade times lead to dev outages and potential issues with auto scaling/healing.
- Decrease backup time and reduce outage (cloud controller [CC] lock)
- e.g. Sandbox: 3 hrs backup, 1hr 14 min CC lock → 13 min backup, 1 min CC lock
- Locked CC leads to dev outages (no app pushes, no delete, no autoscale)
- Possible to do more backups to meet or increase Recovery Point Objective (RPO)
Improving Efficiency & Reliability - Internal blobstore to External blobstore
Azure service endpoints
speed up access to blob
storage
Setup firewalls, ensure CC
can access + Windows Diego
cells for buildpacks
4 x blob storage
containers -
Buildpacks
Images
Droplets
Resources
Prepare blob storage and
required processes
First Migration
5 days before switch
over
NFS
Internal
blobstore
in PCF
Use rclone[5] to transfer
blob objects
Check for routing issues
and slow transfer
speeds
...
Incremental
Migration
1 day before switch
over
NFS
Copy only newer blobs.
Will be much faster
Perform full switch
over
...
Improving Efficiency & Reliability - Internal blobstore to External blobstore
Azure service endpoints
speed up access to blob
storage
4 x blob storage
containers
NFS
Performing the full switchover
CC
Conduct cf push
performance timing
tests* and record results
Use candidate apps on
platform, e.g. Linux +
Windows. Do at least 2
cf pushes for cache
1
Lockdown the
Cloud controller
using cf cli
No app pushes
will be possible
3
Perform a
final copy
migration
NFS
4 Take a snapshot of NFS
Persistent Disk.
(Original NFS disk will
remain orphaned for 5 days
after switchover) PCF foundation
5 Configure and
apply changes
for switch over
6
Perform at least 2
performance tests and
compare results from
step 1. Check for
network issues.
7 Unlock the cloud
controller using
cf cli
2
* https://github.com/dawu415/PCFToolkit/tree/master/tests/cfpush
Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types
- Optimize utilization of VM resources. e.g., memory & disk
- Memory optimized VMs - Esv3 series [6]
- Cost of Dsv2 (default) > cost of Esv3 VMs [1]
- Reduce disk space use
- Potential to decrease VM count
- More VM memory in Esv3: Dsv2 - 28Gb RAM vs Esv3 - 32Gb RAM
- Increased app density per cell (don’t over do it)
- Improve reliability by tuning diego cells
- Ensure are using sufficient resources to support current needs and future growth
Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types
Tuning diego cells:
- Questions we want quantifiable answers to
- How many diego cells do I need to fit x AIs?
- Are we under-provisioned or appropriately over-provisioned on diego cells?
- What’s the minimum disk space per cell? Azure charges based on upper tier.
- Information and statistics we need to know
- What else is running on the cells and how much memory they need? e.g. Anti-virus
- How many AIs, average Memory per AI used and disk quota per AI used
- Can use cf api to get a snapshot of raw information.
- cf applist script from Rakutentech (https://github.com/rakutentech/cf-tools/blob/master/cf-applist.sh)
./cf-applist.sh -s Instances -s Instances -f Name,State,Instances,Memory,Disk_quota > applist_<env>.txt
Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types
- CF capacity behaviour in TAS container and architecture docs [7,8]
- Use our capacity planning spreadsheet:
→ Worksheets at https://github.com/dawu415/PCFToolkit
→ Get all AI memory used and disk quota raw snapshot
→ Get average AI memory and disk quota used. Will be in Mb.
→ Divide by 1024 to get Gb and input to worksheet.
- Capacity information:
- Build capacity monitoring dashboards and alerts
- Understand usage behaviours and do simulations
e.g. what if 25% of app instances switch to higher disk use?
Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types
Additional things to consider:
- VM Disk and Memory Capacity
- For on-prem and Availability Zone (AZ) customers:
- Ensure 1/N % extra in IaaS memory and disk to cover 1 AZ failure in an N AZ setup.
- Factor additional resource per cell reserved for addons
- Warning level in your monitoring.
- CPU vs Memory
- If possible, avoid packing too many AIs into a single diego cell.
- Too many means longer evacuation time leading to longer recovery and VM update times.
- Too many also can hinder application performance. CPU share is based off the AI memory.
Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types
Changing VM Types:
- Ensure you have historical metrics for CPU, memory and disk bandwidth usage
- > 30 days is preferable and seasonal time of heavy usage
- IaaS level may have historical data to help with this.
- If not enough data, can use blue-green algorithm.
- Does the new VM type support your needs?
- Correlate heavy usage peak metrics to
- CPU load
- Memory utilization and Disk IOPS
- Network egress bandwidth
- Write disk IOPS tests scripts using FIO [9] to verify. Can provision empty bosh VM to do tests.*
*https://www.starkandwayne.com/blog/hey-bosh-gimme-a-vm/
Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types
- How to really change the VM type?
- Change VM type in operations manager/platform automation configuration
- Opsman 2.10.3 supports new generation of Azure VM types (without Availability Sets) [10]
- Otherwise add these in automation/manual via opsman api
- But wait…...IaaS might not allow the change or have constraints on what can be changed
- Azure: Cannot switch Availability Sets(AS) sitting in old HW clusters.
- Switch AS only possible when 1 VM in it. e.g. diego cell AS, only 1 diego cell VM.
- Before using the algorithm, check your
- IaaS VM quotas and subnets have capacity
- At least double your current TAS plus all existing iso seg diego cell count.
- Firewall is setup.
- We used the same subnet as the existing diego cell to avoid creating new firewall rules - since it is
temporary for us.
Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types
- Zero Downtime Blue/Green Diego cell VM Switchover algorithm
- Use isolation segments to extend diego pool (replicate tile with unique name).
- Share same GoRouters as TAS.
- Leave the segment name blank to ‘extend’ the pool of TAS diego cells [11]
- Use the segment name to extend the pool of an existing iso seg diego cell
TAS Diego cells:
Dsv2.
Blue - current
running apps
Initial
Delete Iso Seg
tile
With thanks to D. Stevenson for initial discussion and idea
Scale down TAS
diego cell
Adjust max-in-flight for
diego cells to speed up
scale down but beware
and monitor BBS load.
Scale up TAS
diego cell
Scale down Iso
Seg diego cell to 0
Green -
running apps
in new Esv3
Extended TAS Diego
cells in Iso Seg: Esv3
White - no running apps
Setup Iso Seg
with diego cells
Convert TAS
diego cell to Esv3
Switched TAS
Diego cell to
Esv3
Diego cell = 1
Apps auto relocate
to new Diego cells
Improving Efficiency & Reliability - Foundation Configuration Tuning
What we are trying to achieve:
- Optimize utilization of VM resources.
- Decrease VM count for over-utilized resources
- Increase VM count to better platform health/HA
Improving Efficiency & Reliability - Foundation Configuration Tuning
- Get your monitoring ready
- Will need to monitor platform metrics to help guide decisions
- Build dashboards based on KPIs [12, 13, 14, 15]
- Cloud Controller KPIs
- BBS CPU Load and Memory, Diego Cells: CPU load, memory, disk capacity
- UAA
- Loggregator: Doppler, Traffic controller and nozzles
- Overall platform health
- MySQL Server: CPU Load, Memory and Disk
- Go Router: CPU Load, Requests Per Second
Improving Efficiency & Reliability - Foundation Configuration Tuning
- Table out the Changes
- Maintain a record of where you are vs where you want to be
- Maintain notes and comments of decisions
- Helps review cost benefit decisions on a bigger picture
Sample only, not
real data
Get worksheet at https://github.com/dawu415/PCFToolkit
Improving Efficiency & Reliability - Foundation Configuration Tuning
- General VM metrics [12]
- CPU load < 80% - 90% ( < 60% GoRouter )
- Memory Utilization < 80%
- Persistent Disk < 80%-90%
Improving Efficiency & Reliability - Foundation Configuration Tuning
- Loggregator [15]
- Understand the loggregator architecture and use the loggregator guide to assist [16]
- Doppler maximum effectiveness using horizontal VM scaling is 40 VMs (v1 + v2 configuration)
- Maintain a 2:1:1 ratio (doppler : traffic controller : nozzle)
- Check for dropped messages in doppler, connection loss of TC and high resource utilization of
nozzles. Should be part of monitoring
Dopplers
Traffic
Controllers/
RLP
Nozzles
Loggregator
Agents
Improving Efficiency & Reliability - Foundation Configuration Tuning
- Cloud Controller [14]
- CPU load < 80% - 90% ( < 60% GoRouter )
- Memory Utilization < 80%
- Persistent Disk < 80% - 90%
- Determine what your the usage pattern + head room and simulate it with parallel cf pushes in a
script. https://github.com/dawu415/PCFToolkit/tree/master/tests/cfpush
- Exercises the cloud controller api when you can’t really test it in dev. Run in parallel: time cf push, time cf
delete. Review cloud controller metrics. Abnormally slow response times are indicative of low scaling
issues with CC or its worker.
- Review and integrate changes into dev environment. Monitor cloud controller metrics.
- Understand the nozzles that query the CC API to get app names to insert into log.
Tallying the Savings
Change Savings Notes
Diego cell Scheduling
Switching blobstore to
external
Switch diego cells VM +
tuning
Total
~$ 40,000.00 p.a
~$ 21,500.00 p.a
~$1.06 million p.a
~$ 1 million p.a
Originally calculated to be
~$730k but actual ~ $1
million
~30 VMs/foundation deleted
and recreated per weekend.
Savings from 2 North
America foundations
References
[1] Virtual Machine Pricing, https://azure.microsoft.com/en-us/pricing/details/virtual-machines/windows/
[2] Managed Disk Pricing, https://azure.microsoft.com/en-us/pricing/details/managed-disks/
[3] Blob storage Pricing, https://azure.microsoft.com/en-us/pricing/details/storage/blobs/
[4] CF Blobstore High Availability, https://docs.cloudfoundry.org/concepts/high-availability.html#blobstore
[5] rclone tool download, https://rclone.org/downloads/
[6] Ev3 and Esv3-series, https://docs.microsoft.com/en-us/azure/virtual-machines/ev3-esv3-series
[7] TAS Container Mechanics, https://docs.pivotal.io/application-service/concepts/container-security.html#mechanics
[8] TAS Diego Architecture, https://docs.pivotal.io/application-service/concepts/diego/diego-architecture.html
[9] Azure, Benchmark a disk, https://docs.microsoft.com/en-us/azure/virtual-machines/disks-benchmarks
[10] Ops Manager v2.10.3 release notes, https://docs.pivotal.io/ops-manager/2-10/release-notes.html#2-10-3
[11] Isolation Segment - app container, https://docs.pivotal.io/application-service/2-7/operating/installing-pcf-is.html#application_containers
[12] Key Performance Indicators, https://docs.pivotal.io/application-service/operating/monitoring/key-cap-scaling.html
[13] Key Capacity Scaling Indicators, https://docs.pivotal.io/application-service/operating/monitoring/key-cap-scaling.html
[14] Scaling Cloud Controller, https://docs.cloudfoundry.org/running/managing-cf/scaling-cloud-controller.html
[15] Loggregator Guide for CF operators, https://docs.cloudfoundry.org/loggregator/log-ops-guide.html
[16] Loggregator Architecture, https://docs.cloudfoundry.org/loggregator/architecture.html
Acknowledgements
We would like to thank following people for their involvement and efforts to make things happen
- Piotr Chomiak
- Richard Garro
- Dan Buchko
- John Calabrese
- David Stevenson
- Michael Chung
- Kelvin Li
- Jonathan Leung
- Lok Wong
- Haydon Ryan
- John Tan
Q & A

Contenu connexe

Tendances

VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld
 
HTTP Session Replication with Oracle Coherence, GlassFish, WebLogic
HTTP Session Replication with Oracle Coherence, GlassFish, WebLogicHTTP Session Replication with Oracle Coherence, GlassFish, WebLogic
HTTP Session Replication with Oracle Coherence, GlassFish, WebLogicOracle
 
20150716 introduction to apache spark v3
20150716 introduction to apache spark v3 20150716 introduction to apache spark v3
20150716 introduction to apache spark v3 Andrey Vykhodtsev
 
Enable greater data reduction, storage performance, and manageability with De...
Enable greater data reduction, storage performance, and manageability with De...Enable greater data reduction, storage performance, and manageability with De...
Enable greater data reduction, storage performance, and manageability with De...Principled Technologies
 
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdgeThe benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdgePrincipled Technologies
 
KoprowskiT_SPBizConference_2AMaDisasterJustBegan
KoprowskiT_SPBizConference_2AMaDisasterJustBeganKoprowskiT_SPBizConference_2AMaDisasterJustBegan
KoprowskiT_SPBizConference_2AMaDisasterJustBeganTobias Koprowski
 
Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...
Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...
Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...Principled Technologies
 
Benefity Oracle Cloudu (4/4): Storage
Benefity Oracle Cloudu (4/4): StorageBenefity Oracle Cloudu (4/4): Storage
Benefity Oracle Cloudu (4/4): StorageMarketingArrowECS_CZ
 
J2EE Performance And Scalability Bp
J2EE Performance And Scalability BpJ2EE Performance And Scalability Bp
J2EE Performance And Scalability BpChris Adkin
 
Database Provisioning in EM12c: Provision me a Database Now!
Database Provisioning in EM12c: Provision me a Database Now!Database Provisioning in EM12c: Provision me a Database Now!
Database Provisioning in EM12c: Provision me a Database Now!Maaz Anjum
 
Scaling xen desktop and xenapp with solid state drives in healthcare
Scaling xen desktop and xenapp with solid state drives in healthcareScaling xen desktop and xenapp with solid state drives in healthcare
Scaling xen desktop and xenapp with solid state drives in healthcareIntel® Software
 
Power BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle CloudPower BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle CloudKellyn Pot'Vin-Gorman
 
Delphix Platform Overview
Delphix Platform OverviewDelphix Platform Overview
Delphix Platform OverviewFranco_Dagosto
 
EDB 13 - New Enhancements for Security and Usability - APJ
EDB 13 - New Enhancements for Security and Usability - APJEDB 13 - New Enhancements for Security and Usability - APJ
EDB 13 - New Enhancements for Security and Usability - APJEDB
 
Business-critical applications on VMware vSphere 6, VMware Virtual SAN, and V...
Business-critical applications on VMware vSphere 6, VMware Virtual SAN, and V...Business-critical applications on VMware vSphere 6, VMware Virtual SAN, and V...
Business-critical applications on VMware vSphere 6, VMware Virtual SAN, and V...Principled Technologies
 
Pass Summit Linux Scripting for the Microsoft Professional
Pass Summit Linux Scripting for the Microsoft ProfessionalPass Summit Linux Scripting for the Microsoft Professional
Pass Summit Linux Scripting for the Microsoft ProfessionalKellyn Pot'Vin-Gorman
 
SQLSaturday#290_Kiev_WindowsAzureDatabaseForBeginners
SQLSaturday#290_Kiev_WindowsAzureDatabaseForBeginnersSQLSaturday#290_Kiev_WindowsAzureDatabaseForBeginners
SQLSaturday#290_Kiev_WindowsAzureDatabaseForBeginnersTobias Koprowski
 

Tendances (20)

VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
 
Queues, Pools, Caches
Queues, Pools, CachesQueues, Pools, Caches
Queues, Pools, Caches
 
HTTP Session Replication with Oracle Coherence, GlassFish, WebLogic
HTTP Session Replication with Oracle Coherence, GlassFish, WebLogicHTTP Session Replication with Oracle Coherence, GlassFish, WebLogic
HTTP Session Replication with Oracle Coherence, GlassFish, WebLogic
 
20150716 introduction to apache spark v3
20150716 introduction to apache spark v3 20150716 introduction to apache spark v3
20150716 introduction to apache spark v3
 
Enable greater data reduction, storage performance, and manageability with De...
Enable greater data reduction, storage performance, and manageability with De...Enable greater data reduction, storage performance, and manageability with De...
Enable greater data reduction, storage performance, and manageability with De...
 
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdgeThe benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
The benefits of value SAS and data center NVMe drives with Dell EMC PowerEdge
 
KoprowskiT_SPBizConference_2AMaDisasterJustBegan
KoprowskiT_SPBizConference_2AMaDisasterJustBeganKoprowskiT_SPBizConference_2AMaDisasterJustBegan
KoprowskiT_SPBizConference_2AMaDisasterJustBegan
 
Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...
Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...
Upgrade to Dell EMC PowerEdge R940 servers with VMware vSphere 7.0 and gain g...
 
Azure and cloud design patterns
Azure and cloud design patternsAzure and cloud design patterns
Azure and cloud design patterns
 
Benefity Oracle Cloudu (4/4): Storage
Benefity Oracle Cloudu (4/4): StorageBenefity Oracle Cloudu (4/4): Storage
Benefity Oracle Cloudu (4/4): Storage
 
J2EE Performance And Scalability Bp
J2EE Performance And Scalability BpJ2EE Performance And Scalability Bp
J2EE Performance And Scalability Bp
 
Database Provisioning in EM12c: Provision me a Database Now!
Database Provisioning in EM12c: Provision me a Database Now!Database Provisioning in EM12c: Provision me a Database Now!
Database Provisioning in EM12c: Provision me a Database Now!
 
Scaling xen desktop and xenapp with solid state drives in healthcare
Scaling xen desktop and xenapp with solid state drives in healthcareScaling xen desktop and xenapp with solid state drives in healthcare
Scaling xen desktop and xenapp with solid state drives in healthcare
 
IaaS for DBAs in Azure
IaaS for DBAs in AzureIaaS for DBAs in Azure
IaaS for DBAs in Azure
 
Power BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle CloudPower BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle Cloud
 
Delphix Platform Overview
Delphix Platform OverviewDelphix Platform Overview
Delphix Platform Overview
 
EDB 13 - New Enhancements for Security and Usability - APJ
EDB 13 - New Enhancements for Security and Usability - APJEDB 13 - New Enhancements for Security and Usability - APJ
EDB 13 - New Enhancements for Security and Usability - APJ
 
Business-critical applications on VMware vSphere 6, VMware Virtual SAN, and V...
Business-critical applications on VMware vSphere 6, VMware Virtual SAN, and V...Business-critical applications on VMware vSphere 6, VMware Virtual SAN, and V...
Business-critical applications on VMware vSphere 6, VMware Virtual SAN, and V...
 
Pass Summit Linux Scripting for the Microsoft Professional
Pass Summit Linux Scripting for the Microsoft ProfessionalPass Summit Linux Scripting for the Microsoft Professional
Pass Summit Linux Scripting for the Microsoft Professional
 
SQLSaturday#290_Kiev_WindowsAzureDatabaseForBeginners
SQLSaturday#290_Kiev_WindowsAzureDatabaseForBeginnersSQLSaturday#290_Kiev_WindowsAzureDatabaseForBeginners
SQLSaturday#290_Kiev_WindowsAzureDatabaseForBeginners
 

Similaire à How Manulife Saved Over $730K by Improving PCF Efficiency and Reliability

Azure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User StoreAzure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User StoreDataStax Academy
 
NetBackup Appliance Family presentation
NetBackup Appliance Family presentationNetBackup Appliance Family presentation
NetBackup Appliance Family presentationSymantec
 
Testing Delphix: easy data virtualization
Testing Delphix: easy data virtualizationTesting Delphix: easy data virtualization
Testing Delphix: easy data virtualizationFranck Pachot
 
gDBClone - Database Clone “onecommand Automation Tool”
gDBClone - Database Clone “onecommand Automation Tool”gDBClone - Database Clone “onecommand Automation Tool”
gDBClone - Database Clone “onecommand Automation Tool”Ruggero Citton
 
20171122 aws usergrp_coretech-spn-cicd-aws-v01
20171122 aws usergrp_coretech-spn-cicd-aws-v0120171122 aws usergrp_coretech-spn-cicd-aws-v01
20171122 aws usergrp_coretech-spn-cicd-aws-v01Scott Miao
 
2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...
2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...
2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...Hendrik van Run
 
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...Amazon Web Services
 
TECHNICAL BRIEF▶ Backup Exec 15 Blueprint for Large Installations
TECHNICAL BRIEF▶ Backup Exec 15 Blueprint for Large InstallationsTECHNICAL BRIEF▶ Backup Exec 15 Blueprint for Large Installations
TECHNICAL BRIEF▶ Backup Exec 15 Blueprint for Large InstallationsSymantec
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Cloudera, Inc.
 
Ibm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ashIbm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ashAshutosh Mate
 
Володимир Цап "Constraint driven infrastructure - scale or tune?"
Володимир Цап "Constraint driven infrastructure - scale or tune?"Володимир Цап "Constraint driven infrastructure - scale or tune?"
Володимир Цап "Constraint driven infrastructure - scale or tune?"Fwdays
 
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...Amazon Web Services
 
Building big data pipelines with Kafka and Kubernetes
Building big data pipelines with Kafka and KubernetesBuilding big data pipelines with Kafka and Kubernetes
Building big data pipelines with Kafka and KubernetesVenu Ryali
 
The Experience of Java on Kubernetes with Microservices from HackFest
The Experience of Java on Kubernetes with Microservices from HackFestThe Experience of Java on Kubernetes with Microservices from HackFest
The Experience of Java on Kubernetes with Microservices from HackFestYoshio Terada
 
Oracle Exadata Exam Dump
Oracle Exadata Exam DumpOracle Exadata Exam Dump
Oracle Exadata Exam DumpPooja C
 
Champion Fas Deduplication
Champion Fas DeduplicationChampion Fas Deduplication
Champion Fas DeduplicationMichael Hudak
 
Ceph Day Beijing - Ceph all-flash array design based on NUMA architecture
Ceph Day Beijing - Ceph all-flash array design based on NUMA architectureCeph Day Beijing - Ceph all-flash array design based on NUMA architecture
Ceph Day Beijing - Ceph all-flash array design based on NUMA architectureCeph Community
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureCeph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureDanielle Womboldt
 
Dealing with kubesprawl tetris style !
Dealing with kubesprawl   tetris style !Dealing with kubesprawl   tetris style !
Dealing with kubesprawl tetris style !Taco Scargo
 
Ceph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephCeph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephDanielle Womboldt
 

Similaire à How Manulife Saved Over $730K by Improving PCF Efficiency and Reliability (20)

Azure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User StoreAzure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User Store
 
NetBackup Appliance Family presentation
NetBackup Appliance Family presentationNetBackup Appliance Family presentation
NetBackup Appliance Family presentation
 
Testing Delphix: easy data virtualization
Testing Delphix: easy data virtualizationTesting Delphix: easy data virtualization
Testing Delphix: easy data virtualization
 
gDBClone - Database Clone “onecommand Automation Tool”
gDBClone - Database Clone “onecommand Automation Tool”gDBClone - Database Clone “onecommand Automation Tool”
gDBClone - Database Clone “onecommand Automation Tool”
 
20171122 aws usergrp_coretech-spn-cicd-aws-v01
20171122 aws usergrp_coretech-spn-cicd-aws-v0120171122 aws usergrp_coretech-spn-cicd-aws-v01
20171122 aws usergrp_coretech-spn-cicd-aws-v01
 
2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...
2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...
2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...
 
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
 
TECHNICAL BRIEF▶ Backup Exec 15 Blueprint for Large Installations
TECHNICAL BRIEF▶ Backup Exec 15 Blueprint for Large InstallationsTECHNICAL BRIEF▶ Backup Exec 15 Blueprint for Large Installations
TECHNICAL BRIEF▶ Backup Exec 15 Blueprint for Large Installations
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive


 
Ibm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ashIbm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ash
 
Володимир Цап "Constraint driven infrastructure - scale or tune?"
Володимир Цап "Constraint driven infrastructure - scale or tune?"Володимир Цап "Constraint driven infrastructure - scale or tune?"
Володимир Цап "Constraint driven infrastructure - scale or tune?"
 
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
 
Building big data pipelines with Kafka and Kubernetes
Building big data pipelines with Kafka and KubernetesBuilding big data pipelines with Kafka and Kubernetes
Building big data pipelines with Kafka and Kubernetes
 
The Experience of Java on Kubernetes with Microservices from HackFest
The Experience of Java on Kubernetes with Microservices from HackFestThe Experience of Java on Kubernetes with Microservices from HackFest
The Experience of Java on Kubernetes with Microservices from HackFest
 
Oracle Exadata Exam Dump
Oracle Exadata Exam DumpOracle Exadata Exam Dump
Oracle Exadata Exam Dump
 
Champion Fas Deduplication
Champion Fas DeduplicationChampion Fas Deduplication
Champion Fas Deduplication
 
Ceph Day Beijing - Ceph all-flash array design based on NUMA architecture
Ceph Day Beijing - Ceph all-flash array design based on NUMA architectureCeph Day Beijing - Ceph all-flash array design based on NUMA architecture
Ceph Day Beijing - Ceph all-flash array design based on NUMA architecture
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureCeph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
 
Dealing with kubesprawl tetris style !
Dealing with kubesprawl   tetris style !Dealing with kubesprawl   tetris style !
Dealing with kubesprawl tetris style !
 
Ceph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephCeph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for Ceph
 

Plus de VMware Tanzu

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItVMware Tanzu
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023VMware Tanzu
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleVMware Tanzu
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023VMware Tanzu
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductVMware Tanzu
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready AppsVMware Tanzu
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And BeyondVMware Tanzu
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfVMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023VMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023VMware Tanzu
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptxVMware Tanzu
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchVMware Tanzu
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishVMware Tanzu
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVMware Tanzu
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - FrenchVMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023VMware Tanzu
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootVMware Tanzu
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerVMware Tanzu
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeVMware Tanzu
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsVMware Tanzu
 

Plus de VMware Tanzu (20)

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About It
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at Scale
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a Product
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready Apps
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And Beyond
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptx
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - French
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - English
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - English
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - French
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software Engineer
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs Practice
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
 

Dernier

Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Best Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdfBest Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdfIdiosysTechnologies1
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 

Dernier (20)

Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Best Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdfBest Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdf
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 

How Manulife Saved Over $730K by Improving PCF Efficiency and Reliability

  • 1. Start Counting - How we unlocked platform efficiency and reliability, while saving over $730,000 David Wu, Ph.D - Senior Staff Solutions Architect - VMware Tanzu Labs David Filippelli - Lead Site Reliability Engineer - Manulife Alvin Kwame Coch - Senior Site Reliability Engineer - Manulife 2nd September 2021 $1 Million
  • 2. Content - PCF @ Manulife - Efficiency vs Reliability - Improving Efficiency & Reliability - Tallying The Savings - References - Acknowledgements - Q & A
  • 3. PCF @ Manulife - 7 PCF Foundations in Azure North America and Azure Asia - Total: ~8000 AIs and 540 Diego Cells across all foundations - Problem Statement: - How can we improve the efficiency and reliability of our platforms while saving money?
  • 4. Efficiency vs Reliability - Efficiency - Make the platform run better, therefore platform engineers life easier - Optimize resources used - Reduce cost and time to do something - Improve monitoring strategies - Reliability - Deliver an exceptional customer first experience - Increase application uptime - Increase service availability - Improve Recovery Point Objective (RPO)
  • 5. Improving Efficiency & Reliability - Diego Cell Scheduler - Switching TAS Internal blobstore to External blobstore - Changing and Tuning Diego Cell VM types - Foundation Configuration Tuning
  • 6. Improving Efficiency & Reliability – Diego Cell Scheduler - Manulife Scheduler App - Developers self service to subscribe what (apps) and when (day of week and time) to stop/start their apps. - Incentive to developers: Save costs on charge back
  • 7. Improving Efficiency & Reliability - Diego Cell Scheduler - Dev and Sandbox environments not fully utilized on weekends starting Friday evenings until Monday morning. - Costs are incurred to both app teams and platform team. How to save money? - Concourse pipeline to query diego cell memory utilization after apps stopped, to determine how many diego cells to scale down by. This is reverted to original diego cell count via pipeline on Monday morning. diego cell count day of week M T W T F S S M T W T F S S M T W T F S S M T W T F S S M T W T F
  • 8. Improving Efficiency & Reliability - Internal blobstore to External blobstore - Using internal blobstore == NFS running on VM - Cost of VM+persistent disk > cost of Azure blob storage (GRS, us-east1, hot) [1,2,3] - Decrease platform upgrade times while increasing availability - NFS VM is not HA [4] - Large NFS persistent disks leads to long upgrade times - Long upgrade times lead to dev outages and potential issues with auto scaling/healing. - Decrease backup time and reduce outage (cloud controller [CC] lock) - e.g. Sandbox: 3 hrs backup, 1hr 14 min CC lock → 13 min backup, 1 min CC lock - Locked CC leads to dev outages (no app pushes, no delete, no autoscale) - Possible to do more backups to meet or increase Recovery Point Objective (RPO)
  • 9. Improving Efficiency & Reliability - Internal blobstore to External blobstore Azure service endpoints speed up access to blob storage Setup firewalls, ensure CC can access + Windows Diego cells for buildpacks 4 x blob storage containers - Buildpacks Images Droplets Resources Prepare blob storage and required processes First Migration 5 days before switch over NFS Internal blobstore in PCF Use rclone[5] to transfer blob objects Check for routing issues and slow transfer speeds ... Incremental Migration 1 day before switch over NFS Copy only newer blobs. Will be much faster Perform full switch over ...
  • 10. Improving Efficiency & Reliability - Internal blobstore to External blobstore Azure service endpoints speed up access to blob storage 4 x blob storage containers NFS Performing the full switchover CC Conduct cf push performance timing tests* and record results Use candidate apps on platform, e.g. Linux + Windows. Do at least 2 cf pushes for cache 1 Lockdown the Cloud controller using cf cli No app pushes will be possible 3 Perform a final copy migration NFS 4 Take a snapshot of NFS Persistent Disk. (Original NFS disk will remain orphaned for 5 days after switchover) PCF foundation 5 Configure and apply changes for switch over 6 Perform at least 2 performance tests and compare results from step 1. Check for network issues. 7 Unlock the cloud controller using cf cli 2 * https://github.com/dawu415/PCFToolkit/tree/master/tests/cfpush
  • 11. Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types - Optimize utilization of VM resources. e.g., memory & disk - Memory optimized VMs - Esv3 series [6] - Cost of Dsv2 (default) > cost of Esv3 VMs [1] - Reduce disk space use - Potential to decrease VM count - More VM memory in Esv3: Dsv2 - 28Gb RAM vs Esv3 - 32Gb RAM - Increased app density per cell (don’t over do it) - Improve reliability by tuning diego cells - Ensure are using sufficient resources to support current needs and future growth
  • 12. Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types Tuning diego cells: - Questions we want quantifiable answers to - How many diego cells do I need to fit x AIs? - Are we under-provisioned or appropriately over-provisioned on diego cells? - What’s the minimum disk space per cell? Azure charges based on upper tier. - Information and statistics we need to know - What else is running on the cells and how much memory they need? e.g. Anti-virus - How many AIs, average Memory per AI used and disk quota per AI used - Can use cf api to get a snapshot of raw information. - cf applist script from Rakutentech (https://github.com/rakutentech/cf-tools/blob/master/cf-applist.sh) ./cf-applist.sh -s Instances -s Instances -f Name,State,Instances,Memory,Disk_quota > applist_<env>.txt
  • 13. Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types - CF capacity behaviour in TAS container and architecture docs [7,8] - Use our capacity planning spreadsheet: → Worksheets at https://github.com/dawu415/PCFToolkit → Get all AI memory used and disk quota raw snapshot → Get average AI memory and disk quota used. Will be in Mb. → Divide by 1024 to get Gb and input to worksheet. - Capacity information: - Build capacity monitoring dashboards and alerts - Understand usage behaviours and do simulations e.g. what if 25% of app instances switch to higher disk use?
  • 14. Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types Additional things to consider: - VM Disk and Memory Capacity - For on-prem and Availability Zone (AZ) customers: - Ensure 1/N % extra in IaaS memory and disk to cover 1 AZ failure in an N AZ setup. - Factor additional resource per cell reserved for addons - Warning level in your monitoring. - CPU vs Memory - If possible, avoid packing too many AIs into a single diego cell. - Too many means longer evacuation time leading to longer recovery and VM update times. - Too many also can hinder application performance. CPU share is based off the AI memory.
  • 15. Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types Changing VM Types: - Ensure you have historical metrics for CPU, memory and disk bandwidth usage - > 30 days is preferable and seasonal time of heavy usage - IaaS level may have historical data to help with this. - If not enough data, can use blue-green algorithm. - Does the new VM type support your needs? - Correlate heavy usage peak metrics to - CPU load - Memory utilization and Disk IOPS - Network egress bandwidth - Write disk IOPS tests scripts using FIO [9] to verify. Can provision empty bosh VM to do tests.* *https://www.starkandwayne.com/blog/hey-bosh-gimme-a-vm/
  • 16. Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types - How to really change the VM type? - Change VM type in operations manager/platform automation configuration - Opsman 2.10.3 supports new generation of Azure VM types (without Availability Sets) [10] - Otherwise add these in automation/manual via opsman api - But wait…...IaaS might not allow the change or have constraints on what can be changed - Azure: Cannot switch Availability Sets(AS) sitting in old HW clusters. - Switch AS only possible when 1 VM in it. e.g. diego cell AS, only 1 diego cell VM. - Before using the algorithm, check your - IaaS VM quotas and subnets have capacity - At least double your current TAS plus all existing iso seg diego cell count. - Firewall is setup. - We used the same subnet as the existing diego cell to avoid creating new firewall rules - since it is temporary for us.
  • 17. Improving Efficiency & Reliability - Changing and Tuning Diego Cell VM types - Zero Downtime Blue/Green Diego cell VM Switchover algorithm - Use isolation segments to extend diego pool (replicate tile with unique name). - Share same GoRouters as TAS. - Leave the segment name blank to ‘extend’ the pool of TAS diego cells [11] - Use the segment name to extend the pool of an existing iso seg diego cell TAS Diego cells: Dsv2. Blue - current running apps Initial Delete Iso Seg tile With thanks to D. Stevenson for initial discussion and idea Scale down TAS diego cell Adjust max-in-flight for diego cells to speed up scale down but beware and monitor BBS load. Scale up TAS diego cell Scale down Iso Seg diego cell to 0 Green - running apps in new Esv3 Extended TAS Diego cells in Iso Seg: Esv3 White - no running apps Setup Iso Seg with diego cells Convert TAS diego cell to Esv3 Switched TAS Diego cell to Esv3 Diego cell = 1 Apps auto relocate to new Diego cells
  • 18. Improving Efficiency & Reliability - Foundation Configuration Tuning What we are trying to achieve: - Optimize utilization of VM resources. - Decrease VM count for over-utilized resources - Increase VM count to better platform health/HA
  • 19. Improving Efficiency & Reliability - Foundation Configuration Tuning - Get your monitoring ready - Will need to monitor platform metrics to help guide decisions - Build dashboards based on KPIs [12, 13, 14, 15] - Cloud Controller KPIs - BBS CPU Load and Memory, Diego Cells: CPU load, memory, disk capacity - UAA - Loggregator: Doppler, Traffic controller and nozzles - Overall platform health - MySQL Server: CPU Load, Memory and Disk - Go Router: CPU Load, Requests Per Second
  • 20. Improving Efficiency & Reliability - Foundation Configuration Tuning - Table out the Changes - Maintain a record of where you are vs where you want to be - Maintain notes and comments of decisions - Helps review cost benefit decisions on a bigger picture Sample only, not real data Get worksheet at https://github.com/dawu415/PCFToolkit
  • 21. Improving Efficiency & Reliability - Foundation Configuration Tuning - General VM metrics [12] - CPU load < 80% - 90% ( < 60% GoRouter ) - Memory Utilization < 80% - Persistent Disk < 80%-90%
  • 22. Improving Efficiency & Reliability - Foundation Configuration Tuning - Loggregator [15] - Understand the loggregator architecture and use the loggregator guide to assist [16] - Doppler maximum effectiveness using horizontal VM scaling is 40 VMs (v1 + v2 configuration) - Maintain a 2:1:1 ratio (doppler : traffic controller : nozzle) - Check for dropped messages in doppler, connection loss of TC and high resource utilization of nozzles. Should be part of monitoring Dopplers Traffic Controllers/ RLP Nozzles Loggregator Agents
  • 23.
  • 24. Improving Efficiency & Reliability - Foundation Configuration Tuning - Cloud Controller [14] - CPU load < 80% - 90% ( < 60% GoRouter ) - Memory Utilization < 80% - Persistent Disk < 80% - 90% - Determine what your the usage pattern + head room and simulate it with parallel cf pushes in a script. https://github.com/dawu415/PCFToolkit/tree/master/tests/cfpush - Exercises the cloud controller api when you can’t really test it in dev. Run in parallel: time cf push, time cf delete. Review cloud controller metrics. Abnormally slow response times are indicative of low scaling issues with CC or its worker. - Review and integrate changes into dev environment. Monitor cloud controller metrics. - Understand the nozzles that query the CC API to get app names to insert into log.
  • 25.
  • 26. Tallying the Savings Change Savings Notes Diego cell Scheduling Switching blobstore to external Switch diego cells VM + tuning Total ~$ 40,000.00 p.a ~$ 21,500.00 p.a ~$1.06 million p.a ~$ 1 million p.a Originally calculated to be ~$730k but actual ~ $1 million ~30 VMs/foundation deleted and recreated per weekend. Savings from 2 North America foundations
  • 27. References [1] Virtual Machine Pricing, https://azure.microsoft.com/en-us/pricing/details/virtual-machines/windows/ [2] Managed Disk Pricing, https://azure.microsoft.com/en-us/pricing/details/managed-disks/ [3] Blob storage Pricing, https://azure.microsoft.com/en-us/pricing/details/storage/blobs/ [4] CF Blobstore High Availability, https://docs.cloudfoundry.org/concepts/high-availability.html#blobstore [5] rclone tool download, https://rclone.org/downloads/ [6] Ev3 and Esv3-series, https://docs.microsoft.com/en-us/azure/virtual-machines/ev3-esv3-series [7] TAS Container Mechanics, https://docs.pivotal.io/application-service/concepts/container-security.html#mechanics [8] TAS Diego Architecture, https://docs.pivotal.io/application-service/concepts/diego/diego-architecture.html [9] Azure, Benchmark a disk, https://docs.microsoft.com/en-us/azure/virtual-machines/disks-benchmarks [10] Ops Manager v2.10.3 release notes, https://docs.pivotal.io/ops-manager/2-10/release-notes.html#2-10-3 [11] Isolation Segment - app container, https://docs.pivotal.io/application-service/2-7/operating/installing-pcf-is.html#application_containers [12] Key Performance Indicators, https://docs.pivotal.io/application-service/operating/monitoring/key-cap-scaling.html [13] Key Capacity Scaling Indicators, https://docs.pivotal.io/application-service/operating/monitoring/key-cap-scaling.html [14] Scaling Cloud Controller, https://docs.cloudfoundry.org/running/managing-cf/scaling-cloud-controller.html [15] Loggregator Guide for CF operators, https://docs.cloudfoundry.org/loggregator/log-ops-guide.html [16] Loggregator Architecture, https://docs.cloudfoundry.org/loggregator/architecture.html
  • 28. Acknowledgements We would like to thank following people for their involvement and efforts to make things happen - Piotr Chomiak - Richard Garro - Dan Buchko - John Calabrese - David Stevenson - Michael Chung - Kelvin Li - Jonathan Leung - Lok Wong - Haydon Ryan - John Tan
  • 29. Q & A