SlideShare une entreprise Scribd logo
1  sur  41
Applying Hyper-scale
Design Patterns to Routing
Hannes Gredler, CTO RtBrick Inc.
DEVNET-2064
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Who am I ?
• CTO at RtBrick, Inc.
• Past stint: Distinguished Engineer
with the “other router-vendor”
• 18 Years working experience,
developing, deploying and
supporting Routing Software
• Expertise
• BGP, IS-IS, MPLS
• 20+ Patents
• 20+ Proposed Standards
http://www.arkko.com/tools/allstats/hannesgredler.html
• IETF WG co-chair (IS-IS)
DEVNET-2064 2
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
>2013 exposure to Data Center Networks & SR
New large-scale data-center
network model emerging
 [draft-ietf-rtgwg-bgp-routing-large-dc]
 End-to-End Layer-3 routing
 Fixes issues with L2 switching data
plane.
 Hierarchical Topology
 CLOS-based
 Max 5-stages
 Use of aggregation at TORs
DEVNET-2064 3
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Got a couple of inconvenient insights …
• Networks have become Anti-Moore
• direct sourcing from OEM manufacturers in Taiwan
• Hardware is a Commodity
• Cost per Bit dropping sharply (USD 400 / 100GBE port)
• Boutique ASICs viable in 5 years from now ?
• Curated Software Release models approaching EOL
• Modularization or Custom package selection desired (no-PIM, no RSVP, etc.)
• Pay per-use
• Different model (node vs. system) for Resiliency
• Open sourcing of components the new normal
• Integration of components becomes core competency
DEVNET-2064 4
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public 5DEVNET-2064
John Gage
Sun Microsystems
1) “The network is the computer”
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public 6DEVNET-2064
Hannes Gredler, 2015
2) “Is it possible to construct a router
based on the web 2.0 mindset ?”
• Introduction
• Multi-Level Architecture
• Micro-services & APIs
• Commoditization & Unit Economics
• Resiliency, system coupling and state recovery
• Open Source Development & Test
• Conclusion
Agenda
Multi-Level Architecture
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Hyper-scale Multi-level Architecture
9DEVNET-2064
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Forwarding node
• Translates RIB Objects to local OS
representation
• Tables
• Routes
• Nexthops
• Hardware Prefix Caching
• Aggregate FIB table
• (filter specifics)
• Localize fwd table
• VPNs
DEVNET-2064 10
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Protocol I/O node
• Schema driven protocol
serializer /de-serializer
• Keep alive delegation
/absorption
• Terminal Communication point
for Sockets, stdio & file I/O
• Pre-processing protocol stream
(filter BGP PA128)
• Queuing machinery & Routing
Protocol update generation
• Interface state handling
DEVNET-2064 11
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Application (route computation) node
APPd
input_proc
add: bgp_filter_input
chg: bgp_filter_input
del: -
IB: pre-filter Nbr
Neighbor 193.203.0.40
pre-filter Ipv4 RIB
Data Structure
Schema
• Schema driven Data Structure
Server
• Stores Applications Objects
• Routes, Nexthops, Tables
• Triggered execution (Add, Chg, Del)
of internal/external Application code
• Python functions
• C/C++ library calls
• Executables vfork()
DEVNET-2064 12
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Putting it all together
13DEVNET-2064
Micro-Services & APIs
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Build a system of little components
• Micro-service architecture is like a UNIX pipeline model
• Small pieces of software, serving a unique purpose
• Easy transfer of state from one brick to next
SortFilterSource Filter Sink
curl http://192.168.1.1/bds/object | grep “Received-From:” | sort | grep “foo” >
out
DEVNET-2064 15
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
REST/JSON based APIs
16DEVNET-2064
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
bds://local/bgp.neighbor
bds://local/isis.ad
j
bds://local/isis.lsdb.l2
bds://217.160.181.216/bgp.rib-in
PUBSUB
Database centric / Distributed Data Store
DEVNET-2064 17
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Open IPC format = BSON/JSON
• Binary JSON for memory and I/O
efficiency
• JSON conversion on the fly possible
DEVNET-2064 18
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Table replication & state flow within a system
19DEVNET-2064
Commoditization & Unit Economics
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Compute Strategy: Yahoo vs. Google
21DEVNET-2064
Few Big vs. Many Small
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
• Economy of scale will ultimately render custom-ASICs obsolete
• FY2016 systems shipping: 100GB, > 128K FIB entries
• Disintegration happening
• soon to enter the Edge Router Business …
• For ease of integration makes no Hardware, no locality, no OS assumptions
• Unbounded Configuration Possibilities:
• Single Switch, Cluster of Switches, Co-located x86 Rack Servers ….
• Large FIBs, Small FIBs, SW-based forwarders & Combos thereof
Commodity data plane = White-boxes
DEVNET-2064 22
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
• 16-32 CPU Cores, 64 GB RAM, Solid State Disks
• app USD 3000
• Runs stock Ubuntu / Centos
• Linux Containers (LXC)
• dependency management
• Para-Virtualization
Commodity control plane = 1RU Rack Servers
DEVNET-2064 23
Resiliency, system coupling and
state recovery
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Hyper-scale Multi-level Architecture
25DEVNET-2064
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Resiliency
26DEVNET-2064
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public 27DEVNET-2064
Resiliency – snapshot DB to disk
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Resiliency (2) – restart based on disk snapshot
28DEVNET-2064
Open Source
Development & Test
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Open Development
• Open Source
• 100 eyes better than 4 eyes, Network effects
• Long term Maintenance
• Open Source means sharing of not just Code:
• Code
• Test
• Build
• Documentation
30DEVNET-2064
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Open development (1)
• Use what is usable
• No needs to re-invent Linux, event-loops, memory managers
• Kernel based networking stacks are not usable for a router
• Debugging hard (GDB live attachment)
• Experimental forwarding code with no fault-domains in your kernel, really ?
• TCP snapshots / restart.
• In 2016Q1 we did not have a packet forwarding core
• Cisco did release fd.io / VPP
• User space DPDK design aligned with our (religious) believes
• Most feature complete open-source L3 forwarder
• Engineered for performance and maintainability
31DEVNET-2064
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public 32DEVNET-2064
Open development (2)
• Kick-ass VPP crew
• Helped us to implement necessary core-features (indirect next-hop) within two weeks.
• Good balance between Stability and feature velocity
• Excellent Continuous Integration & Test Automation (untypical for FLOSS projects)
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Open Development (3)
VPP Internet stream generator
33DEVNET-2064
Conclusion
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
In Conclusion
• Network Equipment design got to be
• Distributed, Multi-level Architecture
• Micro-service based
• Running on Commodity Hardware
• “System” Resilient
• Open Development / Open Test
• Cisco Vector Packet Processing (VPP)
• Best code in the industry (why is this free ?)
• Good Code Governance
• Establishment of an innovative ecosystem around VPP underway
35DEVNET-2064
rtbrick demo at fd.io booth
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Demo hosted at EC2 instance
37DEVNET-2064
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
• 39Million IPv4 / 3M IPv6 route entries
• BGP table snapshots from RIPE RIS server
• route-processing / update / restart performance
• 20x compared to JUNOS | IOS-XR
• Full-bringup time 180s
• Resync time 26s
• Full fault domain isolation
• Blast radius within a protocol-process of an address family
• Process restart
• Preservation of TCP session
• Fast, robust Re-sync of state
• Everything versioned
Demo SCALE
© 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public
Process restart using snapshots (3)
Normal appd restart
resync time for 2.5M
RIB entries / 28s
snapshot appd restart
resync time for 2.5M
RIB entries / 6s
Thank you
Applying Hyper-scale Design Patterns to Routing

Contenu connexe

Tendances

Cisco catalyst3750presspresentation
Cisco catalyst3750presspresentationCisco catalyst3750presspresentation
Cisco catalyst3750presspresentationho nguyen
 
How to Implement SDN Technology in ITB
How to Implement SDN Technology in ITBHow to Implement SDN Technology in ITB
How to Implement SDN Technology in ITBSDNRG ITB
 
Enterprise Networks - Cisco Digital Network Architecture - Introducing the Ne...
Enterprise Networks - Cisco Digital Network Architecture - Introducing the Ne...Enterprise Networks - Cisco Digital Network Architecture - Introducing the Ne...
Enterprise Networks - Cisco Digital Network Architecture - Introducing the Ne...Cisco Canada
 
The Changing Data Center Landscape
The Changing Data Center LandscapeThe Changing Data Center Landscape
The Changing Data Center LandscapeCisco Canada
 
Tap DANZing - Arista Networks Redefining the Cost of Accessing Network Traffic
Tap DANZing - Arista Networks Redefining the Cost of Accessing Network TrafficTap DANZing - Arista Networks Redefining the Cost of Accessing Network Traffic
Tap DANZing - Arista Networks Redefining the Cost of Accessing Network TrafficEmulex Corporation
 
The Evolution of the Data Centre
The Evolution of the Data CentreThe Evolution of the Data Centre
The Evolution of the Data CentreCisco Canada
 
TechWiseTV Workshop: Software-Defined Access
TechWiseTV Workshop: Software-Defined AccessTechWiseTV Workshop: Software-Defined Access
TechWiseTV Workshop: Software-Defined AccessRobb Boyd
 
Successfully Interconnecting Data Centers
Successfully Interconnecting Data CentersSuccessfully Interconnecting Data Centers
Successfully Interconnecting Data CentersCisco Canada
 
Cisco DC Networking: Gain Insight and Programmability with
Cisco DC Networking: Gain Insight and Programmability with Cisco DC Networking: Gain Insight and Programmability with
Cisco DC Networking: Gain Insight and Programmability with Cisco Canada
 
The Data Center Network Evolution
The Data Center Network EvolutionThe Data Center Network Evolution
The Data Center Network EvolutionCisco Canada
 
Brocade Software Networking (SDN NFV Day ITB 2016)
Brocade Software Networking (SDN NFV Day ITB 2016)Brocade Software Networking (SDN NFV Day ITB 2016)
Brocade Software Networking (SDN NFV Day ITB 2016)SDNRG ITB
 
NFV orchestration for cloud and virtual branch services
NFV orchestration for cloud and virtual branch servicesNFV orchestration for cloud and virtual branch services
NFV orchestration for cloud and virtual branch servicesCisco Canada
 
Understanding Cisco’s Next Generation SD-WAN Solution with Viptela
Understanding Cisco’s Next Generation SD-WAN Solution with ViptelaUnderstanding Cisco’s Next Generation SD-WAN Solution with Viptela
Understanding Cisco’s Next Generation SD-WAN Solution with ViptelaCisco Canada
 
Speed Hybrid WAN Deployment with the New Cisco Intelligent WAN Design Guide -...
Speed Hybrid WAN Deployment with the New Cisco Intelligent WAN Design Guide -...Speed Hybrid WAN Deployment with the New Cisco Intelligent WAN Design Guide -...
Speed Hybrid WAN Deployment with the New Cisco Intelligent WAN Design Guide -...Cisco Enterprise Networks
 
Gain Insight and Programmability with Cisco DC Networking
Gain Insight and Programmability with Cisco DC NetworkingGain Insight and Programmability with Cisco DC Networking
Gain Insight and Programmability with Cisco DC NetworkingCisco Canada
 
Atf 3 q15-7 - delivering cloud scale workflow automation control and visibili...
Atf 3 q15-7 - delivering cloud scale workflow automation control and visibili...Atf 3 q15-7 - delivering cloud scale workflow automation control and visibili...
Atf 3 q15-7 - delivering cloud scale workflow automation control and visibili...Mason Mei
 
Architecture of NFV Platform for Orchestrating Cloud-based & vBranch Managed ...
Architecture of NFV Platform for Orchestrating Cloud-based & vBranch Managed ...Architecture of NFV Platform for Orchestrating Cloud-based & vBranch Managed ...
Architecture of NFV Platform for Orchestrating Cloud-based & vBranch Managed ...Cisco Canada
 
Juniper Unified SDN Technical Presentation (SDN Day ITB 2016)
Juniper  Unified SDN Technical Presentation (SDN Day ITB 2016)Juniper  Unified SDN Technical Presentation (SDN Day ITB 2016)
Juniper Unified SDN Technical Presentation (SDN Day ITB 2016)SDNRG ITB
 

Tendances (20)

Cisco catalyst3750presspresentation
Cisco catalyst3750presspresentationCisco catalyst3750presspresentation
Cisco catalyst3750presspresentation
 
How to Implement SDN Technology in ITB
How to Implement SDN Technology in ITBHow to Implement SDN Technology in ITB
How to Implement SDN Technology in ITB
 
Enterprise Networks - Cisco Digital Network Architecture - Introducing the Ne...
Enterprise Networks - Cisco Digital Network Architecture - Introducing the Ne...Enterprise Networks - Cisco Digital Network Architecture - Introducing the Ne...
Enterprise Networks - Cisco Digital Network Architecture - Introducing the Ne...
 
The Changing Data Center Landscape
The Changing Data Center LandscapeThe Changing Data Center Landscape
The Changing Data Center Landscape
 
Tap DANZing - Arista Networks Redefining the Cost of Accessing Network Traffic
Tap DANZing - Arista Networks Redefining the Cost of Accessing Network TrafficTap DANZing - Arista Networks Redefining the Cost of Accessing Network Traffic
Tap DANZing - Arista Networks Redefining the Cost of Accessing Network Traffic
 
The Evolution of the Data Centre
The Evolution of the Data CentreThe Evolution of the Data Centre
The Evolution of the Data Centre
 
TechWiseTV Workshop: Software-Defined Access
TechWiseTV Workshop: Software-Defined AccessTechWiseTV Workshop: Software-Defined Access
TechWiseTV Workshop: Software-Defined Access
 
Successfully Interconnecting Data Centers
Successfully Interconnecting Data CentersSuccessfully Interconnecting Data Centers
Successfully Interconnecting Data Centers
 
ACI Hands-on Lab
ACI Hands-on LabACI Hands-on Lab
ACI Hands-on Lab
 
Cisco DC Networking: Gain Insight and Programmability with
Cisco DC Networking: Gain Insight and Programmability with Cisco DC Networking: Gain Insight and Programmability with
Cisco DC Networking: Gain Insight and Programmability with
 
The Data Center Network Evolution
The Data Center Network EvolutionThe Data Center Network Evolution
The Data Center Network Evolution
 
Brocade Software Networking (SDN NFV Day ITB 2016)
Brocade Software Networking (SDN NFV Day ITB 2016)Brocade Software Networking (SDN NFV Day ITB 2016)
Brocade Software Networking (SDN NFV Day ITB 2016)
 
NFV orchestration for cloud and virtual branch services
NFV orchestration for cloud and virtual branch servicesNFV orchestration for cloud and virtual branch services
NFV orchestration for cloud and virtual branch services
 
Understanding Cisco’s Next Generation SD-WAN Solution with Viptela
Understanding Cisco’s Next Generation SD-WAN Solution with ViptelaUnderstanding Cisco’s Next Generation SD-WAN Solution with Viptela
Understanding Cisco’s Next Generation SD-WAN Solution with Viptela
 
Speed Hybrid WAN Deployment with the New Cisco Intelligent WAN Design Guide -...
Speed Hybrid WAN Deployment with the New Cisco Intelligent WAN Design Guide -...Speed Hybrid WAN Deployment with the New Cisco Intelligent WAN Design Guide -...
Speed Hybrid WAN Deployment with the New Cisco Intelligent WAN Design Guide -...
 
The Cloudification of the Data Center Network
The Cloudification of the Data Center NetworkThe Cloudification of the Data Center Network
The Cloudification of the Data Center Network
 
Gain Insight and Programmability with Cisco DC Networking
Gain Insight and Programmability with Cisco DC NetworkingGain Insight and Programmability with Cisco DC Networking
Gain Insight and Programmability with Cisco DC Networking
 
Atf 3 q15-7 - delivering cloud scale workflow automation control and visibili...
Atf 3 q15-7 - delivering cloud scale workflow automation control and visibili...Atf 3 q15-7 - delivering cloud scale workflow automation control and visibili...
Atf 3 q15-7 - delivering cloud scale workflow automation control and visibili...
 
Architecture of NFV Platform for Orchestrating Cloud-based & vBranch Managed ...
Architecture of NFV Platform for Orchestrating Cloud-based & vBranch Managed ...Architecture of NFV Platform for Orchestrating Cloud-based & vBranch Managed ...
Architecture of NFV Platform for Orchestrating Cloud-based & vBranch Managed ...
 
Juniper Unified SDN Technical Presentation (SDN Day ITB 2016)
Juniper  Unified SDN Technical Presentation (SDN Day ITB 2016)Juniper  Unified SDN Technical Presentation (SDN Day ITB 2016)
Juniper Unified SDN Technical Presentation (SDN Day ITB 2016)
 

Similaire à Applying Hyper-scale Design Patterns to Routing

Open Device Programmability: Hands-on Intro to RESTCONF (and a bit of NETCONF)
Open Device Programmability: Hands-on Intro to RESTCONF (and a bit of NETCONF)Open Device Programmability: Hands-on Intro to RESTCONF (and a bit of NETCONF)
Open Device Programmability: Hands-on Intro to RESTCONF (and a bit of NETCONF)Cisco DevNet
 
BRKSPG-2069-64bit-package.pdf
BRKSPG-2069-64bit-package.pdfBRKSPG-2069-64bit-package.pdf
BRKSPG-2069-64bit-package.pdfHeng30
 
PLNOG14: Service orchestration in provider network, Tail-f - Przemysław Borek
PLNOG14: Service orchestration in provider network, Tail-f - Przemysław BorekPLNOG14: Service orchestration in provider network, Tail-f - Przemysław Borek
PLNOG14: Service orchestration in provider network, Tail-f - Przemysław BorekPROIDEA
 
Cisco's Open Device Programmability Strategy: Open Discussion
Cisco's Open Device Programmability Strategy: Open DiscussionCisco's Open Device Programmability Strategy: Open Discussion
Cisco's Open Device Programmability Strategy: Open DiscussionCisco DevNet
 
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:Cisco Canada
 
IPv6IntegrationBestPracticesfinal.pdf
IPv6IntegrationBestPracticesfinal.pdfIPv6IntegrationBestPracticesfinal.pdf
IPv6IntegrationBestPracticesfinal.pdfCPUHogg
 
BRKDCN-2670 Day2 operations for Datacenter VxLAN EVPN fabrics.pdf
BRKDCN-2670 Day2 operations for Datacenter VxLAN EVPN fabrics.pdfBRKDCN-2670 Day2 operations for Datacenter VxLAN EVPN fabrics.pdf
BRKDCN-2670 Day2 operations for Datacenter VxLAN EVPN fabrics.pdfHarryH11
 
Open coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpen coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpenInfra Days Poland 2019
 
Cisco Connect Vancouver 2017 - Understanding Cisco next gen SD-WAN
Cisco Connect Vancouver 2017 - Understanding Cisco next gen SD-WANCisco Connect Vancouver 2017 - Understanding Cisco next gen SD-WAN
Cisco Connect Vancouver 2017 - Understanding Cisco next gen SD-WANCisco Canada
 
Citrix Master Class - Live Upgrade from XenApp 6.5 to 7.6
Citrix Master Class - Live Upgrade from XenApp 6.5 to 7.6Citrix Master Class - Live Upgrade from XenApp 6.5 to 7.6
Citrix Master Class - Live Upgrade from XenApp 6.5 to 7.6Lee Bushen
 
WAN Automation Engine API Deep Dive
WAN Automation Engine API Deep DiveWAN Automation Engine API Deep Dive
WAN Automation Engine API Deep DiveCisco DevNet
 
Design and Deployment of Enterprise WLANs
Design and Deployment of Enterprise WLANsDesign and Deployment of Enterprise WLANs
Design and Deployment of Enterprise WLANsFab Fusaro
 
Triangle Kubernetes Meetup: Container cloud networking - Contiv for K8S & Ope...
Triangle Kubernetes Meetup: Container cloud networking - Contiv for K8S & Ope...Triangle Kubernetes Meetup: Container cloud networking - Contiv for K8S & Ope...
Triangle Kubernetes Meetup: Container cloud networking - Contiv for K8S & Ope...Sanjeev Rampal
 
Software Innovations and Control Plane Evolution in the new SDN Transport Arc...
Software Innovations and Control Plane Evolution in the new SDN Transport Arc...Software Innovations and Control Plane Evolution in the new SDN Transport Arc...
Software Innovations and Control Plane Evolution in the new SDN Transport Arc...Cisco Canada
 
Flexpod with SAP HANA and SAP Applications
Flexpod with SAP HANA and SAP ApplicationsFlexpod with SAP HANA and SAP Applications
Flexpod with SAP HANA and SAP ApplicationsLishantian
 

Similaire à Applying Hyper-scale Design Patterns to Routing (20)

Open Device Programmability: Hands-on Intro to RESTCONF (and a bit of NETCONF)
Open Device Programmability: Hands-on Intro to RESTCONF (and a bit of NETCONF)Open Device Programmability: Hands-on Intro to RESTCONF (and a bit of NETCONF)
Open Device Programmability: Hands-on Intro to RESTCONF (and a bit of NETCONF)
 
BRKSPG-2069-64bit-package.pdf
BRKSPG-2069-64bit-package.pdfBRKSPG-2069-64bit-package.pdf
BRKSPG-2069-64bit-package.pdf
 
BRKCRS-2110.pdf
BRKCRS-2110.pdfBRKCRS-2110.pdf
BRKCRS-2110.pdf
 
PLNOG14: Service orchestration in provider network, Tail-f - Przemysław Borek
PLNOG14: Service orchestration in provider network, Tail-f - Przemysław BorekPLNOG14: Service orchestration in provider network, Tail-f - Przemysław Borek
PLNOG14: Service orchestration in provider network, Tail-f - Przemysław Borek
 
Cisco's Open Device Programmability Strategy: Open Discussion
Cisco's Open Device Programmability Strategy: Open DiscussionCisco's Open Device Programmability Strategy: Open Discussion
Cisco's Open Device Programmability Strategy: Open Discussion
 
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
The Hitch-Hikers Guide to Data Centre Virtualization and Workload Consolidation:
 
IPv6IntegrationBestPracticesfinal.pdf
IPv6IntegrationBestPracticesfinal.pdfIPv6IntegrationBestPracticesfinal.pdf
IPv6IntegrationBestPracticesfinal.pdf
 
BRKDCN-2670 Day2 operations for Datacenter VxLAN EVPN fabrics.pdf
BRKDCN-2670 Day2 operations for Datacenter VxLAN EVPN fabrics.pdfBRKDCN-2670 Day2 operations for Datacenter VxLAN EVPN fabrics.pdf
BRKDCN-2670 Day2 operations for Datacenter VxLAN EVPN fabrics.pdf
 
Open coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi AlkobiOpen coud networking at full speed - Avi Alkobi
Open coud networking at full speed - Avi Alkobi
 
CISCO DCNM.pdf
CISCO DCNM.pdfCISCO DCNM.pdf
CISCO DCNM.pdf
 
Cisco Connect Vancouver 2017 - Understanding Cisco next gen SD-WAN
Cisco Connect Vancouver 2017 - Understanding Cisco next gen SD-WANCisco Connect Vancouver 2017 - Understanding Cisco next gen SD-WAN
Cisco Connect Vancouver 2017 - Understanding Cisco next gen SD-WAN
 
Citrix Master Class - Live Upgrade from XenApp 6.5 to 7.6
Citrix Master Class - Live Upgrade from XenApp 6.5 to 7.6Citrix Master Class - Live Upgrade from XenApp 6.5 to 7.6
Citrix Master Class - Live Upgrade from XenApp 6.5 to 7.6
 
WAN Automation Engine API Deep Dive
WAN Automation Engine API Deep DiveWAN Automation Engine API Deep Dive
WAN Automation Engine API Deep Dive
 
Design and Deployment of Enterprise WLANs
Design and Deployment of Enterprise WLANsDesign and Deployment of Enterprise WLANs
Design and Deployment of Enterprise WLANs
 
5 cisco open_stack
5 cisco open_stack5 cisco open_stack
5 cisco open_stack
 
Open v ran
Open v ranOpen v ran
Open v ran
 
Triangle Kubernetes Meetup: Container cloud networking - Contiv for K8S & Ope...
Triangle Kubernetes Meetup: Container cloud networking - Contiv for K8S & Ope...Triangle Kubernetes Meetup: Container cloud networking - Contiv for K8S & Ope...
Triangle Kubernetes Meetup: Container cloud networking - Contiv for K8S & Ope...
 
Software Innovations and Control Plane Evolution in the new SDN Transport Arc...
Software Innovations and Control Plane Evolution in the new SDN Transport Arc...Software Innovations and Control Plane Evolution in the new SDN Transport Arc...
Software Innovations and Control Plane Evolution in the new SDN Transport Arc...
 
BEST REST in OpenStack
BEST REST in OpenStackBEST REST in OpenStack
BEST REST in OpenStack
 
Flexpod with SAP HANA and SAP Applications
Flexpod with SAP HANA and SAP ApplicationsFlexpod with SAP HANA and SAP Applications
Flexpod with SAP HANA and SAP Applications
 

Dernier

『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书rnrncn29
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxAndrieCagasanAkio
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxMario
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxNIMMANAGANTI RAMAKRISHNA
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书zdzoqco
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxmibuzondetrabajo
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119APNIC
 

Dernier (11)

『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptx
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptx
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptx
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptx
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119
 

Applying Hyper-scale Design Patterns to Routing

  • 1. Applying Hyper-scale Design Patterns to Routing Hannes Gredler, CTO RtBrick Inc. DEVNET-2064
  • 2. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Who am I ? • CTO at RtBrick, Inc. • Past stint: Distinguished Engineer with the “other router-vendor” • 18 Years working experience, developing, deploying and supporting Routing Software • Expertise • BGP, IS-IS, MPLS • 20+ Patents • 20+ Proposed Standards http://www.arkko.com/tools/allstats/hannesgredler.html • IETF WG co-chair (IS-IS) DEVNET-2064 2
  • 3. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public >2013 exposure to Data Center Networks & SR New large-scale data-center network model emerging  [draft-ietf-rtgwg-bgp-routing-large-dc]  End-to-End Layer-3 routing  Fixes issues with L2 switching data plane.  Hierarchical Topology  CLOS-based  Max 5-stages  Use of aggregation at TORs DEVNET-2064 3
  • 4. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Got a couple of inconvenient insights … • Networks have become Anti-Moore • direct sourcing from OEM manufacturers in Taiwan • Hardware is a Commodity • Cost per Bit dropping sharply (USD 400 / 100GBE port) • Boutique ASICs viable in 5 years from now ? • Curated Software Release models approaching EOL • Modularization or Custom package selection desired (no-PIM, no RSVP, etc.) • Pay per-use • Different model (node vs. system) for Resiliency • Open sourcing of components the new normal • Integration of components becomes core competency DEVNET-2064 4
  • 5. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public 5DEVNET-2064 John Gage Sun Microsystems 1) “The network is the computer”
  • 6. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public 6DEVNET-2064 Hannes Gredler, 2015 2) “Is it possible to construct a router based on the web 2.0 mindset ?”
  • 7. • Introduction • Multi-Level Architecture • Micro-services & APIs • Commoditization & Unit Economics • Resiliency, system coupling and state recovery • Open Source Development & Test • Conclusion Agenda
  • 9. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Hyper-scale Multi-level Architecture 9DEVNET-2064
  • 10. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Forwarding node • Translates RIB Objects to local OS representation • Tables • Routes • Nexthops • Hardware Prefix Caching • Aggregate FIB table • (filter specifics) • Localize fwd table • VPNs DEVNET-2064 10
  • 11. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Protocol I/O node • Schema driven protocol serializer /de-serializer • Keep alive delegation /absorption • Terminal Communication point for Sockets, stdio & file I/O • Pre-processing protocol stream (filter BGP PA128) • Queuing machinery & Routing Protocol update generation • Interface state handling DEVNET-2064 11
  • 12. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Application (route computation) node APPd input_proc add: bgp_filter_input chg: bgp_filter_input del: - IB: pre-filter Nbr Neighbor 193.203.0.40 pre-filter Ipv4 RIB Data Structure Schema • Schema driven Data Structure Server • Stores Applications Objects • Routes, Nexthops, Tables • Triggered execution (Add, Chg, Del) of internal/external Application code • Python functions • C/C++ library calls • Executables vfork() DEVNET-2064 12
  • 13. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Putting it all together 13DEVNET-2064
  • 15. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Build a system of little components • Micro-service architecture is like a UNIX pipeline model • Small pieces of software, serving a unique purpose • Easy transfer of state from one brick to next SortFilterSource Filter Sink curl http://192.168.1.1/bds/object | grep “Received-From:” | sort | grep “foo” > out DEVNET-2064 15
  • 16. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public REST/JSON based APIs 16DEVNET-2064
  • 17. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public bds://local/bgp.neighbor bds://local/isis.ad j bds://local/isis.lsdb.l2 bds://217.160.181.216/bgp.rib-in PUBSUB Database centric / Distributed Data Store DEVNET-2064 17
  • 18. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Open IPC format = BSON/JSON • Binary JSON for memory and I/O efficiency • JSON conversion on the fly possible DEVNET-2064 18
  • 19. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Table replication & state flow within a system 19DEVNET-2064
  • 21. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Compute Strategy: Yahoo vs. Google 21DEVNET-2064 Few Big vs. Many Small
  • 22. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public • Economy of scale will ultimately render custom-ASICs obsolete • FY2016 systems shipping: 100GB, > 128K FIB entries • Disintegration happening • soon to enter the Edge Router Business … • For ease of integration makes no Hardware, no locality, no OS assumptions • Unbounded Configuration Possibilities: • Single Switch, Cluster of Switches, Co-located x86 Rack Servers …. • Large FIBs, Small FIBs, SW-based forwarders & Combos thereof Commodity data plane = White-boxes DEVNET-2064 22
  • 23. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public • 16-32 CPU Cores, 64 GB RAM, Solid State Disks • app USD 3000 • Runs stock Ubuntu / Centos • Linux Containers (LXC) • dependency management • Para-Virtualization Commodity control plane = 1RU Rack Servers DEVNET-2064 23
  • 24. Resiliency, system coupling and state recovery
  • 25. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Hyper-scale Multi-level Architecture 25DEVNET-2064
  • 26. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Resiliency 26DEVNET-2064
  • 27. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public 27DEVNET-2064 Resiliency – snapshot DB to disk
  • 28. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Resiliency (2) – restart based on disk snapshot 28DEVNET-2064
  • 30. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Open Development • Open Source • 100 eyes better than 4 eyes, Network effects • Long term Maintenance • Open Source means sharing of not just Code: • Code • Test • Build • Documentation 30DEVNET-2064
  • 31. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Open development (1) • Use what is usable • No needs to re-invent Linux, event-loops, memory managers • Kernel based networking stacks are not usable for a router • Debugging hard (GDB live attachment) • Experimental forwarding code with no fault-domains in your kernel, really ? • TCP snapshots / restart. • In 2016Q1 we did not have a packet forwarding core • Cisco did release fd.io / VPP • User space DPDK design aligned with our (religious) believes • Most feature complete open-source L3 forwarder • Engineered for performance and maintainability 31DEVNET-2064
  • 32. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public 32DEVNET-2064 Open development (2) • Kick-ass VPP crew • Helped us to implement necessary core-features (indirect next-hop) within two weeks. • Good balance between Stability and feature velocity • Excellent Continuous Integration & Test Automation (untypical for FLOSS projects)
  • 33. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Open Development (3) VPP Internet stream generator 33DEVNET-2064
  • 35. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public In Conclusion • Network Equipment design got to be • Distributed, Multi-level Architecture • Micro-service based • Running on Commodity Hardware • “System” Resilient • Open Development / Open Test • Cisco Vector Packet Processing (VPP) • Best code in the industry (why is this free ?) • Good Code Governance • Establishment of an innovative ecosystem around VPP underway 35DEVNET-2064
  • 36. rtbrick demo at fd.io booth
  • 37. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Demo hosted at EC2 instance 37DEVNET-2064
  • 38. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public • 39Million IPv4 / 3M IPv6 route entries • BGP table snapshots from RIPE RIS server • route-processing / update / restart performance • 20x compared to JUNOS | IOS-XR • Full-bringup time 180s • Resync time 26s • Full fault domain isolation • Blast radius within a protocol-process of an address family • Process restart • Preservation of TCP session • Fast, robust Re-sync of state • Everything versioned Demo SCALE
  • 39. © 2016 Cisco and/or its affiliates. All rights reserved. Cisco Public Process restart using snapshots (3) Normal appd restart resync time for 2.5M RIB entries / 28s snapshot appd restart resync time for 2.5M RIB entries / 6s

Notes de l'éditeur

  1. I am Hannes Gredler currently exiting my engagement with Juniper Networks, Inc. the worlds second largest router-vendor. Throughout my entire professional life I have been supporting and building routers and routing software. It takes 10000 hours of experience to master any art. – In the Routing world I have accumulated experience of 18000 hours of deploying and supporting routers and another 18000 hours for developing software and making system design choices. At Juniper I am in charge of providing architectural guidance for Junipers Development center in Bangalore, India. In my role I oversee the routing-protoccol development at the intersection of BGP, IS-IS and MPLS. As a voluntary community service I do chair the IETF working group standardazing extensions to the IS-IS Link-state routing protocol. In addition I have authored/co-authored the following documents http://www.arkko.com/tools/allstats/hannesgredler.html
  2. The core of the system is a distributed data store which holds every state in the system. Irrespective if it is interface information, IS-IS adjacencies, BGP RIBs or forwarding tables – every state is stored in the back store. Brick Data Store (BDS) it is a model-driven indexing and data replication vehicle. BDS has been designed for speed and consistency – data insertion can progress as fast as 1M updates per second per CPU core. The back store is configured by defining tables, objects and its attributes akin to a SQL server using a json config file. The back store also allows to quickly locate who is the originator of a an object and generate local replicas of that data for local (in-situ) processing. BDS is fully horizontal scalable, so if there are large tables (like for example RIB-ins) then it can shard the workload across a set of worker processes) all of the shard-ing can happen without changing a single-line of code The database centric design allows us to do all the cool-HA things, like doing live-software upgrades, restart of components without loss of service, etc. – Furthermore this design paradigm greatly minimizes the amount of boiler-plate code that one has to do to develop new networking code.