SlideShare une entreprise Scribd logo
1  sur  47
Télécharger pour lire hors ligne
Cedrick Lunven
Director of developer Advocacy
Top 10 present and future
innovations in the NoSQL
Cassandra ecosystem
Cédrick
Lunven
Developer Advocates, Events, Community, Partners
2
@clunven
Directeur Developer Relations
➢ Speaker
➢ Streamer
➢ Author
➢ Training
➢ Community Manager
➢ Application developer
➢ Technical Architect
➢ Presales
➢ Support
Agenda
1. Why Cassandra
2. Top 10 innovations in the Cassandra Ecosystem
2.1. Cassandra 4.1 - CEP-03 = System-wide Guardrails
2.2. Cassandra 4.1 - CEP-13 = Denylisting Partition Keys
2.3. Cassandra 4.1 - CEP-18 = Pluggable Extension Points
2.4. Cassandra 4.2 - CEP-07 = Storage Indexed Storage
2.5. Cassandra 4.2 - CEP-19 = Tries Memtables
2.6. Cassandra 4.2 - CEP-21 = Transactional Cluster MetaData
2.7. Cassandra 4.2 - CEP-20 = Dynamic Data Masking
2.8. Cassandra 4.2 - CEP-15 = Acid Transaction
2.9. Communicate with any application using Stargate
2.10. Deploy Cassandra in Kubernetes
3. Resources
3
Why Cassandra ?
4
Relational Databases are Versatile
5
Fast Queries
“Customer-facing”
High number of transactions
Usually Hot / Live Data
High SLA Requirements
(Response Time / Availability)
OLTP: Online Transaction Processing OLAP: Online Analytical Processing
Complex Queries
Historical
High volume of data
Often “Cold Data”
Used by / for Analytics
Volume
of Data
Response
Time
Relational Databases have limited scalability
6
They were designed to run on a single machine
●
●
●
Apache Cassandra = NoSQL Distributed Decentralised DBMS
7
NODE
NODE
NODE
NODE
NODE NODE
NODE
1 Installation = 1 NODE
✔ Capacity = ~ 2-4TB
✔ Throughput = LOTS Tx/sec/core
Communication:
✔ Gossiping
DataCenter | Ring
Distributed ?
8
• Geographic Distribution • Hybrid-Cloud and Multi-Cloud
On-premise
Always On
Every second of downtime
translates into lost revenue
High Availability
Apache
Cassandra
Built as a
Technical
Necessity
Linear Scalability
of Fortune 100
companies
have adopted
90%
Hyper-Scalability
Millions of operations per day,
hour, or second
Faster Pace
Every millisecond of latency
has consequence
Low Latency
Data Everywhere
On-premises, hybrid, multi-cloud,
centralized, or edge
Global Distribution
Use Cases
10
High Throughput
High Volume
Heavy Writes
Heavy Reads
Event Streaming Log Analytics
Internet of Things Other Time Series
Mission-Critical
No Data Loss
Always-on
Scalability
Availability
Distributed
Cloud-native
Caching Pricing
Market Data Inventory
Banking Retail
Tracking /
Logistics
Customer
Experience
API Layer Hybrid-cloud
Enterprise Data
Layer
Multi-cloud
Modern Cloud
Applications
Global Presence
Workload Mobility
Compliance /
GDPR
Cassandra 4.1 (2022)
● CEP-03 = System-wide Guardrails
● CEP-13 = Denylisting Partition Keys
● CEP-18 = Pluggable Extension Points
11
Cassandra 4.0 (2021)
Is a big Deal !
12
● Wait for 6 years
● Production Ready day 1
● Features
○ 25% faster
○ Faster Big Clusters
○ Incremental Repairs
○ Virtual Tables
○ Transient Replicas
Cassandra 4.1 - What’s in it?
● Themes
● Major yearly release
● Usability
● Security
● Pluggability
● Features
● Configuration Improvements
● System-wide Guardrails
● Denylisting Partition Keys
● Pluggable Extension Points
● QA building blocks
13
Improved Configuration
14
● Standard and Intuitive names
○ noun_verb
● Units in values
System-wide Guardrails (CEP-3)
● Hard and soft limits on magnitudes
● Disabling features entirely
● Disallowing specific configuration values
● Administrative users exempted
● Typically checked at the CQL layer
● Based on work in Astra
15
https://cassandra.apache.org/_/blog/Apache-Cassandra-4.1-Features-Guardrails-Framework.html
Cassandra 4.1 - Disabling Features
● Example: ALLOW FILTERING
● Configure in cassandra.yaml :
● Verify setting in cqlsh:
● Developer experience:
16
Cassandra 4.1 - Hard and Soft Limits
● Example: ALLOW FILTERING
● Configure in cassandra.yaml :
● Developer experience (soft limit aka warning):
17
Denylisting Partition Keys (CEP-13)
18
https://cassandra.apache.org/_/blog/Apache-Cassandra-4.1-Denylisting-Partitions.html
● Overloaded partitions break clusters
○ compactions, streaming, GC, etc
○ Bad data modeling choices
○ Unintended usage or attacks
● The solution: operators can identify
partition keys for which queries will be
disallowed overloaded keys
https://unsplash.com/@nevenkrcmarek
Pluggable extension points
● Valuable mechanism to add new features
without disrupting the core
○ Internal APIs and code restructuring
● Pluggable storage (memtables)
○ Next: SSTables
● Pluggable network encryption
○ External key providers
● Pluggable authentication
○ Extend cqlsh via Python modules
● External schema storage
○ etcd for Kubernetes environments
19
Updating to Cassandra 4.1
20
Upgrading to 4.1
● In place upgrade from 4.0
● No application changes
● Simple steps, same as last time
● 1. Verify your software versions
● 2. Perform a cluster-wide snapshot
● 3. Upgrade the first node and verify
● 4. Continue to each node in your
cluster
21
https://foojay.io/today/have-you-upgraded-to-cassandra-4-0/
If you need help preparing for an upgrade to 4.1
● For a limited time — until July 31, 2022 — our award-winning team of Apache Cassandra™
contributors are available to help evaluate the health of your clusters, including conducting a
comprehensive health check, at a reduced price of $200 USD a month.*
● This special offer includes:
● Best practices, performance and security improvements, plus SLA achievement
● Expert-led health check of your Apache Cassandra environment
● On-going support for 1 year via DataStax Luna
● Expertise and peace of mind for your team
● How can you learn more and request this offer?
https://www.datastax.com/lp/luna-apache-cassandra
22
Cassandra 4.2 (2023)
● CEP-07 = Storage Attached Index
● CEP-19 = Tries Memtables
● CEP-21 = Transactional Cluster MetaData
● CEP-20 = Dynamic Data Masking
● CEP-15 = Acid Transaction
23
Storage Attached Index
24
Intended to replace both 2i and SASI
Trie Memtables (CEP-19)
25
Transactional Cluster MetaData
26
ACID Transactions (CEP-15)
27
Cassandra Relationship With Developers
28
Easy Hard
● Scale
● Resilience
● Distribution
● Atomicity
● Serialized changes
● Complicated State
Built-in Data Modeling
& Code
Accord
29
Node 1
Node 4 Node 2
Node 3
● Every node has a Reorder Buffer
● Clock skew is cool
● Leaderless timestamp protocol
● Fast Path Electorates: Fault tolerance
● TL;DR One Round Trip - ish
Reorder
Buffer
Reorder
Buffer
Reorder
Buffer
Reorder
Buffer
Good for Cassandra
● Leaderless
● Scales like Cassandra
● Failure modes match
More Tools
● Communicate with any application using Stargate
● Deploy Cassandra in Kubernetes
30
Stargate Overview
An open source API framework for data
Stargate makes it easy to use a
database for any application workload
by adding plugin support for new APIs,
data types, and access methods
MICROSERVICES
DEVELOPERS
31
Astra DB
gRPC
CQL
GraphQL Document
API
Data
API
Cassandra
3.x
Cassandra
4.x DSE 6.x
API Extensions and Persistence Extensions
32
Stargate
Cassandra Query
Language
GraphQL REST Document
SQL like Table Model
Structured Data
Key-Value Data
Strong Types
Minimal query overhead
Hierarchy of
types and fields
Structured Data
Key-Value Data
Low query overhead
Row based
Structured Data
Key-Value Data
Weaker Types
High query overhead
JSON Documents
Semi-Structured Data
Weaker Types
High query overhead
Drivers Open API
More Performant More Flexible
gRPC
Structured Data (CQL)
Lighter weight
Native driver alternative
Low query overhead
33
Connecting to your cluster (Before)
NODE
NODE
NODE
NODE
NODE NODE
NODE
developers
Apps
CQL
CQL
CQL
CQL
CQL
Drivers
CQL
CQL
34
Connecting to your cluster (with Stargates)
NODE
NODE
NODE
NODE
NODE NODE
NODE
developers
Apps
SDK
CQL
CQL
CQL
CQL
CQL
Stargate
Stargate
Stargate
CQL
CQL
CQL
CQL
CQL
Load
Balancer
(HTTP)
35
+ =
36
Apache
Cassandra®
Cass-Operator
37
Apache
Cassandra®
Cass-Operator
Metrics
Collector
38
Apache
Cassandra®
Cass-Operator
Cassandra
Medusa
(backup/restore)
Metrics
Collector
📁S3, GCP,...
Repear
(repair)
39
Apache
Cassandra®
Cass-Operator
Cassandra
Medusa
(backup/restore)
Metrics
Collector
📁S3, GCP,...
Repear
(repair)
Stargate
Resources
41
Cassandra Community Events
42
● 10/11 - London
● 10/13 - Amsterdam
● 11/08 - Hanoi
● 11/10 - Jakarta
● 11/10 - Santa Clara
● 11/10 - Houston
● 11/10 - Seattle
● 11/15 - Singapore
Every
Wednesday
12pm ET
Cassandra Community Resources
Planet Cassandra is back!
https://www.youtube.com/channel/UCvP-AXuCr
-naAeEccCfKwUA
43
Developer Community
https://www.datastax.com/dev/community
Ecosystem Updates
● Try out 4.1 features
including guardrails
44
Coming soon:
● Cassandra 4.1 support
● Stargate 2.0 release
● New Docs API
● More APIs
Coming soon:
● Cassandra 4.1 support
● K8ssandra Operator v1.2
Cassandra: The Definitive Guide, Revised Third Edition
45
https://www.datastax.com/resources/ebook/oreilly-cassandra-the-definitive-guide
Questions
46
Thank You!
47
@clunven

Contenu connexe

Similaire à Top 10 present and future innovations in the NoSQL Cassandra ecosystem (2022)

iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...DataStax Academy
 
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
 Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep... Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...Databricks
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and KubernetesAltoros
 
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...NETWAYS
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...confluent
 
CN Asturias - Stateful application for kubernetes
CN Asturias -  Stateful application for kubernetes CN Asturias -  Stateful application for kubernetes
CN Asturias - Stateful application for kubernetes Cédrick Lunven
 
Using Kubernetes to make cellular data plans cheaper for 50M users
Using Kubernetes to make cellular data plans cheaper for 50M usersUsing Kubernetes to make cellular data plans cheaper for 50M users
Using Kubernetes to make cellular data plans cheaper for 50M usersMirantis
 
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18Olga Zinkevych
 
NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1Ruslan Meshenberg
 
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...DataStax
 
Achieve high throughput: A case study using a Pensando Distributed Services C...
Achieve high throughput: A case study using a Pensando Distributed Services C...Achieve high throughput: A case study using a Pensando Distributed Services C...
Achieve high throughput: A case study using a Pensando Distributed Services C...Principled Technologies
 
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”GlobalLogic Ukraine
 
Patterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesPatterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesQAware GmbH
 
Patterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesPatterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesJosef Adersberger
 
Unleash the Power of Open Networking
Unleash the Power of Open NetworkingUnleash the Power of Open Networking
Unleash the Power of Open NetworkingCumulus Networks
 
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Dataconomy Media
 
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans JespersenBest Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersenconfluent
 
Scylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScyllaDB
 
Orchestrating Cassandra with Kubernetes Operator and PaaSTA
Orchestrating Cassandra with Kubernetes Operator and PaaSTAOrchestrating Cassandra with Kubernetes Operator and PaaSTA
Orchestrating Cassandra with Kubernetes Operator and PaaSTARaghavendra Prabhu
 

Similaire à Top 10 present and future innovations in the NoSQL Cassandra ecosystem (2022) (20)

iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
 
Multi-cluster k8ssandra
Multi-cluster k8ssandraMulti-cluster k8ssandra
Multi-cluster k8ssandra
 
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
 Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep... Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and Kubernetes
 
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
 
CN Asturias - Stateful application for kubernetes
CN Asturias -  Stateful application for kubernetes CN Asturias -  Stateful application for kubernetes
CN Asturias - Stateful application for kubernetes
 
Using Kubernetes to make cellular data plans cheaper for 50M users
Using Kubernetes to make cellular data plans cheaper for 50M usersUsing Kubernetes to make cellular data plans cheaper for 50M users
Using Kubernetes to make cellular data plans cheaper for 50M users
 
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
 
NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1
 
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
 
Achieve high throughput: A case study using a Pensando Distributed Services C...
Achieve high throughput: A case study using a Pensando Distributed Services C...Achieve high throughput: A case study using a Pensando Distributed Services C...
Achieve high throughput: A case study using a Pensando Distributed Services C...
 
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
 
Patterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesPatterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to Kubernetes
 
Patterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesPatterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to Kubernetes
 
Unleash the Power of Open Networking
Unleash the Power of Open NetworkingUnleash the Power of Open Networking
Unleash the Power of Open Networking
 
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
 
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans JespersenBest Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
 
Scylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi Kivity
 
Orchestrating Cassandra with Kubernetes Operator and PaaSTA
Orchestrating Cassandra with Kubernetes Operator and PaaSTAOrchestrating Cassandra with Kubernetes Operator and PaaSTA
Orchestrating Cassandra with Kubernetes Operator and PaaSTA
 

Plus de Cédrick Lunven

BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...Cédrick Lunven
 
Avoiding Pitfalls for Cassandra.pdf
Avoiding Pitfalls for Cassandra.pdfAvoiding Pitfalls for Cassandra.pdf
Avoiding Pitfalls for Cassandra.pdfCédrick Lunven
 
Unlock cassandra data for application developers using graphQL
Unlock cassandra data for application developers using graphQLUnlock cassandra data for application developers using graphQL
Unlock cassandra data for application developers using graphQLCédrick Lunven
 
An oss api layer for your cassandra
An oss api layer for your cassandraAn oss api layer for your cassandra
An oss api layer for your cassandraCédrick Lunven
 
Xebicon2019 m icroservices
Xebicon2019   m icroservicesXebicon2019   m icroservices
Xebicon2019 m icroservicesCédrick Lunven
 
Reactive Programming with Cassandra
Reactive Programming with CassandraReactive Programming with Cassandra
Reactive Programming with CassandraCédrick Lunven
 
VoxxedDays Luxembourg FF4J
VoxxedDays Luxembourg FF4JVoxxedDays Luxembourg FF4J
VoxxedDays Luxembourg FF4JCédrick Lunven
 
VoxxedDays Luxembourg 2019
VoxxedDays Luxembourg 2019VoxxedDays Luxembourg 2019
VoxxedDays Luxembourg 2019Cédrick Lunven
 
Create API for your Databases
Create API for your DatabasesCreate API for your Databases
Create API for your DatabasesCédrick Lunven
 
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...Cédrick Lunven
 
Streaming, Analytics and Reactive Applications with Apache Cassandra
Streaming, Analytics and Reactive Applications with Apache CassandraStreaming, Analytics and Reactive Applications with Apache Cassandra
Streaming, Analytics and Reactive Applications with Apache CassandraCédrick Lunven
 
Riviera jug apicassandra
Riviera jug apicassandraRiviera jug apicassandra
Riviera jug apicassandraCédrick Lunven
 
Paris Meetup Jhispter #9 - Generator FF4j for Jhipster
Paris Meetup Jhispter #9 - Generator FF4j for JhipsterParis Meetup Jhispter #9 - Generator FF4j for Jhipster
Paris Meetup Jhispter #9 - Generator FF4j for JhipsterCédrick Lunven
 
Introduction to Feature Toggle and FF4J
Introduction to Feature Toggle and FF4JIntroduction to Feature Toggle and FF4J
Introduction to Feature Toggle and FF4JCédrick Lunven
 

Plus de Cédrick Lunven (18)

BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
 
Avoiding Pitfalls for Cassandra.pdf
Avoiding Pitfalls for Cassandra.pdfAvoiding Pitfalls for Cassandra.pdf
Avoiding Pitfalls for Cassandra.pdf
 
Unlock cassandra data for application developers using graphQL
Unlock cassandra data for application developers using graphQLUnlock cassandra data for application developers using graphQL
Unlock cassandra data for application developers using graphQL
 
An oss api layer for your cassandra
An oss api layer for your cassandraAn oss api layer for your cassandra
An oss api layer for your cassandra
 
Xebicon2019 m icroservices
Xebicon2019   m icroservicesXebicon2019   m icroservices
Xebicon2019 m icroservices
 
DevFestBdm2019
DevFestBdm2019DevFestBdm2019
DevFestBdm2019
 
Reactive Programming with Cassandra
Reactive Programming with CassandraReactive Programming with Cassandra
Reactive Programming with Cassandra
 
Shift Dev Conf API
Shift Dev Conf APIShift Dev Conf API
Shift Dev Conf API
 
VoxxedDays Luxembourg FF4J
VoxxedDays Luxembourg FF4JVoxxedDays Luxembourg FF4J
VoxxedDays Luxembourg FF4J
 
VoxxedDays Luxembourg 2019
VoxxedDays Luxembourg 2019VoxxedDays Luxembourg 2019
VoxxedDays Luxembourg 2019
 
Design API - SnowCampIO
Design API - SnowCampIODesign API - SnowCampIO
Design API - SnowCampIO
 
Create API for your Databases
Create API for your DatabasesCreate API for your Databases
Create API for your Databases
 
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
 
Streaming, Analytics and Reactive Applications with Apache Cassandra
Streaming, Analytics and Reactive Applications with Apache CassandraStreaming, Analytics and Reactive Applications with Apache Cassandra
Streaming, Analytics and Reactive Applications with Apache Cassandra
 
Riviera jug apicassandra
Riviera jug apicassandraRiviera jug apicassandra
Riviera jug apicassandra
 
Riviera JUG ff4j
Riviera JUG ff4jRiviera JUG ff4j
Riviera JUG ff4j
 
Paris Meetup Jhispter #9 - Generator FF4j for Jhipster
Paris Meetup Jhispter #9 - Generator FF4j for JhipsterParis Meetup Jhispter #9 - Generator FF4j for Jhipster
Paris Meetup Jhispter #9 - Generator FF4j for Jhipster
 
Introduction to Feature Toggle and FF4J
Introduction to Feature Toggle and FF4JIntroduction to Feature Toggle and FF4J
Introduction to Feature Toggle and FF4J
 

Dernier

Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingScyllaDB
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsLeah Henrickson
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireExakis Nelite
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data SciencePaolo Missier
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)Wonjun Hwang
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...panagenda
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfAnubhavMangla3
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!Memoori
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityVictorSzoltysek
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....rightmanforbloodline
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024Lorenzo Miniero
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe中 央社
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxFIDO Alliance
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...ScyllaDB
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMKumar Satyam
 

Dernier (20)

Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 

Top 10 present and future innovations in the NoSQL Cassandra ecosystem (2022)

  • 1. Cedrick Lunven Director of developer Advocacy Top 10 present and future innovations in the NoSQL Cassandra ecosystem
  • 2. Cédrick Lunven Developer Advocates, Events, Community, Partners 2 @clunven Directeur Developer Relations ➢ Speaker ➢ Streamer ➢ Author ➢ Training ➢ Community Manager ➢ Application developer ➢ Technical Architect ➢ Presales ➢ Support
  • 3. Agenda 1. Why Cassandra 2. Top 10 innovations in the Cassandra Ecosystem 2.1. Cassandra 4.1 - CEP-03 = System-wide Guardrails 2.2. Cassandra 4.1 - CEP-13 = Denylisting Partition Keys 2.3. Cassandra 4.1 - CEP-18 = Pluggable Extension Points 2.4. Cassandra 4.2 - CEP-07 = Storage Indexed Storage 2.5. Cassandra 4.2 - CEP-19 = Tries Memtables 2.6. Cassandra 4.2 - CEP-21 = Transactional Cluster MetaData 2.7. Cassandra 4.2 - CEP-20 = Dynamic Data Masking 2.8. Cassandra 4.2 - CEP-15 = Acid Transaction 2.9. Communicate with any application using Stargate 2.10. Deploy Cassandra in Kubernetes 3. Resources 3
  • 5. Relational Databases are Versatile 5 Fast Queries “Customer-facing” High number of transactions Usually Hot / Live Data High SLA Requirements (Response Time / Availability) OLTP: Online Transaction Processing OLAP: Online Analytical Processing Complex Queries Historical High volume of data Often “Cold Data” Used by / for Analytics Volume of Data Response Time
  • 6. Relational Databases have limited scalability 6 They were designed to run on a single machine ● ● ●
  • 7. Apache Cassandra = NoSQL Distributed Decentralised DBMS 7 NODE NODE NODE NODE NODE NODE NODE 1 Installation = 1 NODE ✔ Capacity = ~ 2-4TB ✔ Throughput = LOTS Tx/sec/core Communication: ✔ Gossiping DataCenter | Ring
  • 8. Distributed ? 8 • Geographic Distribution • Hybrid-Cloud and Multi-Cloud On-premise
  • 9. Always On Every second of downtime translates into lost revenue High Availability Apache Cassandra Built as a Technical Necessity Linear Scalability of Fortune 100 companies have adopted 90% Hyper-Scalability Millions of operations per day, hour, or second Faster Pace Every millisecond of latency has consequence Low Latency Data Everywhere On-premises, hybrid, multi-cloud, centralized, or edge Global Distribution
  • 10. Use Cases 10 High Throughput High Volume Heavy Writes Heavy Reads Event Streaming Log Analytics Internet of Things Other Time Series Mission-Critical No Data Loss Always-on Scalability Availability Distributed Cloud-native Caching Pricing Market Data Inventory Banking Retail Tracking / Logistics Customer Experience API Layer Hybrid-cloud Enterprise Data Layer Multi-cloud Modern Cloud Applications Global Presence Workload Mobility Compliance / GDPR
  • 11. Cassandra 4.1 (2022) ● CEP-03 = System-wide Guardrails ● CEP-13 = Denylisting Partition Keys ● CEP-18 = Pluggable Extension Points 11
  • 12. Cassandra 4.0 (2021) Is a big Deal ! 12 ● Wait for 6 years ● Production Ready day 1 ● Features ○ 25% faster ○ Faster Big Clusters ○ Incremental Repairs ○ Virtual Tables ○ Transient Replicas
  • 13. Cassandra 4.1 - What’s in it? ● Themes ● Major yearly release ● Usability ● Security ● Pluggability ● Features ● Configuration Improvements ● System-wide Guardrails ● Denylisting Partition Keys ● Pluggable Extension Points ● QA building blocks 13
  • 14. Improved Configuration 14 ● Standard and Intuitive names ○ noun_verb ● Units in values
  • 15. System-wide Guardrails (CEP-3) ● Hard and soft limits on magnitudes ● Disabling features entirely ● Disallowing specific configuration values ● Administrative users exempted ● Typically checked at the CQL layer ● Based on work in Astra 15 https://cassandra.apache.org/_/blog/Apache-Cassandra-4.1-Features-Guardrails-Framework.html
  • 16. Cassandra 4.1 - Disabling Features ● Example: ALLOW FILTERING ● Configure in cassandra.yaml : ● Verify setting in cqlsh: ● Developer experience: 16
  • 17. Cassandra 4.1 - Hard and Soft Limits ● Example: ALLOW FILTERING ● Configure in cassandra.yaml : ● Developer experience (soft limit aka warning): 17
  • 18. Denylisting Partition Keys (CEP-13) 18 https://cassandra.apache.org/_/blog/Apache-Cassandra-4.1-Denylisting-Partitions.html ● Overloaded partitions break clusters ○ compactions, streaming, GC, etc ○ Bad data modeling choices ○ Unintended usage or attacks ● The solution: operators can identify partition keys for which queries will be disallowed overloaded keys
  • 19. https://unsplash.com/@nevenkrcmarek Pluggable extension points ● Valuable mechanism to add new features without disrupting the core ○ Internal APIs and code restructuring ● Pluggable storage (memtables) ○ Next: SSTables ● Pluggable network encryption ○ External key providers ● Pluggable authentication ○ Extend cqlsh via Python modules ● External schema storage ○ etcd for Kubernetes environments 19
  • 21. Upgrading to 4.1 ● In place upgrade from 4.0 ● No application changes ● Simple steps, same as last time ● 1. Verify your software versions ● 2. Perform a cluster-wide snapshot ● 3. Upgrade the first node and verify ● 4. Continue to each node in your cluster 21 https://foojay.io/today/have-you-upgraded-to-cassandra-4-0/
  • 22. If you need help preparing for an upgrade to 4.1 ● For a limited time — until July 31, 2022 — our award-winning team of Apache Cassandra™ contributors are available to help evaluate the health of your clusters, including conducting a comprehensive health check, at a reduced price of $200 USD a month.* ● This special offer includes: ● Best practices, performance and security improvements, plus SLA achievement ● Expert-led health check of your Apache Cassandra environment ● On-going support for 1 year via DataStax Luna ● Expertise and peace of mind for your team ● How can you learn more and request this offer? https://www.datastax.com/lp/luna-apache-cassandra 22
  • 23. Cassandra 4.2 (2023) ● CEP-07 = Storage Attached Index ● CEP-19 = Tries Memtables ● CEP-21 = Transactional Cluster MetaData ● CEP-20 = Dynamic Data Masking ● CEP-15 = Acid Transaction 23
  • 24. Storage Attached Index 24 Intended to replace both 2i and SASI
  • 28. Cassandra Relationship With Developers 28 Easy Hard ● Scale ● Resilience ● Distribution ● Atomicity ● Serialized changes ● Complicated State Built-in Data Modeling & Code
  • 29. Accord 29 Node 1 Node 4 Node 2 Node 3 ● Every node has a Reorder Buffer ● Clock skew is cool ● Leaderless timestamp protocol ● Fast Path Electorates: Fault tolerance ● TL;DR One Round Trip - ish Reorder Buffer Reorder Buffer Reorder Buffer Reorder Buffer Good for Cassandra ● Leaderless ● Scales like Cassandra ● Failure modes match
  • 30. More Tools ● Communicate with any application using Stargate ● Deploy Cassandra in Kubernetes 30
  • 31. Stargate Overview An open source API framework for data Stargate makes it easy to use a database for any application workload by adding plugin support for new APIs, data types, and access methods MICROSERVICES DEVELOPERS 31
  • 32. Astra DB gRPC CQL GraphQL Document API Data API Cassandra 3.x Cassandra 4.x DSE 6.x API Extensions and Persistence Extensions 32
  • 33. Stargate Cassandra Query Language GraphQL REST Document SQL like Table Model Structured Data Key-Value Data Strong Types Minimal query overhead Hierarchy of types and fields Structured Data Key-Value Data Low query overhead Row based Structured Data Key-Value Data Weaker Types High query overhead JSON Documents Semi-Structured Data Weaker Types High query overhead Drivers Open API More Performant More Flexible gRPC Structured Data (CQL) Lighter weight Native driver alternative Low query overhead 33
  • 34. Connecting to your cluster (Before) NODE NODE NODE NODE NODE NODE NODE developers Apps CQL CQL CQL CQL CQL Drivers CQL CQL 34
  • 35. Connecting to your cluster (with Stargates) NODE NODE NODE NODE NODE NODE NODE developers Apps SDK CQL CQL CQL CQL CQL Stargate Stargate Stargate CQL CQL CQL CQL CQL Load Balancer (HTTP) 35
  • 42. Cassandra Community Events 42 ● 10/11 - London ● 10/13 - Amsterdam ● 11/08 - Hanoi ● 11/10 - Jakarta ● 11/10 - Santa Clara ● 11/10 - Houston ● 11/10 - Seattle ● 11/15 - Singapore Every Wednesday 12pm ET
  • 43. Cassandra Community Resources Planet Cassandra is back! https://www.youtube.com/channel/UCvP-AXuCr -naAeEccCfKwUA 43 Developer Community https://www.datastax.com/dev/community
  • 44. Ecosystem Updates ● Try out 4.1 features including guardrails 44 Coming soon: ● Cassandra 4.1 support ● Stargate 2.0 release ● New Docs API ● More APIs Coming soon: ● Cassandra 4.1 support ● K8ssandra Operator v1.2
  • 45. Cassandra: The Definitive Guide, Revised Third Edition 45 https://www.datastax.com/resources/ebook/oreilly-cassandra-the-definitive-guide