Real time Messages at Scale with Apache Kafka and Couchbase

RealTime Messages at Scale
with Apache Kafka
Will Gardella
Product Manager

©2015 Couchbase Inc. 6
Agenda
 You might need Kafka if…
 Kafka architecture
 Background - Couchbase
 Couchbase & Kafka
 Behind the Scenes
 Demo
 An Example Producer and Consumer

What’s Apache Kafka for?
You might need Kafka if…

You might need Kafka if…
Photo Credit: Cory Doctorow
https://www.flickr.com/photos/doctorow/14638938

Different speeds for different
systems
 NoSQL
 RDBMS
 Cache
 Search
 Apps
 Metrics
 Logs
 Hadoop
 Relational Data Warehouse

Kafka Architecture
Broker 1
Consumer
Producer
Producer
Producer
Consumer
Consumer
Consumer
Kafka Cluster
Broker 2
Broker 3

Kafka Architecture
Broker 1
Consumer
Zookeeper
Producer
Producer
Producer
Consumer
Consumer
Consumer
Kafka Cluster
Broker 2
Broker 3
Topic 1 – Partition 1

Couchbase Server 4.0
A brief Introduction

Couchbase Server 4.0 for modern applications
Combines the flexibility of JSON, the power of SQL, and the scale of
NoSQL
Develop with Agility Operate at Any Scale
 Flexible JSON data model
 Dynamic schema support
 Powerful query language that extends SQL to JSON
 Sub-millisecond latencies at scale
 Elastic scaling on commodity servers
 High availability

Couchbase Server Defined
The first NoSQL database that enables you to develop with
agility and operate at any scale.
Managed Cache Key-Value Store Document
Database
Embedded
Database
Sync Management

The Power Of The Flexible JSON Schema
Ability to store data in multiple ways
• Denormalized single document, as opposed to normalizing data across multiple table
• Dynamic Schema to add new values when needed

Couchbase and Other Big Data Systems
data scientist / engineersup to 1010 application
users
NoSQL
Database
101- 102
Kafka Hadoop
Spark
Elasticsearch
EDW

Couchbase & Kafka Use Cases
 Couchbase as the Master Database
– Changes in the bucket update data elsewhere
 Triggers / Event Handling
– Handle events like deletions / expirations
externally
– E.g. expiration & replicated session tokens
 Real-time Data Integration
– Extract from Couchbase, transform and load
data in real-time
 Real-time Data Processing
– Extract from a bucket, process in real-time and
load back to another bucket

The Couchbase Kafka Connector
How it works

Database Change Protocol (DCP)
Couchbase Server’s internal data sync mechanism since Couchbase Server 3.x
 Used for
– Intra-Cluster Replication
– Indexing
– XDCR (Cross Datacenter Replication for HA/DR)
– Some connectors, including Kafka and Spark
• Use Couchbase 2.x Java SDK JVM Core IO DCP handling library
 Sends mutations
– Mutations = creation, update, or delete of an item
– Each mutation that occurs in a vBucket has a sequence number
Important: DCP not supported for external clients!

An Example Producer and Consumer
ConnectingCouchbase via Kafka to anApplication

Kafka Generator Example

Kafka Producer Example

A Kafka Consumer Example

Couchbase Kafka Connector Roadmap
Available Now: 1.2 GA
 Kafka Producer or Consumer
 Stream events
 Filters
 Transform events
41
Code: https://github.com/couchbase/couchbase-kafka-connector/
Issues: https://issues.couchbase.com/projects/KAFKAC
Docs: http://developer.couchbase.com/documentation/server/4.1/connectors/kafka-1.2/kafka-intro.html
Planned
 Monthly maintenance releases
Under discussion
 Merge code for Storm connector
 Adopt Kafka Connect (Kafka 0.9)
 ???

Learn More - Couchbase Kafka Connector
Confluent’s Ewen Cheslack-Postava at Couchbase Connect 2015
 Great high level intro to Kafka in ~20 minutes
 https://youtu.be/fFPVwYKUTHs
Couchbase and Kafka - Up and Running in 10 Minutes
 Run through the sample code yourself
 http://blog.couchbase.com/2015/november/kafka-and-couchbase-up-and-running-in-10-minutes
Product docs
 http://developer.couchbase.com/documentation/server/4.1/connectors/kafka-1.2/kafka-intro.html
Avalon Consulting blog and Github repo
 http://blogs.avalonconsult.com/blog/big-data/purchase-transaction-alerting-with-couchbase-and-
kafka/
 https://github.com/Avalon-Consulting-LLC/couchbase-kafka
42

Thank you.
will.gardella@couchbase.com
Twitter: @WillGardella

Real time Messages at Scale with Apache Kafka and Couchbase

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Real time Messages at Scale with Apache Kafka and Couchbase

Similaire à Real time Messages at Scale with Apache Kafka and Couchbase (20)

Dernier

Dernier (20)

Real time Messages at Scale with Apache Kafka and Couchbase

Notes de l'éditeur