SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
1.5 Million Log Lines per Second 
Big Data Everywhere Chicago 2014 
Mike Keane 
mkeane@conversant.com 
Building and maintaining Flume flows at Conversant
•Quicker insight into production data 
•Reduce complexity of administering/managing new servers, data centers, etc. 
•Scalable 
•No data loss or duplication 
•Replace TSV files with Avro objects 
•Able to be monitored by Network Operations Center (NOC) 
•Able to recover from downtime quickly 
R 
SLA for Event Driven Logging with Flume
•A Flume Flow is a series of flume agents data follows from origination to final destination 
•Data on a Flume Flow is packaged in FlumeEvent Avro objects 
•A FlumeEvent is composed of 
•Headers – A map of string value pairs 
•Body – A byte array 
•A FlumeEvent is an atomic unit of data 
•FlumeEvents are sent in batches 
•When a batch of FlumeEvents only partially makes it to the next flume agent in the flow, the entire batch is resent resulting in duplicates 
R 
Simplistic Flume Overview
R 
Simplistic Flume Overview 
Flume Agent
R 
Simplistic Flume Overview 
EmbeddedAgent 
Compressor 
Agent 
Landing 
Agent
Overview of existing network topology 
•3 data centers divided into 12 lanes participating in the OpenRTB market 
•6 lanes in the east coast data center 
•4 lanes in the west coast data center 
•2 lanes in the European data center 
•Each lane has approximately 75 servers handling OpenRTB operations. 
•30 different logs 
•Over 60,000,000,000 log lines per day
Overview of existing network topology.
•2 Server Flume Flow from East Coast (IAD) to Chicago (ORD) with over 250K TSV lines per second 
•No Data Loss 
•Failover 
•Compression performance 
P.O.C. Can Flume handle our log volume reliably?
P.O.C. Overview
P.O.C. passes 
•Larger Batch sizes helped, but could not reach 250K per second 
•Multiple TSV lines Per FlumeEvent hits over 360K per second 
•Failover passed with duplicates 
•Compression passed but needed to parallelize 7X sinks
Taking Flume to Production 
•Embedding the EmbeddedAgent in existing servers 
•Modify EmbeddedAgent 
•Properties from existing infrastructure 
•Implement Monitoring 
•Create “Flume”Implementation of proprietary logging interface 
•Replace POJO to TSV with Avro to AvroDataFile 
•Preventing duplicates, not removing 
•Add LogType header
Taking Flume to Production 
•Custom Sink for AvroDataFile body (based on HDFSEventSink) 
•Check if UUID header is in HBase 
•Yes – increment duplicate count metric 
•No 
•Write AvroDataFile body to HDFS using Custom Writer 
•Put UUID to HBase
Taking Flume to Production 
•Custom Selector based on MultiplexingChannelSelector 
•Route FlumeEvents to channels by log type or groups of log types 
•Bifurcate to multiple locations each log and each location with its own percentage of data to bifurcate
Configuring Flume Flows 
•Configuring Flume can be tedious, use a templating engine 
•In Q2 2014 Conversant expanded from 7 lanes in 2 data centers to 12 lanes in 3 data centers (~400 more servers to configure). 
•Static headers useful for tracking flows 
•15 minutes to configure all Q2 expansion CompressorLane('iad6', [CompressorAgent("dtiad06flm01p"), CompressorAgent("dtiad06flm02p"), CompressorAgent("dtiad06flm03p")]) compressor.list = dtiad06flm01p, dtiad06flm02p,dtiad06flm03p
Monitoring the Flume Flows 
•Flume metrics are available by JMX or Json over HTTP 
•Metrics to monitor 
•ChannelFillPercentage 
•Rate of change on EventDrainSuccessCount on failover sinks 
•FLUME-2307 – File channel deletes fail after timeout (fixed 1.5) 
•Publishing metrics to TSDB provides great visual insight
Monitoring the Flume Flows 
ChannelFillPercentage
Monitoring the Flume Flows 
Rate of taking events off “Critical Logs” file channel
Monitoring the Flume Flows 
Rate of Flume Events by data center East Coast, West Coast, Europe
Monitoring the Flume Flows 
Monitoring by Groups
Benefits of migrating to Flume 
•Business has insight into data in under 10 minutes 
•Configuring expansion trivial 
•Failover enables automatic recovery from down time 
•Bifurcation 
•enables scaled constant regression lane(s) 
•Subset of data to analytics development cluster
Benefits of migrating to Flume 
5 minute aggregations to business within 10 minutes
Gotchas… 
•Scaling for Compression 
•Auto reloading of properties inconsistent 
•“It is recommended (though not required) to use a separate disk for the File Channel checkpoint.” RAID-6 raid array, Force Write Back 
•Bad configurations not easy to see, not always clear in log file. 
•NetcatSource – Not too useful beyond trivial usage
Gotchas… 
•POM file edits 
•JUnits are not deterministic 
•Hadoop jars added to classpath by startup script – IDE 
•Avoiding cost of Avro schema evolution
What is next 
•Upgrade to Flume 1.5 
•Bifurcate to micro batch (Storm? Spark?) 
•Disable sink switch

Contenu connexe

Dernier

Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 

Dernier (20)

Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 

En vedette

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

En vedette (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Big Data Everywhere Chicago: 1.5 Million Log Lines Per Second: Building and Maintaining Flume Flows at Coversant (Conversant)

  • 1. 1.5 Million Log Lines per Second Big Data Everywhere Chicago 2014 Mike Keane mkeane@conversant.com Building and maintaining Flume flows at Conversant
  • 2. •Quicker insight into production data •Reduce complexity of administering/managing new servers, data centers, etc. •Scalable •No data loss or duplication •Replace TSV files with Avro objects •Able to be monitored by Network Operations Center (NOC) •Able to recover from downtime quickly R SLA for Event Driven Logging with Flume
  • 3. •A Flume Flow is a series of flume agents data follows from origination to final destination •Data on a Flume Flow is packaged in FlumeEvent Avro objects •A FlumeEvent is composed of •Headers – A map of string value pairs •Body – A byte array •A FlumeEvent is an atomic unit of data •FlumeEvents are sent in batches •When a batch of FlumeEvents only partially makes it to the next flume agent in the flow, the entire batch is resent resulting in duplicates R Simplistic Flume Overview
  • 4. R Simplistic Flume Overview Flume Agent
  • 5. R Simplistic Flume Overview EmbeddedAgent Compressor Agent Landing Agent
  • 6. Overview of existing network topology •3 data centers divided into 12 lanes participating in the OpenRTB market •6 lanes in the east coast data center •4 lanes in the west coast data center •2 lanes in the European data center •Each lane has approximately 75 servers handling OpenRTB operations. •30 different logs •Over 60,000,000,000 log lines per day
  • 7. Overview of existing network topology.
  • 8. •2 Server Flume Flow from East Coast (IAD) to Chicago (ORD) with over 250K TSV lines per second •No Data Loss •Failover •Compression performance P.O.C. Can Flume handle our log volume reliably?
  • 10. P.O.C. passes •Larger Batch sizes helped, but could not reach 250K per second •Multiple TSV lines Per FlumeEvent hits over 360K per second •Failover passed with duplicates •Compression passed but needed to parallelize 7X sinks
  • 11. Taking Flume to Production •Embedding the EmbeddedAgent in existing servers •Modify EmbeddedAgent •Properties from existing infrastructure •Implement Monitoring •Create “Flume”Implementation of proprietary logging interface •Replace POJO to TSV with Avro to AvroDataFile •Preventing duplicates, not removing •Add LogType header
  • 12. Taking Flume to Production •Custom Sink for AvroDataFile body (based on HDFSEventSink) •Check if UUID header is in HBase •Yes – increment duplicate count metric •No •Write AvroDataFile body to HDFS using Custom Writer •Put UUID to HBase
  • 13. Taking Flume to Production •Custom Selector based on MultiplexingChannelSelector •Route FlumeEvents to channels by log type or groups of log types •Bifurcate to multiple locations each log and each location with its own percentage of data to bifurcate
  • 14. Configuring Flume Flows •Configuring Flume can be tedious, use a templating engine •In Q2 2014 Conversant expanded from 7 lanes in 2 data centers to 12 lanes in 3 data centers (~400 more servers to configure). •Static headers useful for tracking flows •15 minutes to configure all Q2 expansion CompressorLane('iad6', [CompressorAgent("dtiad06flm01p"), CompressorAgent("dtiad06flm02p"), CompressorAgent("dtiad06flm03p")]) compressor.list = dtiad06flm01p, dtiad06flm02p,dtiad06flm03p
  • 15. Monitoring the Flume Flows •Flume metrics are available by JMX or Json over HTTP •Metrics to monitor •ChannelFillPercentage •Rate of change on EventDrainSuccessCount on failover sinks •FLUME-2307 – File channel deletes fail after timeout (fixed 1.5) •Publishing metrics to TSDB provides great visual insight
  • 16. Monitoring the Flume Flows ChannelFillPercentage
  • 17. Monitoring the Flume Flows Rate of taking events off “Critical Logs” file channel
  • 18. Monitoring the Flume Flows Rate of Flume Events by data center East Coast, West Coast, Europe
  • 19. Monitoring the Flume Flows Monitoring by Groups
  • 20. Benefits of migrating to Flume •Business has insight into data in under 10 minutes •Configuring expansion trivial •Failover enables automatic recovery from down time •Bifurcation •enables scaled constant regression lane(s) •Subset of data to analytics development cluster
  • 21. Benefits of migrating to Flume 5 minute aggregations to business within 10 minutes
  • 22. Gotchas… •Scaling for Compression •Auto reloading of properties inconsistent •“It is recommended (though not required) to use a separate disk for the File Channel checkpoint.” RAID-6 raid array, Force Write Back •Bad configurations not easy to see, not always clear in log file. •NetcatSource – Not too useful beyond trivial usage
  • 23. Gotchas… •POM file edits •JUnits are not deterministic •Hadoop jars added to classpath by startup script – IDE •Avoiding cost of Avro schema evolution
  • 24. What is next •Upgrade to Flume 1.5 •Bifurcate to micro batch (Storm? Spark?) •Disable sink switch