Soumettre la recherche
Mettre en ligne
Handling Data in Mega Scale Web Systems
•
Télécharger en tant que PPT, PDF
•
7 j'aime
•
1,063 vues
V
Vineet Gupta
Suivre
Technologie
Signaler
Partager
Signaler
Partager
1 sur 54
Télécharger maintenant
Recommandé
Spark
Spark
Nitish Upreti
Hadoop tutorial for beginners-tibacademy.in
Hadoop tutorial for beginners-tibacademy.in
TIB Academy
Distributed Computing with Apache Hadoop: Technology Overview
Distributed Computing with Apache Hadoop: Technology Overview
Konstantin V. Shvachko
The Secrets of Building Realtime Big Data Systems
The Secrets of Building Realtime Big Data Systems
nathanmarz
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Spark Summit
Bhupeshbansal bigdata
Bhupeshbansal bigdata
Bhupesh Bansal
Real-Time Big Data at In-Memory Speed, Using Storm
Real-Time Big Data at In-Memory Speed, Using Storm
Nati Shalom
Meetup ml spark_ppt
Meetup ml spark_ppt
Snehal Nagmote
Recommandé
Spark
Spark
Nitish Upreti
Hadoop tutorial for beginners-tibacademy.in
Hadoop tutorial for beginners-tibacademy.in
TIB Academy
Distributed Computing with Apache Hadoop: Technology Overview
Distributed Computing with Apache Hadoop: Technology Overview
Konstantin V. Shvachko
The Secrets of Building Realtime Big Data Systems
The Secrets of Building Realtime Big Data Systems
nathanmarz
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Spark Summit
Bhupeshbansal bigdata
Bhupeshbansal bigdata
Bhupesh Bansal
Real-Time Big Data at In-Memory Speed, Using Storm
Real-Time Big Data at In-Memory Speed, Using Storm
Nati Shalom
Meetup ml spark_ppt
Meetup ml spark_ppt
Snehal Nagmote
Jstorm introduction-0.9.6
Jstorm introduction-0.9.6
longda feng
Hadoop
Hadoop
Ramakrishna Reddy Bijjam
PHP Backends for Real-Time User Interaction using Apache Storm.
PHP Backends for Real-Time User Interaction using Apache Storm.
DECK36
getFamiliarWithHadoop
getFamiliarWithHadoop
AmirReza Mohammadi
Hadoop fault tolerance
Hadoop fault tolerance
Pallav Jha
Sector Sphere 2009
Sector Sphere 2009
lilyco
HUG Nov 2010: HDFS Raid - Facebook
HUG Nov 2010: HDFS Raid - Facebook
Yahoo Developer Network
Hdfs high availability
Hdfs high availability
Hadoop User Group
Hadoop training-in-hyderabad
Hadoop training-in-hyderabad
sreehari orienit
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
DataStax Academy
Spark vs storm
Spark vs storm
Trong Ton
S4: Distributed Stream Computing Platform
S4: Distributed Stream Computing Platform
Farzad Nozarian
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Data Con LA
Presentation on Hadoop Technology
Presentation on Hadoop Technology
OpenDev
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Sid Anand
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Jen Aman
Yahoo compares Storm and Spark
Yahoo compares Storm and Spark
Chicago Hadoop Users Group
Document Similarity with Cloud Computing
Document Similarity with Cloud Computing
Bryan Bende
Distributed Caching - Cache Unleashed
Distributed Caching - Cache Unleashed
Avishek Patra
Handling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
Directi Group
Hpts 2011 flexible_oltp
Hpts 2011 flexible_oltp
Jags Ramnarayan
Reduce Side Joins
Reduce Side Joins
Edureka!
Contenu connexe
Tendances
Jstorm introduction-0.9.6
Jstorm introduction-0.9.6
longda feng
Hadoop
Hadoop
Ramakrishna Reddy Bijjam
PHP Backends for Real-Time User Interaction using Apache Storm.
PHP Backends for Real-Time User Interaction using Apache Storm.
DECK36
getFamiliarWithHadoop
getFamiliarWithHadoop
AmirReza Mohammadi
Hadoop fault tolerance
Hadoop fault tolerance
Pallav Jha
Sector Sphere 2009
Sector Sphere 2009
lilyco
HUG Nov 2010: HDFS Raid - Facebook
HUG Nov 2010: HDFS Raid - Facebook
Yahoo Developer Network
Hdfs high availability
Hdfs high availability
Hadoop User Group
Hadoop training-in-hyderabad
Hadoop training-in-hyderabad
sreehari orienit
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
DataStax Academy
Spark vs storm
Spark vs storm
Trong Ton
S4: Distributed Stream Computing Platform
S4: Distributed Stream Computing Platform
Farzad Nozarian
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Data Con LA
Presentation on Hadoop Technology
Presentation on Hadoop Technology
OpenDev
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Sid Anand
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Jen Aman
Yahoo compares Storm and Spark
Yahoo compares Storm and Spark
Chicago Hadoop Users Group
Document Similarity with Cloud Computing
Document Similarity with Cloud Computing
Bryan Bende
Distributed Caching - Cache Unleashed
Distributed Caching - Cache Unleashed
Avishek Patra
Tendances
(19)
Jstorm introduction-0.9.6
Jstorm introduction-0.9.6
Hadoop
Hadoop
PHP Backends for Real-Time User Interaction using Apache Storm.
PHP Backends for Real-Time User Interaction using Apache Storm.
getFamiliarWithHadoop
getFamiliarWithHadoop
Hadoop fault tolerance
Hadoop fault tolerance
Sector Sphere 2009
Sector Sphere 2009
HUG Nov 2010: HDFS Raid - Facebook
HUG Nov 2010: HDFS Raid - Facebook
Hdfs high availability
Hdfs high availability
Hadoop training-in-hyderabad
Hadoop training-in-hyderabad
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
Spark vs storm
Spark vs storm
S4: Distributed Stream Computing Platform
S4: Distributed Stream Computing Platform
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Presentation on Hadoop Technology
Presentation on Hadoop Technology
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Yahoo compares Storm and Spark
Yahoo compares Storm and Spark
Document Similarity with Cloud Computing
Document Similarity with Cloud Computing
Distributed Caching - Cache Unleashed
Distributed Caching - Cache Unleashed
En vedette
Handling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
Directi Group
Hpts 2011 flexible_oltp
Hpts 2011 flexible_oltp
Jags Ramnarayan
Reduce Side Joins
Reduce Side Joins
Edureka!
Introduction to Tokenization
Introduction to Tokenization
Nabeel Yoosuf
Denormalization
Denormalization
Sohail Haider
Efficient Duplicate Detection Over Massive Data Sets
Efficient Duplicate Detection Over Massive Data Sets
Pradeeban Kathiravelu, Ph.D.
What is Payment Tokenization?
What is Payment Tokenization?
Rambus Inc
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
Overview of AWS Services for your Enterprise
Overview of AWS Services for your Enterprise
Blazeclan Technologies Private Limited
Tuple map reduce: beyond classic mapreduce
Tuple map reduce: beyond classic mapreduce
datasalt
MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck
MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck
Vladi Vexler
En vedette
(11)
Handling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
Hpts 2011 flexible_oltp
Hpts 2011 flexible_oltp
Reduce Side Joins
Reduce Side Joins
Introduction to Tokenization
Introduction to Tokenization
Denormalization
Denormalization
Efficient Duplicate Detection Over Massive Data Sets
Efficient Duplicate Detection Over Massive Data Sets
What is Payment Tokenization?
What is Payment Tokenization?
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
Overview of AWS Services for your Enterprise
Overview of AWS Services for your Enterprise
Tuple map reduce: beyond classic mapreduce
Tuple map reduce: beyond classic mapreduce
MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck
MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck
Similaire à Handling Data in Mega Scale Web Systems
Front Range PHP NoSQL Databases
Front Range PHP NoSQL Databases
Jon Meredith
Modeling data and best practices for the Azure Cosmos DB.
Modeling data and best practices for the Azure Cosmos DB.
Mohammad Asif
Azure Cosmos DB - Technical Deep Dive
Azure Cosmos DB - Technical Deep Dive
Andre Essing
Distributed Systems: scalability and high availability
Distributed Systems: scalability and high availability
Renato Lucindo
Pnuts
Pnuts
Ruchika Mehresh
PNUTS
PNUTS
Ruchika Mehresh
Pnuts Review
Pnuts Review
Ruchika Mehresh
Cloud storage
Cloud storage
Zeeshan Bilal
CS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduce
J Singh
Basics of Distributed Systems - Distributed Storage
Basics of Distributed Systems - Distributed Storage
Nilesh Salpe
Tech-Spark: Exploring the Cosmos DB
Tech-Spark: Exploring the Cosmos DB
Ralph Attard
Design Patterns For Distributed NO-reational databases
Design Patterns For Distributed NO-reational databases
lovingprince58
Big data serving: Processing and inference at scale in real time
Big data serving: Processing and inference at scale in real time
Itai Yaffe
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
Naoki (Neo) SATO
17-NoSQL.pptx
17-NoSQL.pptx
levichan1
NOSQL Database: Apache Cassandra
NOSQL Database: Apache Cassandra
Folio3 Software
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
Amazon Web Services
MYSQL
MYSQL
gilashikwa
Design Patterns for Distributed Non-Relational Databases
Design Patterns for Distributed Non-Relational Databases
guestdfd1ec
Need for Time series Database
Need for Time series Database
Pramit Choudhary
Similaire à Handling Data in Mega Scale Web Systems
(20)
Front Range PHP NoSQL Databases
Front Range PHP NoSQL Databases
Modeling data and best practices for the Azure Cosmos DB.
Modeling data and best practices for the Azure Cosmos DB.
Azure Cosmos DB - Technical Deep Dive
Azure Cosmos DB - Technical Deep Dive
Distributed Systems: scalability and high availability
Distributed Systems: scalability and high availability
Pnuts
Pnuts
PNUTS
PNUTS
Pnuts Review
Pnuts Review
Cloud storage
Cloud storage
CS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduce
Basics of Distributed Systems - Distributed Storage
Basics of Distributed Systems - Distributed Storage
Tech-Spark: Exploring the Cosmos DB
Tech-Spark: Exploring the Cosmos DB
Design Patterns For Distributed NO-reational databases
Design Patterns For Distributed NO-reational databases
Big data serving: Processing and inference at scale in real time
Big data serving: Processing and inference at scale in real time
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
17-NoSQL.pptx
17-NoSQL.pptx
NOSQL Database: Apache Cassandra
NOSQL Database: Apache Cassandra
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
MYSQL
MYSQL
Design Patterns for Distributed Non-Relational Databases
Design Patterns for Distributed Non-Relational Databases
Need for Time series Database
Need for Time series Database
Dernier
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
HarshalMandlekar2
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
Ravi Sanghani
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
Ingrid Airi González
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
Curtis Poe
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
LoriGlavin3
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
Inflectra
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
Pixlogix Infotech
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
Hiroshi SHIBATA
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
LoriGlavin3
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
ThousandEyes
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
Kari Kakkonen
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
Mydbops
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
ThousandEyes
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
Skynet Technologies
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
LoriGlavin3
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
Neo4j
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
BookNet Canada
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
LoriGlavin3
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
Alan Dix
Dernier
(20)
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
Handling Data in Mega Scale Web Systems
1.
Vineet Gupta |
GM – Software Engineering | Directi http://www.vineetgupta.com Licensed under Creative Commons Attribution Sharealike Noncommercial Intelligent People. Uncommon Ideas.
2.
3.
4.
5.
6.
7.
8.
Host App Server
DB Server RAM CPU CPU CPU RAM RAM
9.
Sunfire X4640 M2
8 x 6-core 2.6 GHz $ 27k to $ 170k PowerEdge R200 Dual core 2.8 GHz Around $ 550
10.
11.
T1, T2, T3,
T4 App Layer
12.
13.
14.
15.
16.
17.
18.
T1, T2, T3,
T4, T5 App Layer
19.
20.
T3 App Layer
T4 T5 T2 T1 First million rows T3 T4 T5 T2 T1 Second million rows T3 T4 T5 T2 T1 Third million rows
21.
22.
Source:
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.20.1495
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
42.
43.
44.
45.
46.
47.
48.
49.
50.
51.
52.
53.
54.
Intelligent People. Uncommon
Ideas. Licensed under Creative Commons Attribution Sharealike Noncommercial
Télécharger maintenant