SlideShare une entreprise Scribd logo
1  sur  2
Télécharger pour lire hors ligne
12/22/11                                                    Project Voldemort



                            Project Voldemort
                            A distributed database.
                            zoie · bobo · cleo · decomposer · norbert · voldemort · kafka · kamikaze · krati · sensei · azkaban · datafu ·
                            blog


           quickstart               Voldemort is a distributed key-value storage system
           design
           source                         Data is automatically replicated over multiple servers.
           mailing list                   Data is automatically partitioned so each server contains only a subset of the total data
           download                       Server failure is handled transparently
           snapshot build                 Pluggable serialization is supported to allow rich keys and values including lists and tuples with
           configuration                  named fields, as well as to integrate with common serialization frameworks like Protocol Buffers,
           javadoc                        Thrift, Avro and Java Serialization
           developer info                 Data items are versioned to maximize data integrity in failure scenarios without compromising
           fun projects                   availability of the system
           performance                    Each node is independent of other nodes with no central point of failure or coordination
           bugs                           Good single node performance: you can expect 10-20k operations per second depending on the
           wiki                           machines, the network, the disk system, and the data replication factor
                                          Support for pluggable data placement strategies to support things like distribution across data centers
                                          that are geographically far apart.

                                    It is used at LinkedIn for certain high-scalability storage problems where simple functional partitioning is not
                                    sufficient. It is still a new system which has rough edges, bad error messages, and probably plenty of
                                    uncaught bugs. Let us know if you find one of these, so we can fix it.

                                    Comparison to relational databases

                                    Voldemort is not a relational database, it does not attempt to satisfy arbitrary relations while satisfying ACID
                                    properties. Nor is it an object database that attempts to transparently map object reference graphs. Nor
                                    does it introduce a new abstraction such as document-orientation. It is basically just a big, distributed,
                                    persistent, fault-tolerant hash table. For applications that can use an O/R mapper like active-record or
                                    hibernate this will provide horizontal scalability and much higher availability but at great loss of convenience.
                                    For large applications under internet-type scalability pressure, a system may likely consists of a number of
                                    functionally partitioned services or apis, which may manage storage resources across multiple data centers
                                    using storage systems which may themselves be horizontally partitioned. For applications in this space,
                                    arbitrary in-database joins are already impossible since all the data is not available in any single database. A
                                    typical pattern is to introduce a caching layer which will require hashtable semantics anyway. For these
                                    applications Voldemort offers a number of advantages:

                                          Voldemort combines in memory caching with the storage system so that a separate caching tier is not
                                          required (instead the storage system itself is just fast)
                                          Unlike MySQL replication, both reads and writes scale horizontally
                                          Data portioning is transparent, and allows for cluster expansion without rebalancing all data
                                          Data replication and placement is decided by a simple API to be able to accommodate a wide range of
                                          application specific strategies
                                          The storage layer is completely mockable so development and unit testing can be done against a
                                          throw-away in-memory storage system without needing a real cluster (or even a real storage system)
                                          for simple testing

                                    The source code is available under the Apache 2.0 license. We are actively looking for contributors so if you
                                    have ideas, code, bug reports, or fixes you would like to contribute please do so.



project-voldemort.com                                                                                                                                  1/2
12/22/11                                      Project Voldemort

                        For help please see the discussion group, or the IRC channel chat.us.freenode.net #voldemort.




project-voldemort.com                                                                                                   2/2

Contenu connexe

Tendances

Getting started with postgresql
Getting started with postgresqlGetting started with postgresql
Getting started with postgresqlbotsplash.com
 
Voldemort on Solid State Drives
Voldemort on Solid State DrivesVoldemort on Solid State Drives
Voldemort on Solid State DrivesVinoth Chandar
 
Hadoop Meetup Jan 2019 - Overview of Ozone
Hadoop Meetup Jan 2019 - Overview of OzoneHadoop Meetup Jan 2019 - Overview of Ozone
Hadoop Meetup Jan 2019 - Overview of OzoneErik Krogen
 
Running Cassandra on Amazon EC2
Running Cassandra on Amazon EC2Running Cassandra on Amazon EC2
Running Cassandra on Amazon EC2Dave Gardner
 
Sql vs NoSQL
Sql vs NoSQLSql vs NoSQL
Sql vs NoSQLRTigger
 
Data Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftData Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftAmazon Web Services
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandraNguyen Quang
 
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016DataStax
 
Cassandra TK 2014 - Large Nodes
Cassandra TK 2014 - Large NodesCassandra TK 2014 - Large Nodes
Cassandra TK 2014 - Large Nodesaaronmorton
 
Migration to Redshift from SQL Server
Migration to Redshift from SQL ServerMigration to Redshift from SQL Server
Migration to Redshift from SQL Serverjoeharris76
 
ScyllaDB: What could you do with Cassandra compatibility at 1.8 million reque...
ScyllaDB: What could you do with Cassandra compatibility at 1.8 million reque...ScyllaDB: What could you do with Cassandra compatibility at 1.8 million reque...
ScyllaDB: What could you do with Cassandra compatibility at 1.8 million reque...Data Con LA
 
Cassandra implementation for collecting data and presenting data
Cassandra implementation for collecting data and presenting dataCassandra implementation for collecting data and presenting data
Cassandra implementation for collecting data and presenting dataChen Robert
 
ClustrixDB: how distributed databases scale out
ClustrixDB: how distributed databases scale outClustrixDB: how distributed databases scale out
ClustrixDB: how distributed databases scale outMariaDB plc
 
NoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture PatternsNoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture PatternsDATAVERSITY
 
MariaDB ColumnStore
MariaDB ColumnStoreMariaDB ColumnStore
MariaDB ColumnStoreMariaDB plc
 
NoSQL Databases: An Introduction and Comparison between Dynamo, MongoDB and C...
NoSQL Databases: An Introduction and Comparison between Dynamo, MongoDB and C...NoSQL Databases: An Introduction and Comparison between Dynamo, MongoDB and C...
NoSQL Databases: An Introduction and Comparison between Dynamo, MongoDB and C...Vivek Adithya Mohankumar
 
Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar
Webinar: DataStax Training - Everything you need to become a Cassandra RockstarWebinar: DataStax Training - Everything you need to become a Cassandra Rockstar
Webinar: DataStax Training - Everything you need to become a Cassandra RockstarDataStax
 

Tendances (20)

Getting started with postgresql
Getting started with postgresqlGetting started with postgresql
Getting started with postgresql
 
Voldemort on Solid State Drives
Voldemort on Solid State DrivesVoldemort on Solid State Drives
Voldemort on Solid State Drives
 
Hadoop Meetup Jan 2019 - Overview of Ozone
Hadoop Meetup Jan 2019 - Overview of OzoneHadoop Meetup Jan 2019 - Overview of Ozone
Hadoop Meetup Jan 2019 - Overview of Ozone
 
NoSQL_Night
NoSQL_NightNoSQL_Night
NoSQL_Night
 
Running Cassandra on Amazon EC2
Running Cassandra on Amazon EC2Running Cassandra on Amazon EC2
Running Cassandra on Amazon EC2
 
Sql vs NoSQL
Sql vs NoSQLSql vs NoSQL
Sql vs NoSQL
 
Data Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftData Warehousing with Amazon Redshift
Data Warehousing with Amazon Redshift
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandra
 
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
 
Cassandra TK 2014 - Large Nodes
Cassandra TK 2014 - Large NodesCassandra TK 2014 - Large Nodes
Cassandra TK 2014 - Large Nodes
 
Migration to Redshift from SQL Server
Migration to Redshift from SQL ServerMigration to Redshift from SQL Server
Migration to Redshift from SQL Server
 
ScyllaDB: What could you do with Cassandra compatibility at 1.8 million reque...
ScyllaDB: What could you do with Cassandra compatibility at 1.8 million reque...ScyllaDB: What could you do with Cassandra compatibility at 1.8 million reque...
ScyllaDB: What could you do with Cassandra compatibility at 1.8 million reque...
 
Cassandra implementation for collecting data and presenting data
Cassandra implementation for collecting data and presenting dataCassandra implementation for collecting data and presenting data
Cassandra implementation for collecting data and presenting data
 
ClustrixDB: how distributed databases scale out
ClustrixDB: how distributed databases scale outClustrixDB: how distributed databases scale out
ClustrixDB: how distributed databases scale out
 
Dynamo db
Dynamo dbDynamo db
Dynamo db
 
Cassandra NoSQL Tutorial
Cassandra NoSQL TutorialCassandra NoSQL Tutorial
Cassandra NoSQL Tutorial
 
NoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture PatternsNoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture Patterns
 
MariaDB ColumnStore
MariaDB ColumnStoreMariaDB ColumnStore
MariaDB ColumnStore
 
NoSQL Databases: An Introduction and Comparison between Dynamo, MongoDB and C...
NoSQL Databases: An Introduction and Comparison between Dynamo, MongoDB and C...NoSQL Databases: An Introduction and Comparison between Dynamo, MongoDB and C...
NoSQL Databases: An Introduction and Comparison between Dynamo, MongoDB and C...
 
Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar
Webinar: DataStax Training - Everything you need to become a Cassandra RockstarWebinar: DataStax Training - Everything you need to become a Cassandra Rockstar
Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar
 

En vedette

Composing and Executing Parallel Data Flow Graphs wth Shell Pipes
Composing and Executing Parallel Data Flow Graphs wth Shell PipesComposing and Executing Parallel Data Flow Graphs wth Shell Pipes
Composing and Executing Parallel Data Flow Graphs wth Shell PipesVinoth Chandar
 
Voldemort : Prototype to Production
Voldemort : Prototype to ProductionVoldemort : Prototype to Production
Voldemort : Prototype to ProductionVinoth Chandar
 
Introducción a Voldemort - Innova4j
Introducción a Voldemort - Innova4jIntroducción a Voldemort - Innova4j
Introducción a Voldemort - Innova4jInnova4j
 

En vedette (6)

Voldemort
VoldemortVoldemort
Voldemort
 
Bluetube
BluetubeBluetube
Bluetube
 
Composing and Executing Parallel Data Flow Graphs wth Shell Pipes
Composing and Executing Parallel Data Flow Graphs wth Shell PipesComposing and Executing Parallel Data Flow Graphs wth Shell Pipes
Composing and Executing Parallel Data Flow Graphs wth Shell Pipes
 
Project Voldemort
Project VoldemortProject Voldemort
Project Voldemort
 
Voldemort : Prototype to Production
Voldemort : Prototype to ProductionVoldemort : Prototype to Production
Voldemort : Prototype to Production
 
Introducción a Voldemort - Innova4j
Introducción a Voldemort - Innova4jIntroducción a Voldemort - Innova4j
Introducción a Voldemort - Innova4j
 

Similaire à Project Voldemort

Cidr11 paper32
Cidr11 paper32Cidr11 paper32
Cidr11 paper32jujukoko
 
Megastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive servicesMegastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive servicesJoão Gabriel Lima
 
AWS Summit 2011: Architecting in the cloud
AWS Summit 2011: Architecting in the cloudAWS Summit 2011: Architecting in the cloud
AWS Summit 2011: Architecting in the cloudAmazon Web Services
 
JCConf 2016 - Cloud Computing Applications - Hazelcast, Spark and Ignite
JCConf 2016 - Cloud Computing Applications - Hazelcast, Spark and IgniteJCConf 2016 - Cloud Computing Applications - Hazelcast, Spark and Ignite
JCConf 2016 - Cloud Computing Applications - Hazelcast, Spark and IgniteJoseph Kuo
 
State of the Container Ecosystem
State of the Container EcosystemState of the Container Ecosystem
State of the Container EcosystemVinay Rao
 
Coding serbia meetup 29.09.2015.
Coding serbia meetup 29.09.2015.Coding serbia meetup 29.09.2015.
Coding serbia meetup 29.09.2015.Matija Gobec
 
The New Stack Container Summit Talk
The New Stack Container Summit TalkThe New Stack Container Summit Talk
The New Stack Container Summit TalkThe New Stack
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSSteve Wong
 
Presentation on Databases in the Cloud
Presentation on Databases in the CloudPresentation on Databases in the Cloud
Presentation on Databases in the Cloudmoshfiq
 
Infinit: Modern Storage Platform for Container Environments
Infinit: Modern Storage Platform for Container EnvironmentsInfinit: Modern Storage Platform for Container Environments
Infinit: Modern Storage Platform for Container EnvironmentsDocker, Inc.
 
NoSQL Database
NoSQL DatabaseNoSQL Database
NoSQL DatabaseSteve Min
 
Scalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudScalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudRed_Hat_Storage
 
LogicalDOC Clustering
LogicalDOC ClusteringLogicalDOC Clustering
LogicalDOC ClusteringLogicalDOC
 
JPJ1406 Distributed, Concurrent, and Independent Access to Encrypted Cloud ...
JPJ1406   Distributed, Concurrent, and Independent Access to Encrypted Cloud ...JPJ1406   Distributed, Concurrent, and Independent Access to Encrypted Cloud ...
JPJ1406 Distributed, Concurrent, and Independent Access to Encrypted Cloud ...chennaijp
 
Highly scalable caching service on cloud - Redis
Highly scalable caching service on cloud - RedisHighly scalable caching service on cloud - Redis
Highly scalable caching service on cloud - RedisKrishna-Kumar
 
EOUG95 - Client Server Very Large Databases - Paper
EOUG95 - Client Server Very Large Databases - PaperEOUG95 - Client Server Very Large Databases - Paper
EOUG95 - Client Server Very Large Databases - PaperDavid Walker
 
Connecting Docker for Cloud IaaS (Speech at CSDN-Oct18
Connecting Docker for Cloud IaaS (Speech at CSDN-Oct18Connecting Docker for Cloud IaaS (Speech at CSDN-Oct18
Connecting Docker for Cloud IaaS (Speech at CSDN-Oct18DaoliCloud Ltd
 

Similaire à Project Voldemort (20)

Cidr11 paper32
Cidr11 paper32Cidr11 paper32
Cidr11 paper32
 
Megastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive servicesMegastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive services
 
AWS Summit 2011: Architecting in the cloud
AWS Summit 2011: Architecting in the cloudAWS Summit 2011: Architecting in the cloud
AWS Summit 2011: Architecting in the cloud
 
JCConf 2016 - Cloud Computing Applications - Hazelcast, Spark and Ignite
JCConf 2016 - Cloud Computing Applications - Hazelcast, Spark and IgniteJCConf 2016 - Cloud Computing Applications - Hazelcast, Spark and Ignite
JCConf 2016 - Cloud Computing Applications - Hazelcast, Spark and Ignite
 
State of the Container Ecosystem
State of the Container EcosystemState of the Container Ecosystem
State of the Container Ecosystem
 
Coding serbia meetup 29.09.2015.
Coding serbia meetup 29.09.2015.Coding serbia meetup 29.09.2015.
Coding serbia meetup 29.09.2015.
 
The New Stack Container Summit Talk
The New Stack Container Summit TalkThe New Stack Container Summit Talk
The New Stack Container Summit Talk
 
Datastores
DatastoresDatastores
Datastores
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
 
Presentation on Databases in the Cloud
Presentation on Databases in the CloudPresentation on Databases in the Cloud
Presentation on Databases in the Cloud
 
Infinit: Modern Storage Platform for Container Environments
Infinit: Modern Storage Platform for Container EnvironmentsInfinit: Modern Storage Platform for Container Environments
Infinit: Modern Storage Platform for Container Environments
 
NoSQL Database
NoSQL DatabaseNoSQL Database
NoSQL Database
 
Scalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudScalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the Cloud
 
LogicalDOC Clustering
LogicalDOC ClusteringLogicalDOC Clustering
LogicalDOC Clustering
 
JPJ1406 Distributed, Concurrent, and Independent Access to Encrypted Cloud ...
JPJ1406   Distributed, Concurrent, and Independent Access to Encrypted Cloud ...JPJ1406   Distributed, Concurrent, and Independent Access to Encrypted Cloud ...
JPJ1406 Distributed, Concurrent, and Independent Access to Encrypted Cloud ...
 
Highly scalable caching service on cloud - Redis
Highly scalable caching service on cloud - RedisHighly scalable caching service on cloud - Redis
Highly scalable caching service on cloud - Redis
 
Ceph as software define storage
Ceph as software define storageCeph as software define storage
Ceph as software define storage
 
Apache ignite v1.3
Apache ignite v1.3Apache ignite v1.3
Apache ignite v1.3
 
EOUG95 - Client Server Very Large Databases - Paper
EOUG95 - Client Server Very Large Databases - PaperEOUG95 - Client Server Very Large Databases - Paper
EOUG95 - Client Server Very Large Databases - Paper
 
Connecting Docker for Cloud IaaS (Speech at CSDN-Oct18
Connecting Docker for Cloud IaaS (Speech at CSDN-Oct18Connecting Docker for Cloud IaaS (Speech at CSDN-Oct18
Connecting Docker for Cloud IaaS (Speech at CSDN-Oct18
 

Plus de Gregory Pence

Cloud Computing As A Threat To Older Tech Companies Ny Times
Cloud Computing As A Threat To Older Tech Companies   Ny TimesCloud Computing As A Threat To Older Tech Companies   Ny Times
Cloud Computing As A Threat To Older Tech Companies Ny TimesGregory Pence
 
2011 120211 Gartner Predictions 253709
2011 120211 Gartner Predictions 2537092011 120211 Gartner Predictions 253709
2011 120211 Gartner Predictions 253709Gregory Pence
 
Cloud 101 What The Heck Do Iaa S, Paa S And Saa S Companies Do Venture Beat
Cloud 101  What The Heck Do Iaa S, Paa S And Saa S Companies Do    Venture BeatCloud 101  What The Heck Do Iaa S, Paa S And Saa S Companies Do    Venture Beat
Cloud 101 What The Heck Do Iaa S, Paa S And Saa S Companies Do Venture BeatGregory Pence
 
10 Disruptive Cloud Companies We’Re Excited About Venture Beat
10 Disruptive Cloud Companies We’Re Excited About   Venture Beat10 Disruptive Cloud Companies We’Re Excited About   Venture Beat
10 Disruptive Cloud Companies We’Re Excited About Venture BeatGregory Pence
 
Tidemark Enterprise Disruption In The Cloud Zd Net
Tidemark  Enterprise Disruption In The Cloud   Zd NetTidemark  Enterprise Disruption In The Cloud   Zd Net
Tidemark Enterprise Disruption In The Cloud Zd NetGregory Pence
 
Www.Sas.Com Resources Whitepaper Wp 33890
Www.Sas.Com Resources Whitepaper Wp 33890Www.Sas.Com Resources Whitepaper Wp 33890
Www.Sas.Com Resources Whitepaper Wp 33890Gregory Pence
 

Plus de Gregory Pence (8)

On Demand Int
On Demand IntOn Demand Int
On Demand Int
 
Data Man Imperitive
Data Man ImperitiveData Man Imperitive
Data Man Imperitive
 
Cloud Computing As A Threat To Older Tech Companies Ny Times
Cloud Computing As A Threat To Older Tech Companies   Ny TimesCloud Computing As A Threat To Older Tech Companies   Ny Times
Cloud Computing As A Threat To Older Tech Companies Ny Times
 
2011 120211 Gartner Predictions 253709
2011 120211 Gartner Predictions 2537092011 120211 Gartner Predictions 253709
2011 120211 Gartner Predictions 253709
 
Cloud 101 What The Heck Do Iaa S, Paa S And Saa S Companies Do Venture Beat
Cloud 101  What The Heck Do Iaa S, Paa S And Saa S Companies Do    Venture BeatCloud 101  What The Heck Do Iaa S, Paa S And Saa S Companies Do    Venture Beat
Cloud 101 What The Heck Do Iaa S, Paa S And Saa S Companies Do Venture Beat
 
10 Disruptive Cloud Companies We’Re Excited About Venture Beat
10 Disruptive Cloud Companies We’Re Excited About   Venture Beat10 Disruptive Cloud Companies We’Re Excited About   Venture Beat
10 Disruptive Cloud Companies We’Re Excited About Venture Beat
 
Tidemark Enterprise Disruption In The Cloud Zd Net
Tidemark  Enterprise Disruption In The Cloud   Zd NetTidemark  Enterprise Disruption In The Cloud   Zd Net
Tidemark Enterprise Disruption In The Cloud Zd Net
 
Www.Sas.Com Resources Whitepaper Wp 33890
Www.Sas.Com Resources Whitepaper Wp 33890Www.Sas.Com Resources Whitepaper Wp 33890
Www.Sas.Com Resources Whitepaper Wp 33890
 

Project Voldemort

  • 1. 12/22/11 Project Voldemort Project Voldemort A distributed database. zoie · bobo · cleo · decomposer · norbert · voldemort · kafka · kamikaze · krati · sensei · azkaban · datafu · blog quickstart Voldemort is a distributed key-value storage system design source Data is automatically replicated over multiple servers. mailing list Data is automatically partitioned so each server contains only a subset of the total data download Server failure is handled transparently snapshot build Pluggable serialization is supported to allow rich keys and values including lists and tuples with configuration named fields, as well as to integrate with common serialization frameworks like Protocol Buffers, javadoc Thrift, Avro and Java Serialization developer info Data items are versioned to maximize data integrity in failure scenarios without compromising fun projects availability of the system performance Each node is independent of other nodes with no central point of failure or coordination bugs Good single node performance: you can expect 10-20k operations per second depending on the wiki machines, the network, the disk system, and the data replication factor Support for pluggable data placement strategies to support things like distribution across data centers that are geographically far apart. It is used at LinkedIn for certain high-scalability storage problems where simple functional partitioning is not sufficient. It is still a new system which has rough edges, bad error messages, and probably plenty of uncaught bugs. Let us know if you find one of these, so we can fix it. Comparison to relational databases Voldemort is not a relational database, it does not attempt to satisfy arbitrary relations while satisfying ACID properties. Nor is it an object database that attempts to transparently map object reference graphs. Nor does it introduce a new abstraction such as document-orientation. It is basically just a big, distributed, persistent, fault-tolerant hash table. For applications that can use an O/R mapper like active-record or hibernate this will provide horizontal scalability and much higher availability but at great loss of convenience. For large applications under internet-type scalability pressure, a system may likely consists of a number of functionally partitioned services or apis, which may manage storage resources across multiple data centers using storage systems which may themselves be horizontally partitioned. For applications in this space, arbitrary in-database joins are already impossible since all the data is not available in any single database. A typical pattern is to introduce a caching layer which will require hashtable semantics anyway. For these applications Voldemort offers a number of advantages: Voldemort combines in memory caching with the storage system so that a separate caching tier is not required (instead the storage system itself is just fast) Unlike MySQL replication, both reads and writes scale horizontally Data portioning is transparent, and allows for cluster expansion without rebalancing all data Data replication and placement is decided by a simple API to be able to accommodate a wide range of application specific strategies The storage layer is completely mockable so development and unit testing can be done against a throw-away in-memory storage system without needing a real cluster (or even a real storage system) for simple testing The source code is available under the Apache 2.0 license. We are actively looking for contributors so if you have ideas, code, bug reports, or fixes you would like to contribute please do so. project-voldemort.com 1/2
  • 2. 12/22/11 Project Voldemort For help please see the discussion group, or the IRC channel chat.us.freenode.net #voldemort. project-voldemort.com 2/2