High Scalability -

Kristi Anderson |

2 Comments |

Permalink |

tagged

nosql,

redis in

C++,

Caching,

Clustering,

Database,

DevOps,

Redis,

cache,

queue

Thursday

Nov292012

Performance data for LevelDB, Berkley DB and BangDB for Random Operations

Thursday, November 29, 2012 at 9:15AM

This is a guest post by Sachin Sinha, Founder of Iqlect and developer of BangDB.

The goal for the paper is to provide the performances data for following embedded databases under various scenarios for random operations such as write and read. The data is presented in graphical manner to make the data self explanatory to some extent.

LevelDB:
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values. Leveldb is based on LSM (Log-Structured Merge-Tree) and uses SSTable and MemTable for the database implementation. It's written in C++ and availabe under BSD license. LevelDB treats key and value as arbitrary byte arrays and stores keys in ordered fashion. It uses snappy compression for the data compression. Write and Read are concurrent for the db, but write performs best with single thread whereas Read scales with number of cores

BerkleyDB:
BerkleyDB (BDB) is a library that provides high performance embedded database for key/value data. Its the most widely used database library with millions of deployed copies. BDB can be configured to run from concurrent data store to transactional data store to fully ACID compliant db. It's written in C and availabe under Sleepycat Public License. BDB treats key and value as arbitrary byte arrays and stores keys in both ordered fashion using BTREE and un-ordered way using HASH. Write and Read are concurrent for the db, and scales well with number of cores especially the Read operation

BangDB:
BangDB is a high performance embedded database for key value data. It's a new entrant into the embedded db space. It's written in C++ and available under BSD license. BangDB treats key and value as arbitrary byte arrays and stores keys in both ordered fashion using BTREE and un-ordered way using HASH. Write, Read are concurrent and scales well with the number of cores

The comparison has been done on the similar grounds (as much as possible) for all the dbs to measure the data as crisply and accurately as possible.

The results of the test show BangDB faster in both reads and writes:

26 Comments |

Permalink |

Thursday

Jan272011

Comet - An Example of the New Key-Code Databases

Thursday, January 27, 2011 at 9:10AM

Comet is an active distributed key-value store built at the University of Washington. The paper describing Comet is Comet: An active distributed key-value store, there are also slides, and a MP3 of a presentation given at OSDI '10. Here's a succinct overview of Comet:

Today's cloud storage services, such as Amazon S3 or peer-to-peer DHTs, are highly inflexible and impose a variety of constraints on their clients: specific replication and consistency schemes, fixed data timeouts, limited logging, etc. We witnessed such inflexibility first-hand as part of our Vanish work, where we used a DHT to store encryption keys temporarily. To address this issue, we built Comet, an extensible storage service that allows clients to inject snippets of code that control their data's behavior inside the storage service.

I found this paper quite interesting because it takes the initial steps of collocating code with a key-value store, which turns it into what might called a key-code store. This is something I've been exploring as a way of moving behavior to data in order to overcome network limitations in the cloud and provide other benefits. An innovator in this area is the Alchemy Database, which has already combined Redis and Lua. A good platform for this sort of thing might be Node.js integrated with V8. This would allow complex Javascript programs to run in an efficient evented container. There are a lot of implications of this sort of architecture, more about that later, but the Comet paper describes a very interesting start.

From the abstract and conclusion:

4 Comments |

Permalink |

Paper,

Monday

Aug302010

Pomegranate - Storing Billions and Billions of Tiny Little Files

Monday, August 30, 2010 at 7:03AM

Pomegranate is a novel distributed file system built over distributed tabular storage that acts an awful lot like a NoSQL system. It's targeted at increasing the performance of tiny object access in order to support applications like online photo and micro-blog services, which require high concurrency, high throughput, and low latency. Their tests seem to indicate it works:

We have demonstrate that file system over tabular storage performs well for highly concurrent access. In our test cluster, we observed linearly increased more than 100,000 aggregate read and write requests served per second (RPS).

Rather than sitting atop the file system like almost every other K-V store, Pomegranate is baked into file system. The idea is that the file system API is common to every platform so it wouldn't require a separate API to use. Every application could use it out of the box.

The features of Pomegranate are:

It handles billions of small files efficiently, even in one directory;
It provide separate and scalable caching layer, which can be snapshot-able;
The storage layer uses log structured store to absorb small file writes to utilize the disk bandwidth;
Build a global namespace for both small files and large files;
Columnar storage to exploit temporal and spatial locality;
Distributed extendible hash to index metadata;
Snapshot-able and reconfigurable caching to increase parallelism and tolerant failures;
Pomegranate should be the first file system that is built over tabular storage, and the building experience should be worthy for file system community.

Can Ma, who leads the research on Pomegranate, was kind enough to agree to a short interview.

6 Comments |

Permalink |

Thursday

Nov052009

A Yes for a NoSQL Taxonomy

Thursday, November 5, 2009 at 7:50AM

NorthScale's Steven Yen in his highly entertaining NoSQL is a Horseless Carriage presentation has come up with a NoSQL taxonomy that thankfully focuses a little more on what NoSQL is, than what it isn't:

key‐value‐cache
- memcached, repcached, coherence, infinispan, eXtreme scale, jboss cache, velocity, terracoqa
key‐value‐store
- keyspace, flare, schema‐free, RAMCloud
eventually‐consistent key‐value‐store
- dynamo, voldemort, Dynomite, SubRecord, Mo8onDb, Dovetaildb
ordered‐key‐value‐store
- tokyo tyrant, lightcloud, NMDB, luxio, memcachedb, actord
data‐structures server
- redis
tuple‐store
- gigaspaces, coord, apache river
object database
- ZopeDB, db4o, Shoal
document store
- CouchDB, Mongo, Jackrabbit, XML Databases, ThruDB, CloudKit, Perservere, Riak Basho, Scalaris
wide columnar store
- BigTable, Hbase, Cassandra, Hypertable, KAI, OpenNeptune, Qbase, KDI

"Who will win?" Steven asks. He answers: the most approachable API with enough power will win. Steven touts the contender with the most devastating knock out punch will be document stores because "everyone groks documents." Though the thought is there will be just a few winners and products will converge in functionality.

Steven is banking on the "worse is better" model of dominance, which is hard to argue with as it has been so successful an adoption pattern in our field. The convergence idea is something I also agree with. What we have now are a lot features masquerading as products. Over time they will merge together to become more full featured offerings.

The key question though is what is enough power to win? Just getting a value back for a key won't be enough. Who are you putting your money on?

13 Comments |

Permalink |

key-value store,

nosql,

papers

Thursday

Oct292009

Paper: No Relation: The Mixed Blessings of Non-Relational Databases

Thursday, October 29, 2009 at 9:14AM

This excellent survey of the field was written by Ian Thomas Varley as part of his Master of Science in Engineering program.

The aim of this paper is to explore the conceptual design space of non-relational databases as compared to traditional relational databases. It is clear that the design needs of the two paradigms are different, but how fundamental are the differences, and what strategies can we use to transition our conceptual designs from one to the other?

There are a few things to like about this paper. A running a example is used to show the different ways to model data depending on which type of solution you are targeting, especially covering how many-to-many relationships are modeled, data integrity, and how to support optional attributes. There's also a brief survey of some of the major systems.

The most interesting section of the report is where it tackles the problem of design for non-relational systems. The approach has two different phases: design questions and design strategies.

The questions you should ask yourself about your problem are:

5 Comments |

Permalink |

Paper,

key-value store,

nosql

Thursday

Oct082009

Riak - web-shaped data storage system

Thursday, October 8, 2009 at 8:06AM

Update: Short presentation NYC by Bryan Fink demonstrating the riak web-shaped data storage engine

Riak is another new and interesting key-value store entrant. Some of the features it offers are:

Document-oriented
Scalable, decentralized key-value store
Standard get, put, and delete operations.
Distributed, fault-tolerant storage solution.
Configurable levels of consistency, availability, and partition tolerance
Support for Erlang, Ruby, PHP, Javascript, Java, Python, HTTP
open source and NoSQL
Pluggable backends
Eventing system
Monitoring
Inter-cluster replication
Links between records that can be traversed.
Map/Reduce. Functions are executed on the data node. One interesting difference is that a list keys are required to specify which values are operated on as apposed to running calculations on all values.

Hacker News Thread. More juicy details on how Riak compares to Cassandra, mongodb, couchdb, etc.

5 Comments |

Permalink |

Product,