Entries in Database (63)

Tuesday

Oct082013

F1 and Spanner Holistically Compared

Tuesday, October 8, 2013 at 8:45AM

This aricle, F1: A Distributed SQL Database That Scales by Srihari Srinivasan, is republished with permission from a blog you really should follow: Systems We Make - Curating Complex Distributed Systems.

With both the F1 and Spanner papers out its now possible to understand their interplay a bit holistically. So lets start by revisiting the key goals of both systems.

Key Goals of F1′s design

System must be able to scale up by adding resources
Ability to re-shard and rebalance data without application changes
ACID consistency for transactions
Full SQL support, support for indexes

Spanner’s objectives

Click to read more ...

HighScalability Team |

Top Features of a Scalable Database

Wednesday, July 4, 2012 at 9:15AM

This is a guest post by Douglas Wilson, EMEA Field Application Engineer at Raima, based on insights from biulding their Raima Database Manager.

Scalability and Hardware

Scalability is the ability to maintain performance as demands on the system increase, by adding further resources. Normally those resources will be in the form of hardware. Since processor speeds are no longer increasing much, scaling up the hardware normally means adding extra processors or cores, and more memory.

Scalability and Software

However, scalability requires software that can utilize the extra hardware effectively. The software must be designed to allow parallel processing. In the context of a database engine this means that the server component must be multi-threaded, to allow the operating system to schedule parallel tasks on all the cores that are available. Not only that, but the database engine must provide an efficient way to break its workload into as many parallel tasks as there are cores. So, for example, if the database server always uses only four threads then it will make very little difference whether this server runs on a four-core machine or an eight-core machine.

Distributed Design

Click to read more ...

HighScalability Team |

1 Comment |

Permalink |

Print Article

Email Article

Database

Wednesday

Dec212011

In Memory Data Grid Technologies

Wednesday, December 21, 2011 at 9:05AM

In Memory Data Grid Technologies - what is it, who are the leaders, why would I want one, what the business benefits and how do you use one?

What is an In Memory Data Grid?

It is not an in-memory relational database, a NOSQL database or a relational database. It is a different breed of software datastore.

In summary an IMDG is an ‘off the shelf’ software product that exhibits the following characteristics:

The data model is distributed across many servers in a single location or across multiple locations. This distribution is known as a data fabric. This distributed model is known as a ‘shared nothing’ architecture.

Click to read more ...

Paul Colmer |

25 Comments |

Permalink |

datagrid

Monday

Oct182010

NoCAP

Monday, October 18, 2010 at 4:00PM

In this post i wanted to spend sometime on the CAP theorem and clarify some of the confusion that i often see when people associate CAP with scalability without fully understanding the implications that comes with it and the alternative approaches

You can read the full article here

Nati Shalom |

5 Comments |

Permalink |

tagged

nosql in

CAP,

database scalability,

datagrid,

distribucted caching,

gigaspaces

Thursday

Sep102009

Building Scalable Databases: Denormalization, the NoSQL Movement and Digg

Thursday, September 10, 2009 at 6:27AM

Database normalization is a technique for designing relational database schemas that ensures that the data is optimal for ad-hoc querying and that modifications such as deletion or insertion of data does not lead to data inconsistency. Database denormalization is the process of optimizing your database for reads by creating redundant data. A consequence of denormalization is that insertions or deletions could cause data inconsistency if not uniformly applied to all redundant copies of the data within the database.

Read more on Carnage4life blog...

mg1313 |

2 Comments |

Permalink |

digg,

nosql,

scalable

Wednesday

Aug052009

Anti-RDBMS: A list of distributed key-value stores

Wednesday, August 5, 2009 at 12:49AM

Update 8: Introducing MongoDB by Eliot Horowit.

Update 7: The Future of Scalable Databases by Robin Mathew.

Update 6: NoSQL : If Only it Was that Easy. BJ Clark lays down the law on which databases are scalable: Tokyo - NO, Redis - NO, Voldemort - YES, MongoDB - Not Yet, Cassandra - Probably, Amazon S3 - YES * 2, MySQL - NO. The real thing to point out is that if you are being held back from making something super awesome because you can’t choose a database, you are doing it wrong.
Update 5: Exciting stuff happening in Japan at this Key-Value Storage meeting in Tokyo. Presentations on Groonga, Senna, Lux IO, Tokyo-Cabinet, Tx, repcached, Kai, Cagra, kumofs, ROMA, and Flare.

Update 4: NoSQL and the Relational Model: don’t throw the baby out with the bathwater by Matthew Willson. So my key point is, this kind of modelling is WORTH DOING, regardless of which database tool you end up using for physical storage.
Update 3: Choosing a non-relational database; why we migrated from MySQL to MongoDB. An illuminating article explaining why Boxed Ice move to MongoDB over MySQL and other NoSQL options: easy install, PHP support, replication and master-master support, good doc, auto sharding on the road map. They still use MySQL for billing.
Update 2: They are now called NoSQL databases. So keep up! Eric Lai wrote a good article in Computerworld No to SQL? Anti-database movement gains steam about the phenomena. There was even a NoSQL conference. It was unfortunately full by the time I wanted to sign up, but there are presentations by all the major players. Nice Hacker News thread too.
Update: Some Notes on Distributed Key Stores by Leonard Lin. What's the best way to handle a fast growing system with 100M items that requires low latency and lots of inserts? Leanord takes a trip through several competing systems. The winner was: Tokyo Cabinet.

Richard Jones has put together a very nice list of various key-value stores around the internets. The list includes: Project Voldemort, Ringo, Scalaris, Kai, Dynomite, MemcacheDB, ThruDB, CouchDB, Cassandra, HBase, and Hypertable. Richard also includes some commentary and their basic components (language, fault tolerance, persistence, client protocol, data model, docs, community).

There's an excellent discussion in the comments of Paxos vs Vector Clock techniques for synchronizing writes in the face of network failures.

Todd Hoff |

5 Comments |

Permalink |

Print Article

Email Article

Database,

key-value store

Thursday

Jul022009

Product: Facebook's Cassandra - A Massive Distributed Store

Thursday, July 2, 2009 at 12:31AM

Update 2: Presentation from the NoSQL conference: slides, video.
Update: Why you won't be building your killer app on a distributed hash table by Jonathan Ellis. Why I think Cassandra is the most promising of the open-source distributed databases --you get a relatively rich data model and a distribution model that supports efficient range queries. These are not things that can be grafted on top of a simpler DHT foundation, so Cassandra will be useful for a wider variety of applications.

James Hamilton has published a thorough summary of Facebook's Cassandra, another scalable key-value store for your perusal. It's open source and is described as a "BigTable data model running on a Dynamo-like infrastructure." Cassandra is used in Facebook as an email search system containing 25TB and over 100m mailboxes.

Google Code for Cassandra - A Structured Storage System on a P2P Network

SIGMOD 2008 Presentation.

Video Presentation at Facebook

Facebook Engineering Blog for Cassandra

Anti-RDBMS: A list of distributed key-value stores

Facebook Cassandra Architecture and Design by James Hamilton

Todd Hoff |

4 Comments |

Permalink |

key-value store

Thursday

Jul022009

Product: Project Voldemort - A Distributed Database

Thursday, July 2, 2009 at 12:02AM

Update: Presentation from the NoSQL conference: slides, video 1, video 2.

Project Voldemort is an open source implementation of the basic parts of Dynamo (Amazon’s Highly Available Key-value Store) distributed key-value storage system. LinkedIn is using it in their production environment for "certain high-scalability storage problems where simple functional partitioning is not sufficient."

From their website:

Data is automatically replicated over multiple servers.

Data is automatically partitioned so each server contains only a subset of the total data

Server failure is handled transparently

Pluggable serialization is supported to allow rich keys and values including lists and tuples with named fields, as well as to integrate with common serialization frameworks like Protocol Buffers, Thrift, and Java Serialization

Data items are versioned to maximize data integrity in failure scenarios without compromising availability of the system

Each node is independent of other nodes with no central point of failure or coordination

Good single node performance: you can expect 10-20k operations per second depending on the machines, the network, and the replication factor

Support for pluggable data placement strategies to support things like distribution across data centers that are geographical far apart.

They also have a nice design page going over some of their architectural choices: key-value store only, no complex queries or joins; consistent hashing is used to assign data to nodes; JSON is used for schema definition; versioning and read-repair for distributed consistency; a strict layered architecture with put, get, and delete as the interface between layers.

Just a hint when naming a project: don't name it after one of the most popular key words in muggledom. The only way someone will find your genius via search is with a dark spell. As I am a Good Witch I couldn't find much on Voldemort in the real world. But the idea is great and is very much in line with current thinking on scalable database design. Worth a look.

The CouchDB Project

Todd Hoff |

7 Comments |

Permalink |

Java,

key-value store

Saturday

Jun132009

Neo4j - a Graph Database that Kicks Buttox

Saturday, June 13, 2009 at 1:18AM

Update: Social networks in the database: using a graph database. A nice post on representing, traversing, and performing other common social network operations using a graph database.

If you are Digg or LinkedIn you can build your own speedy graph database to represent your complex social network relationships. For those of more modest means Neo4j, a graph database, is a good alternative.

A graph is a collection nodes (things) and edges (relationships) that connect pairs of nodes. Slap properties (key-value pairs) on nodes and relationships and you have a surprisingly powerful way to represent most anything you can think of. In a graph database "relationships are first-class citizens. They connect two nodes and both nodes and relationships can hold an arbitrary amount of key-value pairs. So you can look at a graph database as a key-value store, with full support for relationships."

A graph looks something like:

For more lovely examples take a look at the Graph Image Gallery.

Here's a good summary by Emil Eifrem, founder of the Neo4j, making the case for why graph databases rule:

Most applications today handle data that is deeply associative, i.e. structured as graphs (networks). The most obvious example of this is social networking sites, but even tagging systems, content management systems and wikis deal with inherently hierarchical or graph-shaped data.

This turns out to be a problem because it’s difficult to deal with recursive data structures in traditional relational databases. In essence, each traversal along a link in a graph is a join, and joins are known to be very expensive. Furthermore, with user-driven content, it is difficult to pre-conceive the exact schema of the data that will be handled. Unfortunately, the relational model requires upfront schemas and makes it difficult to fit this more dynamic and ad-hoc data.

A graph database uses nodes, relationships between nodes and key-value properties instead of tables to represent information. This model is typically substantially faster for associative data sets and uses a schema-less, bottoms-up model that is ideal for capturing ad-hoc and rapidly changing data.

So relational database can't handle complex relationships. Graph systems are opaque, unmaintainable, and inflexible. OO databases loose flexibility by combining logic and data. Key-value stores require the programmer to maintain all relationships. There, everybody sucks :-)

Neo4j's Key Characteristics

Dual license: open source and commercial.

Well suited for many web use cases such as tagging, metadata annotations, social networks, wikis and other network-shaped or hierarchical data sets.

An intuitive graph-oriented model for data representation. Instead of static and rigid tables, rows and columns, you work with a flexible graph network consisting of nodes, relationships and properties.

Decent documentation, active and responsive email list, a few releases, good buzz. All a good sign for something that has a chance to last a while.

Has bindings for a number of languages Python, Jython, Ruby, and Clojure. No binding for .Net yet. The recommendation is to access using a REST interface.

Disk-based, native storage manager completely optimized for storing graph structures for maximum performance and scalability. SSD ready.

Massive scalability. Neo4j can handle graphs of several billion nodes/relationships/properties on a single machine.

Frequently outperforms relational backends with >1000x for many increasingly important use cases.

Powerful traversal framework for high-speed traversals in the node space.

Small footprint. Neo4j is a single <500k jar with one dependency (the Java Transaction API).

Simple and convenient object-oriented API.

Retrieving children is trivial in a graph database.

No need to flatten and serialize an object graph as graphs are native to a graph database.

Fully transactional like a real database. Supports JTA/JTS, XA, 2PC, Tx recovery, deadlock detection, etc.

Current implementation is built to handle large graphs that don't fit in memory with durability. It's not a cache, it's a fully persistent transactional store.

No events or triggers. Planned in a future release.

No sharding. A suggestion for how one might shard is here.

Some common graph calculations are missing. For example, in a social network finding a common friend for a set of users.

Separates data and logic with a more "natural" representation than tables. This makes it easy to use Neo4j as the storage tier for OO code while keeping behaviour and state separate.

Neo4j traverses depths of 1000 levels and beyond at millisecond speed. That's many orders of magnitude faster than relational systems.

Neo4j vs Hadoop

This post makes an illuminating comparison between Neo4j vs Hadoop:

In principle, Hadoop and other Key-Value stores are mostly concerned with relatively flat data structures. That is, they are extremely fast and scalable regarding retrieval of simple objects, like values, documents or even objects.

However, if you want to do deeper traversal of e.g. a graph, you will have to retrieve the nodes for every traversal step (very fast) and then match them yourself in some manner (e.g. in Java or so) - slow.

Neo4j in contrast is build around the concept of "deep" data structures. This gives you almost unlimited flexibility regarding the layout of your data and domain object graph and very fast deep
traversals (hops over several nodes) since they are handled natively by the Neo4j engine down to the storage layer and not your client code. The drawback is that for huge data amounts (>1Billion nodes) the clustering and partitioning of the graph becomes non-trivial, which is one of the areas we are working on.

Then of course there are differences in the transaction models, consistency and others, but I hope this gives you a very short philosophical answer :)

It would have never occurred to me to compare the two, but the comparison shows why we need multiple complementary views of data. Hadoop scales the data grid and the compute grid and is more flexible in how data are queried and combined. Neo4j has far lower latencies for complex navigation problems. It's not a zero-sum game.

Neo4j -- or why graph dbs kick ass

The current database debate and graph databases by Anders Nawroth

On Building a Stupidly Fast Graph Database by Scott Wheeler and the Hacker News Thread

Network Model from wikipedia

Databases as a service: FathomDB

Using Neo4J to load and query OWL ontologies by Sujit Pal

Graph Databases and the Future of Large-Scale Knowledge Management by Marko A. Rodriguez

Memo To The Semantic Web: Drop “Semantic” And Become The “Graph Web” by Hank Williams

Is the Relational Database Doomed? by Tony Bain

Neo Database Introduction

Neo4j Email List

flare Data Visualization for the Web

Giant Global Graph by Tim Berners-Lee

Tim Berners-Lee -- Linked Data at TED

Drop ACID and Think About Data by Bob Ippolito

Analyzing and adapting graph algorithms for large persistent graphs by Larsson, Patrik

Todd Hoff |

18 Comments |

Permalink |

graph

Tuesday

May262009