Friday, November 5, 2010 at 7:44AM
HighScalability Team in hot links
So much good stuff this week...
Adrian Cockcroft Compares NoSQL Availability Models. Let's risk feeding the CAP trolls, and try to get some insight into the differences between the many NoSQL contenders. Adrian asks how each NoSQL product will add a movie to its favorites list, read it back, and how this works across availability zones. Much trickier than it sounds with multiple writers. Cassandra and MongoDB answer back.
Stuff the Internet Says:
@jerng: Reading up on scalability. WHY THE HELL FOR? Because I want to know the future.
@freerangedata: The #nosql options are the micro brews/craft beers of data stores. So many good ones, so little time to try them all.
@edward_ribeiro: Soon, Darwinism will start to play its role on #NoSQL systems. You know, only the fittest will survive.
@connectionreq: I'm always wowed when I hear how Facebook abuses their MySQL databases in crazy ways
@louismrose: This is the kind of scalability we should be working on... http://yfrog.com/59qb0oj
@jkalucki: I don't know. Scale, stability and centrality bring the sexy to infrastructure.
@Taggerz: evented I/O is not magic scalability pixie dust, and like anything, there is a tradeoff.
@CBVirtualPA: Instant scalability - How good is it to know that if a big job comes in, you’ve got the back up to handle it. Priceless.
Redis at Superfeedr. Each of our redis servers process on average 3500 queries per second.
NoSQL In The Wild: How We Do It at Tekpub. We split out the duties of our persistence story cleanly into two camps: reports of things we needed to know to make decisions for the business and data users needed to use our site. Ironically these two different ways of storing data have guided us to do what's natural: put the application data into a high-read, high-availability environment (MongoDb) - put the historical, reporting data into a system that is built to answer questions: a relational data store.
Understanding Throughput-Oriented Architectures by Michael Garland, David B. Kirk. For workloads with abundant parallelism, GPUs deliver higher peak computational throughput than latency-oriented CPUs.
GFS: Evolution on Fast-Forward. Kirk McKusick and Sean Quinlan discuss the origin and evolution of the Google File System.
Building Scalable Database Solution with SQL Azure - Introducing Federation in SQL Azure by Cihan Biyikoglu. Last week, we introduced the Federation concept in SQL Azure in this talk. In the next few posts, I’ll detail the upcoming concept and how to build applications using federations in SQL Azure. This will help you be ready when this technology ships in future versions of SQL Azure.
Google continues its quest to speed up the Internet by releasing mod_pagespeed for Apache, which performs many speed optimizations automatically. We’re starting with more than 15 on-the-fly optimizations that address various aspects of web performance.
On Quora, Ian McAllisterhas a good explanation of Amazon't working backwards approach to product development. Write a press release first and if doesn't sound interesting then you have some work to do.
Matthew Fowler gives a good example of using a compute grid to handle the load caused by consolidation of worldwide processing through a single system in order to save operational costs.
Scriptable Object Cache by Kuch Bhi Kabhi Bhi. What we have done here is inverted the responsibility. Instead of doing stuff in the code and putting it in the cache, what we can do now is do the stuff atomically in the cache itself.
NoSQL, Heroku, and You by Adam Wiggin. Database-as-as-service is one of the coming decade’s most promising business models. Already services like MongoHQ (MongoDB), Cloudant (CouchDB), and Amazon RDS (MySQL) are offering fully hosted and managed databases to apps running in EC2.
Redis in Practice: Who’s Online? by Luke Melia. Great explanation of how to use the advanced data structure capabilities of Redis to solve practical problems like figuring out who is online.
The case for a cloud computing price war. The long list of issues associated with sessions at the various cloud conferences from security to picking a hypervisor remain moot until the cloud industry becomes more price competitive with on premise computing options.
Google with classes onExploring computational thinking. Let's try to remember computational thinking isn't the only kind of thinking we need, but it does seem to be the only kind emphasized.
NoSQL Database Architectures & Hypertable by Doug Judd of Hypertable who presents an overview of NoSQL database architectures and, in particular, Hypertable. Hypertable is an implementation of BigTable.