Entries by HighScalability Team (1576)

Friday
Feb242012

Stuff The Internet Says On Scalability For February 24, 2012

This is not your father's HighScalability:

  • 13,000 times the world’s GDP: Cost of the Death Star
  • Quotable quotes:
    • @chrissalzman: Scalability is the enemy of right now.
    • @resatsch: I like our IT team: "We used Redis before Youporn did it"
    • @virtual_bill: Mixing flash and spinning disk to balance cost is like strapping a rocket to a turtle.
    • @jaksprats: HDDs got slower at random access as they got bigger, cuz disk seeks stayed almost the same, similar phenomenon w/ Flash
  • Priam, king of Troy, begat a daughter, Cassandra, and Netflix, king of true distributed Amazon infrastructure, begat a co-processor for Cassandra, Priam, used for Backup and recovery, Bootstrapping, Centralized configuration management, and RESTful monitoring and metrics. This is why Troy was never actually destroyed, it was simply backedup in-situ to another region.
  • Evernote is everfaithful to SQL because SQL gives it all the ACID it needs to keep its billion Notes and almost 2 billion Resource files in order. But is keeping an attribute map to facilitate painless schema upgrades and partitioning users really being faithful to SQL?
  • With Amazon's Simple Workflow service Amazon is moving up stack, leaving traditional IaaS behind, and heading directly into enterprise PaaS territory. It's an interesting choice. Workflow systems are a highly researched area that tend to not do well in practice because real life state machines and dependencies quickly outstrip the expressiveness and capabilities of the underlying workflow engine. Typically using a queuing system, application logic, and publish-subscribe event notification, will get you where you need to go. But enterprises need workflow and approval processes and that requirement may be one of the tethers being cut with a workflow service. We'll see if users are willing to put that much of their application structure into lock down.
Don't miss all that the Internet has to say on Scalability, click below and become eventually consistent with all scalability knowledge...

Click to read more ...

Tuesday
Feb212012

Pixable Architecture - Crawling, Analyzing, and Ranking 20 Million Photos a Day

This is a guest post by Alberto Lopez Toledo, PHD, CTO of Pixable, and Julio Viera, VP of Engineering at Pixable.

Pixable aggregates photos from across your different social networks and finds the best ones so you never miss an important moment. That means currently processing the metadata of more than 20 million new photos per day: crawling, analyzing, ranking, and sorting them along with the other 5+ billion that are already stored in our database. Making sense of all that data has challenges, but two in particular rise above the rest:

  1. How to access millions of photos per day from Facebook, Twitter, Instagram, and other services in the most efficient manner.
  2. How to process, organize, index, and store all the meta-data related to those photos.

Sure, Pixable’s infrastructure is changing continuously, but there are some things that we have learned over the last year. As a result, we have been able to build a scalable infrastructure that takes advantage of today’s tools, languages and cloud service, all running on Amazon Web Services where we have more than 80 servers running. This document provides a brief introduction to those lessons:

Click to read more ...

Monday
Feb202012

Berkeley DB Architecture - NoSQL Before NoSQL was Cool

After the filesystem and simple library packages like dbmBerkeley DB was the original luxury embedded database widely used by applications as their core database engine. NoSQL before NoSQL was cool. The hidden secret making complex applications sing. If you want to dispense with all the network overhead of a server based system, it's still a a good choice.

There's a great writeup for the architecture behind Berkeley DB in the book The Architecture of Open Source Applications. If you want to understand more about how a database works or if you are pondering how to build your own, it's rich in detail, explanations, and lessons. Here's the Berkeley DB  chapter from the book. It covers topics like: Architectural Overview; The Access Methods: Btree, Hash, Recno, Queue; The Library Interface Layer; The Buffer Manager: Mpool; Write-ahead Logging; The Lock Manager: Lock; The Log Manager: Log; The Transaction Manager: Txn. 

Click to read more ...

Friday
Feb172012

Stuff The Internet Says On Scalability For February 17, 2012

HighScalability Tested, Cyborg Approved:

  • Google's DNS: 70 billion requests a day; Superexponential: the rate of tech progress;  Akka: 48 cores / 20 million messages a second; 1 minute: intervals for health; Santa Tracker: 1.6 Million Requests per Second; 70x: MySQL cluster performance improvement
  • Quotable Quotes
    • @joeweinman: Lightbody at #ccevent : "Rule #1: architect with price structure in mind
    • Techdirt: Nothing Scales Like Stupidity
    • @brainvat: The IRS of Spain has a columnar database with 100,000 columns and over a trillion rows
  • Zynga is now 20% Amazon and 80% their own cloud, reversing their previous approach of launching in Amazon for the spiky growth phase of a product and then folding it back in when growth rates stabilized. Follow the money...

Don't miss all that the Internet has to say on Scalability, click below and become eventually consistent with all scalability knowledge...

Click to read more ...

Thursday
Feb162012

A Super Short on the Youporn Stack - 300K QPS and 100 Million Page Views Per Day

Eric Pickup from Youporn.com posted on a news group that Youporn is now 100% Redis based and will soon be revealing more about their architecture at the ConFoo conference. Some stunning, but not surprising numbers were revealed:

  • 100 million page views per day
  • A cluster of Redis slaves are handling over 300k queries per second.

Some additional nuggets:

  • Additional Redis nodes were added because the network cards couldn't keep up with Redis.
  • Impressed with Redis' performance.
  • All reads come from Redis;  we are maintaining MySQL just to allow us to build new sorted sets as our requirement change
  • Most data is found in hashes with ordered sets used to know what data to show.   
    • A typical lookup would be an zInterStore on: videos:filters:released, Videos:filters:orientation:straight,Videos:filters:categories:{category_id}, Videos:ordering:rating
    • Then perform a zRange to get the pages we want and get the list of video_ids back. 
    • Then start a pipeline and get all the videos from hashes.
    • Do use some key/value lookups and some lists, but the majority of our operations are using the above pattern.

Not much to see yet, but hopefully we'll learn more after their talk. 

Thursday
Feb162012

A Short on the Pinterest Stack for Handling 3+ Million Users

Pinterest co-founder Paul Sciarra shared a bit about their stack on Quora:

  • Python + heavily-modified Django at the application layer
  • Tornado and (very selectively) node.js as web-servers.  
  • Memcached and membase / redis for object- and logical-caching, respectively.  
  • RabbitMQ as a message queue.  
  • Nginx, HAproxy and Varnish for static-delivery and load-balancing.
  • Persistent data storage using MySQL.  
  • MrJob on EMR for map-reduce.
  • Git.

Alex Popescu has created a cool diagram of the setup and provided some thoughtful analysis as well.

Tuesday
Feb142012

Sponsored Post: Percona Live, AiCache, Next Big Sound, ElasticHosts, Red 5 Studios, Logic Monitor, New Relic, AppDynamics, CloudSigma, ManageEngine, Site24x7

Who's Hiring?

  • Anybody interested in helping manage a 100+ Linux server deployment? Next Big Sound is a an analytics company for the music industry and is looking someone to help them scale.
  • Red 5 Studios. Wanted: DBAs and Programmers interested in MySQL scalability and replication. If interested, please see us here

Fun and Informative Events

  • The Percona Live MySQL Conference & Expo features 60+ speakers, 72 breakout sessions, and keynotes from HP, Facebook, Box, Eucalyptus Systems, and more. April 10-12 in Santa Clara

Cool Products and Services

  • aiCache creates a better user experience by increasing the speed scale and stability of your web-site.
  • ElasticHosts award winning cloud server hosting launches across North America. Adding data centers in Los Angeles and Toronto. Free trial.
  • LogicMonitor - Hosted monitoring of your entire technology stack. Dashboards, trending graphs, alerting. Try it free and be up and running in just 15 minutes.
  • New Relic - real user monitoring optimize for humans, not bots. Live application stats, SQL/NoSQL performance, web transactions, proactive notifications. Take 2 minutes to sign up for a free trial.
  • AppDynamics is the very first free product designed for troubleshooting Java performance while getting full visibility in production environments. Visit http://www.appdynamics.com/free.
  • CloudSigma. Utility style high performance cloud servers in the US and Europe delivered on all 10GigE networking. Run any OS, take advantage of SSD storage and tailored infrastructure options.
  • ManageEngine Applications Manager : Monitor physical, virtual and Cloud Applications.
  • www.site24x7.com : Monitor End User Experience from a global monitoring network.

For a longer description of each sponsor, please read more below...

Click to read more ...

Monday
Feb132012

Tumblr Architecture - 15 Billion Page Views a Month and Harder to Scale than Twitter

With over 15 billion page views a month Tumblr has become an insanely popular blogging platform. Users may like Tumblr for its simplicity, its beauty, its strong focus on user experience, or its friendly and engaged community, but like it they do.

Growing at over 30% a month has not been without challenges. Some reliability problems among them. It helps to realize that Tumblr operates at surprisingly huge scales: 500 million page views a day, a peak rate of ~40k requests per second, ~3TB of new data to store a day, all running on 1000+ servers.

One of the common patterns across successful startups is the perilous chasm crossing from startup to wildly successful startup. Finding people, evolving infrastructures, servicing old infrastructures, while handling huge month over month increases in traffic, all with only four engineers, means you have to make difficult choices about what to work on. This was Tumblr’s situation. Now with twenty engineers there’s enough energy to work on issues and develop some very interesting solutions.

Tumblr started as a fairly typical large LAMP application. The direction they are moving in now is towards a distributed services model built around Scala, HBase, Redis, Kafka, Finagle,  and an intriguing cell based architecture for powering their Dashboard. Effort is now going into fixing short term problems in their PHP application, pulling things out, and doing it right using services.

The theme at Tumblr is transition at massive scale. Transition from a LAMP stack to a somewhat bleeding edge stack. Transition from a small startup team to a fully armed and ready development team churning out new features and infrastructure. To help us understand how Tumblr is living this theme is startup veteran Blake Matheny, Distributed Systems Engineer at Tumblr. Here’s what Blake has to say about the House of Tumblr:

Click to read more ...

Friday
Feb102012

Stuff The Internet Says On Scalability For February 10, 2012

HighScalability Tested, Mother Approved:

Don't miss all that the Internet has to say on Scalability, click below and become eventually consistent with all scalability knowledge...

Click to read more ...

Friday
Feb032012

Stuff The Internet Says On Scalability For February 3, 2012

I'm only here for the HighScalability:

  • 762 billion: objects stored on S3; $1B/Quarter: Google spend on servers; 100 Petabytes: Storage for Facebook's photos and videos.
  • Quotable Quotes:
    • @knorth2: #IPO filing says #Facebook is "dependent on our ability to maintain and scale our technical infrastructure"
    • @debuggist: Scalability trumps politics.
    • @cagedether: Hype of #Hadoop is driving pressure on people to keep everything
    • @nanreh: My MongoDB t shirt has never helped me get laid. This is typical with #nosql databases.
    • @lusis: I kenna do it, Capt'n. IO is pegged, disk is saturated…I lost 3 good young men when the cache blew up!
    • Kenton Varda: Jeff Dean puts his pants on one leg at a time, but if he had more than two legs, you'd see that his approach is actually O(log n)
  • One upon a time manufacturing located near rivers for power. Likewise software will be located next to storage, CPU, and analytics resources in a small cartel of clouds. That's the contention of Here Come the Cloud Cartels. This tributary system (pun intended) will be Amazon, Cisco Systems, Google, I.B.M., Microsoft, Oracle and a few competitors. Supposedly the benefit will be cheap computing, but when has a cartel ever lead to cheap anything?
Don't miss all that the Internet has to say on Scalability, click below or stay ignorant...

Click to read more ...