Tachyon - Fault Tolerant Distributed File System with 300 Times Higher Throughput than HDFS

Top

Tachyon - Fault Tolerant Distributed File System with 300 Times Higher Throughput than HDFS

Wednesday, April 17, 2013 at 9:25AM

Tachyon (github) is interesting new filesystem brought to by the folks at the UC Berkeley AMP Lab:

Tachyon is a fault tolerant distributed ﬁle system enabling reliable file sharing at memory-speed across cluster frameworks, such as Spark and MapReduce.It offers up to 300 times higher throughput than HDFS, by leveraging lineage information and using memory aggressively. Tachyon caches working set files in memory, and enables different jobs/queries and frameworks to access cached files at memory speed. Thus, Tachyon avoids going to disk to load datasets that is frequently read.

It has a Java-like File API, native support for raw tables, a pluggable file system, and it works with Hadoop with no modifications.

It might work well for streaming media too as you wouldn't have to wait for the complete file to hit the disk before rendering.

Discuss on Hacker News

HighScalability Team |

Reader Comments

There are no comments for this journal entry. To create a new comment, use the form below.

Post a New Comment

Enter your information below to add a new comment.

Author:

Author Email (optional):

Author URL (optional):

Post:

↓ | ↑

Some HTML allowed: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <code> <em> <i> <strike> <strong>