Facebook Slashes Data Replication With HDFS RAID

Avoiding replication is a key component of efficient data storage, and one method Facebook uses to accomplish this task is HDFS RAID, which it detailed in a post on its engineering blog.

HDFSRAID650Avoiding replication is a key component of efficient data storage, and one method Facebook uses to accomplish this task is HDFS RAID, which it detailed in a post on its engineering blog.

Facebook credited members of its data infrastructure team with the development of HDFS RAID — including Dikang Gu, Peter Knowles, and Guogiang Jerry Chen — and it offered some background on the technology in its developer blog post:

The default replication of a file in HDFS is three, which can lead to a lot of space overhead.

AW+

WORK SMARTER - LEARN, GROW AND BE INSPIRED.

Subscribe today!

To Read the Full Story Become an Adweek+ Subscriber

View Subscription Options

Already a member? Sign in