Social Pro Daily

Facebook Slashes Data Replication With HDFS RAID

Avoiding replication is a key component of efficient data storage, and one method Facebook uses to accomplish this task is HDFS RAID, which it detailed in a post on its engineering blog.

Facebook credited members of its data infrastructure team with the development of HDFS RAID — including Dikang Gu, Peter Knowles, and Guogiang Jerry Chen — and it offered some background on the technology in its developer blog post:

The default replication of a file in HDFS is three, which can lead to a lot of space overhead.

David Cohen

@9Number9
david.cohen@adweek.com

David Cohen is editor of Adweek's Social Pro Daily.

Social Pro Daily

Facebook Slashes Data Replication With HDFS RAID

Avoiding replication is a key component of efficient data storage, and one method Facebook uses to accomplish this task is HDFS RAID, which it detailed in a post on its engineering blog.

David Cohen

About

Subscriptions

Events

Publications

WORK SMARTER - LEARN, GROW AND BE INSPIRED.

Subscribe today!

David Cohen