Facebook Slashes Data Replication With HDFS RAID
Avoiding replication is a key component of efficient data storage, and one method Facebook uses to accomplish this task is HDFS RAID, which it detailed in a post on its engineering blog.
Avoiding replication is a key component of efficient data storage, and one method Facebook uses to accomplish this task is HDFS RAID, which it detailed in a post on its engineering blog.
Facebook credited members of its data infrastructure team with the development of HDFS RAID — including Dikang Gu, Peter Knowles, and Guogiang Jerry Chen — and it offered some background on the technology in its developer blog post:
The default replication of a file in HDFS is three, which can lead to a lot of space overhead.
WORK SMARTER - LEARN, GROW AND BE INSPIRED.
Subscribe today!
To Read the Full Story Become an Adweek+ Subscriber
Already a member? Sign in