Erasure Coding for Better Storage Efficiency

Ceph's default replication level provides excellent protection against data loss by storing three copies of your data on different OSDs. The chance of losing all three disks that contain the same objects, within the period that it takes Ceph to rebuild from a failed disk, is verging on the extreme edge of probability. However, storing three copies of data vastly increases both the purchase cost of the hardware and also associated operational costs such as power and cooling. Furthermore, storing copies also means that for every client write, the backend storage must write three times the amount of data. In some scenarios, either of these drawbacks may mean that Ceph is not a viable option.

Erasure codes are designed to offer a solution. Much like how RAID 5 and 6 offer increased usable storage capacity over RAID 1, erasure coding allows Ceph to provide more usable storage from the same raw capacity. However, also like the parity-based RAID levels, erasure coding brings its own set of disadvantages.

In this chapter you will learn the following:

  • What is erasure coding and how does it work?
  • Details around Ceph's implementation of erasure coding
  • How to create and tune an erasure-coded RADOS pool
  • A look into the future features of erasure coding with the Ceph Kraken release
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
52.14.172.93