I. Introduction
In the last decade, we witnessed a stark increase in data generated by various sources, e.g., social networks, Internet of Things (IoT), and persistently stored on public clouds (an estimated 175 ZB by 2025 [1]). Cloud-based storage systems put under enormous pressure on their storage backends, as they must continuously expand the storage infrastructure to cope with such ever-growing demand. There exist different techniques to lower storage footprint: (a) compression [2] and deduplication [3], [4], (b) erasure coding [5], [6] as an alternative to store multiple replicas for reliability, or (c) a combination of them [7].