| 1 | - Shifted dedup-able data. |
| 2 | Allow for dedup buffer generation to shift contents by random number |
| 3 | of sectors (fill the gaps with uncompressible data). Some storage |
| 4 | subsystems modernized the deduplication detection algorithms to look |
| 5 | for shifted data as well. For example, some databases push a timestamp |
| 6 | on the prefix of written blocks, which makes the underlying data |
| 7 | dedup-able in different alignment. FIO should be able to simulate such |
| 8 | workload. |
| 9 | |
| 10 | - Generation of similar data (but not exact). |
| 11 | A rising trend in enterprise storage systems. |
| 12 | Generation of "similar" data means random uncompressible buffers |
| 13 | that differ by few(configurable number of) bits from each other. |
| 14 | The storage subsystem usually identifies the similar buffers using |
| 15 | locality-sensitive hashing or other methods. |
| 16 | |