git.kernel.dk Git - linux-2.6-block.git/commit

author	Qu Wenruo <wqu@suse.com>
	Sun, 13 Feb 2022 07:42:33 +0000 (15:42 +0800)
committer	David Sterba <dsterba@suse.com>
	Wed, 23 Feb 2022 16:55:08 +0000 (17:55 +0100)
commit	2ac3e062af024e5f5ad21afecf677becbaed9ed8
tree	6616c35b1dcfaefbdbfd8c61c473ee023fd9d9e7	tree
parent	26fbac2517fcad34fa3f950151fd4c0240fb2935	commit \| diff

btrfs: reduce extent threshold for autodefrag

There is a big gap between inode_should_defrag() and autodefrag extent
size threshold.  For inode_should_defrag() it has a flexible
@small_write value. For compressed extent is 16K, and for non-compressed
extent it's 64K.

However for autodefrag extent size threshold, it's always fixed to the
default value (256K).

This means, the following write sequence will trigger autodefrag to
defrag ranges which didn't trigger autodefrag:

  pwrite 0 8k
  sync
  pwrite 8k 128K
  sync

The latter 128K write will also be considered as a defrag target (if
other conditions are met). While only that 8K write is really
triggering autodefrag.

Such behavior can cause extra IO for autodefrag.

Close the gap, by copying the @small_write value into inode_defrag, so
that later autodefrag can use the same @small_write value which
triggered autodefrag.

With the existing transid value, this allows autodefrag really to scan
the ranges which triggered autodefrag.

Although this behavior change is mostly reducing the extent_thresh value
for autodefrag, I believe in the future we should allow users to specify
the autodefrag extent threshold through mount options, but that's an
other problem to consider in the future.

CC: stable@vger.kernel.org # 5.16
Signed-off-by: Qu Wenruo <wqu@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

fs/btrfs/ctree.h		diff \| blob \| blame \| history
fs/btrfs/file.c		diff \| blob \| blame \| history
fs/btrfs/inode.c		diff \| blob \| blame \| history