mm: page_alloc: fix move_freepages_block() range error
authorJohannes Weiner <hannes@cmpxchg.org>
Wed, 20 Mar 2024 18:02:10 +0000 (14:02 -0400)
committerAndrew Morton <akpm@linux-foundation.org>
Fri, 26 Apr 2024 03:56:03 +0000 (20:56 -0700)
When a block is partially outside the zone of the cursor page, the
function cuts the range to the pivot page instead of the zone start.  This
can leave large parts of the block behind, which encourages incompatible
page mixing down the line (ask for one type, get another), and thus
long-term fragmentation.

This triggers reliably on the first block in the DMA zone, whose start_pfn
is 1.  The block is stolen, but everything before the pivot page (which
was often hundreds of pages) is left on the old list.

Link: https://lkml.kernel.org/r/20240320180429.678181-6-hannes@cmpxchg.org
Signed-off-by: Johannes Weiner <hannes@cmpxchg.org>
Reviewed-by: Vlastimil Babka <vbabka@suse.cz>
Tested-by: Baolin Wang <baolin.wang@linux.alibaba.com>
Cc: David Hildenbrand <david@redhat.com>
Cc: "Huang, Ying" <ying.huang@intel.com>
Cc: Mel Gorman <mgorman@techsingularity.net>
Cc: Zi Yan <ziy@nvidia.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
mm/page_alloc.c

index dcacb86efd29854eef70acbf755aaf2fa460ac94..0e223d5b94fa5bbc78db93c5470fa2aa563dd885 100644 (file)
@@ -1650,9 +1650,15 @@ int move_freepages_block(struct zone *zone, struct page *page,
        start_pfn = pageblock_start_pfn(pfn);
        end_pfn = pageblock_end_pfn(pfn) - 1;
 
-       /* Do not cross zone boundaries */
+       /*
+        * The caller only has the lock for @zone, don't touch ranges
+        * that straddle into other zones. While we could move part of
+        * the range that's inside the zone, this call is usually
+        * accompanied by other operations such as migratetype updates
+        * which also should be locked.
+        */
        if (!zone_spans_pfn(zone, start_pfn))
-               start_pfn = pfn;
+               return 0;
        if (!zone_spans_pfn(zone, end_pfn))
                return 0;