[v2,1/2] mm: migrate: Fix return value if all subpages of THPs are migrated successfully
Message ID | fca6bb0bd48a0292a0ace2fadd0f44579a060cbb.1666335603.git.baolin.wang@linux.alibaba.com |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:4242:0:0:0:0:0 with SMTP id s2csp610801wrr; Fri, 21 Oct 2022 03:18:16 -0700 (PDT) X-Google-Smtp-Source: AMsMyM5hEgqqv/1itfOq8ZWs79JCnkn3wGpm9vuy2AIOpWbenra2UWWnTiqU6mQQhXrFjxhEfwan X-Received: by 2002:a17:907:6d9b:b0:78d:f24b:e358 with SMTP id sb27-20020a1709076d9b00b0078df24be358mr14703228ejc.714.1666347496627; Fri, 21 Oct 2022 03:18:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666347496; cv=none; d=google.com; s=arc-20160816; b=mbbxLSVp/9H2AmQzKSIjVO+PuxsR6TPJFYkZRPsDnOVBZ+eAtelbJ1Ukb3x6nxUjtE 3SzASFsu81m3R8CcCZrq2FQrKwHHD30iYW3Eokf3YUYfODmoscCKxIZLm9lCLzMErrsu iZ1Ww32+2WwjsW3NfgI/xWyy5mDi/9Xn3cdMSBbuZ0icfvBoYGtKI8/VtnasbR+hH4kD GPNw6jTtvgp1CWuk9wPRktt3AIIvTQBwOa0UqByjE1Lh9ltgOzcO6LV11af+GSdlyRH1 kFc4qnT/fnnfh1VFAcubz1zQYf0KTL8uptXTW1m/EcaORMZ7xcgps+EsWbHIGQVF/Xio M12g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:message-id:date:subject:cc:to:from; bh=VgX9H1qAS+UbCF4uoAVvy1TMCj1OGBBNsSmsDI0cBG4=; b=Xsv8xIbmPTvlMddQiLPdFRnhUisatDdYpAs3PExVLmo1z8a2bttkEJloIeFYapDH9F uaF+PXdMagJYnikOoKLevltH5qHXhQelAt6rZl6SLwQAStyMooh3P5jx0CFZMeODWXr2 h4QvpClBCrH3VVDZUal72vZEXeM3sfiWisNhPqCJq3bmq/lcumtP0bEpE8QMmQ+iNSVg Fjhlw/oCP4ZIbVEBnFecRIQ6U+NLS9n3wLNQL5u243PfnkpuLpSO2PjxVk6tZ+rAxZ+Y 82Ry4n+uw9AzOtccfor1PjO3z/yloemUT9yGcwrLuwGuYp13Xo4Xml5tm9Wm18b+fsjy cVOw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id du6-20020a17090772c600b0078c0c866a18si20818898ejc.19.2022.10.21.03.17.51; Fri, 21 Oct 2022 03:18:16 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230446AbiJUKQl (ORCPT <rfc822;pwkd43@gmail.com> + 99 others); Fri, 21 Oct 2022 06:16:41 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43096 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230430AbiJUKQi (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Fri, 21 Oct 2022 06:16:38 -0400 Received: from out30-42.freemail.mail.aliyun.com (out30-42.freemail.mail.aliyun.com [115.124.30.42]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 30B401AD680 for <linux-kernel@vger.kernel.org>; Fri, 21 Oct 2022 03:16:36 -0700 (PDT) X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R181e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018045192;MF=baolin.wang@linux.alibaba.com;NM=1;PH=DS;RN=10;SR=0;TI=SMTPD_---0VSjF-ZZ_1666347392; Received: from localhost(mailfrom:baolin.wang@linux.alibaba.com fp:SMTPD_---0VSjF-ZZ_1666347392) by smtp.aliyun-inc.com; Fri, 21 Oct 2022 18:16:33 +0800 From: Baolin Wang <baolin.wang@linux.alibaba.com> To: akpm@linux-foundation.org Cc: david@redhat.com, ying.huang@intel.com, ziy@nvidia.com, shy828301@gmail.com, apopple@nvidia.com, baolin.wang@linux.alibaba.com, jingshan@linux.alibaba.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: [PATCH v2 1/2] mm: migrate: Fix return value if all subpages of THPs are migrated successfully Date: Fri, 21 Oct 2022 18:16:23 +0800 Message-Id: <fca6bb0bd48a0292a0ace2fadd0f44579a060cbb.1666335603.git.baolin.wang@linux.alibaba.com> X-Mailer: git-send-email 1.8.3.1 X-Spam-Status: No, score=-9.9 required=5.0 tests=BAYES_00, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2, SPF_HELO_NONE,SPF_PASS,UNPARSEABLE_RELAY,USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1747291992492526484?= X-GMAIL-MSGID: =?utf-8?q?1747291992492526484?= |
Series |
[v2,1/2] mm: migrate: Fix return value if all subpages of THPs are migrated successfully
|
|
Commit Message
Baolin Wang
Oct. 21, 2022, 10:16 a.m. UTC
When THP migration, if THPs are split and all subpages are migrated successfully
, the migrate_pages() will still return the number of THP that were not migrated.
That will confuse the callers of migrate_pages(), for example, which will make
the longterm pinning failed though all pages are migrated successfully.
Thus we should return 0 to indicate all pages are migrated in this case.
Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com>
---
Changes from v1:
- Fix the return value of migrate_pages() instead of fixing the
callers' validation.
---
mm/migrate.c | 7 +++++++
1 file changed, 7 insertions(+)
Comments
On Fri, 21 Oct 2022 18:16:23 +0800 Baolin Wang <baolin.wang@linux.alibaba.com> wrote: > When THP migration, if THPs are split and all subpages are migrated successfully > , the migrate_pages() will still return the number of THP that were not migrated. > That will confuse the callers of migrate_pages(), for example, which will make > the longterm pinning failed though all pages are migrated successfully. > > Thus we should return 0 to indicate all pages are migrated in this case. > This had me puzzled for a while. I think this wording is clearer? : During THP migration, if THPs are not migrated but they are split and all : subpages are migrated successfully, migrate_pages() will still return the : number of THP pages that were not migrated. This will confuse the callers : of migrate_pages(). For example, the longterm pinning will failed though : all pages are migrated successfully. : : Thus we should return 0 to indicate that all pages are migrated in this : case. This is a fairly longstanding problem? No Fixes: we can identify? Did you consider the desirability of a -stable backport?
On Fri, Oct 21, 2022 at 11:41 AM Andrew Morton <akpm@linux-foundation.org> wrote: > > On Fri, 21 Oct 2022 18:16:23 +0800 Baolin Wang <baolin.wang@linux.alibaba.com> wrote: > > > When THP migration, if THPs are split and all subpages are migrated successfully > > , the migrate_pages() will still return the number of THP that were not migrated. > > That will confuse the callers of migrate_pages(), for example, which will make > > the longterm pinning failed though all pages are migrated successfully. > > > > Thus we should return 0 to indicate all pages are migrated in this case. > > > > This had me puzzled for a while. I think this wording is clearer? > > : During THP migration, if THPs are not migrated but they are split and all > : subpages are migrated successfully, migrate_pages() will still return the > : number of THP pages that were not migrated. This will confuse the callers > : of migrate_pages(). For example, the longterm pinning will failed though > : all pages are migrated successfully. > : > : Thus we should return 0 to indicate that all pages are migrated in this > : case. > > This is a fairly longstanding problem? No Fixes: we can identify? It doesn't seem like a long standing issue. It seems like commit b5bade978e9b ("mm: migrate: fix the return value of migrate_pages()") fixed one problem, but introduced this new one IIUC. Before this commit, the code did: nr_failed += retry + thp_retry; rc = nr_failed; But retry and thp_retry were actually reset for each retry until the last one. So as long as there is no permanent migration failure and THP split failure, nr_failed should be 0 IIUC. TBH the code is a little bit hard to follow, please correct me if I'm wrong. > > Did you consider the desirability of a -stable backport? >
Yang Shi <shy828301@gmail.com> writes: > On Fri, Oct 21, 2022 at 11:41 AM Andrew Morton > <akpm@linux-foundation.org> wrote: >> >> On Fri, 21 Oct 2022 18:16:23 +0800 Baolin Wang <baolin.wang@linux.alibaba.com> wrote: >> >> > When THP migration, if THPs are split and all subpages are migrated successfully >> > , the migrate_pages() will still return the number of THP that were not migrated. >> > That will confuse the callers of migrate_pages(), for example, which will make >> > the longterm pinning failed though all pages are migrated successfully. >> > >> > Thus we should return 0 to indicate all pages are migrated in this case. >> > >> >> This had me puzzled for a while. I think this wording is clearer? >> >> : During THP migration, if THPs are not migrated but they are split and all >> : subpages are migrated successfully, migrate_pages() will still return the >> : number of THP pages that were not migrated. This will confuse the callers >> : of migrate_pages(). For example, the longterm pinning will failed though >> : all pages are migrated successfully. >> : >> : Thus we should return 0 to indicate that all pages are migrated in this >> : case. >> >> This is a fairly longstanding problem? No Fixes: we can identify? > > It doesn't seem like a long standing issue. It seems like commit > b5bade978e9b ("mm: migrate: fix the return value of migrate_pages()") > fixed one problem, but introduced this new one IIUC. > > Before this commit, the code did: > > nr_failed += retry + thp_retry; > rc = nr_failed; > > But retry and thp_retry were actually reset for each retry until the > last one. So as long as there is no permanent migration failure and > THP split failure, nr_failed should be 0 IIUC. TBH the code is a > little bit hard to follow, please correct me if I'm wrong. I think that you are correct. We can added Fixes: b5bade978e9b ("mm: migrate: fix the return value of migrate_pages()") >> Did you consider the desirability of a -stable backport? I think this can be backport to -stable. Best Regards, Huang, Ying
Baolin Wang <baolin.wang@linux.alibaba.com> writes: > When THP migration, if THPs are split and all subpages are migrated successfully > , the migrate_pages() will still return the number of THP that were not migrated. > That will confuse the callers of migrate_pages(), for example, which will make > the longterm pinning failed though all pages are migrated successfully. > > Thus we should return 0 to indicate all pages are migrated in this case. > > Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> > --- > Changes from v1: > - Fix the return value of migrate_pages() instead of fixing the > callers' validation. > --- > mm/migrate.c | 7 +++++++ > 1 file changed, 7 insertions(+) > > diff --git a/mm/migrate.c b/mm/migrate.c > index 8e5eb6e..1da0dbc 100644 > --- a/mm/migrate.c > +++ b/mm/migrate.c > @@ -1582,6 +1582,13 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, > */ > list_splice(&ret_pages, from); > > + /* > + * Return 0 in case all subpages of fail-to-migrate THPs are > + * migrated successfully. > + */ > + if (nr_thp_split && list_empty(from)) > + rc = 0; Why do you need to check nr_thp_split? Wouldn't list_empty(from) == True imply success? And if it doesn't imply success wouldn't it be possible to end up with nr_thp_split && list_empty(from) whilst still having pages that failed to migrate? The list management and return code logic from unmap_and_move() has gotten pretty difficult to follow and could do with some rework IMHO. > count_vm_events(PGMIGRATE_SUCCESS, nr_succeeded); > count_vm_events(PGMIGRATE_FAIL, nr_failed_pages); > count_vm_events(THP_MIGRATION_SUCCESS, nr_thp_succeeded);
On 10/24/2022 9:56 AM, Huang, Ying wrote: > Yang Shi <shy828301@gmail.com> writes: > >> On Fri, Oct 21, 2022 at 11:41 AM Andrew Morton >> <akpm@linux-foundation.org> wrote: >>> >>> On Fri, 21 Oct 2022 18:16:23 +0800 Baolin Wang <baolin.wang@linux.alibaba.com> wrote: >>> >>>> When THP migration, if THPs are split and all subpages are migrated successfully >>>> , the migrate_pages() will still return the number of THP that were not migrated. >>>> That will confuse the callers of migrate_pages(), for example, which will make >>>> the longterm pinning failed though all pages are migrated successfully. >>>> >>>> Thus we should return 0 to indicate all pages are migrated in this case. >>>> >>> >>> This had me puzzled for a while. I think this wording is clearer? >>> >>> : During THP migration, if THPs are not migrated but they are split and all >>> : subpages are migrated successfully, migrate_pages() will still return the >>> : number of THP pages that were not migrated. This will confuse the callers >>> : of migrate_pages(). For example, the longterm pinning will failed though >>> : all pages are migrated successfully. >>> : >>> : Thus we should return 0 to indicate that all pages are migrated in this >>> : case. >>> >>> This is a fairly longstanding problem? No Fixes: we can identify? >> >> It doesn't seem like a long standing issue. It seems like commit >> b5bade978e9b ("mm: migrate: fix the return value of migrate_pages()") >> fixed one problem, but introduced this new one IIUC. >> >> Before this commit, the code did: >> >> nr_failed += retry + thp_retry; >> rc = nr_failed; >> >> But retry and thp_retry were actually reset for each retry until the >> last one. So as long as there is no permanent migration failure and >> THP split failure, nr_failed should be 0 IIUC. TBH the code is a >> little bit hard to follow, please correct me if I'm wrong. > > I think that you are correct. We can added > > Fixes: b5bade978e9b ("mm: migrate: fix the return value of migrate_pages()") I think so too. Thanks Yang and Ying for pointing it out. > >>> Did you consider the desirability of a -stable backport? > > I think this can be backport to -stable. Agree. Andrew, could you help to add the Fixes tag and cc -stable? Thanks.
On 10/24/2022 10:36 AM, Alistair Popple wrote: > > Baolin Wang <baolin.wang@linux.alibaba.com> writes: > >> When THP migration, if THPs are split and all subpages are migrated successfully >> , the migrate_pages() will still return the number of THP that were not migrated. >> That will confuse the callers of migrate_pages(), for example, which will make >> the longterm pinning failed though all pages are migrated successfully. >> >> Thus we should return 0 to indicate all pages are migrated in this case. >> >> Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> >> --- >> Changes from v1: >> - Fix the return value of migrate_pages() instead of fixing the >> callers' validation. >> --- >> mm/migrate.c | 7 +++++++ >> 1 file changed, 7 insertions(+) >> >> diff --git a/mm/migrate.c b/mm/migrate.c >> index 8e5eb6e..1da0dbc 100644 >> --- a/mm/migrate.c >> +++ b/mm/migrate.c >> @@ -1582,6 +1582,13 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, >> */ >> list_splice(&ret_pages, from); >> >> + /* >> + * Return 0 in case all subpages of fail-to-migrate THPs are >> + * migrated successfully. >> + */ >> + if (nr_thp_split && list_empty(from)) >> + rc = 0; > > Why do you need to check nr_thp_split? Wouldn't list_empty(from) == True Only in the case of THP split, we can meet this abnormal case. So if no THP split, just return the original 'rc' instead of validating the list, since the 'nr_thp_split' validation is cheaper than the list_empty() validation IMHO. > imply success? And if it doesn't imply success wouldn't it be possible > to end up with nr_thp_split && list_empty(from) whilst still having > pages that failed to migrate? > > The list management and return code logic from unmap_and_move() has > gotten pretty difficult to follow and could do with some rework IMHO. Yes, Huang Ying has sent a RFC patchset[1] doing some code refactor, which seems a good start. [1] https://lore.kernel.org/all/20220921060616.73086-1-ying.huang@intel.com/
Baolin Wang <baolin.wang@linux.alibaba.com> writes: > On 10/24/2022 10:36 AM, Alistair Popple wrote: >> Baolin Wang <baolin.wang@linux.alibaba.com> writes: >> >>> When THP migration, if THPs are split and all subpages are migrated successfully >>> , the migrate_pages() will still return the number of THP that were not migrated. >>> That will confuse the callers of migrate_pages(), for example, which will make >>> the longterm pinning failed though all pages are migrated successfully. >>> >>> Thus we should return 0 to indicate all pages are migrated in this case. >>> >>> Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> >>> --- >>> Changes from v1: >>> - Fix the return value of migrate_pages() instead of fixing the >>> callers' validation. >>> --- >>> mm/migrate.c | 7 +++++++ >>> 1 file changed, 7 insertions(+) >>> >>> diff --git a/mm/migrate.c b/mm/migrate.c >>> index 8e5eb6e..1da0dbc 100644 >>> --- a/mm/migrate.c >>> +++ b/mm/migrate.c >>> @@ -1582,6 +1582,13 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, >>> */ >>> list_splice(&ret_pages, from); >>> >>> + /* >>> + * Return 0 in case all subpages of fail-to-migrate THPs are >>> + * migrated successfully. >>> + */ >>> + if (nr_thp_split && list_empty(from)) >>> + rc = 0; >> Why do you need to check nr_thp_split? Wouldn't list_empty(from) == True > > Only in the case of THP split, we can meet this abnormal case. So if no THP > split, just return the original 'rc' instead of validating the list, since the > 'nr_thp_split' validation is cheaper than the list_empty() validation IMHO. Is it really that much cheaper? We're already retrying migrations multiple times, etc. so surely the difference here would be marginal at best, and IMHO the code would be much clearer if we always set rc = 0 when list_empty(from) = True. >> imply success? And if it doesn't imply success wouldn't it be possible >> to end up with nr_thp_split && list_empty(from) whilst still having >> pages that failed to migrate? >> The list management and return code logic from unmap_and_move() has >> gotten pretty difficult to follow and could do with some rework IMHO. > > Yes, Huang Ying has sent a RFC patchset[1] doing some code refactor, which seems > a good start. Thanks for pointing that out, I looked at it a while back but missed the clean ups. I was kind of waiting for the non-RFC version before taking another closer look. > [1] https://lore.kernel.org/all/20220921060616.73086-1-ying.huang@intel.com/
On 10/24/2022 3:24 PM, Alistair Popple wrote: > > Baolin Wang <baolin.wang@linux.alibaba.com> writes: > >> On 10/24/2022 10:36 AM, Alistair Popple wrote: >>> Baolin Wang <baolin.wang@linux.alibaba.com> writes: >>> >>>> When THP migration, if THPs are split and all subpages are migrated successfully >>>> , the migrate_pages() will still return the number of THP that were not migrated. >>>> That will confuse the callers of migrate_pages(), for example, which will make >>>> the longterm pinning failed though all pages are migrated successfully. >>>> >>>> Thus we should return 0 to indicate all pages are migrated in this case. >>>> >>>> Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> >>>> --- >>>> Changes from v1: >>>> - Fix the return value of migrate_pages() instead of fixing the >>>> callers' validation. >>>> --- >>>> mm/migrate.c | 7 +++++++ >>>> 1 file changed, 7 insertions(+) >>>> >>>> diff --git a/mm/migrate.c b/mm/migrate.c >>>> index 8e5eb6e..1da0dbc 100644 >>>> --- a/mm/migrate.c >>>> +++ b/mm/migrate.c >>>> @@ -1582,6 +1582,13 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, >>>> */ >>>> list_splice(&ret_pages, from); >>>> >>>> + /* >>>> + * Return 0 in case all subpages of fail-to-migrate THPs are >>>> + * migrated successfully. >>>> + */ >>>> + if (nr_thp_split && list_empty(from)) >>>> + rc = 0; >>> Why do you need to check nr_thp_split? Wouldn't list_empty(from) == True >> >> Only in the case of THP split, we can meet this abnormal case. So if no THP >> split, just return the original 'rc' instead of validating the list, since the >> 'nr_thp_split' validation is cheaper than the list_empty() validation IMHO. > > Is it really that much cheaper? We're already retrying migrations > multiple times, etc. so surely the difference here would be marginal at > best, and IMHO the code would be much clearer if we always set rc = 0 > when list_empty(from) = True. Yeah, the difference is marginal and I have no strong preference. OK, will drop the 'nr_thp_split' in next version. Thanks. >>> imply success? And if it doesn't imply success wouldn't it be possible >>> to end up with nr_thp_split && list_empty(from) whilst still having >>> pages that failed to migrate? >>> The list management and return code logic from unmap_and_move() has >>> gotten pretty difficult to follow and could do with some rework IMHO. >> >> Yes, Huang Ying has sent a RFC patchset[1] doing some code refactor, which seems >> a good start. > > Thanks for pointing that out, I looked at it a while back but missed the > clean ups. I was kind of waiting for the non-RFC version before taking > another closer look. > >> [1] https://lore.kernel.org/all/20220921060616.73086-1-ying.huang@intel.com/
Baolin Wang <baolin.wang@linux.alibaba.com> writes: > On 10/24/2022 3:24 PM, Alistair Popple wrote: >> Baolin Wang <baolin.wang@linux.alibaba.com> writes: >> >>> On 10/24/2022 10:36 AM, Alistair Popple wrote: >>>> Baolin Wang <baolin.wang@linux.alibaba.com> writes: >>>> >>>>> When THP migration, if THPs are split and all subpages are migrated successfully >>>>> , the migrate_pages() will still return the number of THP that were not migrated. >>>>> That will confuse the callers of migrate_pages(), for example, which will make >>>>> the longterm pinning failed though all pages are migrated successfully. >>>>> >>>>> Thus we should return 0 to indicate all pages are migrated in this case. >>>>> >>>>> Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> >>>>> --- >>>>> Changes from v1: >>>>> - Fix the return value of migrate_pages() instead of fixing the >>>>> callers' validation. >>>>> --- >>>>> mm/migrate.c | 7 +++++++ >>>>> 1 file changed, 7 insertions(+) >>>>> >>>>> diff --git a/mm/migrate.c b/mm/migrate.c >>>>> index 8e5eb6e..1da0dbc 100644 >>>>> --- a/mm/migrate.c >>>>> +++ b/mm/migrate.c >>>>> @@ -1582,6 +1582,13 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, >>>>> */ >>>>> list_splice(&ret_pages, from); >>>>> >>>>> + /* >>>>> + * Return 0 in case all subpages of fail-to-migrate THPs are >>>>> + * migrated successfully. >>>>> + */ >>>>> + if (nr_thp_split && list_empty(from)) >>>>> + rc = 0; >>>> Why do you need to check nr_thp_split? Wouldn't list_empty(from) == True >>> >>> Only in the case of THP split, we can meet this abnormal case. So if no THP >>> split, just return the original 'rc' instead of validating the list, since the >>> 'nr_thp_split' validation is cheaper than the list_empty() validation IMHO. >> Is it really that much cheaper? We're already retrying migrations >> multiple times, etc. so surely the difference here would be marginal at >> best, and IMHO the code would be much clearer if we always set rc = 0 >> when list_empty(from) = True. > > Yeah, the difference is marginal and I have no strong preference. OK, will drop > the 'nr_thp_split' in next version. Thanks. Thanks. With that change feel free to add: Reviewed-by: Alistair Popple <apopple@nvidia.com> >>>> imply success? And if it doesn't imply success wouldn't it be possible >>>> to end up with nr_thp_split && list_empty(from) whilst still having >>>> pages that failed to migrate? >>>> The list management and return code logic from unmap_and_move() has >>>> gotten pretty difficult to follow and could do with some rework IMHO. >>> >>> Yes, Huang Ying has sent a RFC patchset[1] doing some code refactor, which seems >>> a good start. >> Thanks for pointing that out, I looked at it a while back but missed the >> clean ups. I was kind of waiting for the non-RFC version before taking >> another closer look. >> >>> [1] https://lore.kernel.org/all/20220921060616.73086-1-ying.huang@intel.com/
diff --git a/mm/migrate.c b/mm/migrate.c index 8e5eb6e..1da0dbc 100644 --- a/mm/migrate.c +++ b/mm/migrate.c @@ -1582,6 +1582,13 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, */ list_splice(&ret_pages, from); + /* + * Return 0 in case all subpages of fail-to-migrate THPs are + * migrated successfully. + */ + if (nr_thp_split && list_empty(from)) + rc = 0; + count_vm_events(PGMIGRATE_SUCCESS, nr_succeeded); count_vm_events(PGMIGRATE_FAIL, nr_failed_pages); count_vm_events(THP_MIGRATION_SUCCESS, nr_thp_succeeded);