diff mbox series

mm: vmscan: the dirty folio unmap redundantly

Message ID	20231018013004.1569-1-justinjiang@vivo.com
State	New
Headers	Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.32 as permitted sender) client-ip=23.128.96.32; From: Zhiguo Jiang <justinjiang@vivo.com> To: Andrew Morton <akpm@linux-foundation.org>, linux-mm@kvack.org, linux-kernel@vger.kernel.org Cc: opensource.kernel@vivo.com, Zhiguo Jiang <justinjiang@vivo.com> Subject: [PATCH] mm: vmscan: the dirty folio unmap redundantly Date: Wed, 18 Oct 2023 09:30:03 +0800 Message-ID: <20231018013004.1569-1-justinjiang@vivo.com> Content-Transfer-Encoding: 8bit Content-Type: text/plain MIME-Version: 1.0 Precedence: bulk
Series	mm: vmscan: the dirty folio unmap redundantly \| mm: vmscan: the dirty folio unmap redundantly

Commit Message

Zhiguo Jiang Oct. 18, 2023, 1:30 a.m. UTC

  If the dirty folio is not reclaimed in the shrink process, it do
not need to unmap, which can save shrinking time during traversaling
the dirty folio.

Signed-off-by: Zhiguo Jiang <justinjiang@vivo.com>
---
 mm/vmscan.c | 72 +++++++++++++++++++++++++++--------------------------
 1 file changed, 37 insertions(+), 35 deletions(-)
 mode change 100644 => 100755 mm/vmscan.c

Comments

Matthew Wilcox Oct. 18, 2023, 2:12 p.m. UTC | #1

On Wed, Oct 18, 2023 at 09:30:03AM +0800, Zhiguo Jiang wrote:
> If the dirty folio is not reclaimed in the shrink process, it do
> not need to unmap, which can save shrinking time during traversaling
> the dirty folio.

Don't we have to unmap it first in order to make sure that all the
dirty bits from the PTEs have been transferred to the folio dirty bit?

David Hildenbrand Oct. 19, 2023, 1:03 p.m. UTC | #2

On 18.10.23 03:30, Zhiguo Jiang wrote:
> If the dirty folio is not reclaimed in the shrink process, it do
> not need to unmap, which can save shrinking time during traversaling
> the dirty folio.

Hi,

I really cannot understand what you mean with "the dirty folio unmap 
redundantly". No clue what this patch is supposed to tackle by staring 
at the patch subject.


This patch is supposed to improve performance. Can you provide some 
proof that it does and that we should even care about this change?

> 
> Signed-off-by: Zhiguo Jiang <justinjiang@vivo.com>
> ---
>   mm/vmscan.c | 72 +++++++++++++++++++++++++++--------------------------
>   1 file changed, 37 insertions(+), 35 deletions(-)
>   mode change 100644 => 100755 mm/vmscan.c
> 
> diff --git a/mm/vmscan.c b/mm/vmscan.c
> index 2cc0cb41fb32..cf555cdfcefc
> --- a/mm/vmscan.c
> +++ b/mm/vmscan.c
> @@ -1261,6 +1261,43 @@ static unsigned int shrink_folio_list(struct list_head *folio_list,
>   			enum ttu_flags flags = TTU_BATCH_FLUSH;
>   			bool was_swapbacked = folio_test_swapbacked(folio);
>   
> +			if (folio_test_dirty(folio)) {
> +				/*
> +				 * Only kswapd can writeback filesystem folios
> +				 * to avoid risk of stack overflow. But avoid
> +				 * injecting inefficient single-folio I/O into
> +				 * flusher writeback as much as possible: only
> +				 * write folios when we've encountered many
> +				 * dirty folios, and when we've already scanned
> +				 * the rest of the LRU for clean folios and see
> +				 * the same dirty folios again (with the reclaim
> +				 * flag set).
> +				 */
> +				if (folio_is_file_lru(folio) &&
> +					(!current_is_kswapd() ||
> +					 !folio_test_reclaim(folio) ||
> +					 !test_bit(PGDAT_DIRTY, &pgdat->flags))) {
> +					/*
> +					 * Immediately reclaim when written back.
> +					 * Similar in principle to folio_deactivate()
> +					 * except we already have the folio isolated
> +					 * and know it's dirty
> +					 */
> +					node_stat_mod_folio(folio, NR_VMSCAN_IMMEDIATE,
> +							nr_pages);
> +					folio_set_reclaim(folio);
> +
> +					goto activate_locked;
> +				}
> +
> +				if (references == FOLIOREF_RECLAIM_CLEAN)
> +					goto keep_locked;
> +				if (!may_enter_fs(folio, sc->gfp_mask))
> +					goto keep_locked;
> +				if (!sc->may_writepage)
> +					goto keep_locked;
> +			}
> +
>   			if (folio_test_pmd_mappable(folio))
>   				flags |= TTU_SPLIT_HUGE_PMD;
>   
> @@ -1286,41 +1323,6 @@ static unsigned int shrink_folio_list(struct list_head *folio_list,
>   
>   		mapping = folio_mapping(folio);
>   		if (folio_test_dirty(folio)) {

Can you elaborate why we want to remove below code? It would have made 
sense to me to duplicate the code in an early check before unmap, if the 
folio is already dirty before checking all PTEs. But why can we remove 
that post-unmap code?

Zhiguo Jiang Oct. 19, 2023, 1:23 p.m. UTC | #3

在 2023/10/19 21:03, David Hildenbrand 写道:
> [你通常不会收到来自 david@redhat.com 的电子邮件。请访问 
> https://aka.ms/LearnAboutSenderIdentification，以了解这一点为什么很重要]
>
> On 18.10.23 03:30, Zhiguo Jiang wrote:
>> If the dirty folio is not reclaimed in the shrink process, it do
>> not need to unmap, which can save shrinking time during traversaling
>> the dirty folio.
>
> Hi,
>
> I really cannot understand what you mean with "the dirty folio unmap
> redundantly". No clue what this patch is supposed to tackle by staring
> at the patch subject.
>
>
> This patch is supposed to improve performance. Can you provide some
> proof that it does and that we should even care about this change?

Hi，
What I understand is that in the shrink_folio_list() the sources of the file
dirty folio include two ways below:
1. The dirty folio is from the incoming parameter folio_list,
    which is the inactive file lru.
2. The dirty folio is from the PTE dirty bit transferred by
    the try_to_unmap().

Currently, both sources of dirty pages are determined after unmap
to determine whether they support pageout and recyling.

For the first source of the dirty folio, if the dirty folio does not
support pageout, the dirty folio can skip unmap in advance to reduce
recyling time.

This patch is not well considered.
The v2 new patch will be submitted later, Please help to continue review.

Thanks

Jiang Zhiguo
>
>>
>> Signed-off-by: Zhiguo Jiang <justinjiang@vivo.com>
>> ---
>>   mm/vmscan.c | 72 +++++++++++++++++++++++++++--------------------------
>>   1 file changed, 37 insertions(+), 35 deletions(-)
>>   mode change 100644 => 100755 mm/vmscan.c
>>
>> diff --git a/mm/vmscan.c b/mm/vmscan.c
>> index 2cc0cb41fb32..cf555cdfcefc
>> --- a/mm/vmscan.c
>> +++ b/mm/vmscan.c
>> @@ -1261,6 +1261,43 @@ static unsigned int shrink_folio_list(struct 
>> list_head *folio_list,
>>                       enum ttu_flags flags = TTU_BATCH_FLUSH;
>>                       bool was_swapbacked = 
>> folio_test_swapbacked(folio);
>>
>> +                     if (folio_test_dirty(folio)) {
>> +                             /*
>> +                              * Only kswapd can writeback filesystem 
>> folios
>> +                              * to avoid risk of stack overflow. But 
>> avoid
>> +                              * injecting inefficient single-folio 
>> I/O into
>> +                              * flusher writeback as much as 
>> possible: only
>> +                              * write folios when we've encountered 
>> many
>> +                              * dirty folios, and when we've already 
>> scanned
>> +                              * the rest of the LRU for clean folios 
>> and see
>> +                              * the same dirty folios again (with 
>> the reclaim
>> +                              * flag set).
>> +                              */
>> +                             if (folio_is_file_lru(folio) &&
>> +                                     (!current_is_kswapd() ||
>> + !folio_test_reclaim(folio) ||
>> +                                      !test_bit(PGDAT_DIRTY, 
>> &pgdat->flags))) {
>> +                                     /*
>> +                                      * Immediately reclaim when 
>> written back.
>> +                                      * Similar in principle to 
>> folio_deactivate()
>> +                                      * except we already have the 
>> folio isolated
>> +                                      * and know it's dirty
>> +                                      */
>> +                                     node_stat_mod_folio(folio, 
>> NR_VMSCAN_IMMEDIATE,
>> +                                                     nr_pages);
>> +                                     folio_set_reclaim(folio);
>> +
>> +                                     goto activate_locked;
>> +                             }
>> +
>> +                             if (references == FOLIOREF_RECLAIM_CLEAN)
>> +                                     goto keep_locked;
>> +                             if (!may_enter_fs(folio, sc->gfp_mask))
>> +                                     goto keep_locked;
>> +                             if (!sc->may_writepage)
>> +                                     goto keep_locked;
>> +                     }
>> +
>>                       if (folio_test_pmd_mappable(folio))
>>                               flags |= TTU_SPLIT_HUGE_PMD;
>>
>> @@ -1286,41 +1323,6 @@ static unsigned int shrink_folio_list(struct 
>> list_head *folio_list,
>>
>>               mapping = folio_mapping(folio);
>>               if (folio_test_dirty(folio)) {
>
> Can you elaborate why we want to remove below code? It would have made
> sense to me to duplicate the code in an early check before unmap, if the
> folio is already dirty before checking all PTEs. But why can we remove
> that post-unmap code?
>
> -- 
> Cheers,
>
> David / dhildenb
>

diff mbox series

Patch

diff --git a/mm/vmscan.c b/mm/vmscan.c
index 2cc0cb41fb32..cf555cdfcefc
--- a/mm/vmscan.c
+++ b/mm/vmscan.c
@@ -1261,6 +1261,43 @@  static unsigned int shrink_folio_list(struct list_head *folio_list,
 			enum ttu_flags flags = TTU_BATCH_FLUSH;
 			bool was_swapbacked = folio_test_swapbacked(folio);
 
+			if (folio_test_dirty(folio)) {
+				/*
+				 * Only kswapd can writeback filesystem folios
+				 * to avoid risk of stack overflow. But avoid
+				 * injecting inefficient single-folio I/O into
+				 * flusher writeback as much as possible: only
+				 * write folios when we've encountered many
+				 * dirty folios, and when we've already scanned
+				 * the rest of the LRU for clean folios and see
+				 * the same dirty folios again (with the reclaim
+				 * flag set).
+				 */
+				if (folio_is_file_lru(folio) &&
+					(!current_is_kswapd() ||
+					 !folio_test_reclaim(folio) ||
+					 !test_bit(PGDAT_DIRTY, &pgdat->flags))) {
+					/*
+					 * Immediately reclaim when written back.
+					 * Similar in principle to folio_deactivate()
+					 * except we already have the folio isolated
+					 * and know it's dirty
+					 */
+					node_stat_mod_folio(folio, NR_VMSCAN_IMMEDIATE,
+							nr_pages);
+					folio_set_reclaim(folio);
+
+					goto activate_locked;
+				}
+
+				if (references == FOLIOREF_RECLAIM_CLEAN)
+					goto keep_locked;
+				if (!may_enter_fs(folio, sc->gfp_mask))
+					goto keep_locked;
+				if (!sc->may_writepage)
+					goto keep_locked;
+			}
+
 			if (folio_test_pmd_mappable(folio))
 				flags |= TTU_SPLIT_HUGE_PMD;
 
@@ -1286,41 +1323,6 @@  static unsigned int shrink_folio_list(struct list_head *folio_list,
 
 		mapping = folio_mapping(folio);
 		if (folio_test_dirty(folio)) {
-			/*
-			 * Only kswapd can writeback filesystem folios
-			 * to avoid risk of stack overflow. But avoid
-			 * injecting inefficient single-folio I/O into
-			 * flusher writeback as much as possible: only
-			 * write folios when we've encountered many
-			 * dirty folios, and when we've already scanned
-			 * the rest of the LRU for clean folios and see
-			 * the same dirty folios again (with the reclaim
-			 * flag set).
-			 */
-			if (folio_is_file_lru(folio) &&
-			    (!current_is_kswapd() ||
-			     !folio_test_reclaim(folio) ||
-			     !test_bit(PGDAT_DIRTY, &pgdat->flags))) {
-				/*
-				 * Immediately reclaim when written back.
-				 * Similar in principle to folio_deactivate()
-				 * except we already have the folio isolated
-				 * and know it's dirty
-				 */
-				node_stat_mod_folio(folio, NR_VMSCAN_IMMEDIATE,
-						nr_pages);
-				folio_set_reclaim(folio);
-
-				goto activate_locked;
-			}
-
-			if (references == FOLIOREF_RECLAIM_CLEAN)
-				goto keep_locked;
-			if (!may_enter_fs(folio, sc->gfp_mask))
-				goto keep_locked;
-			if (!sc->may_writepage)
-				goto keep_locked;
-
 			/*
 			 * Folio is dirty. Flush the TLB if a writable entry
 			 * potentially exists to avoid CPU writes after I/O