[5.10,2/3] blk-wbt: call rq_qos_add() after wb_normal is initialized

Message ID 20221018014326.467842-3-yukuai1@huaweicloud.com
State New
Headers
Series wbt stable patches |

Commit Message

Yu Kuai Oct. 18, 2022, 1:43 a.m. UTC
  From: Yu Kuai <yukuai3@huawei.com>

commit 8c5035dfbb9475b67c82b3fdb7351236525bf52b upstream.

Our test found a problem that wbt inflight counter is negative, which
will cause io hang(noted that this problem doesn't exist in mainline):

t1: device create	t2: issue io
add_disk
 blk_register_queue
  wbt_enable_default
   wbt_init
    rq_qos_add
    // wb_normal is still 0
			/*
			 * in mainline, disk can't be opened before
			 * bdev_add(), however, in old kernels, disk
			 * can be opened before blk_register_queue().
			 */
			blkdev_issue_flush
                        // disk size is 0, however, it's not checked
                         submit_bio_wait
                          submit_bio
                           blk_mq_submit_bio
                            rq_qos_throttle
                             wbt_wait
			      bio_to_wbt_flags
                               rwb_enabled
			       // wb_normal is 0, inflight is not increased

    wbt_queue_depth_changed(&rwb->rqos);
     wbt_update_limits
     // wb_normal is initialized
                            rq_qos_track
                             wbt_track
                              rq->wbt_flags |= bio_to_wbt_flags(rwb, bio);
			      // wb_normal is not 0,wbt_flags will be set
t3: io completion
blk_mq_free_request
 rq_qos_done
  wbt_done
   wbt_is_tracked
   // return true
   __wbt_done
    wbt_rqw_done
     atomic_dec_return(&rqw->inflight);
     // inflight is decreased

commit 8235b5c1e8c1 ("block: call bdev_add later in device_add_disk") can
avoid this problem, however it's better to fix this problem in wbt:

1) Lower kernel can't backport this patch due to lots of refactor.
2) Root cause is that wbt call rq_qos_add() before wb_normal is
initialized.

Fixes: e34cbd307477 ("blk-wbt: add general throttling mechanism")
Cc: <stable@vger.kernel.org>
Signed-off-by: Yu Kuai <yukuai3@huawei.com>
Link: https://lore.kernel.org/r/20220913105749.3086243-1-yukuai1@huaweicloud.com
Signed-off-by: Jens Axboe <axboe@kernel.dk>
---
 block/blk-wbt.c | 9 ++++-----
 1 file changed, 4 insertions(+), 5 deletions(-)
  

Comments

Greg KH Oct. 26, 2022, 4:47 p.m. UTC | #1
On Tue, Oct 18, 2022 at 09:43:25AM +0800, Yu Kuai wrote:
> From: Yu Kuai <yukuai3@huawei.com>
> 
> commit 8c5035dfbb9475b67c82b3fdb7351236525bf52b upstream.

I need a 5.15 version of this, and the 3/3 patch in order to be able to
apply the 5.10.y version.

Can you please send that, and then resend the remaining patches here for
5.10.y?

thanks,

greg k-h
  
Yu Kuai Oct. 27, 2022, 11:28 a.m. UTC | #2
Hi,

在 2022/10/27 0:47, Greg KH 写道:
> On Tue, Oct 18, 2022 at 09:43:25AM +0800, Yu Kuai wrote:
>> From: Yu Kuai <yukuai3@huawei.com>
>>
>> commit 8c5035dfbb9475b67c82b3fdb7351236525bf52b upstream.
> 
> I need a 5.15 version of this, and the 3/3 patch in order to be able to
> apply the 5.10.y version.
> 
> Can you please send that, and then resend the remaining patches here for
> 5.10.y?

Yes, I can do that. By the way, just to confirm:

I already saw that patch 2,3 is queued:

[PATCH 5.15 122/530] blk-wbt: call rq_qos_add() after wb_normal is 
initialized
[PATCH 5.15 519/530] blk-wbt: fix that rwb->wc is always set to 1 in 
wbt_init()

Do I still need to send a 5.15 version?

Thanks,
Kuai
> 
> thanks,
> 
> greg k-h
> .
>
  
Greg KH Oct. 27, 2022, 11:49 a.m. UTC | #3
On Thu, Oct 27, 2022 at 07:28:26PM +0800, Yu Kuai wrote:
> Hi,
> 
> 在 2022/10/27 0:47, Greg KH 写道:
> > On Tue, Oct 18, 2022 at 09:43:25AM +0800, Yu Kuai wrote:
> > > From: Yu Kuai <yukuai3@huawei.com>
> > > 
> > > commit 8c5035dfbb9475b67c82b3fdb7351236525bf52b upstream.
> > 
> > I need a 5.15 version of this, and the 3/3 patch in order to be able to
> > apply the 5.10.y version.
> > 
> > Can you please send that, and then resend the remaining patches here for
> > 5.10.y?
> 
> Yes, I can do that. By the way, just to confirm:
> 
> I already saw that patch 2,3 is queued:
> 
> [PATCH 5.15 122/530] blk-wbt: call rq_qos_add() after wb_normal is
> initialized
> [PATCH 5.15 519/530] blk-wbt: fix that rwb->wc is always set to 1 in
> wbt_init()
> 
> Do I still need to send a 5.15 version?

Not if it is already in the tree, no.

thanks,

greg k-h
  

Patch

diff --git a/block/blk-wbt.c b/block/blk-wbt.c
index 4ec0a018a2ad..bafdb8098893 100644
--- a/block/blk-wbt.c
+++ b/block/blk-wbt.c
@@ -840,6 +840,10 @@  int wbt_init(struct request_queue *q)
 	rwb->enable_state = WBT_STATE_ON_DEFAULT;
 	rwb->wc = 1;
 	rwb->rq_depth.default_depth = RWB_DEF_DEPTH;
+	rwb->min_lat_nsec = wbt_default_latency_nsec(q);
+
+	wbt_queue_depth_changed(&rwb->rqos);
+	wbt_set_write_cache(q, test_bit(QUEUE_FLAG_WC, &q->queue_flags));
 
 	/*
 	 * Assign rwb and add the stats callback.
@@ -847,10 +851,5 @@  int wbt_init(struct request_queue *q)
 	rq_qos_add(q, &rwb->rqos);
 	blk_stat_add_callback(q, rwb->cb);
 
-	rwb->min_lat_nsec = wbt_default_latency_nsec(q);
-
-	wbt_queue_depth_changed(&rwb->rqos);
-	wbt_set_write_cache(q, test_bit(QUEUE_FLAG_WC, &q->queue_flags));
-
 	return 0;
 }