From patchwork Mon Jun 12 13:52:17 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Sergei Shtepa X-Patchwork-Id: 10643 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a59:994d:0:b0:3d9:f83d:47d9 with SMTP id k13csp2636436vqr; Mon, 12 Jun 2023 07:41:41 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ5nkg283fVZ/IJLDBGfnDjQN26iLjv5Fy1oG+2CZxIcoPVGr1FiMWlayRA25RUo+0acVMGo X-Received: by 2002:a05:6808:1148:b0:398:36a0:d0c with SMTP id u8-20020a056808114800b0039836a00d0cmr4665249oiu.33.1686580900972; Mon, 12 Jun 2023 07:41:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1686580900; cv=none; d=google.com; s=arc-20160816; b=Dyt5uOVd33ZFxzXOGP88KjjMZYXIOSvKSuM0VYPzJlzG6FX54yZb+Q327C0L08OV+J jlO1PuZ4fts6P0tqBCYFmtj7j/pHNZuZnZ6VNTg0VwHDniCwgaPEK9Nq+F9HAPXHixKj c8tVNkgkRhsMfJeDG9sSdYF83ZsMIQqDI1C0ujQvXDwk9k3TC1aExl5cG96s2+UXjQNp tFux7P8CzAW2sOGSjdyoaxyonuFpB1cKnIfCwe7v6jNDV4s781Qspi/eAw9c3DTkCits I5O8AXZTtpGs0220yxmoMf4TTNNnB96InYOw1CJK/62cqzysddVpNaEOZxEhqibXLcHH LzIw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=VMupNHJxB/OlqZAdjGEudFPkvX35rQc51mrvvIsSFSU=; b=USJDUMzDlr/DybNSeuxJVVvEgkSYULaxk03VMztuHP+D/7nqTuwUi/9RRm8rvE4G36 nm+C+P7tlGha3ZcRWI9pf31RitIN7y9FQCqa2iWQH5Da2II9eb1K0+Pa7zRThRRv0al7 vIvbjdQzvSTZ+76zC3YdXbhy1NSxcAvZ/JzuJIvwpLUMy6g81LA+RUAhXrhcYYOSN4Lh LlYdeiOHetHZMUNjKEGwtgX/gLMax3B1FlDNUSFWmnszfTACst2F0SWdYJr44Lyfmw6+ b01buRMYg+hH7b8u2j+49d0qnR6f6p6ueEv2xCeOzud8ztKnKuO+DbhwmpA4cdxzsFpf LT3g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@veeam.com header.s=mx4-2022 header.b=mPRX5pUd; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=veeam.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id mh17-20020a17090b4ad100b00259ac7bf8c6si7305379pjb.84.2023.06.12.07.41.21; Mon, 12 Jun 2023 07:41:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@veeam.com header.s=mx4-2022 header.b=mPRX5pUd; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=veeam.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232495AbjFLNwv (ORCPT + 99 others); Mon, 12 Jun 2023 09:52:51 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:40636 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231319AbjFLNwt (ORCPT ); Mon, 12 Jun 2023 09:52:49 -0400 Received: from mx4.veeam.com (mx4.veeam.com [104.41.138.86]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E0F52A1; Mon, 12 Jun 2023 06:52:47 -0700 (PDT) Received: from mail.veeam.com (prgmbx01.amust.local [172.24.128.102]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mx4.veeam.com (Postfix) with ESMTPS id 9B78420E2F; Mon, 12 Jun 2023 16:52:45 +0300 (MSK) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=veeam.com; s=mx4-2022; t=1686577965; bh=VMupNHJxB/OlqZAdjGEudFPkvX35rQc51mrvvIsSFSU=; h=From:To:CC:Subject:Date:From; b=mPRX5pUdZUa3Cva+YpjENDfB98Mw07nkySMECMJmqb+UPi3G62sdwhoq49uQsmP4C FUm2Sujfni0MtkmnSeBGTQpF0YkVLT+z07LCkuN7dDKaDlNk/aNG8YWxtTwSQBVpvk x08wjSjuFuGFeTU0869PuhEWmtYlGOfDracbpJaAuurswKWhxcBi5SCuIGOhUjaL1W p6i2j52S2cp7qTA2yL1iBwTpuShYmCPnY9QAtxaMYQZS/APtDPyuN8ArRTEQAloSyJ mS/V80I3WE4KDa90IDNLQ0E85VildOoMGAr+9POr1Srt7zbNYk+oyBhSirpy1bdm7L OZiurO17HNd6Q== Received: from ssh-deb10-ssd-vb.amust.local (172.24.10.107) by prgmbx01.amust.local (172.24.128.102) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.26; Mon, 12 Jun 2023 15:52:44 +0200 From: Sergei Shtepa To: , , , CC: , , , , , , , , , , , , Subject: [PATCH v5 00/11] blksnap - block devices snapshots module Date: Mon, 12 Jun 2023 15:52:17 +0200 Message-ID: <20230612135228.10702-1-sergei.shtepa@veeam.com> X-Mailer: git-send-email 2.20.1 MIME-Version: 1.0 X-Originating-IP: [172.24.10.107] X-ClientProxiedBy: prgmbx02.amust.local (172.24.128.103) To prgmbx01.amust.local (172.24.128.102) X-EsetResult: clean, is OK X-EsetId: 37303A29240315546D776B X-Veeam-MMEX: True X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1768504537979326928?= X-GMAIL-MSGID: =?utf-8?q?1768508255098337195?= Hi all. I am happy to offer a improved version of the Block Devices Snapshots Module. It allows to create non-persistent snapshots of any block devices. The main purpose of such snapshots is to provide backups of block devices. See more in Documentation/block/blksnap.rst. The Block Device Filtering Mechanism is added to the block layer. This allows to attach and detach block device filters to the block layer. Filters allow to extend the functionality of the block layer. See more in Documentation/block/blkfilter.rst. The tool, library and tests for working with blksnap can be found on github. Link: https://github.com/veeam/blksnap/tree/stable-v2.0 There are few changes in this patch version. The experience of using the out-of-tree version of the blksnap module on real servers was taken into account. v5 changes: - Rebase for "kernel/git/axboe/linux-block.git" branch "for-6.5/block". Link: https://git.kernel.org/pub/scm/linux/kernel/git/axboe/linux-block.git/log/?h=for-6.5/block v4 changes: - Structures for describing the state of chunks are allocated dynamically. This reduces memory consumption, since the struct chunk is allocated only for those blocks for which the snapshot image state differs from the original block device. - The algorithm for calculating the chunk size depending on the size of the block device has been changed. For large block devices, it is now possible to allocate a larger number of chunks, and their size is smaller. - For block devices, a 'filter' file has been added to /sys/block/. It displays the name of the filter that is attached to the block device. - Fixed a problem with the lack of protection against re-adding a block device to a snapshot. - Fixed a bug in the algorithm of allocating the next bio for a chunk. This problem was accurred on large disks, for which a chunk consists of at least two bio. - The ownership mechanism of the diff_area structure has been changed. This fixed the error of prematurely releasing the diff_area structure when destroying the snapshot. - Documentation corrected. - The Sparse analyzer is passed. - Use __u64 type instead pointers in UAPI. v3 changes: - New block device I/O controls BLKFILTER_ATTACH and BLKFILTER_DETACH allow to attach and detach filters. - New block device I/O control BLKFILTER_CTL allow send command to attached block device filter. - The copy-on-write algorithm for processing I/O units has been optimized and has become asynchronous. - The snapshot image reading algorithm has been optimized and has become asynchronous. - Optimized the finite state machine for processing chunks. - Fixed a tracking block size calculation bug. v2 changes: - Added documentation for Block Device Filtering Mechanism. - Added documentation for Block Devices Snapshots Module (blksnap). - The MAINTAINERS file has been updated. - Optimized queue code for snapshot images. - Fixed comments, log messages and code for better readability. v1 changes: - Forgotten "static" declarations have been added. - The text of the comments has been corrected. - It is possible to connect only one filter, since there are no others in upstream. - Do not have additional locks for attach/detach filter. - blksnap.h moved to include/uapi/. - #pragma once and commented code removed. - uuid_t removed from user API. - Removed default values for module parameters from the configuration file. - The debugging code for tracking memory leaks has been removed. - Simplified Makefile. - Optimized work with large memory buffers, CBT tables are now in virtual memory. - The allocation code of minor numbers has been optimized. - The implementation of the snapshot image block device has been simplified, now it is a bio-based block device. - Removed initialization of global variables with null values. - only one bio is used to copy one chunk. - Checked on ppc64le. Thanks for preparing v4 patch: - Christoph Hellwig for his significant contribution to the project. - Fabio Fantoni for his participation in the project, useful advice and faith in the success of the project. - Donald Buczek for researching the module and user-space tool. His fresh look revealed a number of flaw. - Bagas Sanjaya for comments on the documentation. Sergei Shtepa (11): documentation: Block Device Filtering Mechanism block: Block Device Filtering Mechanism documentation: Block Devices Snapshots Module blksnap: header file of the module interface blksnap: module management interface functions blksnap: handling and tracking I/O units blksnap: minimum data storage unit of the original block device blksnap: difference storage blksnap: event queue from the difference storage blksnap: snapshot and snapshot image block device blksnap: Kconfig and Makefile Documentation/block/blkfilter.rst | 64 ++++ Documentation/block/blksnap.rst | 345 +++++++++++++++++ Documentation/block/index.rst | 2 + MAINTAINERS | 17 + block/Makefile | 3 +- block/bdev.c | 1 + block/blk-core.c | 27 ++ block/blk-filter.c | 213 ++++++++++ block/blk.h | 11 + block/genhd.c | 10 + block/ioctl.c | 7 + block/partitions/core.c | 10 + drivers/block/Kconfig | 2 + drivers/block/Makefile | 2 + drivers/block/blksnap/Kconfig | 12 + drivers/block/blksnap/Makefile | 15 + drivers/block/blksnap/cbt_map.c | 227 +++++++++++ drivers/block/blksnap/cbt_map.h | 90 +++++ drivers/block/blksnap/chunk.c | 454 ++++++++++++++++++++++ drivers/block/blksnap/chunk.h | 114 ++++++ drivers/block/blksnap/diff_area.c | 554 +++++++++++++++++++++++++++ drivers/block/blksnap/diff_area.h | 144 +++++++ drivers/block/blksnap/diff_buffer.c | 127 ++++++ drivers/block/blksnap/diff_buffer.h | 37 ++ drivers/block/blksnap/diff_storage.c | 316 +++++++++++++++ drivers/block/blksnap/diff_storage.h | 111 ++++++ drivers/block/blksnap/event_queue.c | 87 +++++ drivers/block/blksnap/event_queue.h | 65 ++++ drivers/block/blksnap/main.c | 483 +++++++++++++++++++++++ drivers/block/blksnap/params.h | 16 + drivers/block/blksnap/snapimage.c | 124 ++++++ drivers/block/blksnap/snapimage.h | 10 + drivers/block/blksnap/snapshot.c | 443 +++++++++++++++++++++ drivers/block/blksnap/snapshot.h | 68 ++++ drivers/block/blksnap/tracker.c | 339 ++++++++++++++++ drivers/block/blksnap/tracker.h | 75 ++++ include/linux/blk-filter.h | 51 +++ include/linux/blk_types.h | 2 + include/linux/blkdev.h | 1 + include/uapi/linux/blk-filter.h | 35 ++ include/uapi/linux/blksnap.h | 421 ++++++++++++++++++++ include/uapi/linux/fs.h | 3 + 42 files changed, 5137 insertions(+), 1 deletion(-) create mode 100644 Documentation/block/blkfilter.rst create mode 100644 Documentation/block/blksnap.rst create mode 100644 block/blk-filter.c create mode 100644 drivers/block/blksnap/Kconfig create mode 100644 drivers/block/blksnap/Makefile create mode 100644 drivers/block/blksnap/cbt_map.c create mode 100644 drivers/block/blksnap/cbt_map.h create mode 100644 drivers/block/blksnap/chunk.c create mode 100644 drivers/block/blksnap/chunk.h create mode 100644 drivers/block/blksnap/diff_area.c create mode 100644 drivers/block/blksnap/diff_area.h create mode 100644 drivers/block/blksnap/diff_buffer.c create mode 100644 drivers/block/blksnap/diff_buffer.h create mode 100644 drivers/block/blksnap/diff_storage.c create mode 100644 drivers/block/blksnap/diff_storage.h create mode 100644 drivers/block/blksnap/event_queue.c create mode 100644 drivers/block/blksnap/event_queue.h create mode 100644 drivers/block/blksnap/main.c create mode 100644 drivers/block/blksnap/params.h create mode 100644 drivers/block/blksnap/snapimage.c create mode 100644 drivers/block/blksnap/snapimage.h create mode 100644 drivers/block/blksnap/snapshot.c create mode 100644 drivers/block/blksnap/snapshot.h create mode 100644 drivers/block/blksnap/tracker.c create mode 100644 drivers/block/blksnap/tracker.h create mode 100644 include/linux/blk-filter.h create mode 100644 include/uapi/linux/blk-filter.h create mode 100644 include/uapi/linux/blksnap.h Acked-by: Christoph Hellwig