[v6,3/3] nvmem: core: Expose cells through sysfs

Message ID 20230717075147.43326-4-miquel.raynal@bootlin.com
State New
Headers
Series NVMEM cells in sysfs |

Commit Message

Miquel Raynal July 17, 2023, 7:51 a.m. UTC
  The binary content of nvmem devices is available to the user so in the
easiest cases, finding the content of a cell is rather easy as it is
just a matter of looking at a known and fixed offset. However, nvmem
layouts have been recently introduced to cope with more advanced
situations, where the offset and size of the cells is not known in
advance or is dynamic. When using layouts, more advanced parsers are
used by the kernel in order to give direct access to the content of each
cell, regardless of its position/size in the underlying
device. Unfortunately, these information are not accessible by users,
unless by fully re-implementing the parser logic in userland.

Let's expose the cells and their content through sysfs to avoid these
situations. Of course the relevant NVMEM sysfs Kconfig option must be
enabled for this support to be available.

Not all nvmem devices expose cells. Indeed, the .bin_attrs attribute
group member will be filled at runtime only when relevant and will
remain empty otherwise. In this case, as the cells attribute group will
be empty, it will not lead to any additional folder/file creation.

Exposed cells are read-only. There is, in practice, everything in the
core to support a write path, but as I don't see any need for that, I
prefer to keep the interface simple (and probably safer). The interface
is documented as being in the "testing" state which means we can later
add a write attribute if though relevant.

There is one limitation though: if a layout is built as a module but is
not properly installed in the system and loaded manually with insmod
while the nvmem device driver was built-in, the cells won't appear in
sysfs. But if done like that, the cells won't be usable by the built-in
kernel drivers anyway.

Signed-off-by: Miquel Raynal <miquel.raynal@bootlin.com>
Reviewed-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
---
 drivers/nvmem/core.c | 101 +++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 101 insertions(+)
  

Comments

Michael Walle July 17, 2023, 12:24 p.m. UTC | #1
Hi,

> There is one limitation though: if a layout is built as a module but is
> not properly installed in the system and loaded manually with insmod
> while the nvmem device driver was built-in, the cells won't appear in
> sysfs. But if done like that, the cells won't be usable by the built-in
> kernel drivers anyway.

What is the difference between manual loading with insmod and automatic
module loading? Or is the limitation, layout as M and device driver as Y
doesn't work?

-michael
  
Greg KH July 17, 2023, 2:32 p.m. UTC | #2
On Mon, Jul 17, 2023 at 09:51:47AM +0200, Miquel Raynal wrote:
> The binary content of nvmem devices is available to the user so in the
> easiest cases, finding the content of a cell is rather easy as it is
> just a matter of looking at a known and fixed offset. However, nvmem
> layouts have been recently introduced to cope with more advanced
> situations, where the offset and size of the cells is not known in
> advance or is dynamic. When using layouts, more advanced parsers are
> used by the kernel in order to give direct access to the content of each
> cell, regardless of its position/size in the underlying
> device. Unfortunately, these information are not accessible by users,
> unless by fully re-implementing the parser logic in userland.
> 
> Let's expose the cells and their content through sysfs to avoid these
> situations. Of course the relevant NVMEM sysfs Kconfig option must be
> enabled for this support to be available.
> 
> Not all nvmem devices expose cells. Indeed, the .bin_attrs attribute
> group member will be filled at runtime only when relevant and will
> remain empty otherwise. In this case, as the cells attribute group will
> be empty, it will not lead to any additional folder/file creation.
> 
> Exposed cells are read-only. There is, in practice, everything in the
> core to support a write path, but as I don't see any need for that, I
> prefer to keep the interface simple (and probably safer). The interface
> is documented as being in the "testing" state which means we can later
> add a write attribute if though relevant.
> 
> There is one limitation though: if a layout is built as a module but is
> not properly installed in the system and loaded manually with insmod
> while the nvmem device driver was built-in, the cells won't appear in
> sysfs. But if done like that, the cells won't be usable by the built-in
> kernel drivers anyway.

Wait, what?  That should not be an issue here, if so, then this change
is not correct and should be fixed as this is NOT an issue for sysfs
(otherwise the whole tree wouldn't work.)

Please fix up your dependancies if this is somehow not working properly.

thanks,

greg k-h
  
Miquel Raynal July 17, 2023, 4:33 p.m. UTC | #3
Hi Greg,

gregkh@linuxfoundation.org wrote on Mon, 17 Jul 2023 16:32:09 +0200:

> On Mon, Jul 17, 2023 at 09:51:47AM +0200, Miquel Raynal wrote:
> > The binary content of nvmem devices is available to the user so in the
> > easiest cases, finding the content of a cell is rather easy as it is
> > just a matter of looking at a known and fixed offset. However, nvmem
> > layouts have been recently introduced to cope with more advanced
> > situations, where the offset and size of the cells is not known in
> > advance or is dynamic. When using layouts, more advanced parsers are
> > used by the kernel in order to give direct access to the content of each
> > cell, regardless of its position/size in the underlying
> > device. Unfortunately, these information are not accessible by users,
> > unless by fully re-implementing the parser logic in userland.
> > 
> > Let's expose the cells and their content through sysfs to avoid these
> > situations. Of course the relevant NVMEM sysfs Kconfig option must be
> > enabled for this support to be available.
> > 
> > Not all nvmem devices expose cells. Indeed, the .bin_attrs attribute
> > group member will be filled at runtime only when relevant and will
> > remain empty otherwise. In this case, as the cells attribute group will
> > be empty, it will not lead to any additional folder/file creation.
> > 
> > Exposed cells are read-only. There is, in practice, everything in the
> > core to support a write path, but as I don't see any need for that, I
> > prefer to keep the interface simple (and probably safer). The interface
> > is documented as being in the "testing" state which means we can later
> > add a write attribute if though relevant.
> > 
> > There is one limitation though: if a layout is built as a module but is
> > not properly installed in the system and loaded manually with insmod
> > while the nvmem device driver was built-in, the cells won't appear in
> > sysfs. But if done like that, the cells won't be usable by the built-in
> > kernel drivers anyway.  
> 
> Wait, what?  That should not be an issue here, if so, then this change
> is not correct and should be fixed as this is NOT an issue for sysfs
> (otherwise the whole tree wouldn't work.)
> 
> Please fix up your dependancies if this is somehow not working properly.

I'm not sure I fully get your point.

There is no way we can describe any dependency between a storage device
driver and an nvmem layout. NVMEM is a pure software abstraction, the
layout that will be chosen depends on the device tree, but if the
layout has not been installed, there is no existing mechanism in
the kernel to prevent it from being loaded (how do you know it's
not on purpose?).

Thanks,
Miquèl
  
Miquel Raynal July 17, 2023, 4:41 p.m. UTC | #4
Hi Michael,

michael@walle.cc wrote on Mon, 17 Jul 2023 14:24:45 +0200:

> Hi,
> 
> > There is one limitation though: if a layout is built as a module but is
> > not properly installed in the system and loaded manually with insmod
> > while the nvmem device driver was built-in, the cells won't appear in
> > sysfs. But if done like that, the cells won't be usable by the built-in
> > kernel drivers anyway.  
> 
> What is the difference between manual loading with insmod and automatic
> module loading? Or is the limitation, layout as M and device driver as Y
> doesn't work?

The nvmem core uses usermodehelper to load the relevant layout
module, but that only works if the module was installed correctly (make
modules_install).

The limitation is:
* Any storage device driver that registers an nvmem interface =y (or =m
  but loaded before the nvmem layout)
* The relevant nvmem layout =m *and* not installed with make
  modules_install

If you see a way to workaround this, let me know, but there is no way
we can enforce Kconfig dependencies between storage drivers and nvmem
layouts IMHO.

Thanks,
Miquèl
  
Greg KH July 17, 2023, 4:59 p.m. UTC | #5
On Mon, Jul 17, 2023 at 06:33:23PM +0200, Miquel Raynal wrote:
> Hi Greg,
> 
> gregkh@linuxfoundation.org wrote on Mon, 17 Jul 2023 16:32:09 +0200:
> 
> > On Mon, Jul 17, 2023 at 09:51:47AM +0200, Miquel Raynal wrote:
> > > The binary content of nvmem devices is available to the user so in the
> > > easiest cases, finding the content of a cell is rather easy as it is
> > > just a matter of looking at a known and fixed offset. However, nvmem
> > > layouts have been recently introduced to cope with more advanced
> > > situations, where the offset and size of the cells is not known in
> > > advance or is dynamic. When using layouts, more advanced parsers are
> > > used by the kernel in order to give direct access to the content of each
> > > cell, regardless of its position/size in the underlying
> > > device. Unfortunately, these information are not accessible by users,
> > > unless by fully re-implementing the parser logic in userland.
> > > 
> > > Let's expose the cells and their content through sysfs to avoid these
> > > situations. Of course the relevant NVMEM sysfs Kconfig option must be
> > > enabled for this support to be available.
> > > 
> > > Not all nvmem devices expose cells. Indeed, the .bin_attrs attribute
> > > group member will be filled at runtime only when relevant and will
> > > remain empty otherwise. In this case, as the cells attribute group will
> > > be empty, it will not lead to any additional folder/file creation.
> > > 
> > > Exposed cells are read-only. There is, in practice, everything in the
> > > core to support a write path, but as I don't see any need for that, I
> > > prefer to keep the interface simple (and probably safer). The interface
> > > is documented as being in the "testing" state which means we can later
> > > add a write attribute if though relevant.
> > > 
> > > There is one limitation though: if a layout is built as a module but is
> > > not properly installed in the system and loaded manually with insmod
> > > while the nvmem device driver was built-in, the cells won't appear in
> > > sysfs. But if done like that, the cells won't be usable by the built-in
> > > kernel drivers anyway.  
> > 
> > Wait, what?  That should not be an issue here, if so, then this change
> > is not correct and should be fixed as this is NOT an issue for sysfs
> > (otherwise the whole tree wouldn't work.)
> > 
> > Please fix up your dependancies if this is somehow not working properly.
> 
> I'm not sure I fully get your point.
> 
> There is no way we can describe any dependency between a storage device
> driver and an nvmem layout. NVMEM is a pure software abstraction, the
> layout that will be chosen depends on the device tree, but if the
> layout has not been installed, there is no existing mechanism in
> the kernel to prevent it from being loaded (how do you know it's
> not on purpose?).

Once a layout has been loaded, the sysfs files should show up, right?
Otherwise what does a "layout" do?  (hint, I have no idea, it's an odd
term to me...)

thanks,

greg k-h
  
Chen-Yu Tsai July 18, 2023, 10:26 a.m. UTC | #6
Hi,

On Mon, Jul 17, 2023 at 09:51:47AM +0200, Miquel Raynal wrote:
> The binary content of nvmem devices is available to the user so in the
> easiest cases, finding the content of a cell is rather easy as it is
> just a matter of looking at a known and fixed offset. However, nvmem
> layouts have been recently introduced to cope with more advanced
> situations, where the offset and size of the cells is not known in
> advance or is dynamic. When using layouts, more advanced parsers are
> used by the kernel in order to give direct access to the content of each
> cell, regardless of its position/size in the underlying
> device. Unfortunately, these information are not accessible by users,
> unless by fully re-implementing the parser logic in userland.
> 
> Let's expose the cells and their content through sysfs to avoid these
> situations. Of course the relevant NVMEM sysfs Kconfig option must be
> enabled for this support to be available.
> 
> Not all nvmem devices expose cells. Indeed, the .bin_attrs attribute
> group member will be filled at runtime only when relevant and will
> remain empty otherwise. In this case, as the cells attribute group will
> be empty, it will not lead to any additional folder/file creation.
> 
> Exposed cells are read-only. There is, in practice, everything in the
> core to support a write path, but as I don't see any need for that, I
> prefer to keep the interface simple (and probably safer). The interface
> is documented as being in the "testing" state which means we can later
> add a write attribute if though relevant.
> 
> There is one limitation though: if a layout is built as a module but is
> not properly installed in the system and loaded manually with insmod
> while the nvmem device driver was built-in, the cells won't appear in
> sysfs. But if done like that, the cells won't be usable by the built-in
> kernel drivers anyway.
> 
> Signed-off-by: Miquel Raynal <miquel.raynal@bootlin.com>
> Reviewed-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> ---
>  drivers/nvmem/core.c | 101 +++++++++++++++++++++++++++++++++++++++++++
>  1 file changed, 101 insertions(+)
> 
> diff --git a/drivers/nvmem/core.c b/drivers/nvmem/core.c
> index 48659106a1e2..6c04a9cf6919 100644
> --- a/drivers/nvmem/core.c
> +++ b/drivers/nvmem/core.c
> @@ -325,6 +325,43 @@ static umode_t nvmem_bin_attr_is_visible(struct kobject *kobj,
>  	return nvmem_bin_attr_get_umode(nvmem);
>  }
>  
> +static struct nvmem_cell *nvmem_create_cell(struct nvmem_cell_entry *entry,
> +					    const char *id, int index);
> +
> +static ssize_t nvmem_cell_attr_read(struct file *filp, struct kobject *kobj,
> +				    struct bin_attribute *attr, char *buf,
> +				    loff_t pos, size_t count)
> +{
> +	struct nvmem_cell_entry *entry;
> +	struct nvmem_cell *cell = NULL;
> +	size_t cell_sz, read_len;
> +	void *content;
> +
> +	entry = attr->private;
> +	cell = nvmem_create_cell(entry, entry->name, 0);
> +	if (IS_ERR(cell))
> +		return PTR_ERR(cell);
> +
> +	if (!cell)
> +		return -EINVAL;
> +
> +	content = nvmem_cell_read(cell, &cell_sz);
> +	if (IS_ERR(content)) {
> +		read_len = PTR_ERR(content);
> +		goto destroy_cell;
> +	}
> +
> +	read_len = min_t(unsigned int, cell_sz - pos, count);
> +	memcpy(buf, content + pos, read_len);
> +	kfree(content);
> +
> +destroy_cell:
> +	kfree_const(cell->id);
> +	kfree(cell);
> +
> +	return read_len;
> +}
> +
>  /* default read/write permissions */
>  static struct bin_attribute bin_attr_rw_nvmem = {
>  	.attr	= {
> @@ -346,8 +383,14 @@ static const struct attribute_group nvmem_bin_group = {
>  	.is_bin_visible = nvmem_bin_attr_is_visible,
>  };
>  
> +/* Cell attributes will be dynamically allocated */
> +static struct attribute_group nvmem_cells_group = {
> +	.name		= "cells",
> +};
> +
>  static const struct attribute_group *nvmem_dev_groups[] = {
>  	&nvmem_bin_group,
> +	&nvmem_cells_group,
>  	NULL,
>  };
>  
> @@ -406,6 +449,58 @@ static void nvmem_sysfs_remove_compat(struct nvmem_device *nvmem,
>  		device_remove_bin_file(nvmem->base_dev, &nvmem->eeprom);
>  }
>  
> +static int nvmem_populate_sysfs_cells(struct nvmem_device *nvmem)
> +{
> +	struct bin_attribute **cells_attrs, *attrs;
> +	struct nvmem_cell_entry *entry;
> +	unsigned int ncells = 0, i = 0;
> +	int ret = 0;
> +
> +	mutex_lock(&nvmem_mutex);
> +
> +	if (list_empty(&nvmem->cells))
> +		goto unlock_mutex;
> +
> +	/* Allocate an array of attributes with a sentinel */
> +	ncells = list_count_nodes(&nvmem->cells);
> +	cells_attrs = devm_kcalloc(&nvmem->dev, ncells + 1,
> +				   sizeof(struct bin_attribute *), GFP_KERNEL);
> +	if (!cells_attrs) {
> +		ret = -ENOMEM;
> +		goto unlock_mutex;
> +	}
> +
> +	attrs = devm_kcalloc(&nvmem->dev, ncells, sizeof(struct bin_attribute), GFP_KERNEL);
> +	if (!attrs) {
> +		ret = -ENOMEM;
> +		goto unlock_mutex;
> +	}
> +
> +	/* Initialize each attribute to take the name and size of the cell */
> +	list_for_each_entry(entry, &nvmem->cells, node) {
> +		sysfs_bin_attr_init(&attrs[i]);
> +		attrs[i].attr.name = devm_kstrdup(&nvmem->dev, entry->name, GFP_KERNEL);
> +		attrs[i].attr.mode = 0444;
> +		attrs[i].size = entry->bytes;
> +		attrs[i].read = &nvmem_cell_attr_read;
> +		attrs[i].private = entry;
> +		if (!attrs[i].attr.name) {
> +			ret = -ENOMEM;
> +			goto unlock_mutex;
> +		}
> +
> +		cells_attrs[i] = &attrs[i];
> +		i++;
> +	}
> +
> +	nvmem_cells_group.bin_attrs = cells_attrs;
> +
> +unlock_mutex:
> +	mutex_unlock(&nvmem_mutex);
> +
> +	return ret;
> +}
> +
>  #else /* CONFIG_NVMEM_SYSFS */
>  
>  static int nvmem_sysfs_setup_compat(struct nvmem_device *nvmem,
> @@ -1006,6 +1101,12 @@ struct nvmem_device *nvmem_register(const struct nvmem_config *config)
>  	if (rval)
>  		goto err_remove_cells;
>  
> +#ifdef CONFIG_NVMEM_SYSFS
> +	rval = nvmem_populate_sysfs_cells(nvmem);
> +	if (rval)
> +		goto err_remove_cells;

This breaks nvmem / efuse devices with multiple cells that share the
same name. Something like this in DT:

	efuse: efuse@11f10000 {
		compatible = "mediatek,mt8183-efuse",
			     "mediatek,efuse";
		reg = <0 0x11f10000 0 0x1000>;
		#address-cells = <1>;
		#size-cells = <1>;
		thermal_calibration: calib@180 {
			reg = <0x180 0xc>;
		};

		mipi_tx_calibration: calib@190 {
			reg = <0x190 0xc>;
		};

		svs_calibration: calib@580 {
			reg = <0x580 0x64>;
		};
	};

creates three cells, all named DT, and sysfs will complain:

    sysfs: cannot create duplicate filename '/devices/platform/soc/11f10000.efuse/nvmem1/cells/calib'
    mediatek,efuse: probe of 11f10000.efuse failed with error -17

This causes the MT8183-based Chromebooks to lose display capability,
among other things.

The problem lies in the nvmem DT parsing code, where the cell name is
derived from the node name, without including the address portion.
However I'm not sure we can change that, since it could be considered
ABI?


ChenYu

> +#endif
> +
>  	dev_dbg(&nvmem->dev, "Registering nvmem device %s\n", config->name);
>  
>  	rval = device_add(&nvmem->dev);
> -- 
> 2.34.1
>
  
Miquel Raynal July 31, 2023, 3:33 p.m. UTC | #7
Hi Greg,

gregkh@linuxfoundation.org wrote on Mon, 17 Jul 2023 18:59:52 +0200:

> On Mon, Jul 17, 2023 at 06:33:23PM +0200, Miquel Raynal wrote:
> > Hi Greg,
> > 
> > gregkh@linuxfoundation.org wrote on Mon, 17 Jul 2023 16:32:09 +0200:
> >   
> > > On Mon, Jul 17, 2023 at 09:51:47AM +0200, Miquel Raynal wrote:  
> > > > The binary content of nvmem devices is available to the user so in the
> > > > easiest cases, finding the content of a cell is rather easy as it is
> > > > just a matter of looking at a known and fixed offset. However, nvmem
> > > > layouts have been recently introduced to cope with more advanced
> > > > situations, where the offset and size of the cells is not known in
> > > > advance or is dynamic. When using layouts, more advanced parsers are
> > > > used by the kernel in order to give direct access to the content of each
> > > > cell, regardless of its position/size in the underlying
> > > > device. Unfortunately, these information are not accessible by users,
> > > > unless by fully re-implementing the parser logic in userland.
> > > > 
> > > > Let's expose the cells and their content through sysfs to avoid these
> > > > situations. Of course the relevant NVMEM sysfs Kconfig option must be
> > > > enabled for this support to be available.
> > > > 
> > > > Not all nvmem devices expose cells. Indeed, the .bin_attrs attribute
> > > > group member will be filled at runtime only when relevant and will
> > > > remain empty otherwise. In this case, as the cells attribute group will
> > > > be empty, it will not lead to any additional folder/file creation.
> > > > 
> > > > Exposed cells are read-only. There is, in practice, everything in the
> > > > core to support a write path, but as I don't see any need for that, I
> > > > prefer to keep the interface simple (and probably safer). The interface
> > > > is documented as being in the "testing" state which means we can later
> > > > add a write attribute if though relevant.
> > > > 
> > > > There is one limitation though: if a layout is built as a module but is
> > > > not properly installed in the system and loaded manually with insmod
> > > > while the nvmem device driver was built-in, the cells won't appear in
> > > > sysfs. But if done like that, the cells won't be usable by the built-in
> > > > kernel drivers anyway.    
> > > 
> > > Wait, what?  That should not be an issue here, if so, then this change
> > > is not correct and should be fixed as this is NOT an issue for sysfs
> > > (otherwise the whole tree wouldn't work.)
> > > 
> > > Please fix up your dependancies if this is somehow not working properly.  
> > 
> > I'm not sure I fully get your point.
> > 
> > There is no way we can describe any dependency between a storage device
> > driver and an nvmem layout. NVMEM is a pure software abstraction, the
> > layout that will be chosen depends on the device tree, but if the
> > layout has not been installed, there is no existing mechanism in
> > the kernel to prevent it from being loaded (how do you know it's
> > not on purpose?).  
> 
> Once a layout has been loaded, the sysfs files should show up, right?
> Otherwise what does a "layout" do?  (hint, I have no idea, it's an odd
> term to me...)

Sorry for the latency in responding to these questions, I'll try to
clarify the situation.

We have:
- device drivers (like NAND flashes, SPI-NOR flashes or EEPROMs) which
  typically probe and register their devices into the nvmem
  layer to expose their content through NVMEM.
- each registration in NVMEM leads to the creation of the relevant
  NVMEM cells which can then be used by other device drivers
  (typically: a network controller retrieving a MAC address from an
  EEPROM through the generic NVMEM abstraction).

We recently covered a slightly new case: the NVMEM cells can be in
random places in the storage devices so we need a "dynamic" way to
discover them: this is the purpose of the NVMEM layouts. We know cell X
is in the device, we just don't know where it is exactly at compile
time, the layout driver will discover it dynamically for us at runtime.

While the "static cells" parser is built-in the NVMEM subsystem, you
explicitly asked to have the layouts modularized. This means
registering a storage device in nvmem while no layout driver has been
inserted yet is now a scenario. We cannot describe any dependency
between a storage device and a layout driver. We cannot defer the probe
either because device drivers which don't get access to their NVMEM
cell are responsible of choosing what to do (most of the time, the idea
is to fallback to a default value to avoid failing the probe for no
reason).

So to answer your original question:

> Once a layout has been loaded, the sysfs files should show up, right?

No. The layouts are kind of "libraries" that the NVMEM subsystem uses
to try exposing cells *when* a new device is registered in NVMEM (not
later). The registration of an NVMEM layout does not trigger any new
parsing, because that is not how the NVMEM subsystem was designed.

I must emphasize that if the layout driver is installed in
/lib/modules/ there is no problem, it will be loaded with
usermodehelper. But if it is not, we can very well have the layout
driver inserted after, and this case, while in practice possible, is
irrelevant from a driver standpoint. It does not make any sense to have
these cells created "after" because they are mostly used during probes.
An easy workaround would be to unregister/register again the underlying
storage device driver.

Do these explanations clarify the situation?

Thanks,
Miquèl
  
Miquel Raynal July 31, 2023, 4:05 p.m. UTC | #8
Hi Chen-Yu,

> >  static int nvmem_sysfs_setup_compat(struct nvmem_device *nvmem,
> > @@ -1006,6 +1101,12 @@ struct nvmem_device *nvmem_register(const struct nvmem_config *config)
> >  	if (rval)
> >  		goto err_remove_cells;
> >  
> > +#ifdef CONFIG_NVMEM_SYSFS
> > +	rval = nvmem_populate_sysfs_cells(nvmem);
> > +	if (rval)
> > +		goto err_remove_cells;  
> 
> This breaks nvmem / efuse devices with multiple cells that share the
> same name. Something like this in DT:
> 
> 	efuse: efuse@11f10000 {
> 		compatible = "mediatek,mt8183-efuse",
> 			     "mediatek,efuse";
> 		reg = <0 0x11f10000 0 0x1000>;
> 		#address-cells = <1>;
> 		#size-cells = <1>;
> 		thermal_calibration: calib@180 {
> 			reg = <0x180 0xc>;
> 		};
> 
> 		mipi_tx_calibration: calib@190 {
> 			reg = <0x190 0xc>;
> 		};
> 
> 		svs_calibration: calib@580 {
> 			reg = <0x580 0x64>;
> 		};
> 	};
> 
> creates three cells, all named DT, and sysfs will complain:
> 
>     sysfs: cannot create duplicate filename '/devices/platform/soc/11f10000.efuse/nvmem1/cells/calib'
>     mediatek,efuse: probe of 11f10000.efuse failed with error -17
> 
> This causes the MT8183-based Chromebooks to lose display capability,
> among other things.

Sorry for the breakage, I did not identify this case, but you're right
this is incorrectly handled currently.

> The problem lies in the nvmem DT parsing code, where the cell name is
> derived from the node name, without including the address portion.
> However I'm not sure we can change that, since it could be considered
> ABI?

I would be in favor suffixing the cell names anyway as they have not
been exposed yet to userspace at all (well, not more than a couple of
days in -next).

Thanks,
Miquèl
  
Greg KH Aug. 1, 2023, 9:56 a.m. UTC | #9
On Mon, Jul 31, 2023 at 05:33:13PM +0200, Miquel Raynal wrote:
> Hi Greg,
> 
> gregkh@linuxfoundation.org wrote on Mon, 17 Jul 2023 18:59:52 +0200:
> 
> > On Mon, Jul 17, 2023 at 06:33:23PM +0200, Miquel Raynal wrote:
> > > Hi Greg,
> > > 
> > > gregkh@linuxfoundation.org wrote on Mon, 17 Jul 2023 16:32:09 +0200:
> > >   
> > > > On Mon, Jul 17, 2023 at 09:51:47AM +0200, Miquel Raynal wrote:  
> > > > > The binary content of nvmem devices is available to the user so in the
> > > > > easiest cases, finding the content of a cell is rather easy as it is
> > > > > just a matter of looking at a known and fixed offset. However, nvmem
> > > > > layouts have been recently introduced to cope with more advanced
> > > > > situations, where the offset and size of the cells is not known in
> > > > > advance or is dynamic. When using layouts, more advanced parsers are
> > > > > used by the kernel in order to give direct access to the content of each
> > > > > cell, regardless of its position/size in the underlying
> > > > > device. Unfortunately, these information are not accessible by users,
> > > > > unless by fully re-implementing the parser logic in userland.
> > > > > 
> > > > > Let's expose the cells and their content through sysfs to avoid these
> > > > > situations. Of course the relevant NVMEM sysfs Kconfig option must be
> > > > > enabled for this support to be available.
> > > > > 
> > > > > Not all nvmem devices expose cells. Indeed, the .bin_attrs attribute
> > > > > group member will be filled at runtime only when relevant and will
> > > > > remain empty otherwise. In this case, as the cells attribute group will
> > > > > be empty, it will not lead to any additional folder/file creation.
> > > > > 
> > > > > Exposed cells are read-only. There is, in practice, everything in the
> > > > > core to support a write path, but as I don't see any need for that, I
> > > > > prefer to keep the interface simple (and probably safer). The interface
> > > > > is documented as being in the "testing" state which means we can later
> > > > > add a write attribute if though relevant.
> > > > > 
> > > > > There is one limitation though: if a layout is built as a module but is
> > > > > not properly installed in the system and loaded manually with insmod
> > > > > while the nvmem device driver was built-in, the cells won't appear in
> > > > > sysfs. But if done like that, the cells won't be usable by the built-in
> > > > > kernel drivers anyway.    
> > > > 
> > > > Wait, what?  That should not be an issue here, if so, then this change
> > > > is not correct and should be fixed as this is NOT an issue for sysfs
> > > > (otherwise the whole tree wouldn't work.)
> > > > 
> > > > Please fix up your dependancies if this is somehow not working properly.  
> > > 
> > > I'm not sure I fully get your point.
> > > 
> > > There is no way we can describe any dependency between a storage device
> > > driver and an nvmem layout. NVMEM is a pure software abstraction, the
> > > layout that will be chosen depends on the device tree, but if the
> > > layout has not been installed, there is no existing mechanism in
> > > the kernel to prevent it from being loaded (how do you know it's
> > > not on purpose?).  
> > 
> > Once a layout has been loaded, the sysfs files should show up, right?
> > Otherwise what does a "layout" do?  (hint, I have no idea, it's an odd
> > term to me...)
> 
> Sorry for the latency in responding to these questions, I'll try to
> clarify the situation.
> 
> We have:
> - device drivers (like NAND flashes, SPI-NOR flashes or EEPROMs) which
>   typically probe and register their devices into the nvmem
>   layer to expose their content through NVMEM.
> - each registration in NVMEM leads to the creation of the relevant
>   NVMEM cells which can then be used by other device drivers
>   (typically: a network controller retrieving a MAC address from an
>   EEPROM through the generic NVMEM abstraction).


So is a "cell" here a device in the device model?  Or something else?

> We recently covered a slightly new case: the NVMEM cells can be in
> random places in the storage devices so we need a "dynamic" way to
> discover them: this is the purpose of the NVMEM layouts. We know cell X
> is in the device, we just don't know where it is exactly at compile
> time, the layout driver will discover it dynamically for us at runtime.

So you then create the needed device when it is found?

> While the "static cells" parser is built-in the NVMEM subsystem, you
> explicitly asked to have the layouts modularized. This means
> registering a storage device in nvmem while no layout driver has been
> inserted yet is now a scenario. We cannot describe any dependency
> between a storage device and a layout driver. We cannot defer the probe
> either because device drivers which don't get access to their NVMEM
> cell are responsible of choosing what to do (most of the time, the idea
> is to fallback to a default value to avoid failing the probe for no
> reason).
> 
> So to answer your original question:
> 
> > Once a layout has been loaded, the sysfs files should show up, right?
> 
> No. The layouts are kind of "libraries" that the NVMEM subsystem uses
> to try exposing cells *when* a new device is registered in NVMEM (not
> later). The registration of an NVMEM layout does not trigger any new
> parsing, because that is not how the NVMEM subsystem was designed.

So they are a type of "class" right?  Why not just use class devices
then?

> I must emphasize that if the layout driver is installed in
> /lib/modules/ there is no problem, it will be loaded with
> usermodehelper. But if it is not, we can very well have the layout
> driver inserted after, and this case, while in practice possible, is
> irrelevant from a driver standpoint. It does not make any sense to have
> these cells created "after" because they are mostly used during probes.
> An easy workaround would be to unregister/register again the underlying
> storage device driver.

We really do not support any situation where a module is NOT in the
proper place when device discovery happens.  So this shouldn't be an
issue, yet you all mention it?  So how is it happening?

And if you used the class code, would that work better as mentioned
above?

thanks

greg k-h
  
Miquel Raynal Aug. 1, 2023, 4:54 p.m. UTC | #10
Hi Greg,

gregkh@linuxfoundation.org wrote on Tue, 1 Aug 2023 11:56:40 +0200:

> On Mon, Jul 31, 2023 at 05:33:13PM +0200, Miquel Raynal wrote:
> > Hi Greg,
> > 
> > gregkh@linuxfoundation.org wrote on Mon, 17 Jul 2023 18:59:52 +0200:
> >   
> > > On Mon, Jul 17, 2023 at 06:33:23PM +0200, Miquel Raynal wrote:  
> > > > Hi Greg,
> > > > 
> > > > gregkh@linuxfoundation.org wrote on Mon, 17 Jul 2023 16:32:09 +0200:
> > > >     
> > > > > On Mon, Jul 17, 2023 at 09:51:47AM +0200, Miquel Raynal wrote:    
> > > > > > The binary content of nvmem devices is available to the user so in the
> > > > > > easiest cases, finding the content of a cell is rather easy as it is
> > > > > > just a matter of looking at a known and fixed offset. However, nvmem
> > > > > > layouts have been recently introduced to cope with more advanced
> > > > > > situations, where the offset and size of the cells is not known in
> > > > > > advance or is dynamic. When using layouts, more advanced parsers are
> > > > > > used by the kernel in order to give direct access to the content of each
> > > > > > cell, regardless of its position/size in the underlying
> > > > > > device. Unfortunately, these information are not accessible by users,
> > > > > > unless by fully re-implementing the parser logic in userland.
> > > > > > 
> > > > > > Let's expose the cells and their content through sysfs to avoid these
> > > > > > situations. Of course the relevant NVMEM sysfs Kconfig option must be
> > > > > > enabled for this support to be available.
> > > > > > 
> > > > > > Not all nvmem devices expose cells. Indeed, the .bin_attrs attribute
> > > > > > group member will be filled at runtime only when relevant and will
> > > > > > remain empty otherwise. In this case, as the cells attribute group will
> > > > > > be empty, it will not lead to any additional folder/file creation.
> > > > > > 
> > > > > > Exposed cells are read-only. There is, in practice, everything in the
> > > > > > core to support a write path, but as I don't see any need for that, I
> > > > > > prefer to keep the interface simple (and probably safer). The interface
> > > > > > is documented as being in the "testing" state which means we can later
> > > > > > add a write attribute if though relevant.
> > > > > > 
> > > > > > There is one limitation though: if a layout is built as a module but is
> > > > > > not properly installed in the system and loaded manually with insmod
> > > > > > while the nvmem device driver was built-in, the cells won't appear in
> > > > > > sysfs. But if done like that, the cells won't be usable by the built-in
> > > > > > kernel drivers anyway.      
> > > > > 
> > > > > Wait, what?  That should not be an issue here, if so, then this change
> > > > > is not correct and should be fixed as this is NOT an issue for sysfs
> > > > > (otherwise the whole tree wouldn't work.)
> > > > > 
> > > > > Please fix up your dependancies if this is somehow not working properly.    
> > > > 
> > > > I'm not sure I fully get your point.
> > > > 
> > > > There is no way we can describe any dependency between a storage device
> > > > driver and an nvmem layout. NVMEM is a pure software abstraction, the
> > > > layout that will be chosen depends on the device tree, but if the
> > > > layout has not been installed, there is no existing mechanism in
> > > > the kernel to prevent it from being loaded (how do you know it's
> > > > not on purpose?).    
> > > 
> > > Once a layout has been loaded, the sysfs files should show up, right?
> > > Otherwise what does a "layout" do?  (hint, I have no idea, it's an odd
> > > term to me...)  
> > 
> > Sorry for the latency in responding to these questions, I'll try to
> > clarify the situation.
> > 
> > We have:
> > - device drivers (like NAND flashes, SPI-NOR flashes or EEPROMs) which
> >   typically probe and register their devices into the nvmem
> >   layer to expose their content through NVMEM.
> > - each registration in NVMEM leads to the creation of the relevant
> >   NVMEM cells which can then be used by other device drivers
> >   (typically: a network controller retrieving a MAC address from an
> >   EEPROM through the generic NVMEM abstraction).  
> 
> 
> So is a "cell" here a device in the device model?  Or something else?

It is not a device in the device model, but I am wondering if it should
not be one actually. I discussed with Rafal about another issue in the
current design (dependence over a layout driver which might defer
forever a storage device probe) which might be solved if the core was
handling these layouts differently.

> > We recently covered a slightly new case: the NVMEM cells can be in
> > random places in the storage devices so we need a "dynamic" way to
> > discover them: this is the purpose of the NVMEM layouts. We know cell X
> > is in the device, we just don't know where it is exactly at compile
> > time, the layout driver will discover it dynamically for us at runtime.  
> 
> So you then create the needed device when it is found?

We don't create devices, but we match the layouts with the NVMEM
devices thanks to the of_ logic.

> > While the "static cells" parser is built-in the NVMEM subsystem, you
> > explicitly asked to have the layouts modularized. This means
> > registering a storage device in nvmem while no layout driver has been
> > inserted yet is now a scenario. We cannot describe any dependency
> > between a storage device and a layout driver. We cannot defer the probe
> > either because device drivers which don't get access to their NVMEM
> > cell are responsible of choosing what to do (most of the time, the idea
> > is to fallback to a default value to avoid failing the probe for no
> > reason).
> > 
> > So to answer your original question:
> >   
> > > Once a layout has been loaded, the sysfs files should show up, right?  
> > 
> > No. The layouts are kind of "libraries" that the NVMEM subsystem uses
> > to try exposing cells *when* a new device is registered in NVMEM (not
> > later). The registration of an NVMEM layout does not trigger any new
> > parsing, because that is not how the NVMEM subsystem was designed.  
> 
> So they are a type of "class" right?  Why not just use class devices
> then?
> 
> > I must emphasize that if the layout driver is installed in
> > /lib/modules/ there is no problem, it will be loaded with
> > usermodehelper. But if it is not, we can very well have the layout
> > driver inserted after, and this case, while in practice possible, is
> > irrelevant from a driver standpoint. It does not make any sense to have
> > these cells created "after" because they are mostly used during probes.
> > An easy workaround would be to unregister/register again the underlying
> > storage device driver.  
> 
> We really do not support any situation where a module is NOT in the
> proper place when device discovery happens.

Great, I didn't know. Then there is no issue.

>  So this shouldn't be an
> issue, yet you all mention it?  So how is it happening?

Just transparency, I'm giving all details I can.

I'll try to come with something slightly different than what we have
with the current approach.

Thanks,
Miquèl
  

Patch

diff --git a/drivers/nvmem/core.c b/drivers/nvmem/core.c
index 48659106a1e2..6c04a9cf6919 100644
--- a/drivers/nvmem/core.c
+++ b/drivers/nvmem/core.c
@@ -325,6 +325,43 @@  static umode_t nvmem_bin_attr_is_visible(struct kobject *kobj,
 	return nvmem_bin_attr_get_umode(nvmem);
 }
 
+static struct nvmem_cell *nvmem_create_cell(struct nvmem_cell_entry *entry,
+					    const char *id, int index);
+
+static ssize_t nvmem_cell_attr_read(struct file *filp, struct kobject *kobj,
+				    struct bin_attribute *attr, char *buf,
+				    loff_t pos, size_t count)
+{
+	struct nvmem_cell_entry *entry;
+	struct nvmem_cell *cell = NULL;
+	size_t cell_sz, read_len;
+	void *content;
+
+	entry = attr->private;
+	cell = nvmem_create_cell(entry, entry->name, 0);
+	if (IS_ERR(cell))
+		return PTR_ERR(cell);
+
+	if (!cell)
+		return -EINVAL;
+
+	content = nvmem_cell_read(cell, &cell_sz);
+	if (IS_ERR(content)) {
+		read_len = PTR_ERR(content);
+		goto destroy_cell;
+	}
+
+	read_len = min_t(unsigned int, cell_sz - pos, count);
+	memcpy(buf, content + pos, read_len);
+	kfree(content);
+
+destroy_cell:
+	kfree_const(cell->id);
+	kfree(cell);
+
+	return read_len;
+}
+
 /* default read/write permissions */
 static struct bin_attribute bin_attr_rw_nvmem = {
 	.attr	= {
@@ -346,8 +383,14 @@  static const struct attribute_group nvmem_bin_group = {
 	.is_bin_visible = nvmem_bin_attr_is_visible,
 };
 
+/* Cell attributes will be dynamically allocated */
+static struct attribute_group nvmem_cells_group = {
+	.name		= "cells",
+};
+
 static const struct attribute_group *nvmem_dev_groups[] = {
 	&nvmem_bin_group,
+	&nvmem_cells_group,
 	NULL,
 };
 
@@ -406,6 +449,58 @@  static void nvmem_sysfs_remove_compat(struct nvmem_device *nvmem,
 		device_remove_bin_file(nvmem->base_dev, &nvmem->eeprom);
 }
 
+static int nvmem_populate_sysfs_cells(struct nvmem_device *nvmem)
+{
+	struct bin_attribute **cells_attrs, *attrs;
+	struct nvmem_cell_entry *entry;
+	unsigned int ncells = 0, i = 0;
+	int ret = 0;
+
+	mutex_lock(&nvmem_mutex);
+
+	if (list_empty(&nvmem->cells))
+		goto unlock_mutex;
+
+	/* Allocate an array of attributes with a sentinel */
+	ncells = list_count_nodes(&nvmem->cells);
+	cells_attrs = devm_kcalloc(&nvmem->dev, ncells + 1,
+				   sizeof(struct bin_attribute *), GFP_KERNEL);
+	if (!cells_attrs) {
+		ret = -ENOMEM;
+		goto unlock_mutex;
+	}
+
+	attrs = devm_kcalloc(&nvmem->dev, ncells, sizeof(struct bin_attribute), GFP_KERNEL);
+	if (!attrs) {
+		ret = -ENOMEM;
+		goto unlock_mutex;
+	}
+
+	/* Initialize each attribute to take the name and size of the cell */
+	list_for_each_entry(entry, &nvmem->cells, node) {
+		sysfs_bin_attr_init(&attrs[i]);
+		attrs[i].attr.name = devm_kstrdup(&nvmem->dev, entry->name, GFP_KERNEL);
+		attrs[i].attr.mode = 0444;
+		attrs[i].size = entry->bytes;
+		attrs[i].read = &nvmem_cell_attr_read;
+		attrs[i].private = entry;
+		if (!attrs[i].attr.name) {
+			ret = -ENOMEM;
+			goto unlock_mutex;
+		}
+
+		cells_attrs[i] = &attrs[i];
+		i++;
+	}
+
+	nvmem_cells_group.bin_attrs = cells_attrs;
+
+unlock_mutex:
+	mutex_unlock(&nvmem_mutex);
+
+	return ret;
+}
+
 #else /* CONFIG_NVMEM_SYSFS */
 
 static int nvmem_sysfs_setup_compat(struct nvmem_device *nvmem,
@@ -1006,6 +1101,12 @@  struct nvmem_device *nvmem_register(const struct nvmem_config *config)
 	if (rval)
 		goto err_remove_cells;
 
+#ifdef CONFIG_NVMEM_SYSFS
+	rval = nvmem_populate_sysfs_cells(nvmem);
+	if (rval)
+		goto err_remove_cells;
+#endif
+
 	dev_dbg(&nvmem->dev, "Registering nvmem device %s\n", config->name);
 
 	rval = device_add(&nvmem->dev);