[0/3] mm/arch: Fix a few collide definition on private use of VM_FAULT_*

Message ID 20230205231704.909536-1-peterx@redhat.com
Headers
Series mm/arch: Fix a few collide definition on private use of VM_FAULT_* |

Message

Peter Xu Feb. 5, 2023, 11:17 p.m. UTC
  I noticed a few collision usage on VM_FAULT_* definition in the page fault
path on arm/arm64/s390 where the VM_FAULT_* can overlap with the generic
definition of vm_fault_reason.

The major overlapped part being VM_FAULT_HINDEX_MASK which is used only by
the hugetlb hwpoisoning.

I'm not sure whether any of them can have a real impact, but that does not
look like to be expected.  I didn't copy stable, if anyone thinks it should
please shoot.  Nor did I test them in any form - I just changed the
allocations from top bits and added a comment for each of them.

Please have a look, thanks.

Peter Xu (3):
  mm/arm: Define private VM_FAULT_* reasons from top bits
  mm/arm64: Define private VM_FAULT_* reasons from top bits
  mm/s390: Define private VM_FAULT_* reasons from top bits

 arch/arm/mm/fault.c   |  8 ++++++--
 arch/arm64/mm/fault.c |  8 ++++++--
 arch/s390/mm/fault.c  | 14 +++++++++-----
 3 files changed, 21 insertions(+), 9 deletions(-)
  

Comments

Matthew Wilcox Feb. 6, 2023, 12:10 a.m. UTC | #1
On Sun, Feb 05, 2023 at 06:17:01PM -0500, Peter Xu wrote:
> I noticed a few collision usage on VM_FAULT_* definition in the page fault
> path on arm/arm64/s390 where the VM_FAULT_* can overlap with the generic
> definition of vm_fault_reason.
> 
> The major overlapped part being VM_FAULT_HINDEX_MASK which is used only by
> the hugetlb hwpoisoning.
> 
> I'm not sure whether any of them can have a real impact, but that does not
> look like to be expected.  I didn't copy stable, if anyone thinks it should
> please shoot.  Nor did I test them in any form - I just changed the
> allocations from top bits and added a comment for each of them.

This seems like a bad way to do it.  Why not just put these VM_FAULT_*
definitions in linux/mm_types.h?  Then we'll see them when adding new
VM_FAULT codes.  Sure, they won't be used by every architecture, but
so what?
  
Peter Xu Feb. 6, 2023, 12:54 a.m. UTC | #2
On Mon, Feb 06, 2023 at 12:10:53AM +0000, Matthew Wilcox wrote:
> On Sun, Feb 05, 2023 at 06:17:01PM -0500, Peter Xu wrote:
> > I noticed a few collision usage on VM_FAULT_* definition in the page fault
> > path on arm/arm64/s390 where the VM_FAULT_* can overlap with the generic
> > definition of vm_fault_reason.
> > 
> > The major overlapped part being VM_FAULT_HINDEX_MASK which is used only by
> > the hugetlb hwpoisoning.
> > 
> > I'm not sure whether any of them can have a real impact, but that does not
> > look like to be expected.  I didn't copy stable, if anyone thinks it should
> > please shoot.  Nor did I test them in any form - I just changed the
> > allocations from top bits and added a comment for each of them.
> 
> This seems like a bad way to do it.  Why not just put these VM_FAULT_*
> definitions in linux/mm_types.h?  Then we'll see them when adding new
> VM_FAULT codes.  Sure, they won't be used by every architecture, but
> so what?

My initial version actually contains a few VM_FAULT_PRIVATE_N there, but I
noticed only the minority uses that, especially there's s390 which takes 5
entries.  I didn't had my mind straight on which's the best to go, then I
removed them and posted this simpler version, with comment on each to fix
the issues, more in a sense of raising the problem first.

I agree it isn't a problem at all, not until 32 bits all used up.  But that
seems to slightly encourage more archs from using the new private entries
which I wanted to avoid.

If to take a closer look, we may not really need that much private entries.
With s390, what I read is:

  - VM_FAULT_BADMAP could be replaced directly with VM_FAULT_SIGSEGV?
  - VM_FAULT_PFAULT could be replaced directly with VM_FAULT_BADCONTEXT?

Then if I'm not wrong we can already reduce 5->3 private entries.

I didn't directly change that because I am not 100% confident and I can't
test them myself.  It'll be great if arch people can have a look at either
s390 and arm to see whether there's chance of simplifcations first.  So the
patchset is more of raising the collision issue first, meanwhile great to
attract attention for arch people to refactor upon it.

I can also try to reduce the private entries and introduce PRIVATE entries
accordingly as you suggested, but I'll need more help on reviews and tests
than this one.

Thanks,
  
Matthew Wilcox Feb. 6, 2023, 2:51 a.m. UTC | #3
On Sun, Feb 05, 2023 at 07:54:35PM -0500, Peter Xu wrote:
> On Mon, Feb 06, 2023 at 12:10:53AM +0000, Matthew Wilcox wrote:
> > On Sun, Feb 05, 2023 at 06:17:01PM -0500, Peter Xu wrote:
> > > I noticed a few collision usage on VM_FAULT_* definition in the page fault
> > > path on arm/arm64/s390 where the VM_FAULT_* can overlap with the generic
> > > definition of vm_fault_reason.
> > > 
> > > The major overlapped part being VM_FAULT_HINDEX_MASK which is used only by
> > > the hugetlb hwpoisoning.
> > > 
> > > I'm not sure whether any of them can have a real impact, but that does not
> > > look like to be expected.  I didn't copy stable, if anyone thinks it should
> > > please shoot.  Nor did I test them in any form - I just changed the
> > > allocations from top bits and added a comment for each of them.
> > 
> > This seems like a bad way to do it.  Why not just put these VM_FAULT_*
> > definitions in linux/mm_types.h?  Then we'll see them when adding new
> > VM_FAULT codes.  Sure, they won't be used by every architecture, but
> > so what?
> 
> My initial version actually contains a few VM_FAULT_PRIVATE_N there, but I

That wasn't what I meant.  I meant putting VM_FAULT_BADMAP and
VM_FAULT_SIGSEGV in mm_types.h.  Not having "Here is a range of reserved
arch private ones".
  
Peter Xu Feb. 6, 2023, 3:18 a.m. UTC | #4
On Mon, Feb 06, 2023 at 02:51:18AM +0000, Matthew Wilcox wrote:
> That wasn't what I meant.  I meant putting VM_FAULT_BADMAP and
> VM_FAULT_SIGSEGV in mm_types.h.  Not having "Here is a range of reserved
> arch private ones".

VM_FAULT_SIGSEGV is there already; I assume you meant adding them all
directly into vm_fault_reason.

Then I don't think it's a good idea..

Currently vm_fault_reason is a clear interface for handle_mm_fault() for
not only arch pffault handlers but also soft faults like GUP.

If handle_mm_fault() doesn't return VM_FAULT_BADMAP at all, I don't think
we should have it as public API at all.  When arch1 people reading the
VM_FAULT_ documents, it shouldn't care about some fault reason that only
happens with arch2.  Gup shouldn't care about it either.

Logically a new page fault handler should handle all the retval of
vm_fault_reason afaiu.  That shouldn't include e.g. VM_FAULT_BADMAP either.

Thanks,
  
Matthew Wilcox Feb. 6, 2023, 5:09 a.m. UTC | #5
On Sun, Feb 05, 2023 at 10:18:30PM -0500, Peter Xu wrote:
> On Mon, Feb 06, 2023 at 02:51:18AM +0000, Matthew Wilcox wrote:
> > That wasn't what I meant.  I meant putting VM_FAULT_BADMAP and
> > VM_FAULT_SIGSEGV in mm_types.h.  Not having "Here is a range of reserved
> > arch private ones".
> 
> VM_FAULT_SIGSEGV is there already; I assume you meant adding them all
> directly into vm_fault_reason.
> 
> Then I don't think it's a good idea..
> 
> Currently vm_fault_reason is a clear interface for handle_mm_fault() for
> not only arch pffault handlers but also soft faults like GUP.
> 
> If handle_mm_fault() doesn't return VM_FAULT_BADMAP at all, I don't think
> we should have it as public API at all.  When arch1 people reading the
> VM_FAULT_ documents, it shouldn't care about some fault reason that only
> happens with arch2.  Gup shouldn't care about it either.
> 
> Logically a new page fault handler should handle all the retval of
> vm_fault_reason afaiu.  That shouldn't include e.g. VM_FAULT_BADMAP either.

Hmm, right.  Looking specifically at how s390 uses VM_FAULT_BADMAP,
it just seems to be a badly structured fault.c.  Seems to me that
do_fault_error() should take an extra si_code argument, and
instead of returning VM_FAULT_BADACCESS / VM_FAULT_BADMAP from
various functions, those functions should call do_fault_error()
directly, passing it VM_FAULT_SIGSEGV and the appropriate si_code.

But this is all on the s390 people to fix; I don't want to break their
arch by trying it myself.
  
Heiko Carstens Feb. 9, 2023, 8:04 p.m. UTC | #6
On Mon, Feb 06, 2023 at 05:09:57AM +0000, Matthew Wilcox wrote:
> On Sun, Feb 05, 2023 at 10:18:30PM -0500, Peter Xu wrote:
> > On Mon, Feb 06, 2023 at 02:51:18AM +0000, Matthew Wilcox wrote:
> > > That wasn't what I meant.  I meant putting VM_FAULT_BADMAP and
> > > VM_FAULT_SIGSEGV in mm_types.h.  Not having "Here is a range of reserved
> > > arch private ones".
> > 
> > VM_FAULT_SIGSEGV is there already; I assume you meant adding them all
> > directly into vm_fault_reason.
> > 
> > Then I don't think it's a good idea..
> > 
> > Currently vm_fault_reason is a clear interface for handle_mm_fault() for
> > not only arch pffault handlers but also soft faults like GUP.
> > 
> > If handle_mm_fault() doesn't return VM_FAULT_BADMAP at all, I don't think
> > we should have it as public API at all.  When arch1 people reading the
> > VM_FAULT_ documents, it shouldn't care about some fault reason that only
> > happens with arch2.  Gup shouldn't care about it either.
> > 
> > Logically a new page fault handler should handle all the retval of
> > vm_fault_reason afaiu.  That shouldn't include e.g. VM_FAULT_BADMAP either.
> 
> Hmm, right.  Looking specifically at how s390 uses VM_FAULT_BADMAP,
> it just seems to be a badly structured fault.c.  Seems to me that
> do_fault_error() should take an extra si_code argument, and
> instead of returning VM_FAULT_BADACCESS / VM_FAULT_BADMAP from
> various functions, those functions should call do_fault_error()
> directly, passing it VM_FAULT_SIGSEGV and the appropriate si_code.
> 
> But this is all on the s390 people to fix; I don't want to break their
> arch by trying it myself.

Yes, will take a look at it. For now I will apply Peter's patch in order to
get rid of the collision.