[PATCH6/8] omp: Reorder call for TARGET_SIMD_CLONE_ADJUST (was Re: [PATCH7/8] vect: Add TARGET_SIMD_CLONE_ADJUST_RET_OR_PARAM)
Checks
Commit Message
This patch moves the call to TARGET_SIMD_CLONE_ADJUST until after the
arguments and return types have been transformed into vector types. It
also constructs the adjuments and retval modifications after this call,
allowing targets to alter the types of the arguments and return of the
clone prior to the modifications to the function definition.
Is this OK?
gcc/ChangeLog:
* omp-simd-clone.cc (simd_clone_adjust_return_type): Hoist out
code to create return array and don't return new type.
(simd_clone_adjust_argument_types): Hoist out code that creates
ipa_param_body_adjustments and don't return them.
(simd_clone_adjust): Call TARGET_SIMD_CLONE_ADJUST after return
and argument types have been vectorized, create adjustments and
return array after the hook.
(expand_simd_clones): Call TARGET_SIMD_CLONE_ADJUST after return
and argument types have been vectorized.
On 04/10/2023 13:40, Andre Vieira (lists) wrote:
>
>
> On 04/10/2023 11:41, Richard Biener wrote:
>> On Wed, 4 Oct 2023, Andre Vieira (lists) wrote:
>>
>>>
>>>
>>> On 30/08/2023 14:04, Richard Biener wrote:
>>>> On Wed, 30 Aug 2023, Andre Vieira (lists) wrote:
>>>>
>>>>> This patch adds a new target hook to enable us to adapt the types
>>>>> of return
>>>>> and parameters of simd clones. We use this in two ways, the first
>>>>> one is
>>>>> to
>>>>> make sure we can create valid SVE types, including the SVE type
>>>>> attribute,
>>>>> when creating a SVE simd clone, even when the target options do not
>>>>> support
>>>>> SVE. We are following the same behaviour seen with x86 that
>>>>> creates simd
>>>>> clones according to the ABI rules when no simdlen is provided, even
>>>>> if that
>>>>> simdlen is not supported by the current target options. Note that
>>>>> this
>>>>> doesn't mean the simd clone will be used in auto-vectorization.
>>>>
>>>> You are not documenting the bool parameter of the new hook.
>>>>
>>>> What's wrong with doing the adjustment in TARGET_SIMD_CLONE_ADJUST?
>>>
>>> simd_clone_adjust_argument_types is called after that hook, so by the
>>> time we
>>> call TARGET_SIMD_CLONE_ADJUST the types are still in scalar, not
>>> vector. The
>>> same is true for the return type one.
>>>
>>> Also the changes to the types need to be taken into consideration in
>>> 'adjustments' I think.
>>
>> Nothing in the three existing implementations of TARGET_SIMD_CLONE_ADJUST
>> relies on this ordering I think, how about moving the hook invocation
>> after simd_clone_adjust_argument_types?
>>
>
> But that wouldn't change the 'ipa_param_body_adjustments' for when we
> have a function definition and we need to redo the body.
>> Richard.
>>
>>> PS: I hope the subject line survived, my email client is having a bit
>>> of a
>>> wobble this morning... it's what you get for updating software :(
Comments
Hi Richi,
Friendly ping on this. I'm going away for two weeks end of this week, so
I won't be here for end of stage-1, but I'd still very much like to get
this done for GCC 14.
I don't know if you had a chance to look at this yet when you reviewed
the other patches or if you maybe just missed it? A quick td;lr this
moves around the TARGET_SIMD_CLONE_ADJUST call after we've vectorized
the types in simdclones to avoid having to add the extra target hooks to
change the types. This required some moving around of the code that
constructed the adjustments and the code that constructed the array for
the return value.
Kind regards,
Andre
On 18/10/2023 15:41, Andre Vieira (lists) wrote:
> This patch moves the call to TARGET_SIMD_CLONE_ADJUST until after the
> arguments and return types have been transformed into vector types. It
> also constructs the adjuments and retval modifications after this call,
> allowing targets to alter the types of the arguments and return of the
> clone prior to the modifications to the function definition.
>
> Is this OK?
>
> gcc/ChangeLog:
>
> * omp-simd-clone.cc (simd_clone_adjust_return_type): Hoist out
> code to create return array and don't return new type.
> (simd_clone_adjust_argument_types): Hoist out code that creates
> ipa_param_body_adjustments and don't return them.
> (simd_clone_adjust): Call TARGET_SIMD_CLONE_ADJUST after return
> and argument types have been vectorized, create adjustments and
> return array after the hook.
> (expand_simd_clones): Call TARGET_SIMD_CLONE_ADJUST after return
> and argument types have been vectorized.
>
> On 04/10/2023 13:40, Andre Vieira (lists) wrote:
>>
>>
>> On 04/10/2023 11:41, Richard Biener wrote:
>>> On Wed, 4 Oct 2023, Andre Vieira (lists) wrote:
>>>
>>>>
>>>>
>>>> On 30/08/2023 14:04, Richard Biener wrote:
>>>>> On Wed, 30 Aug 2023, Andre Vieira (lists) wrote:
>>>>>
>>>>>> This patch adds a new target hook to enable us to adapt the types
>>>>>> of return
>>>>>> and parameters of simd clones. We use this in two ways, the first
>>>>>> one is
>>>>>> to
>>>>>> make sure we can create valid SVE types, including the SVE type
>>>>>> attribute,
>>>>>> when creating a SVE simd clone, even when the target options do
>>>>>> not support
>>>>>> SVE. We are following the same behaviour seen with x86 that
>>>>>> creates simd
>>>>>> clones according to the ABI rules when no simdlen is provided,
>>>>>> even if that
>>>>>> simdlen is not supported by the current target options. Note that
>>>>>> this
>>>>>> doesn't mean the simd clone will be used in auto-vectorization.
>>>>>
>>>>> You are not documenting the bool parameter of the new hook.
>>>>>
>>>>> What's wrong with doing the adjustment in TARGET_SIMD_CLONE_ADJUST?
>>>>
>>>> simd_clone_adjust_argument_types is called after that hook, so by
>>>> the time we
>>>> call TARGET_SIMD_CLONE_ADJUST the types are still in scalar, not
>>>> vector. The
>>>> same is true for the return type one.
>>>>
>>>> Also the changes to the types need to be taken into consideration in
>>>> 'adjustments' I think.
>>>
>>> Nothing in the three existing implementations of
>>> TARGET_SIMD_CLONE_ADJUST
>>> relies on this ordering I think, how about moving the hook invocation
>>> after simd_clone_adjust_argument_types?
>>>
>>
>> But that wouldn't change the 'ipa_param_body_adjustments' for when we
>> have a function definition and we need to redo the body.
>>> Richard.
>>>
>>>> PS: I hope the subject line survived, my email client is having a
>>>> bit of a
>>>> wobble this morning... it's what you get for updating software :(
On Wed, 18 Oct 2023, Andre Vieira (lists) wrote:
> This patch moves the call to TARGET_SIMD_CLONE_ADJUST until after the
> arguments and return types have been transformed into vector types. It also
> constructs the adjuments and retval modifications after this call, allowing
> targets to alter the types of the arguments and return of the clone prior to
> the modifications to the function definition.
>
> Is this OK?
OK (I was hoping for Jakub to have a look).
Thanks,
Richard.
> gcc/ChangeLog:
>
> * omp-simd-clone.cc (simd_clone_adjust_return_type): Hoist out
> code to create return array and don't return new type.
> (simd_clone_adjust_argument_types): Hoist out code that creates
> ipa_param_body_adjustments and don't return them.
> (simd_clone_adjust): Call TARGET_SIMD_CLONE_ADJUST after return
> and argument types have been vectorized, create adjustments and
> return array after the hook.
> (expand_simd_clones): Call TARGET_SIMD_CLONE_ADJUST after return
> and argument types have been vectorized.
>
> On 04/10/2023 13:40, Andre Vieira (lists) wrote:
> >
> >
> > On 04/10/2023 11:41, Richard Biener wrote:
> >> On Wed, 4 Oct 2023, Andre Vieira (lists) wrote:
> >>
> >>>
> >>>
> >>> On 30/08/2023 14:04, Richard Biener wrote:
> >>>> On Wed, 30 Aug 2023, Andre Vieira (lists) wrote:
> >>>>
> >>>>> This patch adds a new target hook to enable us to adapt the types of
> >>>>> return
> >>>>> and parameters of simd clones. We use this in two ways, the first one
> >>>>> is
> >>>>> to
> >>>>> make sure we can create valid SVE types, including the SVE type
> >>>>> attribute,
> >>>>> when creating a SVE simd clone, even when the target options do not
> >>>>> support
> >>>>> SVE. We are following the same behaviour seen with x86 that creates
> >>>>> simd
> >>>>> clones according to the ABI rules when no simdlen is provided, even if
> >>>>> that
> >>>>> simdlen is not supported by the current target options. Note that this
> >>>>> doesn't mean the simd clone will be used in auto-vectorization.
> >>>>
> >>>> You are not documenting the bool parameter of the new hook.
> >>>>
> >>>> What's wrong with doing the adjustment in TARGET_SIMD_CLONE_ADJUST?
> >>>
> >>> simd_clone_adjust_argument_types is called after that hook, so by the time
> >>> we
> >>> call TARGET_SIMD_CLONE_ADJUST the types are still in scalar, not vector.
> >>> The
> >>> same is true for the return type one.
> >>>
> >>> Also the changes to the types need to be taken into consideration in
> >>> 'adjustments' I think.
> >>
> >> Nothing in the three existing implementations of TARGET_SIMD_CLONE_ADJUST
> >> relies on this ordering I think, how about moving the hook invocation
> >> after simd_clone_adjust_argument_types?
> >>
> >
> > But that wouldn't change the 'ipa_param_body_adjustments' for when we have a
> > function definition and we need to redo the body.
> >> Richard.
> >>
> >>> PS: I hope the subject line survived, my email client is having a bit of a
> >>> wobble this morning... it's what you get for updating software :(
>
On Tue, Oct 31, 2023 at 07:59:25AM +0000, Richard Biener wrote:
> On Wed, 18 Oct 2023, Andre Vieira (lists) wrote:
>
> > This patch moves the call to TARGET_SIMD_CLONE_ADJUST until after the
> > arguments and return types have been transformed into vector types. It also
> > constructs the adjuments and retval modifications after this call, allowing
> > targets to alter the types of the arguments and return of the clone prior to
> > the modifications to the function definition.
> >
> > Is this OK?
>
> OK (I was hoping for Jakub to have a look).
Sorry for the delay, no objections from me there.
Jakub
@@ -701,10 +701,9 @@ simd_clone_create (struct cgraph_node *old_node, bool force_local)
}
/* Adjust the return type of the given function to its appropriate
- vector counterpart. Returns a simd array to be used throughout the
- function as a return value. */
+ vector counterpart. */
-static tree
+static void
simd_clone_adjust_return_type (struct cgraph_node *node)
{
tree fndecl = node->decl;
@@ -714,7 +713,7 @@ simd_clone_adjust_return_type (struct cgraph_node *node)
/* Adjust the function return type. */
if (orig_rettype == void_type_node)
- return NULL_TREE;
+ return;
t = TREE_TYPE (TREE_TYPE (fndecl));
if (INTEGRAL_TYPE_P (t) || POINTER_TYPE_P (t))
veclen = node->simdclone->vecsize_int;
@@ -737,24 +736,6 @@ simd_clone_adjust_return_type (struct cgraph_node *node)
veclen));
}
TREE_TYPE (TREE_TYPE (fndecl)) = t;
- if (!node->definition)
- return NULL_TREE;
-
- t = DECL_RESULT (fndecl);
- /* Adjust the DECL_RESULT. */
- gcc_assert (TREE_TYPE (t) != void_type_node);
- TREE_TYPE (t) = TREE_TYPE (TREE_TYPE (fndecl));
- relayout_decl (t);
-
- tree atype = build_array_type_nelts (orig_rettype,
- node->simdclone->simdlen);
- if (maybe_ne (veclen, node->simdclone->simdlen))
- return build1 (VIEW_CONVERT_EXPR, atype, t);
-
- /* Set up a SIMD array to use as the return value. */
- tree retval = create_tmp_var_raw (atype, "retval");
- gimple_add_tmp_var (retval);
- return retval;
}
/* Each vector argument has a corresponding array to be used locally
@@ -788,7 +769,7 @@ create_tmp_simd_array (const char *prefix, tree type, poly_uint64 simdlen)
declarations will be remapped. New arguments which are not to be remapped
are marked with USER_FLAG. */
-static ipa_param_body_adjustments *
+static void
simd_clone_adjust_argument_types (struct cgraph_node *node)
{
auto_vec<tree> args;
@@ -798,15 +779,11 @@ simd_clone_adjust_argument_types (struct cgraph_node *node)
else
simd_clone_vector_of_formal_parm_types (&args, node->decl);
struct cgraph_simd_clone *sc = node->simdclone;
- vec<ipa_adjusted_param, va_gc> *new_params = NULL;
- vec_safe_reserve (new_params, sc->nargs);
- unsigned i, j, k;
+ unsigned i, k;
poly_uint64 veclen;
for (i = 0; i < sc->nargs; ++i)
{
- ipa_adjusted_param adj;
- memset (&adj, 0, sizeof (adj));
tree parm = NULL_TREE;
tree parm_type = NULL_TREE;
if(i < args.length())
@@ -815,17 +792,12 @@ simd_clone_adjust_argument_types (struct cgraph_node *node)
parm_type = node->definition ? TREE_TYPE (parm) : parm;
}
- adj.base_index = i;
- adj.prev_clone_index = i;
-
sc->args[i].orig_arg = node->definition ? parm : NULL_TREE;
sc->args[i].orig_type = parm_type;
switch (sc->args[i].arg_type)
{
default:
- /* No adjustment necessary for scalar arguments. */
- adj.op = IPA_PARAM_OP_COPY;
break;
case SIMD_CLONE_ARG_TYPE_LINEAR_UVAL_CONSTANT_STEP:
case SIMD_CLONE_ARG_TYPE_LINEAR_UVAL_VARIABLE_STEP:
@@ -834,7 +806,6 @@ simd_clone_adjust_argument_types (struct cgraph_node *node)
= create_tmp_simd_array (IDENTIFIER_POINTER (DECL_NAME (parm)),
TREE_TYPE (parm_type),
sc->simdlen);
- adj.op = IPA_PARAM_OP_COPY;
break;
case SIMD_CLONE_ARG_TYPE_LINEAR_VAL_CONSTANT_STEP:
case SIMD_CLONE_ARG_TYPE_LINEAR_VAL_VARIABLE_STEP:
@@ -851,28 +822,12 @@ simd_clone_adjust_argument_types (struct cgraph_node *node)
GET_MODE_BITSIZE (SCALAR_TYPE_MODE (parm_type)));
if (multiple_p (veclen, sc->simdlen))
veclen = sc->simdlen;
- adj.op = IPA_PARAM_OP_NEW;
- adj.param_prefix_index = IPA_PARAM_PREFIX_SIMD;
+ tree vtype;
if (POINTER_TYPE_P (parm_type))
- adj.type = build_vector_type (pointer_sized_int_node, veclen);
+ vtype = build_vector_type (pointer_sized_int_node, veclen);
else
- adj.type = build_vector_type (parm_type, veclen);
- sc->args[i].vector_type = adj.type;
- k = vector_unroll_factor (sc->simdlen, veclen);
- for (j = 1; j < k; j++)
- {
- vec_safe_push (new_params, adj);
- if (j == 1)
- {
- memset (&adj, 0, sizeof (adj));
- adj.op = IPA_PARAM_OP_NEW;
- adj.user_flag = 1;
- adj.param_prefix_index = IPA_PARAM_PREFIX_SIMD;
- adj.base_index = i;
- adj.prev_clone_index = i;
- adj.type = sc->args[i].vector_type;
- }
- }
+ vtype = build_vector_type (parm_type, veclen);
+ sc->args[i].vector_type = vtype;
if (node->definition)
sc->args[i].simd_array
@@ -880,20 +835,12 @@ simd_clone_adjust_argument_types (struct cgraph_node *node)
? IDENTIFIER_POINTER (DECL_NAME (parm))
: NULL, parm_type, sc->simdlen);
}
- vec_safe_push (new_params, adj);
}
if (sc->inbranch)
{
tree base_type = simd_clone_compute_base_data_type (sc->origin, sc);
- ipa_adjusted_param adj;
- memset (&adj, 0, sizeof (adj));
- adj.op = IPA_PARAM_OP_NEW;
- adj.user_flag = 1;
- adj.param_prefix_index = IPA_PARAM_PREFIX_MASK;
-
- adj.base_index = i;
- adj.prev_clone_index = i;
+ tree mask_type;
if (INTEGRAL_TYPE_P (base_type) || POINTER_TYPE_P (base_type))
veclen = sc->vecsize_int;
else
@@ -906,17 +853,14 @@ simd_clone_adjust_argument_types (struct cgraph_node *node)
if (multiple_p (veclen, sc->simdlen))
veclen = sc->simdlen;
if (sc->mask_mode != VOIDmode)
- adj.type
+ mask_type
= lang_hooks.types.type_for_mode (sc->mask_mode, 1);
else if (POINTER_TYPE_P (base_type))
- adj.type = build_vector_type (pointer_sized_int_node, veclen);
+ mask_type = build_vector_type (pointer_sized_int_node, veclen);
else
- adj.type = build_vector_type (base_type, veclen);
- vec_safe_push (new_params, adj);
+ mask_type = build_vector_type (base_type, veclen);
k = vector_unroll_factor (sc->simdlen, veclen);
- for (j = 1; j < k; j++)
- vec_safe_push (new_params, adj);
/* We have previously allocated one extra entry for the mask. Use
it and fill it. */
@@ -932,24 +876,16 @@ simd_clone_adjust_argument_types (struct cgraph_node *node)
= create_tmp_simd_array ("mask", base_type, sc->simdlen);
else if (k > 1)
sc->args[i].simd_array
- = create_tmp_simd_array ("mask", adj.type, k);
+ = create_tmp_simd_array ("mask", mask_type, k);
else
sc->args[i].simd_array = NULL_TREE;
}
sc->args[i].orig_type = base_type;
sc->args[i].arg_type = SIMD_CLONE_ARG_TYPE_MASK;
- sc->args[i].vector_type = adj.type;
+ sc->args[i].vector_type = mask_type;
}
- if (node->definition)
- {
- ipa_param_body_adjustments *adjustments
- = new ipa_param_body_adjustments (new_params, node->decl);
-
- adjustments->modify_formal_parameters ();
- return adjustments;
- }
- else
+ if (!node->definition)
{
tree new_arg_types = NULL_TREE, new_reversed;
bool last_parm_void = false;
@@ -957,15 +893,20 @@ simd_clone_adjust_argument_types (struct cgraph_node *node)
last_parm_void = true;
gcc_assert (TYPE_ARG_TYPES (TREE_TYPE (node->decl)));
- j = vec_safe_length (new_params);
- for (i = 0; i < j; i++)
+ for (i = 0; i < sc->nargs; i++)
{
- struct ipa_adjusted_param *adj = &(*new_params)[i];
tree ptype;
- if (adj->op == IPA_PARAM_OP_COPY)
- ptype = args[adj->base_index];
- else
- ptype = adj->type;
+ switch (sc->args[i].arg_type)
+ {
+ default:
+ ptype = sc->args[i].orig_type;
+ break;
+ case SIMD_CLONE_ARG_TYPE_LINEAR_VAL_CONSTANT_STEP:
+ case SIMD_CLONE_ARG_TYPE_LINEAR_VAL_VARIABLE_STEP:
+ case SIMD_CLONE_ARG_TYPE_VECTOR:
+ ptype = sc->args[i].vector_type;
+ break;
+ }
new_arg_types = tree_cons (NULL_TREE, ptype, new_arg_types);
}
new_reversed = nreverse (new_arg_types);
@@ -977,7 +918,6 @@ simd_clone_adjust_argument_types (struct cgraph_node *node)
new_reversed = void_list_node;
}
TYPE_ARG_TYPES (TREE_TYPE (node->decl)) = new_reversed;
- return NULL;
}
}
@@ -996,7 +936,8 @@ simd_clone_init_simd_arrays (struct cgraph_node *node,
arg;
arg = DECL_CHAIN (arg), i++, j++)
{
- if ((*adjustments->m_adj_params)[j].op == IPA_PARAM_OP_COPY
+ ipa_adjusted_param adj = (*adjustments->m_adj_params)[j];
+ if (adj.op == IPA_PARAM_OP_COPY
|| POINTER_TYPE_P (TREE_TYPE (arg)))
continue;
@@ -1004,7 +945,7 @@ simd_clone_init_simd_arrays (struct cgraph_node *node,
tree array = node->simdclone->args[i].simd_array;
if (node->simdclone->mask_mode != VOIDmode
- && node->simdclone->args[i].arg_type == SIMD_CLONE_ARG_TYPE_MASK)
+ && adj.param_prefix_index == IPA_PARAM_PREFIX_MASK)
{
if (array == NULL_TREE)
continue;
@@ -1024,8 +965,9 @@ simd_clone_init_simd_arrays (struct cgraph_node *node,
}
continue;
}
- if (known_eq (TYPE_VECTOR_SUBPARTS (TREE_TYPE (arg)),
- node->simdclone->simdlen))
+ if (!VECTOR_TYPE_P (TREE_TYPE (arg))
+ || known_eq (TYPE_VECTOR_SUBPARTS (TREE_TYPE (arg)),
+ node->simdclone->simdlen))
{
tree ptype = build_pointer_type (TREE_TYPE (TREE_TYPE (array)));
tree ptr = build_fold_addr_expr (array);
@@ -1423,13 +1365,120 @@ simd_clone_adjust (struct cgraph_node *node)
{
push_cfun (DECL_STRUCT_FUNCTION (node->decl));
+ tree orig_rettype = TREE_TYPE (TREE_TYPE (node->decl));
TREE_TYPE (node->decl) = build_distinct_type_copy (TREE_TYPE (node->decl));
+ simd_clone_adjust_return_type (node);
+ simd_clone_adjust_argument_types (node);
targetm.simd_clone.adjust (node);
+ tree retval = NULL_TREE;
+ if (orig_rettype != void_type_node)
+ {
+ poly_uint64 veclen;
+ if (INTEGRAL_TYPE_P (orig_rettype) || POINTER_TYPE_P (orig_rettype))
+ veclen = node->simdclone->vecsize_int;
+ else
+ veclen = node->simdclone->vecsize_float;
+ if (known_eq (veclen, 0U))
+ veclen = node->simdclone->simdlen;
+ else
+ veclen = exact_div (veclen,
+ GET_MODE_BITSIZE (SCALAR_TYPE_MODE (orig_rettype)));
+ if (multiple_p (veclen, node->simdclone->simdlen))
+ veclen = node->simdclone->simdlen;
+
+ retval = DECL_RESULT (node->decl);
+ /* Adjust the DECL_RESULT. */
+ TREE_TYPE (retval) = TREE_TYPE (TREE_TYPE (node->decl));
+ relayout_decl (retval);
+
+ tree atype = build_array_type_nelts (orig_rettype,
+ node->simdclone->simdlen);
+ if (maybe_ne (veclen, node->simdclone->simdlen))
+ retval = build1 (VIEW_CONVERT_EXPR, atype, retval);
+ else
+ {
+ /* Set up a SIMD array to use as the return value. */
+ retval = create_tmp_var_raw (atype, "retval");
+ gimple_add_tmp_var (retval);
+ }
+ }
- tree retval = simd_clone_adjust_return_type (node);
+ struct cgraph_simd_clone *sc = node->simdclone;
+ vec<ipa_adjusted_param, va_gc> *new_params = NULL;
+ vec_safe_reserve (new_params, sc->nargs);
+ unsigned i, j, k;
+ for (i = 0; i < sc->nargs; ++i)
+ {
+ ipa_adjusted_param adj;
+ memset (&adj, 0, sizeof (adj));
+ poly_uint64 veclen;
+ tree elem_type;
+
+ adj.base_index = i;
+ adj.prev_clone_index = i;
+ switch (sc->args[i].arg_type)
+ {
+ default:
+ /* No adjustment necessary for scalar arguments. */
+ adj.op = IPA_PARAM_OP_COPY;
+ break;
+ case SIMD_CLONE_ARG_TYPE_LINEAR_UVAL_VARIABLE_STEP:
+ adj.op = IPA_PARAM_OP_COPY;
+ break;
+ case SIMD_CLONE_ARG_TYPE_MASK:
+ case SIMD_CLONE_ARG_TYPE_LINEAR_VAL_CONSTANT_STEP:
+ case SIMD_CLONE_ARG_TYPE_LINEAR_VAL_VARIABLE_STEP:
+ case SIMD_CLONE_ARG_TYPE_VECTOR:
+ if (sc->args[i].arg_type == SIMD_CLONE_ARG_TYPE_MASK
+ && sc->mask_mode != VOIDmode)
+ elem_type = boolean_type_node;
+ else
+ elem_type = TREE_TYPE (sc->args[i].vector_type);
+ if (INTEGRAL_TYPE_P (elem_type) || POINTER_TYPE_P (elem_type))
+ veclen = sc->vecsize_int;
+ else
+ veclen = sc->vecsize_float;
+ if (known_eq (veclen, 0U))
+ veclen = sc->simdlen;
+ else
+ veclen
+ = exact_div (veclen,
+ GET_MODE_BITSIZE (SCALAR_TYPE_MODE (elem_type)));
+ if (multiple_p (veclen, sc->simdlen))
+ veclen = sc->simdlen;
+ if (sc->args[i].arg_type == SIMD_CLONE_ARG_TYPE_MASK)
+ {
+ adj.user_flag = 1;
+ adj.param_prefix_index = IPA_PARAM_PREFIX_MASK;
+ }
+ else
+ adj.param_prefix_index = IPA_PARAM_PREFIX_SIMD;
+ adj.op = IPA_PARAM_OP_NEW;
+ adj.type = sc->args[i].vector_type;
+ k = vector_unroll_factor (sc->simdlen, veclen);
+ for (j = 1; j < k; j++)
+ {
+ vec_safe_push (new_params, adj);
+ if (j == 1)
+ {
+ memset (&adj, 0, sizeof (adj));
+ adj.op = IPA_PARAM_OP_NEW;
+ adj.user_flag = 1;
+ if (sc->args[i].arg_type == SIMD_CLONE_ARG_TYPE_MASK)
+ adj.param_prefix_index = IPA_PARAM_PREFIX_MASK;
+ else
+ adj.param_prefix_index = IPA_PARAM_PREFIX_SIMD;
+ adj.base_index = i;
+ adj.prev_clone_index = i;
+ adj.type = sc->args[i].vector_type;
+ }
+ }
+ }
+ vec_safe_push (new_params, adj);
+ }
ipa_param_body_adjustments *adjustments
- = simd_clone_adjust_argument_types (node);
- gcc_assert (adjustments);
+ = new ipa_param_body_adjustments (new_params, node->decl);
+ adjustments->modify_formal_parameters ();
push_gimplify_context ();
@@ -2050,9 +2099,9 @@ expand_simd_clones (struct cgraph_node *node)
{
TREE_TYPE (n->decl)
= build_distinct_type_copy (TREE_TYPE (n->decl));
- targetm.simd_clone.adjust (n);
simd_clone_adjust_return_type (n);
simd_clone_adjust_argument_types (n);
+ targetm.simd_clone.adjust (n);
}
if (dump_file)
fprintf (dump_file, "\nGenerated %s clone %s\n",