[PATCH v5 5/6] net/mlx5: fix LTO stringop-overflow warning
Dariusz Sosnowski
dsosnowski at nvidia.com
Thu Feb 5 14:43:35 CET 2026
On Tue, Jan 20, 2026 at 11:52:10AM -0800, Stephen Hemminger wrote:
> When compiling with LTO (Link Time Optimization) enabled, GCC's
> interprocedural analysis produces false positive warnings about
> potential buffer overflow in mlx5dr_action_prepare_decap_l3_data():
>
> In function 'mlx5dr_action_prepare_decap_l3_data',
> inlined from 'mlx5dr_action_handle_tunnel_l3_to_l2',
> inlined from 'mlx5dr_action_create_reformat_hws':
> warning: writing 4 bytes into a region of size 0 [-Wstringop-overflow=]
> memcpy(dst, e_src, MLX5DR_ACTION_INLINE_DATA_SIZE);
> note: at offset [140, 524248] into destination object 'mh_data' of size 64
>
> With LTO, the function chain is fully inlined, giving GCC visibility
> into the 64-byte stack buffer 'mh_data'. However, GCC's static analysis
> cannot determine that num_of_actions is constrained to either
> DECAP_L3_NUM_ACTIONS_W_NO_VLAN (6) or DECAP_L3_NUM_ACTIONS_W_VLAN (7)
> by the callers. It assumes worst-case bounds that greatly exceed the
> buffer size.
>
> Fix this by adding an explicit bounds check at function entry. The
> valid values for num_of_actions are 6 (no VLAN) or 7 (with VLAN),
> which produce maximum buffer usage well under 64 bytes:
> - offset 12 + (num_of_actions-3) * 8 + 2 = max 46 bytes for 7 actions
>
> This provides GCC with the proof it needs that subsequent memcpy
> operations are safe.
>
> This is not a data path function - it executes only during flow rule
> creation, so the additional check has no performance impact.
>
> Bugzilla ID: 1710
> Fixes: f8c8a6d8440d ("net/mlx5/hws: add action object")
> Cc: stable at dpdk.org
>
> Signed-off-by: Stephen Hemminger <stephen at networkplumber.org>
> ---
> drivers/net/mlx5/hws/mlx5dr_action.c | 14 ++++++++++++++
> 1 file changed, 14 insertions(+)
>
> diff --git a/drivers/net/mlx5/hws/mlx5dr_action.c b/drivers/net/mlx5/hws/mlx5dr_action.c
> index b35bf07c3c..3b12506577 100644
> --- a/drivers/net/mlx5/hws/mlx5dr_action.c
> +++ b/drivers/net/mlx5/hws/mlx5dr_action.c
> @@ -3620,6 +3620,20 @@ mlx5dr_action_prepare_decap_l3_data(uint8_t *src, uint8_t *dst,
> uint8_t *e_src;
> int i;
>
> + /*
> + * Bounds check to help GCC LTO static analysis.
> + *
> + * When LTO inlines this into mlx5dr_action_handle_tunnel_l3_to_l2(),
> + * GCC sees the 64-byte mh_data buffer but cannot prove num_of_actions
> + * is bounded, causing false -Wstringop-overflow warnings.
> + *
> + * Valid num_of_actions values are DECAP_L3_NUM_ACTIONS_W_NO_VLAN (6)
> + * or DECAP_L3_NUM_ACTIONS_W_VLAN (7). This check gives GCC the proof
> + * it needs that the loop iterations stay within buffer bounds.
> + */
> + if (unlikely(num_of_actions > DECAP_L3_NUM_ACTIONS_W_VLAN))
> + return;
This function can be executed as part of fast path
in async flow creation, so if possible
I would avoid adding such a condition.
I tested locally with GCC 14.2.0 and it looks like
if this condition is changed to equivalent __rte_assume(),
then this condition is removed from generated code
(https://godbolt.org/z/afx8jjr6Y as an example,
code generated by LTO also optimizes the relevant code).
__rte_assume() also fixes the LTO warning.
Could you please change the condition to equivalent __rte_assume()?
> +
> /* num_of_actions = remove l3l2 + 4/5 inserts + remove extra 2 bytes
> * copy from end of src to the start of dst.
> * move to the end, 2 is the leftover from 14B or 18B
> --
> 2.51.0
>
More information about the dev
mailing list