[EXTERNAL] [RFC PATCH dpdk 0/3] graph: deferred enqueue API for simplified node processing
Pavan Nikhilesh Bhagavatula
pbhagavatula at marvell.com
Tue Mar 10 06:31:49 CET 2026
>
>Performance was measured with l3fwd forwarding between two ports of an
>Intel E810-XXV 2x25G NIC (1 RX queue per port). Two graph worker threads
>ran on hyper threads of the same physical core on an Intel Xeon Silver
>4316 CPU @ 2.30GHz.
>
>Results:
>- Baseline (manual speculation): 37.0 Mpps
>- Deferred API: 36.2 Mpps (-2.2%)
>
On Octeon(Neoverse-n2) platform we see a slight performance increase ~1.5%.
>The slight overhead comes from per-packet edge comparisons. However,
>this is offset by:
>- 826 fewer lines of code across 13 node implementations
>- Reduced instruction cache pressure from simpler code paths
>- Elimination of per-node speculation boilerplate
>- Easier development of new nodes
More information about the dev
mailing list