[dpdk-dev] [PATCH v2 00/12] acl: introduce AVX512 classify method
Konstantin Ananyev
konstantin.ananyev at intel.com
Tue Sep 15 18:50:13 CEST 2020
These patch series introduce support of AVX512 specific classify
implementation for ACL library.
Inside it contains two code-paths –
one uses mostly 256 bit instruction/registers and can
process up to 16 flows in parallel.
second uses 512 bit instruction/registers over majority of
places and can process up to 32 flows in parallel.
This runtime code-path selection is done internally based
on input burst size and is totally opaque to the user.
On my SKX box test-acl shows ~20-65% improvement
(depending on rule-set and input burst size)
when switching from AVX2 to AVX512 classify algorithms.
ICX and CLX testing showed similar level of speedup: up to ~50-60%.
Current AVX512 classify implementation is only supported on x86_64.
Note that this series introduce a formal ABI incompatibility
with previous versions of ACL library.
v1 -> v2:
Deduplicated 8/16 code paths as much as possible
Updated default algorithm selection
Removed library constructor to make it easier integrate with
https://patches.dpdk.org/project/dpdk/list/?series=11831
Updated docs
These patch series depends on:
https://patches.dpdk.org/patch/73922/mbox/
to be applied first.
Konstantin Ananyev (12):
acl: fix x86 build when compiler doesn't support AVX2
doc: fix mixing classify methods in ACL guide
acl: remove of unused enum value
acl: remove library constructor
app/acl: few small improvements
test/acl: expand classify test coverage
acl: add infrastructure to support AVX512 classify
acl: introduce AVX512 classify implementation
acl: enhance AVX512 classify implementation
acl: for AVX512 classify use 4B load whenever possible
test/acl: add AVX512 classify support
app/acl: add AVX512 classify support
app/test-acl/main.c | 19 +-
app/test/test_acl.c | 104 ++--
config/x86/meson.build | 3 +-
.../prog_guide/packet_classif_access_ctrl.rst | 15 +
doc/guides/rel_notes/deprecation.rst | 4 -
doc/guides/rel_notes/release_20_11.rst | 9 +
lib/librte_acl/acl.h | 12 +
lib/librte_acl/acl_bld.c | 34 ++
lib/librte_acl/acl_gen.c | 2 +-
lib/librte_acl/acl_run_avx512.c | 331 +++++++++++
lib/librte_acl/acl_run_avx512x16.h | 526 ++++++++++++++++++
lib/librte_acl/acl_run_avx512x8.h | 439 +++++++++++++++
lib/librte_acl/meson.build | 39 ++
lib/librte_acl/rte_acl.c | 198 +++++--
lib/librte_acl/rte_acl.h | 3 +-
15 files changed, 1638 insertions(+), 100 deletions(-)
create mode 100644 lib/librte_acl/acl_run_avx512.c
create mode 100644 lib/librte_acl/acl_run_avx512x16.h
create mode 100644 lib/librte_acl/acl_run_avx512x8.h
--
2.17.1
More information about the dev
mailing list