[PATCH v1 0/4] ring: some fixes and improvements
Stephen Hemminger
stephen at networkplumber.org
Wed Jan 14 20:42:01 CET 2026
On Wed, 21 May 2025 12:14:28 +0100
Konstantin Ananyev <konstantin.ananyev at huawei.com> wrote:
> First two patches are ‘low risk’ ones.
> Third one touches some core functions within rte_ring library and
> would probably requires extra reviews/testing from different vendors.
> 4th one enables C11 based code on all x86 platforms by default.
> The stretch goal for it – make all supported platforms to use C11
> based code and get rid of legacy code in rte_ring_generic_pvt.h.
> If there would be some issues with latest two patches – we can limit
> ourselves with just first two to apply.
>
> Konstantin Ananyev (4):
> ring: introduce extra run-time checks
> ring/soring: fix head-tail synchronization issue
> ring: fix potential sync issue between head and tail values
> config/x86: enable RTE_USE_C11_MEM_MODEL by default
>
> config/x86/meson.build | 1 +
> lib/ring/rte_ring_c11_pvt.h | 29 ++++++++++++++++++-----------
> lib/ring/rte_ring_elem_pvt.h | 8 ++++++--
> lib/ring/rte_ring_hts_elem_pvt.h | 14 ++++++++++----
> lib/ring/rte_ring_rts_elem_pvt.h | 14 ++++++++++----
> lib/ring/soring.c | 2 ++
> 6 files changed, 47 insertions(+), 21 deletions(-)
>
> --
This patch seems to have gotten stuck and never reviewed thoroughly or merged.
Turned to AI for insight. It found minor things that could be fixed up
when merged. But would be good to have a release note.
Also concerning that the test did not pass CI cleanly.
## DPDK Ring Patchset Review
**Series:** ring head/tail synchronization fixes and C11 memory model enablement
**Author:** Konstantin Ananyev (Huawei)
**Patches:** 4
---
### Patch 1/4: `ring: introduce extra run-time checks`
**Commit Message:**
| Check | Status | Notes |
|-------|--------|-------|
| Subject ≤60 chars | ✓ | 44 characters |
| Lowercase after colon | ✓ | |
| Component prefix | ✓ | `ring:` is correct |
| Imperative mood | ✓ | |
| No trailing period | ✓ | |
| Signed-off-by | ✓ | |
**Code Review:**
The patch adds `RTE_ASSERT()` calls to verify `*entries` and `*free_entries` remain ≤ `r->capacity` after `move_head()` calls. This is a sensible defensive check.
**Issue (Warning):** In `rte_ring_c11_pvt.h`, the code reordering moves `*new_head = *old_head + n;` before the `n == 0` check:
```c
+ *new_head = *old_head + n;
if (n == 0)
return 0;
- *new_head = *old_head + n;
```
This is functionally correct (setting `*new_head` even when returning 0 is harmless), but the change seems unrelated to adding asserts. If intentional, the commit message should explain why.
**Verdict:** ✓ Acceptable with minor observation
---
### Patch 2/4: `ring/soring: fix head-tail synchronization issue`
**Commit Message:**
| Check | Status | Notes |
|-------|--------|-------|
| Subject ≤60 chars | ✓ | 47 characters |
| Lowercase after colon | ✓ | |
| Component prefix | ✓ | `ring/soring:` correct |
| Fixes tag | ✓ | `Fixes: b5458e2cc483` present |
| Signed-off-by | ✓ | |
| Body ≤75 chars | ✓ | Lines properly wrapped |
**Technical Analysis:**
Excellent, detailed commit message explaining the race condition discovered on Ampere Altra Max. The fix adds `rte_atomic_thread_fence(rte_memory_order_release)` after `store(&tail)` operations.
**Code Review:**
```c
+ /* make sure that new tail value is visible */
+ rte_atomic_thread_fence(rte_memory_order_release);
```
This ensures the tail update is visible before any subsequent head updates. However, I note this is described as a workaround—patch 3/4 provides the "better" fix at the CAS level.
**Question:** If patch 3/4 provides the comprehensive fix, why is patch 2/4 kept separate? The comment mentions observing on Ampere Altra Max—should `Cc: stable at dpdk.org` be added since this is a real bug fix?
**Verdict:** ✓ Acceptable (consider adding `Cc: stable at dpdk.org`)
---
### Patch 3/4: `ring: fix potential sync issue between head and tail values`
**Commit Message:**
| Check | Status | Notes |
|-------|--------|-------|
| Subject ≤60 chars | ✗ **ERROR** | **63 characters** (limit is 60) |
| Fixes tags | ✓ | Multiple Fixes present |
| Signed-off-by | ✓ | |
**Error:** Subject line exceeds 60-character limit:
```
ring: fix potential sync issue between head and tail values
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
63 characters
```
**Suggested fix:**
```
ring: fix head-tail sync in C11 memory model
```
**Technical Analysis:**
This is the core fix—changing CAS memory ordering from RELAXED to ACQ_REL:
```c
- rte_memory_order_relaxed,
- rte_memory_order_relaxed);
+ rte_memory_order_acq_rel,
+ rte_memory_order_acquire);
```
The comments explaining the change are well-written:
```c
+ * this CAS(ACQ_REL, ACQUIRE) serves as a hoist
+ * barrier to prevent:
+ * - OOO reads of cons tail value
+ * - OOO copy of elems from the ring
+ * Also RELEASE guarantees that latest tail value
+ * will become visible before the new head value.
```
**Code Review Issues:**
1. **Redundant fences in soring.c removed:** The patch removes the fences added in patch 2/4. This creates a forward dependency—patch 2/4 adds them, patch 3/4 removes them. This violates the "each commit must compile independently" guideline. Consider squashing patches 2 and 3, or reordering so patch 3's ACQ_REL fix comes first.
2. **Missing `Cc: stable at dpdk.org`:** This fixes real bugs that could affect production systems.
3. The d->head load changes from RELAXED to ACQUIRE:
```c
*old_head = rte_atomic_load_explicit(&d->head,
- rte_memory_order_relaxed);
+ rte_memory_order_acquire);
```
Combined with removing the explicit thread fence, this is cleaner but changes the synchronization model significantly. The comment update is appropriate.
**Verdict:** ✗ Needs revision (subject too long, patch ordering concern)
---
### Patch 4/4: `config/x86: enable RTE_USE_C11_MEM_MODEL by default`
**Commit Message:**
| Check | Status | Notes |
|-------|--------|-------|
| Subject ≤60 chars | ✓ | 52 characters |
| Component prefix | ✓ | `config/x86:` correct |
| Signed-off-by | ✓ | |
**Code Review:**
Simple config change:
```c
+dpdk_conf.set('RTE_USE_C11_MEM_MODEL', true)
```
**Concerns:**
1. **Missing release notes:** Enabling C11 memory model by default on x86 is a significant behavioral change. Per AGENTS.md, API/config changes warrant release notes updates.
2. **Documentation:** Should document this change in programmer's guide or platform notes.
3. **Dependency on patch 3:** This change depends on the memory ordering fixes in patch 3/4 being correct. The commit message acknowledges this ("step towards unification...removing rte_ring_generic_pvt.h") but the series should ensure the fixes are validated before enabling C11 model by default.
**Verdict:** ⚠ Acceptable with warning (needs release notes)
---
## Summary
| Patch | Status | Issues |
|-------|--------|--------|
| 1/4 | ✓ OK | Minor: unexplained code reorder |
| 2/4 | ⚠ Warning | Missing `Cc: stable at dpdk.org` |
| 3/4 | ✗ **ERROR** | Subject 63 chars (max 60), patch ordering issue |
| 4/4 | ⚠ Warning | Missing release notes |
## Recommendations
1. **Fix patch 3/4 subject line** — shorten to under 60 characters
2. **Consider squashing patches 2+3** — patch 2 adds fences, patch 3 removes them; this creates odd bisect behavior
3. **Add `Cc: stable at dpdk.org`** to patches 2 and 3 as they fix real bugs
4. **Add release notes** for patch 4/4 documenting the C11 memory model enablement on x86
5. The technical analysis of the memory ordering issue is thorough and the fix approach (ACQ_REL on CAS) is sound for weakly-ordered architectures
More information about the dev
mailing list