Large interruptions for EAL thread running on isol core

Carsten Andrich carsten.andrich at tu-ilmenau.de
Tue Jun 28 09:25:50 CEST 2022


On 24.06.22 17:01, Stephen Hemminger wrote:
> On Thu, 23 Jun 2022 21:03:49 +0200
> Carsten Andrich <carsten.andrich at tu-ilmenau.de> wrote:
>
>>   2. Use real-time priority (SCHED_FIFO w/ priority 99) for the DPDK
>>      threads and
>>      echo -1 > /proc/sys/kernel/sched_rt_runtime_us
>>      to disable the runtime limit. With the runtime limit in place, the
>>      SCHED_FIFO performance will be significantly worse than SCHED_OTHER.
> This can cause major issues if application is normal DPDK application (never does system calls).
> If an interrupt or other event happens on your isolated CPU, the work that it would
> do in soft irq is never performed. FIFO has higher priority than kernel threads.
> This can lead to mystery lockups from other applications (reads not completing, network timeouts, etc).

Thanks for pointing that out. Do you know of any official kernel 
documentation that could shed some light on that? I haven't had any 
serious issues like the ones you list, but maybe I've been lucky. My 
DPDK applications typically run on fairly minimal systems used 
exclusively for DPDK tasks, which require minimal latency/jitter. Minor 
side-effects from using SCHED_FIFO are tolerable in my case, if it 
improves performance.


> In any semi-recent kernel using SCHED_NO_HZ_FULL will keep clock ticks from happening
> on the isolated cores.
On 24.06.22 18:42, Stephen Hemminger wrote:
> Sorry, wrong option.
>
> Full docs here:
> https://docs.kernel.org/timers/no_hz.html

Thanks. nohz_full is already in the list of kernel cmdline options I use.

Best regards,
Carsten



More information about the users mailing list