[dpdk-dev] [PATCH] net/ixgbe: fix busy polling while fiber link update

Zhao1, Wei wei.zhao1 at intel.com
Tue Oct 9 12:22:43 CEST 2018


HI,

> -----Original Message-----
> From: dev [mailto:dev-bounces at dpdk.org] On Behalf Of Ilya Maximets
> Sent: Wednesday, October 3, 2018 3:43 PM
> To: Zhang, Qi Z <qi.z.zhang at intel.com>; dev at dpdk.org
> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> <konstantin.ananyev at intel.com>; Laurent Hardy
> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>;
> stable at dpdk.org; Thomas Monjalon <thomas at monjalon.net>; Yigit, Ferruh
> <ferruh.yigit at intel.com>
> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while fiber link
> update
> 
> On 21.09.2018 17:25, Zhang, Qi Z wrote:
> >
> >
> >> -----Original Message-----
> >> From: dev [mailto:dev-bounces at dpdk.org] On Behalf Of Zhang, Qi Z
> >> Sent: Wednesday, September 12, 2018 4:29 PM
> >> To: Ilya Maximets <i.maximets at samsung.com>; dev at dpdk.org
> >> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> >> <konstantin.ananyev at intel.com>; Laurent Hardy
> >> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>;
> >> stable at dpdk.org
> >> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while
> >> fiber link update
> >>
> >>
> >>
> >>> -----Original Message-----
> >>> From: Ilya Maximets [mailto:i.maximets at samsung.com]
> >>> Sent: Wednesday, September 12, 2018 4:05 PM
> >>> To: Zhang, Qi Z <qi.z.zhang at intel.com>; dev at dpdk.org
> >>> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> >>> <konstantin.ananyev at intel.com>; Laurent Hardy
> >>> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>;
> >>> stable at dpdk.org
> >>> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while
> >>> fiber link update
> >>>
> >>> On 12.09.2018 09:49, Zhang, Qi Z wrote:
> >>>>
> >>>>
> >>>>> -----Original Message-----
> >>>>> From: Ilya Maximets [mailto:i.maximets at samsung.com]
> >>>>> Sent: Monday, September 10, 2018 11:09 PM
> >>>>> To: Zhang, Qi Z <qi.z.zhang at intel.com>; dev at dpdk.org
> >>>>> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> >>>>> <konstantin.ananyev at intel.com>; Laurent Hardy
> >>>>> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>;
> >>>>> stable at dpdk.org
> >>>>> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while
> >>>>> fiber link update
> >>>>>
> >>>>> On 04.09.2018 09:08, Zhang, Qi Z wrote:
> >>>>>> Hi Ilya:
> >>>>>>
> >>>>>>> -----Original Message-----
> >>>>>>> From: dev [mailto:dev-bounces at dpdk.org] On Behalf Of Ilya
> >>>>>>> Maximets
> >>>>>>> Sent: Friday, August 31, 2018 8:40 PM
> >>>>>>> To: dev at dpdk.org
> >>>>>>> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> >>>>>>> <konstantin.ananyev at intel.com>; Laurent Hardy
> >>>>>>> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>; Ilya
> >>>>>>> Maximets <i.maximets at samsung.com>; stable at dpdk.org
> >>>>>>> Subject: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while
> >>>>>>> fiber link update
> >>>>>>>
> >>>>>>> If the multispeed fiber link is in DOWN state, ixgbe_setup_link
> >>>>>>> could take around a second of busy polling. This is highly
> >>>>>>> inconvenient for the case where single thread periodically
> >>>>>>> checks the link
> >>> statuses.
> >>>>>>> For example, OVS main thread periodically updates the link
> >>>>>>> statuses and hangs for a really long time busy waiting on
> >>>>>>> ixgbe_setup_link() for a DOWN fiber ports. For case with 3 down
> >>>>>>> ports it hangs for a 3 seconds and unable to do anything
> >>>>>>> including
> >> packet processing.
> >>>>>>> Fix that by shifting that workaround to a separate thread by
> >>>>>>> alarm handler that will try to set up link if it is DOWN.
> >>>>>>
> >>>>>> Does that mean we will block the interrupt thread for 3 seconds?
> >>>>>
> >>>>> Three times for one second. Other work could be scheduled between.
> >>>>> IMHO, it's much better than blocking usual caller for 3 seconds.
> >>>>>
> >>>>>> Also, can we guarantee there will not be any race condition if we
> >>>>>> call
> >>>>> ixgbe_setup_link at another thread, the base code API is not
> >>>>> assumed to be thread-safe as I know.
> >>>>>
> >>>>> The only user of 'ixgbe_setup_link' is 'ixgbe_dev_start', but it
> >>>>> could be called only if device stopped. 'ixgbe_dev_stop' cancels
> >>>>> the
> >> alarm.
> >>>>> Race with 'link_update' avoided by
> 'IXGBE_FLAG_NEED_LINK_CONFIG'
> >> flag.
> >>>>
> >>>> I guess, it' not only about when ixgb_setup_link race with itself,
> >>>> but also
> >>> when it race with other APIs.
> >>>> Also the concern is, even in current version, we can prove there is
> >>>> no issue,
> >>> how can we guarantee we are safe for future base code update? It's
> >>> not designed as thread-safe.
> >>>> For my option, the change is risky.
> >>>
> >>> In current implementation interrupt handler already calls the
> >>> 'ixgbe_dev_link_update' which subsequently calls 'ixgbe_setup_link'
> >>> in our case if LSC interrupts enabled. So, my change makes the
> >>> driver even safer by moving 'ixgbe_setup_link' to the same interrupt
> thread.
> >>> Otherwise two threads (interrupts handler and the link status
> >>> checking
> >>> thread) could call 'ixgbe_setup_link' simultaneously.
> >>
> >> Ok, you are right, seems the concern I have is already exist , your
> >> patch does not introduce new issue.
> >> So I have no objection if this will fix some issue.
> >> But let's check if any ixgbe experts will comment.
> >>
> >> Regards
> >> Qi
> >>
> >>>
> >>>>
> >>>> Btw, since ixgbe support LSC, it is not necessary for "single
> >>>> thread
> >>> periodically checks the link statuses", right?
> >>>
> >>> In current implementation it will take at least 5 seconds (4 + 1)
> >>> for the interrupt handler to detect DOWN link state for ixgbe
> >>> multispeed fiber. This is too much for many real world cases.
> >>>
> >>>>
> >>>>>
> >>>>>>
> >>>>>> Regards
> >>>>>> Qi
> >>>>>>
> >>>>>>>
> >>>>>>> Fixes: c12d22f65b13 ("net/ixgbe: ensure link status is updated")
> >>>>>>> CC: stable at dpdk.org
> >>>>>>>
> >>>>>>> Signed-off-by: Ilya Maximets <i.maximets at samsung.com>
> >
> > Reviewed-by: Qi Zhang <qi.z.zhang at intel.com>
> 
> Hi.
> Is there any chance for this to be merged in a near future?
> 
> Situation becomes even worse. Recently accepted commit b2815c41bd0b
> ("net/ixgbe: wait longer for link after fiber MAC setup") increases the time of
> the thread hang busy waiting up to
> 1.5 seconds per port on each link state check.
> 
> Best regards, Ilya Maximets.

We are review this patch now, and feedback to you ASAP, SORRY.



More information about the dev mailing list