[dpdk-dev] [PATCH] net/ixgbe: fix busy polling while fiber link update

Zhang, Qi Z qi.z.zhang at intel.com
Fri Sep 21 16:25:04 CEST 2018



> -----Original Message-----
> From: dev [mailto:dev-bounces at dpdk.org] On Behalf Of Zhang, Qi Z
> Sent: Wednesday, September 12, 2018 4:29 PM
> To: Ilya Maximets <i.maximets at samsung.com>; dev at dpdk.org
> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> <konstantin.ananyev at intel.com>; Laurent Hardy
> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>;
> stable at dpdk.org
> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while fiber link
> update
> 
> 
> 
> > -----Original Message-----
> > From: Ilya Maximets [mailto:i.maximets at samsung.com]
> > Sent: Wednesday, September 12, 2018 4:05 PM
> > To: Zhang, Qi Z <qi.z.zhang at intel.com>; dev at dpdk.org
> > Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> > <konstantin.ananyev at intel.com>; Laurent Hardy
> > <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>;
> > stable at dpdk.org
> > Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while
> > fiber link update
> >
> > On 12.09.2018 09:49, Zhang, Qi Z wrote:
> > >
> > >
> > >> -----Original Message-----
> > >> From: Ilya Maximets [mailto:i.maximets at samsung.com]
> > >> Sent: Monday, September 10, 2018 11:09 PM
> > >> To: Zhang, Qi Z <qi.z.zhang at intel.com>; dev at dpdk.org
> > >> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> > >> <konstantin.ananyev at intel.com>; Laurent Hardy
> > >> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>;
> > >> stable at dpdk.org
> > >> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while
> > >> fiber link update
> > >>
> > >> On 04.09.2018 09:08, Zhang, Qi Z wrote:
> > >>> Hi Ilya:
> > >>>
> > >>>> -----Original Message-----
> > >>>> From: dev [mailto:dev-bounces at dpdk.org] On Behalf Of Ilya
> > >>>> Maximets
> > >>>> Sent: Friday, August 31, 2018 8:40 PM
> > >>>> To: dev at dpdk.org
> > >>>> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> > >>>> <konstantin.ananyev at intel.com>; Laurent Hardy
> > >>>> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>; Ilya
> > >>>> Maximets <i.maximets at samsung.com>; stable at dpdk.org
> > >>>> Subject: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while
> > >>>> fiber link update
> > >>>>
> > >>>> If the multispeed fiber link is in DOWN state, ixgbe_setup_link
> > >>>> could take around a second of busy polling. This is highly
> > >>>> inconvenient for the case where single thread periodically checks
> > >>>> the link
> > statuses.
> > >>>> For example, OVS main thread periodically updates the link
> > >>>> statuses and hangs for a really long time busy waiting on
> > >>>> ixgbe_setup_link() for a DOWN fiber ports. For case with 3 down
> > >>>> ports it hangs for a 3 seconds and unable to do anything including
> packet processing.
> > >>>> Fix that by shifting that workaround to a separate thread by
> > >>>> alarm handler that will try to set up link if it is DOWN.
> > >>>
> > >>> Does that mean we will block the interrupt thread for 3 seconds?
> > >>
> > >> Three times for one second. Other work could be scheduled between.
> > >> IMHO, it's much better than blocking usual caller for 3 seconds.
> > >>
> > >>> Also, can we guarantee there will not be any race condition if we
> > >>> call
> > >> ixgbe_setup_link at another thread, the base code API is not
> > >> assumed to be thread-safe as I know.
> > >>
> > >> The only user of 'ixgbe_setup_link' is 'ixgbe_dev_start', but it
> > >> could be called only if device stopped. 'ixgbe_dev_stop' cancels the
> alarm.
> > >> Race with 'link_update' avoided by 'IXGBE_FLAG_NEED_LINK_CONFIG'
> flag.
> > >
> > > I guess, it' not only about when ixgb_setup_link race with itself,
> > > but also
> > when it race with other APIs.
> > > Also the concern is, even in current version, we can prove there is
> > > no issue,
> > how can we guarantee we are safe for future base code update? It's not
> > designed as thread-safe.
> > > For my option, the change is risky.
> >
> > In current implementation interrupt handler already calls the
> > 'ixgbe_dev_link_update' which subsequently calls 'ixgbe_setup_link'
> > in our case if LSC interrupts enabled. So, my change makes the driver
> > even safer by moving 'ixgbe_setup_link' to the same interrupt thread.
> > Otherwise two threads (interrupts handler and the link status checking
> > thread) could call 'ixgbe_setup_link' simultaneously.
> 
> Ok, you are right, seems the concern I have is already exist , your patch does
> not introduce new issue.
> So I have no objection if this will fix some issue.
> But let's check if any ixgbe experts will comment.
> 
> Regards
> Qi
> 
> >
> > >
> > > Btw, since ixgbe support LSC, it is not necessary for "single thread
> > periodically checks the link statuses", right?
> >
> > In current implementation it will take at least 5 seconds (4 + 1) for
> > the interrupt handler to detect DOWN link state for ixgbe multispeed
> > fiber. This is too much for many real world cases.
> >
> > >
> > >>
> > >>>
> > >>> Regards
> > >>> Qi
> > >>>
> > >>>>
> > >>>> Fixes: c12d22f65b13 ("net/ixgbe: ensure link status is updated")
> > >>>> CC: stable at dpdk.org
> > >>>>
> > >>>> Signed-off-by: Ilya Maximets <i.maximets at samsung.com>

Reviewed-by: Qi Zhang <qi.z.zhang at intel.com>

> > >>>> ---
> > >>>>  drivers/net/ixgbe/ixgbe_ethdev.c | 43
> > >>>> ++++++++++++++++++++++++--------
> > >>>>  1 file changed, 32 insertions(+), 11 deletions(-)
> > >>>>
> > >>>> diff --git a/drivers/net/ixgbe/ixgbe_ethdev.c
> > >>>> b/drivers/net/ixgbe/ixgbe_ethdev.c
> > >>>> index 26b192737..a33b9a6e8 100644
> > >>>> --- a/drivers/net/ixgbe/ixgbe_ethdev.c
> > >>>> +++ b/drivers/net/ixgbe/ixgbe_ethdev.c
> > >>>> @@ -221,6 +221,8 @@ static int ixgbe_dev_interrupt_action(struct
> > >>>> rte_eth_dev *dev,
> > >>>>  				      struct rte_intr_handle *handle);  static void
> > >>>> ixgbe_dev_interrupt_handler(void *param);  static void
> > >>>> ixgbe_dev_interrupt_delayed_handler(void *param);
> > >>>> +static void ixgbe_dev_setup_link_alarm_handler(void *param);
> > >>>> +
> > >>>>  static int ixgbe_add_rar(struct rte_eth_dev *dev, struct
> > >>>> ether_addr *mac_addr,
> > >>>>  			 uint32_t index, uint32_t pool);  static void
> > >>>> ixgbe_remove_rar(struct rte_eth_dev *dev, uint32_t index); @@
> > >>>> -2791,6 +2793,8 @@ ixgbe_dev_stop(struct rte_eth_dev *dev)
> > >>>>
> > >>>>  	PMD_INIT_FUNC_TRACE();
> > >>>>
> > >>>> +	rte_eal_alarm_cancel(ixgbe_dev_setup_link_alarm_handler, dev);
> > >>>> +
> > >>>>  	/* disable interrupts */
> > >>>>  	ixgbe_disable_intr(hw);
> > >>>>
> > >>>> @@ -3969,6 +3973,25 @@ ixgbevf_check_link(struct ixgbe_hw *hw,
> > >>>> ixgbe_link_speed *speed,
> > >>>>  	return ret_val;
> > >>>>  }
> > >>>>
> > >>>> +static void
> > >>>> +ixgbe_dev_setup_link_alarm_handler(void *param) {
> > >>>> +	struct rte_eth_dev *dev = (struct rte_eth_dev *)param;
> > >>>> +	struct ixgbe_hw *hw =
> > >>>> IXGBE_DEV_PRIVATE_TO_HW(dev->data->dev_private);
> > >>>> +	struct ixgbe_interrupt *intr =
> > >>>> +		IXGBE_DEV_PRIVATE_TO_INTR(dev->data->dev_private);
> > >>>> +	u32 speed;
> > >>>> +	bool autoneg = false;
> > >>>> +
> > >>>> +	speed = hw->phy.autoneg_advertised;
> > >>>> +	if (!speed)
> > >>>> +		ixgbe_get_link_capabilities(hw, &speed, &autoneg);
> > >>>> +
> > >>>> +	ixgbe_setup_link(hw, speed, true);
> > >>>> +
> > >>>> +	intr->flags &= ~IXGBE_FLAG_NEED_LINK_CONFIG; }
> > >>>> +
> > >>>>  /* return 0 means link status changed, -1 means not changed */
> > >>>> int ixgbe_dev_link_update_share(struct rte_eth_dev *dev, @@
> > >>>> -3981,9
> > >>>> +4004,7 @@ ixgbe_dev_link_update_share(struct rte_eth_dev *dev,
> > >>>>  		IXGBE_DEV_PRIVATE_TO_INTR(dev->data->dev_private);
> > >>>>  	int link_up;
> > >>>>  	int diag;
> > >>>> -	u32 speed = 0;
> > >>>>  	int wait = 1;
> > >>>> -	bool autoneg = false;
> > >>>>
> > >>>>  	memset(&link, 0, sizeof(link));
> > >>>>  	link.link_status = ETH_LINK_DOWN; @@ -3993,13 +4014,8 @@
> > >>>> ixgbe_dev_link_update_share(struct
> > >> rte_eth_dev
> > >>>> *dev,
> > >>>>
> > >>>>  	hw->mac.get_link_status = true;
> > >>>>
> > >>>> -	if ((intr->flags & IXGBE_FLAG_NEED_LINK_CONFIG) &&
> > >>>> -		ixgbe_get_media_type(hw) == ixgbe_media_type_fiber) {
> > >>>> -		speed = hw->phy.autoneg_advertised;
> > >>>> -		if (!speed)
> > >>>> -			ixgbe_get_link_capabilities(hw, &speed, &autoneg);
> > >>>> -		ixgbe_setup_link(hw, speed, true);
> > >>>> -	}
> > >>>> +	if (intr->flags & IXGBE_FLAG_NEED_LINK_CONFIG)
> > >>>> +		return rte_eth_linkstatus_set(dev, &link);
> > >>>>
> > >>>>  	/* check if it needs to wait to complete, if lsc interrupt is enabled */
> > >>>>  	if (wait_to_complete == 0 || dev->data->dev_conf.intr_conf.lsc
> > >>>> !=
> > >>>> 0) @@
> > >>>> -4017,11 +4033,14 @@ ixgbe_dev_link_update_share(struct
> > >>>> rte_eth_dev
> > >> *dev,
> > >>>>  	}
> > >>>>
> > >>>>  	if (link_up == 0) {
> > >>>> -		intr->flags |= IXGBE_FLAG_NEED_LINK_CONFIG;
> > >>>> +		if (ixgbe_get_media_type(hw) == ixgbe_media_type_fiber) {
> > >>>> +			intr->flags |= IXGBE_FLAG_NEED_LINK_CONFIG;
> > >>>> +			rte_eal_alarm_set(10,
> > >>>> +				ixgbe_dev_setup_link_alarm_handler, dev);
> > >>>> +		}
> > >>>>  		return rte_eth_linkstatus_set(dev, &link);
> > >>>>  	}
> > >>>>
> > >>>> -	intr->flags &= ~IXGBE_FLAG_NEED_LINK_CONFIG;
> > >>>>  	link.link_status = ETH_LINK_UP;
> > >>>>  	link.link_duplex = ETH_LINK_FULL_DUPLEX;
> > >>>>
> > >>>> @@ -5128,6 +5147,8 @@ ixgbevf_dev_stop(struct rte_eth_dev *dev)
> > >>>>
> > >>>>  	PMD_INIT_FUNC_TRACE();
> > >>>>
> > >>>> +	rte_eal_alarm_cancel(ixgbe_dev_setup_link_alarm_handler, dev);
> > >>>> +
> > >>>>  	ixgbevf_intr_disable(dev);
> > >>>>
> > >>>>  	hw->adapter_stopped = 1;
> > >>>> --
> > >>>> 2.17.1
> > >>>


More information about the dev mailing list