[dpdk-dev] [PATCH] net/ixgbe: fix busy polling while fiber link update

Zhang, Qi Z qi.z.zhang at intel.com
Wed Sep 12 10:28:49 CEST 2018



> -----Original Message-----
> From: Ilya Maximets [mailto:i.maximets at samsung.com]
> Sent: Wednesday, September 12, 2018 4:05 PM
> To: Zhang, Qi Z <qi.z.zhang at intel.com>; dev at dpdk.org
> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> <konstantin.ananyev at intel.com>; Laurent Hardy
> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>;
> stable at dpdk.org
> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while fiber link
> update
> 
> On 12.09.2018 09:49, Zhang, Qi Z wrote:
> >
> >
> >> -----Original Message-----
> >> From: Ilya Maximets [mailto:i.maximets at samsung.com]
> >> Sent: Monday, September 10, 2018 11:09 PM
> >> To: Zhang, Qi Z <qi.z.zhang at intel.com>; dev at dpdk.org
> >> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> >> <konstantin.ananyev at intel.com>; Laurent Hardy
> >> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>;
> >> stable at dpdk.org
> >> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while
> >> fiber link update
> >>
> >> On 04.09.2018 09:08, Zhang, Qi Z wrote:
> >>> Hi Ilya:
> >>>
> >>>> -----Original Message-----
> >>>> From: dev [mailto:dev-bounces at dpdk.org] On Behalf Of Ilya Maximets
> >>>> Sent: Friday, August 31, 2018 8:40 PM
> >>>> To: dev at dpdk.org
> >>>> Cc: Lu, Wenzhuo <wenzhuo.lu at intel.com>; Ananyev, Konstantin
> >>>> <konstantin.ananyev at intel.com>; Laurent Hardy
> >>>> <laurent.hardy at 6wind.com>; Dai, Wei <wei.dai at intel.com>; Ilya
> >>>> Maximets <i.maximets at samsung.com>; stable at dpdk.org
> >>>> Subject: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while fiber
> >>>> link update
> >>>>
> >>>> If the multispeed fiber link is in DOWN state, ixgbe_setup_link
> >>>> could take around a second of busy polling. This is highly
> >>>> inconvenient for the case where single thread periodically checks the link
> statuses.
> >>>> For example, OVS main thread periodically updates the link statuses
> >>>> and hangs for a really long time busy waiting on ixgbe_setup_link()
> >>>> for a DOWN fiber ports. For case with 3 down ports it hangs for a 3
> >>>> seconds and unable to do anything including packet processing.
> >>>> Fix that by shifting that workaround to a separate thread by alarm
> >>>> handler that will try to set up link if it is DOWN.
> >>>
> >>> Does that mean we will block the interrupt thread for 3 seconds?
> >>
> >> Three times for one second. Other work could be scheduled between.
> >> IMHO, it's much better than blocking usual caller for 3 seconds.
> >>
> >>> Also, can we guarantee there will not be any race condition if we
> >>> call
> >> ixgbe_setup_link at another thread, the base code API is not assumed
> >> to be thread-safe as I know.
> >>
> >> The only user of 'ixgbe_setup_link' is 'ixgbe_dev_start', but it
> >> could be called only if device stopped. 'ixgbe_dev_stop' cancels the alarm.
> >> Race with 'link_update' avoided by 'IXGBE_FLAG_NEED_LINK_CONFIG' flag.
> >
> > I guess, it' not only about when ixgb_setup_link race with itself, but also
> when it race with other APIs.
> > Also the concern is, even in current version, we can prove there is no issue,
> how can we guarantee we are safe for future base code update? It's not
> designed as thread-safe.
> > For my option, the change is risky.
> 
> In current implementation interrupt handler already calls the
> 'ixgbe_dev_link_update' which subsequently calls 'ixgbe_setup_link'
> in our case if LSC interrupts enabled. So, my change makes the driver even
> safer by moving 'ixgbe_setup_link' to the same interrupt thread.
> Otherwise two threads (interrupts handler and the link status checking thread)
> could call 'ixgbe_setup_link' simultaneously.

Ok, you are right, seems the concern I have is already exist , your patch does not introduce new issue.
So I have no objection if this will fix some issue.
But let's check if any ixgbe experts will comment.

Regards
Qi

> 
> >
> > Btw, since ixgbe support LSC, it is not necessary for "single thread
> periodically checks the link statuses", right?
> 
> In current implementation it will take at least 5 seconds (4 + 1) for the interrupt
> handler to detect DOWN link state for ixgbe multispeed fiber. This is too much
> for many real world cases.
> 
> >
> >>
> >>>
> >>> Regards
> >>> Qi
> >>>
> >>>>
> >>>> Fixes: c12d22f65b13 ("net/ixgbe: ensure link status is updated")
> >>>> CC: stable at dpdk.org
> >>>>
> >>>> Signed-off-by: Ilya Maximets <i.maximets at samsung.com>
> >>>> ---
> >>>>  drivers/net/ixgbe/ixgbe_ethdev.c | 43
> >>>> ++++++++++++++++++++++++--------
> >>>>  1 file changed, 32 insertions(+), 11 deletions(-)
> >>>>
> >>>> diff --git a/drivers/net/ixgbe/ixgbe_ethdev.c
> >>>> b/drivers/net/ixgbe/ixgbe_ethdev.c
> >>>> index 26b192737..a33b9a6e8 100644
> >>>> --- a/drivers/net/ixgbe/ixgbe_ethdev.c
> >>>> +++ b/drivers/net/ixgbe/ixgbe_ethdev.c
> >>>> @@ -221,6 +221,8 @@ static int ixgbe_dev_interrupt_action(struct
> >>>> rte_eth_dev *dev,
> >>>>  				      struct rte_intr_handle *handle);  static void
> >>>> ixgbe_dev_interrupt_handler(void *param);  static void
> >>>> ixgbe_dev_interrupt_delayed_handler(void *param);
> >>>> +static void ixgbe_dev_setup_link_alarm_handler(void *param);
> >>>> +
> >>>>  static int ixgbe_add_rar(struct rte_eth_dev *dev, struct
> >>>> ether_addr *mac_addr,
> >>>>  			 uint32_t index, uint32_t pool);  static void
> >>>> ixgbe_remove_rar(struct rte_eth_dev *dev, uint32_t index); @@
> >>>> -2791,6 +2793,8 @@ ixgbe_dev_stop(struct rte_eth_dev *dev)
> >>>>
> >>>>  	PMD_INIT_FUNC_TRACE();
> >>>>
> >>>> +	rte_eal_alarm_cancel(ixgbe_dev_setup_link_alarm_handler, dev);
> >>>> +
> >>>>  	/* disable interrupts */
> >>>>  	ixgbe_disable_intr(hw);
> >>>>
> >>>> @@ -3969,6 +3973,25 @@ ixgbevf_check_link(struct ixgbe_hw *hw,
> >>>> ixgbe_link_speed *speed,
> >>>>  	return ret_val;
> >>>>  }
> >>>>
> >>>> +static void
> >>>> +ixgbe_dev_setup_link_alarm_handler(void *param) {
> >>>> +	struct rte_eth_dev *dev = (struct rte_eth_dev *)param;
> >>>> +	struct ixgbe_hw *hw =
> >>>> IXGBE_DEV_PRIVATE_TO_HW(dev->data->dev_private);
> >>>> +	struct ixgbe_interrupt *intr =
> >>>> +		IXGBE_DEV_PRIVATE_TO_INTR(dev->data->dev_private);
> >>>> +	u32 speed;
> >>>> +	bool autoneg = false;
> >>>> +
> >>>> +	speed = hw->phy.autoneg_advertised;
> >>>> +	if (!speed)
> >>>> +		ixgbe_get_link_capabilities(hw, &speed, &autoneg);
> >>>> +
> >>>> +	ixgbe_setup_link(hw, speed, true);
> >>>> +
> >>>> +	intr->flags &= ~IXGBE_FLAG_NEED_LINK_CONFIG; }
> >>>> +
> >>>>  /* return 0 means link status changed, -1 means not changed */
> >>>> int ixgbe_dev_link_update_share(struct rte_eth_dev *dev, @@ -3981,9
> >>>> +4004,7 @@ ixgbe_dev_link_update_share(struct rte_eth_dev *dev,
> >>>>  		IXGBE_DEV_PRIVATE_TO_INTR(dev->data->dev_private);
> >>>>  	int link_up;
> >>>>  	int diag;
> >>>> -	u32 speed = 0;
> >>>>  	int wait = 1;
> >>>> -	bool autoneg = false;
> >>>>
> >>>>  	memset(&link, 0, sizeof(link));
> >>>>  	link.link_status = ETH_LINK_DOWN; @@ -3993,13 +4014,8 @@
> >>>> ixgbe_dev_link_update_share(struct
> >> rte_eth_dev
> >>>> *dev,
> >>>>
> >>>>  	hw->mac.get_link_status = true;
> >>>>
> >>>> -	if ((intr->flags & IXGBE_FLAG_NEED_LINK_CONFIG) &&
> >>>> -		ixgbe_get_media_type(hw) == ixgbe_media_type_fiber) {
> >>>> -		speed = hw->phy.autoneg_advertised;
> >>>> -		if (!speed)
> >>>> -			ixgbe_get_link_capabilities(hw, &speed, &autoneg);
> >>>> -		ixgbe_setup_link(hw, speed, true);
> >>>> -	}
> >>>> +	if (intr->flags & IXGBE_FLAG_NEED_LINK_CONFIG)
> >>>> +		return rte_eth_linkstatus_set(dev, &link);
> >>>>
> >>>>  	/* check if it needs to wait to complete, if lsc interrupt is enabled */
> >>>>  	if (wait_to_complete == 0 || dev->data->dev_conf.intr_conf.lsc !=
> >>>> 0) @@
> >>>> -4017,11 +4033,14 @@ ixgbe_dev_link_update_share(struct rte_eth_dev
> >> *dev,
> >>>>  	}
> >>>>
> >>>>  	if (link_up == 0) {
> >>>> -		intr->flags |= IXGBE_FLAG_NEED_LINK_CONFIG;
> >>>> +		if (ixgbe_get_media_type(hw) == ixgbe_media_type_fiber) {
> >>>> +			intr->flags |= IXGBE_FLAG_NEED_LINK_CONFIG;
> >>>> +			rte_eal_alarm_set(10,
> >>>> +				ixgbe_dev_setup_link_alarm_handler, dev);
> >>>> +		}
> >>>>  		return rte_eth_linkstatus_set(dev, &link);
> >>>>  	}
> >>>>
> >>>> -	intr->flags &= ~IXGBE_FLAG_NEED_LINK_CONFIG;
> >>>>  	link.link_status = ETH_LINK_UP;
> >>>>  	link.link_duplex = ETH_LINK_FULL_DUPLEX;
> >>>>
> >>>> @@ -5128,6 +5147,8 @@ ixgbevf_dev_stop(struct rte_eth_dev *dev)
> >>>>
> >>>>  	PMD_INIT_FUNC_TRACE();
> >>>>
> >>>> +	rte_eal_alarm_cancel(ixgbe_dev_setup_link_alarm_handler, dev);
> >>>> +
> >>>>  	ixgbevf_intr_disable(dev);
> >>>>
> >>>>  	hw->adapter_stopped = 1;
> >>>> --
> >>>> 2.17.1
> >>>


More information about the dev mailing list