[dpdk-dev] [PATCH v16 1/8] ethdev: add set ptype function

Ferruh Yigit ferruh.yigit at intel.com
Fri Nov 8 12:05:05 CET 2019


On 11/8/2019 10:24 AM, Andrew Rybchenko wrote:
> On 11/8/19 12:52 PM, Ferruh Yigit wrote:
>> On 11/8/2019 9:05 AM, Andrew Rybchenko wrote:
>>> On 11/7/19 9:28 PM, Ferruh Yigit wrote:
>>>> On 11/6/2019 7:17 PM, pbhagavatula at marvell.com wrote:
>>>>> From: Pavan Nikhilesh <pbhagavatula at marvell.com>
>>>>>
>>>>> Add `rte_eth_dev_set_ptypes` function that will allow the application
>>>>> to inform the PMD about reduced range of packet types to handle.
>>>>> Based on the ptypes set PMDs can optimize their Rx path.
>>>>>
>>>>> -If application doesn’t want any ptype information it can call
>>>>> `rte_eth_dev_set_ptypes(ethdev_id, RTE_PTYPE_UNKNOWN, NULL, 0)`
>>>>> and PMD may skip packet type processing and set rte_mbuf::packet_type to
>>>>> RTE_PTYPE_UNKNOWN.
>>>>>
>>>>> -If application doesn’t call `rte_eth_dev_set_ptypes` PMD can return
>>>>> `rte_mbuf::packet_type` with `rte_eth_dev_get_supported_ptypes`.
>>>>>
>>>>> -If application is interested only in L2/L3 layer, it can inform the PMD
>>>>> to update `rte_mbuf::packet_type` with L2/L3 ptype by calling
>>>>> `rte_eth_dev_set_ptypes(ethdev_id,
>>>>> 		RTE_PTYPE_L2_MASK | RTE_PTYPE_L3_MASK, NULL, 0)`.
>>>>>
>>>>> Suggested-by: Konstantin Ananyev <konstantin.ananyev at intel.com>
>>>>> Signed-off-by: Pavan Nikhilesh <pbhagavatula at marvell.com>
>>>>> Reviewed-by: Andrew Rybchenko <arybchenko at solarflare.com>
>>>>> ---
>>>>>  doc/guides/nics/features.rst             |  7 +-
>>>>>  doc/guides/rel_notes/release_19_11.rst   |  8 +++
>>>>>  lib/librte_ethdev/rte_ethdev.c           | 87 +++++++++++++++++++++++-
>>>>>  lib/librte_ethdev/rte_ethdev.h           | 36 ++++++++++
>>>>>  lib/librte_ethdev/rte_ethdev_core.h      | 18 +++++
>>>>>  lib/librte_ethdev/rte_ethdev_version.map |  1 +
>>>>>  6 files changed, 154 insertions(+), 3 deletions(-)
>>>>>
>>>>> diff --git a/doc/guides/nics/features.rst b/doc/guides/nics/features.rst
>>>>> index 7a31cf7c8..f33e8e6d4 100644
>>>>> --- a/doc/guides/nics/features.rst
>>>>> +++ b/doc/guides/nics/features.rst
>>>>> @@ -583,9 +583,12 @@ Packet type parsing
>>>>>  -------------------
>>>>>  
>>>>>  Supports packet type parsing and returns a list of supported types.
>>>>> +Allows application to set ptypes it is interested in.
>>>>>  
>>>>> -* **[implements] eth_dev_ops**: ``dev_supported_ptypes_get``.
>>>>> -* **[related]    API**: ``rte_eth_dev_get_supported_ptypes()``.
>>>>> +* **[implements] eth_dev_ops**: ``dev_supported_ptypes_get``,
>>>>> +* **[related]    API**: ``rte_eth_dev_get_supported_ptypes()``,
>>>>> +  ``rte_eth_dev_set_ptypes()``, ``dev_ptypes_set``.
>>>>> +* **[provides]   mbuf**: ``mbuf.packet_type``.
>>>>>  
>>>>>  
>>>>>  .. _nic_features_timesync:
>>>>> diff --git a/doc/guides/rel_notes/release_19_11.rst b/doc/guides/rel_notes/release_19_11.rst
>>>>> index 23182d187..2fa54aadc 100644
>>>>> --- a/doc/guides/rel_notes/release_19_11.rst
>>>>> +++ b/doc/guides/rel_notes/release_19_11.rst
>>>>> @@ -86,6 +86,13 @@ New Features
>>>>>    SET_TAG action and TAG item have been added to support transient flow
>>>>>    tag.
>>>>>  
>>>>> +* **Added ethdev API to set supported packet types**
>>>>> +
>>>>> +  * Added new API ``rte_eth_dev_set_ptypes`` that allows an application to
>>>>> +    inform PMD about about reduced range of packet types to handle.
>>>>> +  * This scheme will allow PMDs to avoid lookup to internal ptype table on Rx
>>>>> +    and thereby improve Rx performance if application wishes do so.
>>>>> +
>>>>>  * **Updated the enic driver.**
>>>>>  
>>>>>    * Added support for Geneve with options offload.
>>>>> @@ -258,6 +265,7 @@ New Features
>>>>>    * Rx metadata is delivered to host via a dynamic field of ``rte_mbuf`` with
>>>>>      PKT_RX_DYNF_METADATA.
>>>>>  
>>>>> +
>>>>>  Removed Items
>>>>>  -------------
>>>>>  
>>>>> diff --git a/lib/librte_ethdev/rte_ethdev.c b/lib/librte_ethdev/rte_ethdev.c
>>>>> index 652c369e5..13e6daaf4 100644
>>>>> --- a/lib/librte_ethdev/rte_ethdev.c
>>>>> +++ b/lib/librte_ethdev/rte_ethdev.c
>>>>> @@ -2879,6 +2879,92 @@ rte_eth_dev_get_supported_ptypes(uint16_t port_id, uint32_t ptype_mask,
>>>>>  	return j;
>>>>>  }
>>>>>  
>>>>> +int
>>>>> +rte_eth_dev_set_ptypes(uint16_t port_id, uint32_t ptype_mask,
>>>>> +				 uint32_t *set_ptypes, unsigned int num)
>>>>> +{
>>>>> +	const uint32_t valid_ptype_masks[] = {
>>>>> +		RTE_PTYPE_L2_MASK,
>>>>> +		RTE_PTYPE_L3_MASK,
>>>>> +		RTE_PTYPE_L4_MASK,
>>>>> +		RTE_PTYPE_TUNNEL_MASK,
>>>>> +		RTE_PTYPE_INNER_L2_MASK,
>>>>> +		RTE_PTYPE_INNER_L3_MASK,
>>>>> +		RTE_PTYPE_INNER_L4_MASK,
>>>>> +	};
>>>>> +	const uint32_t *all_ptypes;
>>>>> +	struct rte_eth_dev *dev;
>>>>> +	uint32_t unused_mask;
>>>>> +	unsigned int i, j;
>>>>> +	int ret;
>>>>> +
>>>>> +	RTE_ETH_VALID_PORTID_OR_ERR_RET(port_id, -ENODEV);
>>>>> +	dev = &rte_eth_devices[port_id];
>>>>> +
>>>>> +	if (num > 0 && set_ptypes == NULL)
>>>>> +		return -EINVAL;
>>>>> +
>>>>> +	if (*dev->dev_ops->dev_supported_ptypes_get == NULL ||
>>>>> +			*dev->dev_ops->dev_ptypes_set == NULL) {
>>>>> +		ret = 0;
>>>>> +		goto ptype_unknown;
>>>>> +	}
>>>>
>>>> Hi Pavan, Andrew,
>>>>
>>>> Why API is not returning "-ENOTSUP" id dev_ops is not provided as done it other
>>>> APIs?
>>>
>>> As I understand the reason behind it is an optional nature
>>> of the callback. It is just an optimization and there is no
>>> point to complicate applications code to check and treat
>>> -ENOTSUP is OK.
>>
>> I was also thinking this is just an optimization but Jerin wants to call it
>> default in all samples first, now in testpmd by default.
> 
> Why the optimization should not be applied in all samples?
> If sample does not use ptype information it may and should
> be applied since we are all keen for the best possible
> performance.
> 
>> Although it makes simple to ignore the error this way, I think it can be good to
>> differentiate the success, not implemented and failed cases.
> 
> No, it simplifies error tracking. We can check return value and
> if it is not zero say that it goes wrong (instead of not zero and
> not -ENOTSUP).

Right now it is not clear if API successfully run or not supported at all, also
there is no API to get list of ptypes API reports after this API call, so there
is no way to know what is current reported ptypes after this API return success

For example, for the tespmd call, I suggest printing a log saying that supported
ptypes has been reduced if the API returns successfully, but with current API we
can't really say if it was successful or not.

> 
>>>
>>>>> +
>>>>> +	if (ptype_mask == 0) {
>>>>> +		ret = (*dev->dev_ops->dev_ptypes_set)(dev,
>>>>> +				ptype_mask);
>>>>> +		goto ptype_unknown;
>>>>> +	}
>>>>> +
>>>>> +	unused_mask = ptype_mask;
>>>>> +	for (i = 0; i < RTE_DIM(valid_ptype_masks); i++) {
>>>>> +		uint32_t mask = ptype_mask & valid_ptype_masks[i];
>>>>> +		if (mask && mask != valid_ptype_masks[i]) {
>>>>> +			ret = -EINVAL;
>>>>> +			goto ptype_unknown;
>>>>> +		}
>>>>> +		unused_mask &= ~valid_ptype_masks[i];
>>>>> +	}
>>>>
>>>> What does above block does?
>>>> Does it mean 'ptype_mask' only can be combination of 'valid_ptype_masks' items?
>>>
>>> Yes.
>>>
>>>> Let's say can't I set only 'RTE_PTYPE_L2_ETHER' as requested ptype?
>>>
>>> Yes
>>>
>>>> Why this limitation and is it documented anywhere, like the API documentation?
>>>
>>> It is simply incorrect to have non-mask bits here.
>>> RTE_PTYPE_L2_ETHER & RTE_PTYPE_L2_ETHER_ARP != 0
>>
>> Right.
>>
>>> It is documented in the ptype_mask parameter description.
>>
>> OK, I missed it in ptype_mask description.
>>
>>>
>>>>> +
>>>>> +	if (unused_mask) {
>>>>> +		ret = -EINVAL;
>>>>> +		goto ptype_unknown;
>>>>> +	}
>>>>> +
>>>>> +	all_ptypes = (*dev->dev_ops->dev_supported_ptypes_get)(dev);
>>>>> +	if (all_ptypes == NULL) {
>>>>> +		ret = 0;
>>>>> +		goto ptype_unknown;
>>>>> +	}
>>>>> +
>>>>> +	/*
>>>>> +	 * Accodommodate as many set_ptypes as possible. If the supplied
>>>>
>>>> s/Accodommodate/Accommodate
>>>>
>>>>> +	 * set_ptypes array is insufficient fill it partially.
>>>>> +	 */
>>>>> +	for (i = 0, j = 0; set_ptypes != NULL &&
>>>>> +				(all_ptypes[i] != RTE_PTYPE_UNKNOWN); ++i) {
>>>>> +		if (ptype_mask & all_ptypes[i]) {
>>>>> +			if (j < num - 1) {
>>>>> +				set_ptypes[j] = all_ptypes[i];
>>>>> +				j++;
>>>>> +				continue;
>>>>> +			}
>>>>> +			break;
>>>>> +		}
>>>>> +	}
>>>>> +
>>>>> +	if (set_ptypes != NULL && j < num)
>>>>> +		set_ptypes[j] = RTE_PTYPE_UNKNOWN;
>>>>> +
>>>>> +	return (*dev->dev_ops->dev_ptypes_set)(dev, ptype_mask);
>>>>
>>>> Only a question, at this stage API knows the supported ptypes by device, does it
>>>> make sense to check if 'ptype_mask' has anything unsupported instead of passing
>>>> it to the driver and expect driver to check it?
>>>
>>> It is also explained by the nature of the API. Application says:
>>> I'm interested in the following ptypes only to let PMD know that
>>> it could drop classification behind it. If PMD does not support
>>> some layers, it will not provide classification anyway.
>>
>> in "eth_dev_ptypes_set_t" documentation, it says:
>>  "- (-EINVAL) if *ptype_mask* is invalid."
>> I thought PMD does a check on received 'ptype_mask', if expectation is PMD will
>> do the check, why not do the check in API commonly instead of each PMD.
>> What is the documented case "*ptype_mask* is invalid."?
> 
> Not a combination of RTE_PTYPE_*_MASK.
> There are few cases above where -EINVAL is returned includes failed
> validation of ptype_mask.

OK, let me try to again,
according API, *PMD* can return error if 'ptype_mask' is not valid,
a) what is that validity check in PMD?
b) if it is a common check can we move if from PMD to the API?

> 
>>>
>>>>> +
>>>>> +ptype_unknown:
>>>>> +	if (num > 0)
>>>>> +		set_ptypes[0] = RTE_PTYPE_UNKNOWN;
>>>>> +
>>>>> +	return ret;
>>>>> +}
>>>>> +
>>>>>  int
>>>>>  rte_eth_macaddr_get(uint16_t port_id, struct rte_ether_addr *mac_addr)
>>>>>  {
>>>>> @@ -2891,7 +2977,6 @@ rte_eth_macaddr_get(uint16_t port_id, struct rte_ether_addr *mac_addr)
>>>>>  	return 0;
>>>>>  }
>>>>>  
>>>>> -
>>>>>  int
>>>>>  rte_eth_dev_get_mtu(uint16_t port_id, uint16_t *mtu)
>>>>>  {
>>>>> diff --git a/lib/librte_ethdev/rte_ethdev.h b/lib/librte_ethdev/rte_ethdev.h
>>>>> index 44d77b332..a7872819d 100644
>>>>> --- a/lib/librte_ethdev/rte_ethdev.h
>>>>> +++ b/lib/librte_ethdev/rte_ethdev.h
>>>>> @@ -2604,6 +2604,42 @@ int rte_eth_dev_fw_version_get(uint16_t port_id,
>>>>>   */
>>>>>  int rte_eth_dev_get_supported_ptypes(uint16_t port_id, uint32_t ptype_mask,
>>>>>  				     uint32_t *ptypes, int num);
>>>>> +/**
>>>>> + * @warning
>>>>> + * @b EXPERIMENTAL: this API may change without prior notice.
>>>>> + *
>>>>> + * Inform Ethernet device about reduced range of packet types to handle.
>>>>> + *
>>>>> + * Application can use this function to set only specific ptypes that it's
>>>>> + * interested. This information can be used by the PMD to optimize Rx path.
>>>>> + *
>>>>> + * The function accepts an array `set_ptypes` allocated by the caller to
>>>>> + * store the packet types set by the driver, the last element of the array
>>>>> + * is set to RTE_PTYPE_UNKNOWN. The size of the `set_ptype` array should be
>>>>> + * `rte_eth_dev_get_supported_ptypes() + 1` else it might only be filled
>>>>> + * partially.
>>>>> + *
>>>>> + * @param port_id
>>>>> + *   The port identifier of the Ethernet device.
>>>>> + * @param ptype_mask
>>>>> + *   The ptype family that application is interested in should be bitwise OR of
>>>>> + *   RTE_PTYPE_*_MASK or 0.
>>>>
>>>> No strong opinion but is it really a mask? it doesn't used to mask any value,
>>>> but it is list of ptypes that application requests, in a bitwise ORed form, so I
>>>> this it can be ptypes.
>>>
>>> See above.
>>
>> OK, it is a mask really.
>>
>>>
>>>>> + * @param set_ptypes
>>>>> + *   An array pointer to store set packet types, allocated by caller. The
>>>>> + *   function marks the end of array with RTE_PTYPE_UNKNOWN.
>>>>
>>>> Not for this patch, but is an API also required to get current supported ptypes
>>>> from the driver? Even application keeps this list, something can reset in the
>>>> device/driver, supported pytpe list can be changed and it can differ with what
>>>> application knows.
>>>
>>> I would say that set_ptypes should not affected supported_ptypes.
>>
>> I expect same. I was asking should we have an API that returns list of ptypes
>> returned by PMD after 'rte_eth_dev_set_ptypes()' call? Can we know that list
>> without a new API?
> 
> As I understand the only way right now is to get on when ptypes are set.

That is what I am saying, assuming application storing this ptype list and
driver did a reset or something can make application list invalid/wrong, so a
better way to get the list can be required.

> I would add _get_ptypes() function when it is really required only.

OK, I don't know if application may need to know it.

> 
>>>
>>>>> + * @param num
>>>>> + *   Size of the array pointed by param ptypes.
>>>>> + *   Should be rte_eth_dev_get_supported_ptypes() + 1 to accommodate the
>>>>> + *   set ptypes.
>>>>> + * @return
>>>>> + *   - (0) if Success.
>>>>> + *   - (-ENODEV) if *port_id* invalid.
>>>>> + *   - (-EINVAL) if *ptype_mask* is invalid (or) set_ptypes is NULL and
>>>>> + *     num > 0.
>>>>> + */
>>>>> +__rte_experimental
>>>>> +int rte_eth_dev_set_ptypes(uint16_t port_id, uint32_t ptype_mask,
>>>>> +			   uint32_t *set_ptypes, unsigned int num);
>>>>>  
>>>>>  /**
>>>>>   * Retrieve the MTU of an Ethernet device.
>>>>> diff --git a/lib/librte_ethdev/rte_ethdev_core.h b/lib/librte_ethdev/rte_ethdev_core.h
>>>>> index f215af7c9..657c39b56 100644
>>>>> --- a/lib/librte_ethdev/rte_ethdev_core.h
>>>>> +++ b/lib/librte_ethdev/rte_ethdev_core.h
>>>>> @@ -234,6 +234,22 @@ typedef int (*eth_dev_infos_get_t)(struct rte_eth_dev *dev,
>>>>>  typedef const uint32_t *(*eth_dev_supported_ptypes_get_t)(struct rte_eth_dev *dev);
>>>>>  /**< @internal Get supported ptypes of an Ethernet device. */
>>>>>  
>>>>> +/**
>>>>> + * @internal
>>>>> + * Inform Ethernet device about reduced range of packet types to handle.
>>>>> + *
>>>>> + * @param dev
>>>>> + *   The Ethernet device identifier.
>>>>> + * @param ptype_mask
>>>>> + *   The ptype family that application is interested in should be bitwise OR of
>>>>> + *   RTE_PTYPE_*_MASK or 0.
>>>>> + * @return
>>>>> + *   - (0) if Success.
>>>>> + *   - (-EINVAL) if *ptype_mask* is invalid.
>>>>> + */
>>>>> +typedef int (*eth_dev_ptypes_set_t)(struct rte_eth_dev *dev,
>>>>> +				     uint32_t ptype_mask);
>>>>> +
>>>>>  typedef int (*eth_queue_start_t)(struct rte_eth_dev *dev,
>>>>>  				    uint16_t queue_id);
>>>>>  /**< @internal Start rx and tx of a queue of an Ethernet device. */
>>>>> @@ -630,6 +646,8 @@ struct eth_dev_ops {
>>>>>  	eth_fw_version_get_t       fw_version_get; /**< Get firmware version. */
>>>>>  	eth_dev_supported_ptypes_get_t dev_supported_ptypes_get;
>>>>>  	/**< Get packet types supported and identified by device. */
>>>>> +	eth_dev_ptypes_set_t dev_ptypes_set;
>>>>> +	/**< Inform Ethernet device about reduced range of packet types to handle. */
>>>>>  
>>>>>  	vlan_filter_set_t          vlan_filter_set; /**< Filter VLAN Setup. */
>>>>>  	vlan_tpid_set_t            vlan_tpid_set; /**< Outer/Inner VLAN TPID Setup. */
>>>>> diff --git a/lib/librte_ethdev/rte_ethdev_version.map b/lib/librte_ethdev/rte_ethdev_version.map
>>>>> index ee289ef13..ccfbeae23 100644
>>>>> --- a/lib/librte_ethdev/rte_ethdev_version.map
>>>>> +++ b/lib/librte_ethdev/rte_ethdev_version.map
>>>>> @@ -292,4 +292,5 @@ EXPERIMENTAL {
>>>>>  	rte_flow_dynf_metadata_offs;
>>>>>  	rte_flow_dynf_metadata_mask;
>>>>>  	rte_flow_dynf_metadata_register;
>>>>> +	rte_eth_dev_set_ptypes;
>>>>>  };
>>>>>
>>>
> 



More information about the dev mailing list