[dpdk-dev] [PATCH 1/4] add vector PMD RX for FVL
Bruce Richardson
bruce.richardson at intel.com
Tue Sep 29 16:27:41 CEST 2015
On Mon, Sep 28, 2015 at 01:05:24AM +0800, Zhe Tao wrote:
> The vPMD RX function uses the multi-buffer and SSE instructions to
> accelerate the RX speed, but now the pktype cannot be supported by the vPMD RX,
> because it will decrease the performance heavily.
>
> Signed-off-by: Zhe Tao <zhe.tao at intel.com>
> ---
> config/common_bsdapp | 2 +
> config/common_linuxapp | 2 +
> drivers/net/i40e/Makefile | 1 +
> drivers/net/i40e/base/i40e_type.h | 3 +
> drivers/net/i40e/i40e_rxtx.c | 28 ++-
> drivers/net/i40e/i40e_rxtx.h | 20 +-
> drivers/net/i40e/i40e_rxtx_vec.c | 484 ++++++++++++++++++++++++++++++++++++++
> 7 files changed, 535 insertions(+), 5 deletions(-)
> create mode 100644 drivers/net/i40e/i40e_rxtx_vec.c
>
<snip>
> +
> + /* vPMD receive routine, now only accept (nb_pkts == RTE_I40E_VPMD_RX_BURST)
> + * in one loop
> + *
> + * Notice:
> + * - nb_pkts < RTE_I40E_VPMD_RX_BURST, just return no packet
I don't think this comment matches the implementation below. I think you are
allowed to request bursts as small as RTE_I40E_DESCS_PER_LOOP.
> + * - nb_pkts > RTE_I40E_VPMD_RX_BURST, only scan RTE_I40E_VPMD_RX_BURST
> + * numbers of DD bits
> +
> + */
> +static inline uint16_t
> +_recv_raw_pkts_vec(struct i40e_rx_queue *rxq, struct rte_mbuf **rx_pkts,
> + uint16_t nb_pkts, uint8_t *split_packet)
> +{
> + volatile union i40e_rx_desc *rxdp;
> + struct i40e_rx_entry *sw_ring;
> + uint16_t nb_pkts_recd;
> + int pos;
> + uint64_t var;
> + __m128i shuf_msk;
> +
> + __m128i crc_adjust = _mm_set_epi16(
> + 0, 0, 0, /* ignore non-length fields */
> + -rxq->crc_len, /* sub crc on data_len */
> + 0, /* ignore high-16bits of pkt_len */
> + -rxq->crc_len, /* sub crc on pkt_len */
> + 0, 0 /* ignore pkt_type field */
> + );
> + __m128i dd_check, eop_check;
> +
> + /* nb_pkts shall be less equal than RTE_I40E_MAX_RX_BURST */
> + nb_pkts = RTE_MIN(nb_pkts, RTE_I40E_MAX_RX_BURST);
> +
> + /* nb_pkts has to be floor-aligned to RTE_I40E_DESCS_PER_LOOP */
> + nb_pkts = RTE_ALIGN_FLOOR(nb_pkts, RTE_I40E_DESCS_PER_LOOP);
/Bruce
More information about the dev
mailing list