[dpdk-dev] [PATCH v3 15/19] vhost: postpone rings addresses translation
Yao, Lei A
lei.a.yao at intel.com
Tue Oct 17 03:24:28 CEST 2017
Hi, Maxime
> -----Original Message-----
> From: Maxime Coquelin [mailto:maxime.coquelin at redhat.com]
> Sent: Monday, October 16, 2017 6:48 PM
> To: Yao, Lei A <lei.a.yao at intel.com>; 'dev at dpdk.org' <dev at dpdk.org>;
> Horton, Remy <remy.horton at intel.com>; Bie, Tiwei <tiwei.bie at intel.com>;
> 'yliu at fridaylinux.org' <yliu at fridaylinux.org>
> Cc: 'mst at redhat.com' <mst at redhat.com>; 'jfreiman at redhat.com'
> <jfreiman at redhat.com>; 'vkaplans at redhat.com' <vkaplans at redhat.com>;
> 'jasowang at redhat.com' <jasowang at redhat.com>
> Subject: Re: [dpdk-dev] [PATCH v3 15/19] vhost: postpone rings addresses
> translation
>
>
>
> On 10/16/2017 11:47 AM, Maxime Coquelin wrote:
> > Hi Yao,
> >
> > On 10/16/2017 08:23 AM, Yao, Lei A wrote:
> >> Hi, Maxime
> >>
> >> Add one comment:
> >> This issue with virtio-net only occur when I use CPU on socket 1.
> >
> > Thanks for the report.
> > I fail to reproduce for now.
> >
> > What is your qemu command line?
> > Is it reproducible systematically when there is a NUMA reallocation
> > (DPDK on socket 0, QEMU on socket 1)?
>
> Nevermind, I just reproduced the (an?) issue.
> The issue I reproduce is not linked to NUMA reallocation, but to
> messages sequencing differences between QEMU versions.
>
> So, I'm not 100% this is the same issue, as you mention it works fine
> when using CPU socket 0.
>
> The patch "vhost: postpone rings addresses translation" moves rings
> addresses translation at either vring kick or enable time, depending
> on whether protocol features are enabled or not. This has been done
> because we must not interpret ring information as long as the vring is
> not fully initialized.
>
> While my patch works fine with recent QEMU version, it breaks with older
> ones, like QEMU v2.5. The reason is that on these older versions,
> VHOST_USER_SET_VRING_ENABLE is called once and before
> VHOST_USER_SET_VRING_ADDR. At that time, the ring adresses aren't
> available so the translation is not done. On recent QEMU versions,
> we receive VHOST_USER_SET_VRING_ENABLE also after having received
> the rings addresses, so it works fine.
>
> The below fix consists in performing the rings addresses translation
> also when handling VHOST_USER_SET_VRING_ADDR if ring has already been
> enabled.
>
> I'll post a formal patch later today or tomorrow morning after having
> tested it more conscientiously. Let me know if it fixes your issue.
>
> Thanks,
> Maxime
>
Thanks for your quick fix. I try your patch and it can totally work at my side for virtio-net.
I tested it with Qemu 2.5~2.7, 2.10.
The previous info about it can work on numa 0 is a misleading info. Because
I use qemu 2.10 for some special test at that time. So it can work.
BRs
Lei
> diff --git a/lib/librte_vhost/vhost_user.c b/lib/librte_vhost/vhost_user.c
> index 76c4eeca5..1f6cba4b9 100644
> --- a/lib/librte_vhost/vhost_user.c
> +++ b/lib/librte_vhost/vhost_user.c
> @@ -372,33 +372,6 @@ ring_addr_to_vva(struct virtio_net *dev, struct
> vhost_virtqueue *vq,
> return qva_to_vva(dev, ra);
> }
>
> -/*
> - * The virtio device sends us the desc, used and avail ring addresses.
> - * This function then converts these to our address space.
> - */
> -static int
> -vhost_user_set_vring_addr(struct virtio_net *dev, VhostUserMsg *msg)
> -{
> - struct vhost_virtqueue *vq;
> - struct vhost_vring_addr *addr = &msg->payload.addr;
> -
> - if (dev->mem == NULL)
> - return -1;
> -
> - /* addr->index refers to the queue index. The txq 1, rxq is 0. */
> - vq = dev->virtqueue[msg->payload.addr.index];
> -
> - /*
> - * Rings addresses should not be interpreted as long as the ring
> is not
> - * started and enabled
> - */
> - memcpy(&vq->ring_addrs, addr, sizeof(*addr));
> -
> - vring_invalidate(dev, vq);
> -
> - return 0;
> -}
> -
> static struct virtio_net *
> translate_ring_addresses(struct virtio_net *dev, int vq_index)
> {
> @@ -464,6 +437,43 @@ translate_ring_addresses(struct virtio_net *dev,
> int vq_index)
> }
>
> /*
> + * The virtio device sends us the desc, used and avail ring addresses.
> + * This function then converts these to our address space.
> + */
> +static int
> +vhost_user_set_vring_addr(struct virtio_net **pdev, VhostUserMsg *msg)
> +{
> + struct vhost_virtqueue *vq;
> + struct vhost_vring_addr *addr = &msg->payload.addr;
> + struct virtio_net *dev = *pdev;
> +
> + if (dev->mem == NULL)
> + return -1;
> +
> + /* addr->index refers to the queue index. The txq 1, rxq is 0. */
> + vq = dev->virtqueue[msg->payload.addr.index];
> +
> + /*
> + * Rings addresses should not be interpreted as long as the ring
> is not
> + * started and enabled
> + */
> + memcpy(&vq->ring_addrs, addr, sizeof(*addr));
> +
> + vring_invalidate(dev, vq);
> +
> + if (vq->enabled && (dev->features &
> + (1ULL << VHOST_USER_F_PROTOCOL_FEATURES))) {
> + dev = translate_ring_addresses(dev,
> msg->payload.state.index);
> + if (!dev)
> + return -1;
> +
> + *pdev = dev;
> + }
> +
> + return 0;
> +}
> +
> +/*
> * The virtio device sends us the available ring last used index.
> */
> static int
> @@ -1273,7 +1283,7 @@ vhost_user_msg_handler(int vid, int fd)
> vhost_user_set_vring_num(dev, &msg);
> break;
> case VHOST_USER_SET_VRING_ADDR:
> - vhost_user_set_vring_addr(dev, &msg);
> + vhost_user_set_vring_addr(&dev, &msg);
> break;
> case VHOST_USER_SET_VRING_BASE:
> vhost_user_set_vring_base(dev, &msg);
More information about the dev
mailing list