[dpdk-dev] [PATCH] eal: fix floating device argument pointer
Gaëtan Rivet
gaetan.rivet at 6wind.com
Thu Oct 25 11:42:06 CEST 2018
On Thu, Oct 25, 2018 at 03:22:11AM +0000, Zhang, Qi Z wrote:
>
>
> > -----Original Message-----
> > From: Gaëtan Rivet [mailto:gaetan.rivet at 6wind.com]
> > Sent: Wednesday, October 24, 2018 10:34 AM
> > To: Thomas Monjalon <thomas at monjalon.net>
> > Cc: dev at dpdk.org; Zhang, Qi Z <qi.z.zhang at intel.com>
> > Subject: Re: [dpdk-dev] [PATCH] eal: fix floating device argument pointer
> >
> > On Wed, Oct 24, 2018 at 04:43:45PM +0200, Thomas Monjalon wrote:
> > > 24/10/2018 00:39, Gaëtan Rivet:
> > > > Hi,
> > > >
> > > > On Mon, Oct 22, 2018 at 09:25:22AM +0200, Thomas Monjalon wrote:
> > > > > 22/10/2018 07:49, Qi Zhang:
> > > > > > After we insert a devargs into devargs_list, following bus->scan
> > > > > > may destroy it due to another rte_devargs_insert. Its better not
> > > > > > to use a devargs pointer after it has been inserted.
> > > > >
> > > >
> > > > A bus scan calls rte_devargs_insert? Mapping devargs to device is
> > > > the responsibility of the bus scan, if it calls potentially
> > > > destructive functions, it must rebuild the map.
>
> This does happens when try to attach a vdev on secondary, and I think this is the real place need to fix.
> I will drop this patch and submit a new fix to prevent unnecessary rte_devargs_insert during the vdev bus scan.
>
The vdev_init function should call dev_probe instead of reimplementing it.
But looking at the big picture, maybe the real bug is secondary process.
> Thanks
> Qi
>
> > > >
> > > > > I think the problem is in:
> > > > >
> > > > > rte_devargs_insert(struct rte_devargs *da) {
> > > > > int ret;
> > > > >
> > > > > ret = rte_devargs_remove(da);
> > > > > if (ret < 0)
> > > > > return ret;
> > > > > TAILQ_INSERT_TAIL(&devargs_list, da, next);
> > > > > return 0;
> > > > > }
> > > > >
> > > > > We insert a structure which is freed!
> > > >
> > > > Not usually, I hope!
> > > >
> > > > >
> > > > > See http://git.dpdk.org/dpdk/commit/?id=55744d83d525
> > > > >
> > > > > Gaetan, what can be the fix?
> > > >
> > > > 1. rte_devargs_insert is misdefined.
> > > > It is designed as a function that can never fail.
> > > > The function should return void instead.
> > > >
> > > > 2. rte_devargs_remove(da), will not remove da itself.
> > > > It will remove whichever rte_devargs matches da within the internal
> > > > list. If da does not match any in the list, it does nothing.
> > > > As da is a newly-callocated structure, it is actually safe to
> > > > continue using it after having called rte_devargs_remove(), because
> > > > it cannot possibly have been inserted in the meantime (so would not
> > > > have been freed, even if another devargs matched it).
> > >
> > > If the devargs pointer passed in parameter is the same as the one in
> > > the list, it will be freed.
> > >
> >
> > This would only happen if one did:
> >
> > rte_devargs_insert(dev->devargs);
> >
> > > > The actual issue is that the matching rte_devargs within the list
> > > > would be referenced by a device after a successful scan, meaning that
> > > > this reference is not safe if someone attemps to insert the same
> > > > device after the bus->scan(). If my understanding is correct, the
> > above
> > > > fix is not necessary, but probing should be guarded against
> > > > re-entrancy.
> > >
> > > We may want to probe again with different parameters.
> > >
> >
> > Sure, but in this case the fix is to check whether the device is already probed,
> > and if so remove it before probing it again with the new devargs.
> >
> > >
> > > Nice rant :)
> >
> > :)
> >
> > --
> > Gaëtan Rivet
> > 6WIND
--
Gaëtan Rivet
6WIND
More information about the dev
mailing list