[dpdk-dev] [PATCH 2/5] mempool: reduce wasted space on mempool populate
Olivier Matz
olivier.matz at 6wind.com
Tue Oct 29 18:09:10 CET 2019
On Tue, Oct 29, 2019 at 01:09:01PM +0300, Andrew Rybchenko wrote:
> On 10/28/19 5:01 PM, Olivier Matz wrote:
> > The size returned by rte_mempool_op_calc_mem_size_default() is aligned
> > to the specified page size. Therefore, with big pages, the returned size
> > can be much more that what we really need to populate the mempool.
> >
> > For instance, populating a mempool that requires 1.1GB of memory with
> > 1GB hugepages can result in allocating 2GB of memory.
> >
> > This problem is hidden most of the time due to the allocation method of
> > rte_mempool_populate_default(): when try_iova_contig_mempool=true, it
> > first tries to allocate an iova contiguous area, without the alignment
> > constraint. If it fails, it fallbacks to an aligned allocation that does
> > not require to be iova-contiguous. This can also fallback into several
> > smaller aligned allocations.
> >
> > This commit changes rte_mempool_op_calc_mem_size_default() to relax the
> > alignment constraint to a cache line and to return a smaller size.
> >
> > Signed-off-by: Olivier Matz <olivier.matz at 6wind.com>
>
> One may be unrelated questions below
>
> Reviewed-by: Andrew Rybdhenko <arybchenko at solarflare.com>
>
> [snip]
>
> > diff --git a/lib/librte_mempool/rte_mempool_ops_default.c b/lib/librte_mempool/rte_mempool_ops_default.c
> > index 4e2bfc82d..f6aea7662 100644
> > --- a/lib/librte_mempool/rte_mempool_ops_default.c
> > +++ b/lib/librte_mempool/rte_mempool_ops_default.c
> > @@ -12,7 +12,7 @@ rte_mempool_op_calc_mem_size_default(const struct rte_mempool *mp,
> > size_t *min_chunk_size, size_t *align)
> > {
> > size_t total_elt_sz;
> > - size_t obj_per_page, pg_num, pg_sz;
> > + size_t obj_per_page, pg_sz, objs_in_last_page;
> > size_t mem_size;
> > total_elt_sz = mp->header_size + mp->elt_size + mp->trailer_size;
> > @@ -33,14 +33,30 @@ rte_mempool_op_calc_mem_size_default(const struct rte_mempool *mp,
> > mem_size =
> > RTE_ALIGN_CEIL(total_elt_sz, pg_sz) * obj_num;
> > } else {
> > - pg_num = (obj_num + obj_per_page - 1) / obj_per_page;
> > - mem_size = pg_num << pg_shift;
> > + /* In the best case, the allocator will return a
> > + * page-aligned address. For example, with 5 objs,
> > + * the required space is as below:
> > + * | page0 | page1 | page2 (last) |
> > + * |obj0 |obj1 |xxx|obj2 |obj3 |xxx|obj4|
> > + * <------------- mem_size ------------->
> > + */
> > + objs_in_last_page = ((obj_num - 1) % obj_per_page) + 1;
> > + /* room required for the last page */
> > + mem_size = objs_in_last_page * total_elt_sz;
> > + /* room required for other pages */
> > + mem_size += ((obj_num - objs_in_last_page) /
> > + obj_per_page) << pg_shift;
> > +
> > + /* In the worst case, the allocator returns a
> > + * non-aligned pointer, wasting up to
> > + * total_elt_sz. Add a margin for that.
> > + */
> > + mem_size += total_elt_sz - 1;
> > }
> > }
> > - *min_chunk_size = RTE_MAX((size_t)1 << pg_shift, total_elt_sz);
> > -
> > - *align = RTE_MAX((size_t)RTE_CACHE_LINE_SIZE, (size_t)1 << pg_shift);
> > + *min_chunk_size = total_elt_sz;
> > + *align = RTE_CACHE_LINE_SIZE;
>
> Not directly related to the patch, but may be RTE_MEMPOOL_ALIGN should be
> used?
Yes, and there is another one that could be changed in
rte_mempool_populate_iova().
I can add a patch for that.
>
> > return mem_size;
> > }
>
More information about the dev
mailing list