Memory pool interface design
Orna Agmon Ben-Yehuda
ladypine at gmail.com
Sun May 17 22:08:58 IDT 2015
On Sat, May 16, 2015 at 10:27 PM, Elazar Leibovich <elazarl at gmail.com>
wrote:
> Thanks Orna,
>
> My understanding is, that in stock Linux kernel, a process that allocates
> too much memory is unlikely to receive NULL from malloc. The more likely
> scenario is, the whole system would swap out pages, and the OOM killer
> would, hopefully, kill the offending process. During that time it is
> unlikely that malloc would return NULL.
>
>
Oleg already mentioned that OOM kills an arbitrary process, not necessarily
the one causing problems=failing a malloc. But even if it did kill the
process which failed the malloc, this is not necessarily the correct
behavior. I am considering a world in which applications are expected to be
aware of memory pressure. In this case, process where the malloc failed is
not necessarily the one that needs to crash or even free memory. There
might be other measures that need to be taken before it happens. For
example, an elastic application should release memory. An example for such
a scenario is when you work with postgresql, which relies on storing data
in cached pages. It can release memory easily, and make room for the failed
malloc to succeed. Another option is for the balloon device driver to
deflate, and by that make room for malloc.
> Indeed, if you look at actual applications written for Linux, you'd see
> that many use variants of xmalloc, which simply aborts if malloc returns
> NULL. I think that the logic is similar to what I wrote before.
>
> I thought to give a list of such real world programs, but someone wrote it
> much better than I did:
>
>
> http://eli.thegreenplace.net/2009/10/30/handling-out-of-memory-conditions-in-c
>
> But feel free to correct me if I'm wrong.
>
> PS,
> I think we have miscommunication here, since from what I understand the
> paper linked is about memory allocation for a VM, while I'm talking about
> memory allocation for a process.
>
>
No miscommunication. Indeed our paper deals with allocating memory to VMs,
but to make things work we needed to handle swapping and memory allocation,
even returning memory from libc to the operating system, all to make the
memory allocation of one process behave elastically. See section 7 for the
changes we made to the operating system configuration. Our processes
certainly need to get the return value from malloc and live, even if we get
NULL, which we might get many times as a healthy progress of things.
> On Sat, May 16, 2015 at 10:10 PM, Orna Agmon Ben-Yehuda <
> ladypine at gmail.com> wrote:
>
>> Hi Elazar,
>>
>> I find that malloc failure checking is vital (within the user program)
>> even on a regular system with gigabytes of memory. For example, when the
>> program gets into a recursive loop which allocates memory and then digs
>> deeper. In other cases, It is useful to check the return value of malloc
>> when the program input size is unlimited, and it is better to inform the
>> user about the too-large element of the input rather than to crash.
>>
>> I do not understand, however, how memory overcommitment leads you to not
>> require malloc to fail. Maybe when you say overcommitment you do not mean
>> what I mean, but in our scenario[1] of memory overcommitment, where memory
>> balloon drivers are used, we configured the memory allocations to fail when
>> there was not enough physical memory, so that the guest application would
>> be able to tell when there is memory pressure. Otherwise, the burden of
>> handling memory pressure is laid purely on the operating system itself.
>>
>> [1] Ginseng: Market-Driven Memory Allocation
>> <http://www.cs.technion.ac.il/~ladypine/vee18-agmon-ben-yehuda.pdf>,
>> Orna Agmon Ben-Yehuda, Eyal Posener, Muli Ben-Yehuda, Assaf Schuster, Ahuva
>> Mu'alem. In proceedings of VEE 2014.
>>
>>
>> On Sat, May 16, 2015 at 9:14 PM, Elazar Leibovich <elazarl at gmail.com>
>> wrote:
>>
>>> The question of whether to use a global malloc function, or to use a
>>> function pointer is orthogonal to my question.
>>>
>>> My question is, should I support the case of malloc failure. On one
>>> hand, it complicates the API significantly, but on the other hand it might
>>> be useful for some use cases.
>>>
>>> It's pretty obvious to me that in a modern Linux userspace program,
>>> supporting malloc failure does not worth the trouble. But are there other
>>> use cases where it's vital?
>>>
>>> Another clarification, my code would never have abort. What I was
>>> saying, that the malloc could simply abort current task, if it does not
>>> have memory.
>>>
>>> As a side note, In my experience, it is sometimes useful to use
>>> preallocated "memory pools"[0]. Letting the user choose memory allocator is
>>> also useful when using it in the kernel, since otherwise the library simply
>>> won't compile. See for example protobuf-c which receives an allocator in
>>> its functions,
>>> https://github.com/protobuf-c/protobuf-c/blob/master/protobuf-c/protobuf-c.c#L2019
>>>
>>>
>>> [0]
>>> http://eli.thegreenplace.net/2008/10/17/memmgr-a-fixed-pool-memory-allocator
>>>
>>> On Fri, May 15, 2015 at 9:00 PM, Baruch Even <baruch at ev-en.org> wrote:
>>>
>>>> I would question the need to abstract away the memory allocations of
>>>> your library compared to everything else. If someone cares enough about it
>>>> he can replace malloc and free completely to use a different allocation
>>>> scheme.
>>>>
>>>> In most cases I've cared about memory allocations I just wanted none of
>>>> them at all and only wanted intrusive data structures and just running the
>>>> system with a fixed memory allocation from the start to the end. It's not
>>>> always possible in a generic library though..
>>>>
>>>> If you are writing a library you should never abort inside it, that
>>>> would be very annoying to the user. Give him a null and let him crash or
>>>> handle it as he sees fit.
>>>>
>>>> Baruch
>>>>
>>>> On Fri, May 15, 2015 at 5:47 PM, Elazar Leibovich <elazarl at gmail.com>
>>>> wrote:
>>>>
>>>>> I'm writing a small C library, that I want to open source.
>>>>>
>>>>> I want them to be usable for embedded environment, where memory
>>>>> allocation must be controlled.
>>>>>
>>>>> Hence, I abstracted away calls to malloc/realloc, and replaced them
>>>>> with
>>>>>
>>>>> struct mem_pool {
>>>>> void *(*allloc)(void *mem_pool, void *prev_ptr, int size);
>>>>> };
>>>>>
>>>>> User would implement
>>>>>
>>>>> struct my_mem_pool {
>>>>> struct mem_pool pool;
>>>>> ...
>>>>> };
>>>>>
>>>>> struct my_mem_pool pool = { { my_alloc_func }, ...);
>>>>>
>>>>> I've had to design question I'm interested with:
>>>>>
>>>>> 1) Should I support both malloc and realloc?
>>>>>
>>>>> I think the performance benefits of supporting malloc instead of
>>>>> realloc(NULL) are negligible, and not worth complicating the interface.
>>>>>
>>>>> 2) Should the memory pool be allowed to fail?
>>>>>
>>>>> In typical Linux system, where memory overcommit is allowed, checking
>>>>> malloc return value provides little benefit. But is it the same for
>>>>> embedded system?
>>>>>
>>>>> My feeling is, embedded system should predict the memory usage for
>>>>> each input size, and avoid processing input which is too large.
>>>>>
>>>>> For example, stack overflow error can never be handled, and one is
>>>>> expected to calculate the longest stack length for any input and make sure
>>>>> he wouldn't overflow.
>>>>>
>>>>> So I think it's still reasonable never to report allocation failure,
>>>>> and to expect the memory allocator to raise the relevant
>>>>> abort/panic/exception in such a case.
>>>>>
>>>>> But I'll be happy to hear other considerations I missed.
>>>>>
>>>>> Thanks,
>>>>>
>>>>> _______________________________________________
>>>>> Linux-il mailing list
>>>>> Linux-il at cs.huji.ac.il
>>>>> http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il
>>>>>
>>>>>
>>>>
>>>
>>> _______________________________________________
>>> Linux-il mailing list
>>> Linux-il at cs.huji.ac.il
>>> http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il
>>>
>>>
>>
>>
>> --
>> Orna Agmon Ben-Yehuda.
>> http://ladypine.org
>>
>
>
--
Orna Agmon Ben-Yehuda.
http://ladypine.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.huji.ac.il/pipermail/linux-il/attachments/20150517/cbe43271/attachment-0001.html>
More information about the Linux-il
mailing list