[libvirt RFC PATCH 0/5] eliminating VIR_FREE in the *Clear() functions

Laine Stump laine at redhat.com
Fri Feb 12 18:55:36 UTC 2021


On 2/12/21 5:25 AM, Daniel P. Berrangé wrote:
> On Fri, Feb 12, 2021 at 11:07:21AM +0100, Erik Skultety wrote:
>> On Fri, Feb 12, 2021 at 10:43:56AM +0100, Martin Kletzander wrote:
>>> On Fri, Feb 12, 2021 at 12:54:02AM -0500, Laine Stump wrote:
>>>> I've looked at a few of these, and one thing I've found is that very
>>>> often we have a function called somethingSomethingClear(), and:
>>>>
>>>> 1) The only places it is ever called will immediately free the memory
>>>> of the object as soon as they clear it.
>>>>
>>>> and very possibly
>>>>
>>>> 2) It doesn't actually *clear* everything. Some items are cleared via VIR_FREE(), but then some of the other pointers call
>>>>
>>>>     bobLoblawFree(def->bobloblaw)
>>>>
>>>> and then don't actually set def->bobloblaw to NULL - so the functions
>>>> aren't actually "Clearing", they're "Freeing and then clearing a few
>>>> things, but not everything".
>>>>
>>>
>>> One thing I am wondering is whether this is really only used where it makes
>>> sense.  As far as I understand, and please correct me if I am way off, the
>>> purpose of the Clear functions is to:
>>>
>>>   a) provide a way to remove everything from a structure that the current
>>>      function cannot recreate (there is a pointer to it somewhere else which
>>>      would not be updated) and
>>>
>>>   b) provide a way to reset a structure so that it can be filled again without
>>>      needless reallocation.
>>>
>>> I think (b) is obviously pointless, especially lately, so the only reasonable
>>> usage would be for the scenario (a).  However, I think I remember this also
>>> being used in places where it would be perfectly fine to free the variable and
>>> recreate it.  Maybe it could ease up the decision, at least by eliminating some
>>> of the code, if my hunch is correct.
>>>
>>> In my quick search I only found virDomainVideoDefClear to be used in this manner
>>> and I am not convinced that it is worth having this extra function with extra
>>
>> You could always memset it explicitly, someone might find the code more
>> readable then. IMO I'd vote for an explicit memset just for the sake of better
>> security practice (since we'll have to wait a little while for something like
>> SGX to be convenient to deploy and develop with...). Anyhow, I'm not sure how
>> many cycles exactly would be wasted, but IIRC a recent discussion memset can be
>> optimized away (correct me if I don't remember it well!), so Dan P.B.
>> suggested to gradually convert to some platform-specific ways on how to
>> sanitize the memory safely - with that in mind, I'd say we use an explicit
>> memset in all the functions in question and convert them later?
> 
> I only suggest that for places where security is required. ie to scrub
> passwords.

Yeah, I'm not planning to touch anything that is clearing out passwords 
and such. Only the *Clear() functions that currently have the dual 
purposes of:

1) freeing memory pointed to by the object in question (and any sub-objects)

2) clearing out the object so that it can be re-used with no side 
effects (e.g., pointers NULLed so that subsequent uses believe 
(correctly) that nothing is being pointed at, setting counters to 0, 
types to ..._NONE, etc.

> If the compiler wants to cull memsets in places unrelated to security
> that's fine by me, or at least, not our problem to worry about.

I would hope that the compiler would be smart enough to not optimize it 
out if it can't determine 100% that it will never make a difference. 
This would mean that, for example, unless a *Clear() function is defined 
static, it couldn't optimize out a memset() at the end (because it can't 
know what would be done with the object after return).

But if it's going to optimize out a memset, it would likely also 
optimize out the "loblaw = NULL;" in the VIR_FREE invocation, so...

(My mind keeps going back to 1994, when I turned on the 80386 "invalid 
address faults" bit (forget the exact name) on our router product that 
was running 8086 realmode *BSD, and suddenly so many stupid pointer bugs 
were immediately revealed )by a segfault) instead of the code just 
silently going off into the weeds. And when we started NULLing out 
pointers as things were freed we found so many more; the sources of 
mysterious problems that customers had been reporting for months were 
suddenly obvious. So my subconscious tells me that NULLing out freed 
pointers (and the memory they point to) is just "safer", and we're 
spending all this time removing that safety; kind of like going through 
all the cars in the world to remove their seatbelts because they make 
driving less convenient, and airbags offer a similar type of protection...)




More information about the libvir-list mailing list