[libvirt] [PATCH v1] Loop through all resolved addresses in virNetSocketNewListenTCP

Olaf Hering olaf at aepfle.de
Thu May 17 06:14:23 UTC 2018


Am Wed, 16 May 2018 18:44:32 -0400
schrieb John Ferlan <jferlan at redhat.com>:

> On 05/15/2018 04:20 AM, Olaf Hering wrote:
> > Currently virNetSocketNewListenTCP bails out early under the following
> > conditions:
> > - the hostname resolves to at least one IPv4 and at least one IPv6
> >   address  
> which produces what error?
> > IPv6. Binding the IPv6 address will obviously fail. But this terminates
> > the entire loop, even if binding to IPv4 succeeded.

What happens if a hostname resolves to more than one address, but none of the
interfaces has some of the addresses assigned? bind() will fail.

Just try it yourself, remove one address with 'ip addr del ...' (whatever the
required syntax is) and do a migration to that host.

For me it happens with BOOTPROTO='dhcp4' instead of 'dhcp' in ifcfg-br0.

> > @@ -382,11 +382,8 @@ int virNetSocketNewListenTCP(const char *nodename,
> >  #endif
> >  
> >          if (bind(fd, runp->ai_addr, runp->ai_addrlen) < 0) {
> > -            if (errno != EADDRINUSE) {
> > -                virReportSystemError(errno, "%s", _("Unable to bind to port"));
> > -                goto error;
> > -            }
> > -            addrInUse = true;
> > +            if (errno == EADDRINUSE)
> > +                addrInUse = true;  
> 
> So this to me reads as if we're just ignoring all errors, but we do care
> if the address was in use already and we'll save that bit of information
> for later.

Why would any error matter here, beside the two that this function actually
handles in some way? It seems to me the intent of that function is to bind
to at least one of the addresses for the hostname passed to this function.
There is no point in failing the entire operation just because there are 
more addresses in the resolved list. A client that actually tries to connect
to that hostname will also cycle through all addresses until one succeeds.

> bind says this is returned when "The address argument is a null
> pointer." - so essentially we're making up a failure?

Yes. There was no success trying to bind() to any of the resolved addresses.
There are a few places that can lead to nsocks==0, not just failures of bind().


Olaf
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 195 bytes
Desc: Digitale Signatur von OpenPGP
URL: <http://listman.redhat.com/archives/libvir-list/attachments/20180517/1d31ca50/attachment-0001.sig>


More information about the libvir-list mailing list