[libvirt-users] Libvirt migration issues (0.9.4 and 0.9.9)

Daniel Espling espling at cs.umu.se
Tue Jan 31 09:32:05 UTC 2012


Hi again,

I spent some time trying to debug this:

added some printouts and noticed that the virNetSocketDupFD() function is called with cloexec = True, hence triggering the call:  fd = fcntl(sock->fd, F_DUPFD_CLOEXEC);

However, running on CentOS 5.5 our glibc version is glibc-2.5-49.el5_5.5, and it seems the F_DUPFD_CLOEXEC flag was not added until glibc2.7 [1]. I then tried to replace that code with:

if (cloexec) {
        fd = fcntl(sock->fd, F_DUPFD);
        if (fd >= 0)
            fcntl(fd, F_SETFD, FD_CLOEXEC);
    }

but the results are the same; Unable to copy socket file handle: Invalid argument

I also added some more printouts to find more about the fd:
2012-01-31 11:20:24.093+0000: 10445: error : virNetSocketDupFD:790 : sock->fd: 15, cloexec: 1: Invalid argument

and can at least see that fd 15 is the one complaining. Looking at lsof for the KVM process:

kvm        7456      root   13u     0000                0,8           0        852 anon_inode
kvm        7456      root   14u     IPv4             167375                    TCP localhost.localdomain:5900 (LISTEN)
kvm        7456      root   15r     FIFO                0,7                 167376 pipe
kvm        7456      root   16w     FIFO                0,7                 167376 pipe
kvm        7456      root   17u      CHR             10,200                   2986 /dev/net/tun

Seems like it fails duplicating a pipe leading back to the same process?

Regards,

Daniel

1) http://stackoverflow.com/questions/1643304/how-to-set-close-on-exec-by-default


On Jan 30, 2012, at 3:14 PM, Daniel Espling wrote:

> Dear all,
> 
> we're having two different problems with migrations in libvirt, running as root user on host machines with CentOS release 5.5 (Final), kernel: Linux 2.6.32.24 #3 SMP Fri Oct 29 16:22:02 BST 2010 x86_64 x86_64 x86_64 GNU/Linux
> 
> First case:
> 
> virsh version
> Compiled against library: libvir 0.9.4
> Using library: libvir 0.9.4
> Using API: QEMU 0.9.4
> Running hypervisor: QEMU 1.0.50
> 
> Migrations work well for a basic VM, but if we attach a disk to the usb bus migration is no longer possible and fails with the error message: "error: operation failed: migration job: is not active". This is regardless of if the device is mounted inside the VM or not (debian). Please find more information attached.
> 
> If we attach the same (.iso based) disk to the scsi bus instead, migrations work as normal.
> 
> ----
> 
> To mitigate this problem, we tried upgrading to a more recent libvirt version:
> 
> Compiled against library: libvir 0.9.9
> Using library: libvir 0.9.9
> Using API: QEMU 0.9.9
> Running hypervisor: QEMU 1.0.50
> 
> When trying to migrate a normal (debian) instance from one host to another using the same domain as in the previous successful case without any devices attached, migration fails with the error message "error: Unable to copy socket file handle: Invalid argument". The libvirt.log only has a similar single-line of information:  2012-01-30 15:44:46.772+0000: 7546: error : virNetSocketDupFD:787 : Unable to copy socket file handle: Invalid argument. 
> 
> The network configuration used here is the same as we successfully used in the 0.9.4 test case, using static ip's.
> 
> Thankful for assistance, not really sure what to try next. :)
> 
> Regards,
> Daniel Espling
> 
> 
> <libvirt_0.9.4.txt>_______________________________________________
> libvirt-users mailing list
> libvirt-users at redhat.com
> https://www.redhat.com/mailman/listinfo/libvirt-users





More information about the libvirt-users mailing list