[Ovirt-devel] Re: [PATCH] Use multiple processes to check host status
Ian Main
imain at redhat.com
Fri Jun 13 21:53:34 UTC 2008
On Fri, 13 Jun 2008 14:38:16 -0700
Ian Main <imain at redhat.com> wrote:
> This patch causes host-status to fork() up to node_count/5 times to
> connect out to hosts via libvirt. This guarantees that that it takes at
> most 5 timeouts in a row to verify all nodes. This should help with the
> bottleneck we were seeing with libvirt connect timeouts. Testing with 105
> nodes, almost all of which were down, it took 27s to query all of them.
Hmm, I got to thinking.. with all of the nodes on that system it was already established that there was 'no route to host' so the timeouts were quick. A freshly killed node would take longer to timeout. We could set the process count higher to help eliviate this. However, I see that it can take up to 10 minutes to timeout a connection under certain circumstances..
It's clear we should move to having the status pushed from the node to the wui, then timeouts will only be a problem for operations in taskomatic etc. We may want to add a timeout to the libvirt API to deal with this at some point..
Ian
More information about the ovirt-devel
mailing list