[libvirt] libvirtd deadlock on shutdown
Jim Fehlig
jfehlig at suse.com
Fri Jun 22 18:21:02 UTC 2012
Hu Tao wrote:
>> >From 583be33213e922899b23f036494886397b2549dc Mon Sep 17 00:00:00 2001
>> From: Jim Fehlig <jfehlig at suse.com>
>> Date: Thu, 21 Jun 2012 09:21:44 -0600
>> Subject: [PATCH] Fix deadlock on libvirtd shutdown
>>
>> When shutting down libvirtd, the virNetServer shutdown can deadlock
>> if there are in-flight jobs being handled by virNetServerHandleJob().
>> virNetServerFree() will acquire the virNetServer lock and call
>> virThreadPoolFree() to terminate the workers, waiting for the workers
>> to finish. But in-flight workers will attempt to acquire the
>> virNetServer lock, resulting in deadlock.
>>
>> Fix the deadlock by unlocking the virNetServer lock before calling
>> virThreadPoolFree(). This is safe since the virNetServerPtr object
>> is ref-counted and only decrementing the ref count needs to be
>> protected. Additionally, there is no need to re-acquire the lock
>> after virThreadPoolFree() completes as all the workers have
>> terminated.
>> ---
>> src/rpc/virnetserver.c | 6 ++----
>> 1 file changed, 2 insertions(+), 4 deletions(-)
>>
>> diff --git a/src/rpc/virnetserver.c b/src/rpc/virnetserver.c
>> index ae19e84..9d71e53 100644
>> --- a/src/rpc/virnetserver.c
>> +++ b/src/rpc/virnetserver.c
>> @@ -766,10 +766,9 @@ void virNetServerFree(virNetServerPtr srv)
>> virNetServerLock(srv);
>> VIR_DEBUG("srv=%p refs=%d", srv, srv->refs);
>> srv->refs--;
>> - if (srv->refs > 0) {
>> - virNetServerUnlock(srv);
>> + virNetServerUnlock(srv);
>>
>
> At this point other threads may have changed srv->refs...
>
>
>> + if (srv->refs > 0)
>>
>
> ...so it's unsafe to test srv->refs here without locking.
>
> For example, assume srv->refs is 2 at the beginning of virNetServerFree,
>
> thread A thread B
>
> lock srv lock srv (blocked)
>
> dec srv->refs
>
> (srv->refs is 1)
>
> unlock srv
>
> lock srv
>
> dec srv->refs
>
> (srv->refs is 0)
>
> unlock src
>
> test srv->refs test srv->refs
>
>
> In this case both threads have found srv->refs is 0 and are going to
> free srv...
>
Hmm, I thought it was dangerous to read srv->refs after unlocking, and
then wrote the code that way anyhow :-/.
> Following patch fixes problem.
>
> >From 25aa97e05aa76054b781a4e5e83781ee16d5afee Mon Sep 17 00:00:00 2001
> From: Hu Tao <hutao at cn.fujitsu.com>
> Date: Fri, 22 Jun 2012 11:15:01 +0800
> Subject: [PATCH] fix a bug of ref count in virnetserver.c
>
> The test of ref count is not protected by lock, which is unsafe because
> the ref count may have been changed by other threads during the test.
>
> This patch fixes this.
>
ACK, I've pushed this.
Thanks!
Jim
> ---
> src/rpc/virnetserver.c | 5 +++--
> 1 files changed, 3 insertions(+), 2 deletions(-)
>
> diff --git a/src/rpc/virnetserver.c b/src/rpc/virnetserver.c
> index 9d71e53..247ddd7 100644
> --- a/src/rpc/virnetserver.c
> +++ b/src/rpc/virnetserver.c
> @@ -759,15 +759,16 @@ void virNetServerQuit(virNetServerPtr srv)
> void virNetServerFree(virNetServerPtr srv)
> {
> int i;
> + int refs;
>
> if (!srv)
> return;
>
> virNetServerLock(srv);
> VIR_DEBUG("srv=%p refs=%d", srv, srv->refs);
> - srv->refs--;
> + refs = --srv->refs;
> virNetServerUnlock(srv);
> - if (srv->refs > 0)
> + if (refs > 0)
> return;
>
> for (i = 0 ; i < srv->nservices ; i++)
>
More information about the libvir-list
mailing list