[Pulp-list] workers keep dissapearing

Brian Bouterse bbouters at redhat.com
Tue Mar 17 21:32:14 UTC 2015


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Christian,

I put some responses to the corresponding issue you opened [0]. We can
continue the technical discussion there.

To clarify something not related to that issue. Each pulp node
requires its own, independent AMQP broker and that broker cannot be
shared between nodes or parent Pulp installations. RabbitMQ has a
vhost option to provide isolation within a single installation, but
Qpid does not provide that feature so you'll need two, independent
Qpid brokers for two node installations. You'll need a third if those
two nodes sync from a third parent.

- -Brian

On 03/16/2015 05:59 PM, Cristian Falcas wrote:
> Hello,
> 
> I'm trying to install 2 pulp nodes in 2 different regions: one in
> US, the other in Romania. The one from US is the "master" one: has
> qpid and mongodb (both with ssl enabled). Also I want to use this
> one one to sync the server from Romania (it will be a child node).
> 
> Unfortunately, I can't get the workers from Romania to stay up, so
> any sync request remains in "Waiting to begin...". I see those
> messages in the logs from both pulp servers:
> 
> Mar 16 23:51:57 host_dc1 pulp[19906]: 
> pulp.server.async.scheduler:ERROR: Workers 
> 'reserved_resource_worker-1 at host_dc1.company.net' has gone
> missing, removing from list of workers Mar 16 23:51:57 host_dc1
> pulp[19906]: pulp.server.async.tasks:ERROR: The worker named
> reserved_resource_worker-1 at host_dc1.company.net is missing.
> Canceling the tasks in its queue. Mar 16 23:51:58 host_dc1
> pulp[19906]: pulp.server.async.scheduler:ERROR: Workers 
> 'reserved_resource_worker-3 at host_dc1.company.net' has gone
> missing, removing from list of workers Mar 16 23:51:58 host_dc1
> pulp[19906]: pulp.server.async.tasks:ERROR: The worker named
> reserved_resource_worker-3 at host_dc1.company.net is missing.
> Canceling the tasks in its queue. Mar 16 23:51:58 host_dc1
> pulp[19906]: pulp.server.async.worker_watcher:INFO: New worker 
> 'reserved_resource_worker-1 at host_dc1.company.net' discovered Mar 16
> 23:51:58 host_dc1 pulp[19906]: pulp.server.async.scheduler:ERROR:
> Workers 'reserved_resource_worker-2 at host_dc1.company.net' has gone
> missing, removing from list of workers Mar 16 23:51:58 host_dc1
> pulp[19906]: pulp.server.async.tasks:ERROR: The worker named
> reserved_resource_worker-2 at host_dc1.company.net is missing.
> Canceling the tasks in its queue. Mar 16 23:51:58 host_dc1
> pulp[19906]: pulp.server.async.scheduler:ERROR: Workers 
> 'resource_manager at host_dc1.company.net' has gone missing, removing 
> from list of workers Mar 16 23:51:58 host_dc1 pulp[19906]:
> pulp.server.async.tasks:ERROR: The worker named
> resource_manager at host_dc1.company.net is missing. Canceling the
> tasks in its queue. Mar 16 23:51:58 host_dc1 pulp[19906]: 
> pulp.server.async.worker_watcher:INFO: New worker 
> 'reserved_resource_worker-2 at host_dc1.company.net' discovered Mar 16
> 23:51:59 host_dc1 pulp[19906]: 
> pulp.server.async.worker_watcher:INFO: New worker 
> 'resource_manager at host_dc1.company.net' discovered
> 
> I presume it's some kind of timeout at play that sees the second
> node disconnecting. But where to look and what to change?
> 
> Thank you, Cristian Falcas
> 
> _______________________________________________ Pulp-list mailing
> list Pulp-list at redhat.com 
> https://www.redhat.com/mailman/listinfo/pulp-list
> 
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1

iQEcBAEBAgAGBQJVCJ1dAAoJEK48cdELyEfyIK8H/24MaeMW8X78bsuG1HpoiUPL
P/5775Z4g56RqnfIwD7FYXXO0KVtmeHNi/zWGr/wQQlcbOP77UiozVYHvdsVTkSm
2o+u7iyVfeDkjMG1JoTGZ5QGMwlpEfyxyB0eYvfF0ApM6sFKW3UzkBSKKBCdjhH4
+6sRCLhynTT7Cxi2Tqho9cMnjwebCkilgtDGsqlWG94tcMZymyQiVfyEEbADdCwf
10nYBIhOgmEAmtSj1b4xuWaPriocgWJx9PYwjcxmYia0NZuhLEzqL+Suoagh7yXc
4OcRQakqS0bX8Otuv7S9DG6D96pFPUaPObkKv/uIqbxbEoKZRBH9imTmsDHzj0c=
=CzEn
-----END PGP SIGNATURE-----




More information about the Pulp-list mailing list