[Openais] cpgx stuck

David Teigland teigland at redhat.com
Wed Jun 3 14:35:34 PDT 2009


On Wed, Jun 03, 2009 at 04:28:27PM -0500, David Teigland wrote:
> Running "cpgx -d1" on four nodes, where -d1 causes the test to periodically
> kill and restart corosync.  When this kill/restart happens on one node, others
> are typically exiting/joining the cpg during at the same time.  The result is
> that cpgx stops receiving any cpg callbacks, and it just sits there forever.

More specifically, it appears that any cpg join gets stuck if the join occurs
during the failure/recovery period of another node that was killed.

Dave



More information about the Openais mailing list