[Openais] Corosync cluster stack won't start

Tom Pride tom.pride at gmail.com
Fri Mar 19 09:31:00 PDT 2010


Hi All,

Whenever I try to start up my corosync cluster stack I get the following
errors:

Mar 19 16:15:45 mq005 corosync[32649]:   [pcmk  ] ERROR: pcmk_wait_dispatch:
Child process mgmtd exited (pid=32661, rc=100)
Mar 19 16:15:45 mq005 corosync[32649]:   [pcmk  ] notice:
pcmk_wait_dispatch: Child process mgmtd no longer wishes to be respawned
Mar 19 16:15:45 mq005 corosync[32649]:   [pcmk  ] info: update_member: Node
mq005.back.int.cwwtf.local now has process list:
00000000000000000000000000013312 (78610)
Mar 19 16:15:45 mq005 attrd: [32658]: info: init_ais_connection: Connection
to our AIS plugin (9) failed: Library error (2)
Mar 19 16:15:45 mq005 cib: [32656]: info: init_ais_connection: Connection to
our AIS plugin (9) failed: Library error (2)
Mar 19 16:15:45 mq005 stonithd: [32655]: info: init_ais_connection:
Connection to our AIS plugin (9) failed: Library error (2)
Mar 19 16:15:45 mq005 cib: [32656]: CRIT: cib_init: Cannot sign in to the
cluster... terminating
Mar 19 16:15:45 mq005 attrd: [32658]: ERROR: main: HA Signon failed
Mar 19 16:15:45 mq005 stonithd: [32655]: ERROR: failed to connect to cluster
Mar 19 16:15:45 mq005 attrd: [32658]: info: main: Cluster connection active
Mar 19 16:15:45 mq005 stonithd: [32655]: ERROR:
/usr/lib64/heartbeat/stonithd abnormally abort.
Mar 19 16:15:45 mq005 attrd: [32658]: info: main: Accepting attribute
updates
Mar 19 16:15:45 mq005 attrd: [32658]: ERROR: main: Aborting startup

Can someone please provide me with some help in getting to the bottom this?

I have 2 servers running RedHat Enterprise 5.4 64bit and I have installed
the following latest RPMs from http://www.clusterlabs.org/rpm/epel-5/x86_64/

corosync-1.2.0-1.el5.x86_64.rpm
corosynclib-1.2.0-1.el5.x86_64.rpm
cluster-glue-1.0.3-1.el5.x86_64.rpm
cluster-glue-libs-1.0.3-1.el5.x86_64.rpm
heartbeat-libs-3.0.2-2.el5.x86_64.rpm
heartbeat-3.0.2-2.el5.x86_64.rpm
openais-1.1.0-1.el5.x86_64.rpm
openaislib-1.1.0-1.el5.x86_64.rpm
pacemaker-1.0.7-4.el5.x86_64.rpm
pacemaker-libs-1.0.7-4.el5.x86_64.rpm

This is the contents of my /etc/corosync/corosync.conf file:


totem {
    version: 2

     # How long before declaring a token lost (ms)
     token:          5000

     # How many token retransmits before forming a new configuration
     token_retransmits_before_loss_const: 20

     # How long to wait for join messages in the membership protocol (ms)
     join:           1000

     # How long to wait for consensus to be achieved before starting a new
round of membership configuration (ms)
     consensus:      7500

     # Turn off the virtual synchrony filter
     vsftype:        none

     # Number of messages that may be sent by one processor on receipt of
the token
     max_messages:   20

     # Disable encryption
     secauth:    off

     # How many threads to use for encryption/decryption
     threads:       0

     # Limit generated nodeids to 31-bits (positive signed integers)
     clear_node_high_bit: yes

     # Optionally assign a fixed node id (integer)
     # nodeid:         1234

    rrp_mode: passive

    interface {
            ringnumber: 0
            bindnetaddr: 172.59.60.0
            mcastaddr: 226.94.1.1
            mcastport: 4000
      }
      interface {
            ringnumber: 1
            bindnetaddr: 172.23.40.0
            mcastaddr: 226.94.2.1
            mcastport: 4001
    }
}

logging {
     fileline: off
     to_syslog: yes
     to_stderr: no
     syslog_facility: daemon
     debug: on
     timestamp: on
}

amf {
    mode: disabled
}

service {
     # Load the Pacemaker Cluster Resource Manager
     name: pacemaker
     ver:  0
    use_mgmtd: yes
}

aisexec {
     user:    root
     group:    root
}

Many thanks,
Tom
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.linux-foundation.org/pipermail/openais/attachments/20100319/a09aa2f2/attachment.htm 


More information about the Openais mailing list