[Openais] trouble getting started with corosync API
Angus Salkeld
asalkeld at redhat.com
Mon Mar 29 14:34:29 PDT 2010
On Mon, 2010-03-29 at 16:40 -0400, Dan Davis wrote:
> Hi,
>
> I'm a newcomer to working with corosync, and I apologize in advance if
> this is a noob problem. I'm trying the API to see whether it would be
> appropriate to solving some technical problems with getting multiple
> nodes of our software to cooperate. Initially to bootstrap a very
> non-cluster like cooperation mode, but later I'll use more
> capabilities.
>
> When I start the corosync executive, it appears to work properly (now
> that I've figured out consensus 1201), but when I start another
> process that uses the CPG protocol, I get CS_ERR_TRY_AGAIN. I see no
> correlated activity in the corosync log when I do this. Here's the
> output of the executive:
>
> [dan at ohio corosync-1.2.0]$ sudo corosync -f
> Mar 29 16:22:42 corosync [MAIN ] Corosync Cluster Engine ('1.2.0'):
> started and ready to provide service.
> Mar 29 16:22:42 corosync [MAIN ] Corosync built-in features:
> Mar 29 16:22:42 corosync [MAIN ] Successfully read main configuration
> file '/etc/corosync/corosync.conf'.
> Mar 29 16:22:42 corosync [TOTEM ] Initializing transport (UDP/IP).
> Mar 29 16:22:42 corosync [TOTEM ] Initializing transmit/receive
> security: libtomcrypt SOBER128/SHA1HMAC (mode 0).
> Mar 29 16:22:42 corosync [MAIN ] Compatibility mode set to whitetank.
> Using V1 and V2 of the synchronization engine.
> Mar 29 16:22:42 corosync [TOTEM ] The network interface
> [192.168.192.113] is now up.
> Mar 29 16:22:42 corosync [SERV ] Service engine loaded: corosync
> extended virtual synchrony service
> Mar 29 16:22:42 corosync [SERV ] Service engine loaded: corosync
> configuration service
> Mar 29 16:22:42 corosync [SERV ] Service engine loaded: corosync
> cluster closed process group service v1.01
> Mar 29 16:22:42 corosync [SERV ] Service engine loaded: corosync
> cluster config database access v1.01
> Mar 29 16:22:42 corosync [SERV ] Service engine loaded: corosync
> profile loading service
> Mar 29 16:22:42 corosync [SERV ] Service engine loaded: corosync
> cluster quorum service v0.1
>
> It reaches poll_run() in main - I checked in gdb.
>
> Here's the output of testcpg:
>
> [dan at ohio sando]$ cd corosync-1.2.0/test
> [dan at ohio test]$ sudo ./testcpg
> Local node id is 71c0a8c0
> Could not join process group, error 6
> [dan at ohio test]$
>
> A major clue is that corosync will not exit on a SIGINT or SIGTERM,
> even though it ought to do so. I need to kill -9 to get the executive
> to go down and try again. Is this a known issue in 1.2.0 or something
> likely correlated with my problem?
>
> I've had the same behavior on with my hand-compiled corosync 1.2.0 on
> a Fedora 7 box and with an rpm on Fedora 12 with cluster-glue
> installed and started. Thanks to anyone who can help me get started,
>
Hi
I had problems like this with a firewall enabled. Try
disabling your firewall and re-testing.
iptables -F
-Angus
> Dan Davis
> www.indexengines.com
>
>
> _______________________________________________
> Openais mailing list
> Openais at lists.linux-foundation.org
> https://lists.linux-foundation.org/mailman/listinfo/openais
More information about the Openais
mailing list