[Openais] Re: strange behaviour with testclm

Chris Friesen cfriesen at nortelnetworks.com
Tue Jun 29 14:10:20 PDT 2004


Steven Dake wrote:

>  > If I stop, then start the executive on a, testclm is stuck in an 
> infinite loop
>  > on select() and is unable to reconnect with the executive.
>  >
> 
> This is an issue that I'm not sure how to solve (and isn't supposed to
> happen except in a failure case in which case the components will
> failover).  One possibility is to reconnect all of the API connections
> but this is pretty complicated and I'm not sure you would want to
> proceed on that processor under such an error.

I'm just thinking of the case where the executive happens to get a segfault, or 
someone kills it, or whatever.  Ideally we would be able to reconnect.  One way 
that I've done that in the past is to mmap() a file in the filesystem and store 
all state information there with checks as to whether its consistant or not.  On 
restart, I read the data in and if it is consistant I just start where I left off.

> The API should return SA_ERR_LIBRARY in this case.  What was the error
> code you received?

SA_ERR_SYSTEM.  I suspect its from the poll() timeout in saPollRetry().

Chris




More information about the Openais mailing list