I've beem holding off on cutting a new release of trinity until I've nailed this one last bug[1]. When it happens, the watchdog process is in Z state, and the child processes are all blocked on sockets (and no progress is made because the watchdog died). In the one case I've managed to catch a core from the watchdog, it makes no damn sense.. Program terminated with signal 8, Arithmetic exception. #0 check_shm_sanity () at watchdog.c:47 if (shm->running_childs == 0) what the hell does that even mean ? 'shm' is valid, shm->running_childs is '4'. Any ideas ? Dave [1] Until the next bug. -- To unsubscribe from this list: send the line "unsubscribe trinity" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html