As per recent email thread, this patch adds a log message if a JOIN or LEAVE message is discarded while corosync is flushing the receive queue. While ignoring a JOIN message is harmless (it will be resent), ignoring a LEAVE message can cause a longer state transition as it is treated as a node crashing rather than leaving gracefully, so the system admin might be confused as to the cause. Unfortunately, we can't (at the totemudp level) distinguish between JOIN or LEAVE messages without a lot more protocol-specific code creeping in the lower layer so the message is left ambiguous. Signed-Off-By: Christine Caulfield <ccaulfie@xxxxxxxxxx>
diff --git a/exec/totemudp.c b/exec/totemudp.c index 86059af..8ed87ce 100644 --- a/exec/totemudp.c +++ b/exec/totemudp.c @@ -511,6 +511,7 @@ static int net_deliver_fn ( */ message_type = (char *)iovec->iov_base; if (instance->flushing == 1 && *message_type == MESSAGE_TYPE_MEMB_JOIN) { + log_printf(instance->totemudp_log_level_warning, "JOIN or LEAVE message was thrown away during flush operation."); iovec->iov_len = FRAME_SIZE_MAX; return (0); }
_______________________________________________ discuss mailing list discuss@xxxxxxxxxxxx http://lists.corosync.org/mailman/listinfo/discuss