Re: fedora-messaging/rabbitmq in staging cluster rebuild tonight

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, Feb 14, 2020 at 03:54:48PM +0100, Aurelien Bompard wrote:
> Hey folks,
> 
> I thought I'd make a summary of where I'm at. Here are the issues I found
> and what I did about it:
> 
> - We ran into an Ansible issue that the PR
> https://github.com/ansible/ansible/pull/50381 fixes. I've asked pingou to
> patch batcave since it's basically a one-liner that will keep working with
> the older prod version.
> 
> - When starting a RabbitMQ cluster from scratch, there is a race condition
> that is documented here:
> https://www.rabbitmq.com/cluster-formation.html#initial-formation-race-condition
>   On nodes 02 and 03, I've just destroyed the database and let it
> auto-detect the cluster again
>   # systemctl stop rabbitmq-server && rm -rf /var/lib/rabbitmq/mnesia/ &&
> systemctl start rabbitmq-server
>   It worked fine. I checked with "rabbitmqctl list_users" that all nodes
> had the same users declared.
> 
> - I've also fixed a couple things in the playbooks that assumed the cluster
> to be up and setup already.
> 
> - I've rebuilt collectd-rabbitmq for EPEL8 but we currently only install it
> on production apparently (not sure why, I think it could be useful in
> staging.

I think that was me disabling it in stg because it wasn't working?

> - The nagios-plugins-rabbitmq RPM still fails to install because of a
> dependency bug in perl-Monitoring-Plugin, I've opened a ticket about it:
> https://bugzilla.redhat.com/show_bug.cgi?id=1803121
> 
> Now, we need to recreate the queues, users and bindings, and I don't have
> the permissions to run all the playbooks. If someone could run the master
> playbook limited on staging and on the rabbitmq_cluster tag, I think it
> should recreate all users and queues and we should be all set.

on it. 

> I'm around and on IRC if you need me.

Thanks much for working on this. :) 

I guess the next step is to add the stuff that we needed this new
version for and confirm it works? Then on to production?

kevin

Attachment: signature.asc
Description: PGP signature

_______________________________________________
infrastructure mailing list -- infrastructure@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe send an email to infrastructure-leave@xxxxxxxxxxxxxxxxxxxxxxx
Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: https://lists.fedoraproject.org/archives/list/infrastructure@xxxxxxxxxxxxxxxxxxxxxxx

[Index of Archives]     [Fedora Development]     [Fedora Users]     [Fedora Desktop]     [Fedora SELinux]     [Yosemite News]     [KDE Users]

  Powered by Linux