Re: 1x port from bond down causes all osd down in a single machine

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]


On March 26, 2024 5:02:16 PM GMT+01:00, "Szabo, Istvan (Agoda)" <Istvan.Szabo@xxxxxxxxx> wrote:
>Wonder what we are missing from the netplan configuration on ubuntu which ceph needs to tolerate properly.
>We are using this bond configuration on ubuntu 20.04 with octopus ceph:
>    bond1:
>      macaddress: x.x.x.x.x.50
>      dhcp4: no
>      dhcp6: no
>      addresses:
>        -
>      interfaces:
>        - ens2f0np0
>        - ens2f1np1
>      mtu: 9000
>      parameters:
>        mii-monitor-interval: 100
>        mode: 802.3ad
>        lacp-rate: fast
>        transmit-hash-policy: layer3+4
>ens2f1np1 failed and caused slow ops, all osd down ... = disaster
>Any idea what is wrong with this bond config?
Two things come to my mind. Is LACP correctly configured on the switch side? And maybe some STP type problem, hence the switch again. Or is only one interface up/connected?

How does the current state of the bond look?
cat /proc/net/bonding/bond1

Hi Szabo,
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]

  Powered by Linux