There is a config option to have a public and private network configured on your storage nodes. The private is what they would use to talk to each other to do backfilling and recovery
while the public is where clients access the cluster. If your servers were able to communicate with each other, but the backfilling wasn't happening... my guess is that you are using this feature and that your private network is what was being blocked across
the fiber. Double check that vlan's connectivity between the servers.
From: ceph-users [ceph-users-bounces@xxxxxxxxxxxxxx] on behalf of Mike Jacobacci [mikej@xxxxxxxxxx]
Sent: Tuesday, October 11, 2016 3:30 PM Cc: ceph-users@xxxxxxxx Subject: Re: [ceph-users] New OSD Nodes, pgs haven't changed state Hi Goncalo,
Thanks for your reply! I finally figured out that our issue was with the physical setup of the nodes. Se had one OSD and MON node in our office and the others are co-located at our ISP. We have
an almost dark fiber going between our two buildings connected via HP 5400's, but it really isn't since there are some switches in between doing VLAN rewriting (ISP managed).
Even though all the interfaces were communicating without issue, no data would move across the nodes. I ended up moving all nodes into the same rack and data immediately started moving and the cluster
is now working! So it seems the storage traffic was being dropped/blocked by something on our ISP side.
Cheers,
Mike
On Mon, Oct 10, 2016 at 5:22 PM, Goncalo Borges
<goncalo.borges@xxxxxxxxxxxxx> wrote:
Hi Mike... |
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com