On 10/10/2018 12:52 PM, Mike Christie wrote: > On 10/10/2018 08:21 AM, Steven Vacaroaia wrote: >> Hi Jason, >> Thanks for your prompt responses >> >> I have used same iscsi-gateway.cfg file - no security changes - just >> added prometheus entry >> There is no iscsi-gateway.conf but the gateway.conf object is created >> and has correct entries >> >> iscsi-gateway.cfg is identical and contains the following >> >> [config] >> cluster_name = ceph >> gateway_keyring = ceph.client.admin.keyring >> api_secure = false >> trusted_ip_list = >> 10.10.30.181,10.10.30.182,10.10.30.183,10.10.30.184,10.10.30.185,10.10.30.186 >> prometheus_host = 0.0.0.0 >> >> >> >> I am running the disks commands from OSD01 and they fail with the following >> >> INFO [gateway.py:344:load_config()] - (Gateway.load_config) successfully >> loaded existing target definition >> 2018-10-10 09:04:48,956 DEBUG [gateway.py:423:map_luns()] - >> processing tpg2 >> 2018-10-10 09:04:48,956 DEBUG [gateway.py:428:map_luns()] - >> rbd.dstest needed mapping to tpg2 >> 2018-10-10 09:04:48,958 INFO >> [gateway.py:403:bind_alua_group_to_lun()] - Setup group ao for >> rbd.dstest on tpg 2 (state 0, owner True, failover type 1) >> 2018-10-10 09:04:48,958 DEBUG >> [gateway.py:405:bind_alua_group_to_lun()] - Setting Luns tg_pt_gp to ao >> 2018-10-10 09:04:48,959 DEBUG >> [gateway.py:409:bind_alua_group_to_lun()] - Bound rbd.dstest on tpg2 to ao >> 2018-10-10 09:04:48,959 DEBUG [gateway.py:423:map_luns()] - >> processing tpg1 >> 2018-10-10 09:04:48,959 DEBUG [gateway.py:428:map_luns()] - >> rbd.dstest needed mapping to tpg1 >> 2018-10-10 09:04:48,960 INFO >> [gateway.py:403:bind_alua_group_to_lun()] - Setup group ano1 for >> rbd.dstest on tpg 1 (state 1, owner False, failover type 1) >> 2018-10-10 09:04:48,960 DEBUG >> [gateway.py:405:bind_alua_group_to_lun()] - Setting Luns tg_pt_gp to ano1 >> 2018-10-10 09:04:48,961 DEBUG >> [gateway.py:409:bind_alua_group_to_lun()] - Bound rbd.dstest on tpg1 to ano1 >> 2018-10-10 09:04:48,963 INFO [_internal.py:87:_log()] - 127.0.0.1 - >> - [10/Oct/2018 09:04:48] "PUT /api/_disk/rbd.dstest HTTP/1.1" 200 - >> 2018-10-10 09:04:48,965 INFO [rbd-target-api:1804:call_api()] - >> _disk update on 127.0.0.1, successful >> 2018-10-10 09:04:48,965 DEBUG [rbd-target-api:1789:call_api()] - >> processing GW 'osd03' >> 2018-10-10 09:04:49,039 ERROR [rbd-target-api:1810:call_api()] - >> _disk change on osd03 failed with 500 >> 2018-10-10 09:04:49,041 INFO [_internal.py:87:_log()] - 127.0.0.1 - >> - [10/Oct/2018 09:04:49] "PUT /api/disk/rbd.dstest HTTP/1.1" 500 - >> >> >> on OSD03 there is the folowing "error" >> >> INFO [lun.py:656:add_dev_to_lio()] - (LUN.add_dev_to_lio) Adding image >> 'rbd.dstest' to LIO >> 2018-10-10 09:04:49,037 DEBUG [lun.py:666:add_dev_to_lio()] - >> control="max_data_area_mb=8" >> >> Amazingly enough, gwcli on OSD03 show the disk created but on OSD01 it >> does not >> If I restart gwcli on OSD01 , disk is there but it cannot be added to >> the host because it image does not exist ??? > > What is the output of > > systemctl status rbd-target-api > systemctl status rbd-target-gw > > Is api in a failed state or does it indicate it has been crashing and > restarting? > > Does /var/log/messages show that rbd-target-api is crashing and > restarting and could you attach the stack trace? The > /var/log/rbd-target-api log will show > > Does > > gwcli ls > > show it cannot reach the remote gateways? > Could you also give me the output of targetcli ls on both gateways? _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com