oh and my config is: volume posix0 type storage/posix option directory /export/brick-newgluster end-volume volume locks0 type features/locks subvolumes posix0 end-volume volume brick0 type performance/io-threads subvolumes locks0 end-volume volume server type protocol/server option transport-type tcp option listen-port 16996 option auth.addr.brick0.allow * subvolumes brick0 end-volume volume tur-awc1-0 type protocol/client option transport-type tcp option remote-port 16996 option remote-host tur-awc1 option remote-subvolume brick0 end-volume volume tur-awc2-0 type protocol/client option transport-type tcp option remote-port 16996 option remote-host tur-awc2 option remote-subvolume brick0 end-volume volume tur-awc3-0 type protocol/client option transport-type tcp option remote-port 16996 option remote-host tur-awc3 option remote-subvolume brick0 end-volume volume nufa type cluster/nufa option local-volume-name `hostname`-0 subvolumes tur-awc1-0 tur-awc2-0 tur-awc3-0 end-volume Matt On Tue, Apr 21, 2009 at 08:33:03AM +1200, m.c.wilkins at massey.ac.nz wrote: > > hi, > > i'm running 2.0.0rc7 (config below) in a nufa setup. please help me > out, i'm getting some errors in my logs: > > on tur-awc1 i have: > > 2009-04-21 01:24:43 E [client-protocol.c:533:client_ping_timer_expired] tur-awc3-0: ping timer expired! bailing transport > 2009-04-21 01:24:43 E [saved-frames.c:169:saved_frames_unwind] tur-awc3-0: forced unwinding frame type(1) op(MKNOD) > 2009-04-21 01:24:43 E [fuse-bridge.c:1274:fuse_rename_cbk] glusterfs-fuse: 102353757: /090417_HWI-EAS209_0011_FC30K/Data/IPAR_1.3/Bustard1.3.2_20-04-2009_mjscolli/GERALD_20-04-2009_mjscolli/s_5_0009_realign.txt.tmp -> /090417_HWI-EAS209_0011_FC30K/Data/IPAR_1.3/Bustard1.3.2_20-04-2009_mjscolli/GERALD_20-04-2009_mjscolli/s_5_0009_realign.txt => -1 (Transport endpoint is not connected) > 2009-04-21 01:24:43 E [saved-frames.c:169:saved_frames_unwind] tur-awc3-0: forced unwinding frame type(1) op(STAT) > 2009-04-21 01:24:43 E [dht-common.c:762:dht_attr_cbk] nufa: subvolume tur-awc3-0 returned -1 (Transport endpoint is not connected) > 2009-04-21 01:24:43 E [saved-frames.c:169:saved_frames_unwind] tur-awc3-0: forced unwinding frame type(1) op(STAT) > 2009-04-21 01:24:43 E [dht-common.c:762:dht_attr_cbk] nufa: subvolume tur-awc3-0 returned -1 (Transport endpoint is not connected) > 2009-04-21 01:24:43 E [saved-frames.c:169:saved_frames_unwind] tur-awc3-0: forced unwinding frame type(1) op(STAT) > 2009-04-21 01:24:43 E [dht-common.c:762:dht_attr_cbk] nufa: subvolume tur-awc3-0 returned -1 (Transport endpoint is not connected) > 2009-04-21 01:24:43 E [saved-frames.c:169:saved_frames_unwind] tur-awc3-0: forced unwinding frame type(2) op((null)) > 2009-04-21 01:24:43 E [client-protocol.c:630:client_ping_cbk] tur-awc3-0: timer must have expired > 2009-04-21 01:24:45 N [client-protocol.c:6159:client_setvolume_cbk] tur-awc3-0: connection and handshake succeeded > > seems like it is having problems communicating with tur-awc3, and in > that machines log i have: > > 2009-04-21 01:24:45 N [server-protocol.c:7513:mop_setvolume] server: accepted client from 130.123.129.121:1013 > 2009-04-21 01:24:45 E [socket.c:102:__socket_rwv] server: writev failed (Broken pipe) > 2009-04-21 01:24:45 N [server-protocol.c:8268:notify] server: 130.123.129.121:1011 disconnected > 2009-04-21 04:03:12 W [nufa.c:219:nufa_lookup] nufa: incomplete layout failure for path=/ > 2009-04-21 04:03:12 W [fuse-bridge.c:301:need_fresh_lookup] fuse-bridge: revalidate of / failed (Resource temporarily unavailable) > > any idea what is happening? i can confirm that both machines are up > and under almost no load, connected on the same gigabit switch. > 130.123.129.121 is the IP address of tur-awc1. > > any help much appreciated > > Matt > > On Mon, Apr 20, 2009 at 11:48:16AM +1200, m.c.wilkins at massey.ac.nz wrote: > > > > hi, > > > > how do a submit a bug report? > > > > with 2.0.0rc7 a > > ./configure --prefix=/usr/local/glusterfs-2.0.0rc7 > > make ; make install > > fails with > > > > make[5]: Entering directory `/var/tmp/glusterfs-2.0.0rc7/xlators/mount/fuse/utils' > > make[5]: Nothing to be done for `install-exec-am'. > > test -z "/sbin" || mkdir -p -- "/sbin" > > /usr/bin/install -c 'mount.glusterfs' '/sbin/mount.glusterfs' > > /usr/bin/install: cannot remove `/sbin/mount.glusterfs': Permission denied > > make[5]: *** [install-utilSCRIPTS] Error 1 > > make[5]: Leaving directory `/var/tmp/glusterfs-2.0.0rc7/xlators/mount/fuse/utils' > > make[4]: *** [install-am] Error 2 > > make[4]: Leaving directory `/var/tmp/glusterfs-2.0.0rc7/xlators/mount/fuse/utils' > > make[3]: *** [install-recursive] Error 1 > > make[3]: Leaving directory `/var/tmp/glusterfs-2.0.0rc7/xlators/mount/fuse' > > make[2]: *** [install-recursive] Error 1 > > make[2]: Leaving directory `/var/tmp/glusterfs-2.0.0rc7/xlators/mount' > > make[1]: *** [install-recursive] Error 1 > > make[1]: Leaving directory `/var/tmp/glusterfs-2.0.0rc7/xlators' > > make: *** [install-recursive] Error 1 > > > > since i don't have permission to write to /sbin, and > > xlators/mount/fuse/utils/Makefile sets > > > > utildir = $(destdir)/sbin > > > > but destdir is not set to anything. destdir should be set to the > > eprefix specified by the configure script (which defaults to prefix). > > > > thanks > > > > Matt > > > > > > _______________________________________________ > > Gluster-users mailing list > > Gluster-users at gluster.org > > http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users > > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users