Transport endpoint

robinr at muohio.edu (Robin, Robin) · Mon, 17 Dec 2012 10:58:19 -0500

Hi,

Joe, there's nothing around the time stamp.

On the client, /var/log/glusterfs/mnt-gluster-RedhawkShared.log:
[2012-12-13 17:25:21.014040] I [fuse-bridge.c:3376:fuse_init]
0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.13
kernel 7.10
[2012-12-13 17:25:21.014823] I
[afr-common.c:1964:afr_set_root_inode_on_first_lookup]
0-RedhawkShared-replicate-0: added root inode
[2012-12-15 03:21:37.566345] I [glusterfsd.c:1666:main]
0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 3.3.0
[2012-12-15 03:21:37.687280] I [io-cache.c:1549:check_cache_size_ok]
0-RedhawkShared-quick-read: Max cache size is 25133539328
[2012-12-15 03:21:37.692566] I [io-cache.c:1549:check_cache_size_ok]
0-RedhawkShared-io-cache: Max cache size is 25133539328
[2012-12-15 03:21:37.694761] I [client.c:2142:notify]
0-RedhawkShared-client-0: parent translators are ready, attempting connect
on transport
[2012-12-15 03:21:37.698557] I [client.c:2142:notify]
0-RedhawkShared-client-1: parent translators are ready, attempting connect
on transport

There's nothing on 2012-12-15, other than the fact that at 03:21 AM, I
manually remount the gluster partition again.

John, I've not filed a bug yet.

Let me do another test on the Gluster protocol.

I'm running keepalived for an NFS floating IP. That could be my
work-around for now.

The downside of using NFS is the fact that I need to use an additional
software like keepalived. What are the other downsides ?

Thanks..
Robin

On 12/17/12 10:29 AM, "Joe Julian" <joe at julianfamily.org> wrote:

>On 12/17/2012 06:56 AM, Robin, Robin wrote:
>> Hi,
>>
>> I've got Gluster error: Transport endpoint not connected.
>>
>> It came up twice after trying to rsync 2 TB filesystem over; it
>> reached about 1.8 TB and got the error.
>>
>> Logs on the server side (on reverse time order):
>> [2012-12-15 00:53:24.747934] I
>> [server-helpers.c:629:server_connection_destroy]
>> 0-RedhawkShared-server: destroyed connection of
>> 
>>mualhpcp01.hpc.muohio.edu-17684-2012/12/13-17:25:16:994209-RedhawkShared-
>>client-0-0
>> [2012-12-15 00:53:24.743459] I [server-helpers.c:474:do_fd_cleanup]
>> 0-RedhawkShared-server: fd cleanup on
>> 
>>/mkennedy/tramelot_nwfs/rpr3/rpr3/rpr3_sparky/matrix/.4d_ccnoesy.ucsf.QTQ
>>swL
>> [2012-12-15 00:53:24.743430] I
>> [server-helpers.c:330:do_lock_table_cleanup] 0-RedhawkShared-server:
>> finodelk released on
>> 
>>/mkennedy/tramelot_nwfs/rpr3/rpr3/rpr3_sparky/matrix/.4d_ccnoesy.ucsf.QTQ
>>swL
>> [2012-12-15 00:53:24.743400] I
>> [server-helpers.c:741:server_connection_put] 0-RedhawkShared-server:
>> Shutting down connection
>> 
>>mualhpcp01.hpc.muohio.edu-17684-2012/12/13-17:25:16:994209-RedhawkShared-
>>client-0-0
>> [2012-12-15 00:53:24.743368] I [server.c:685:server_rpc_notify]
>> 0-RedhawkShared-server: disconnecting connectionfrom
>> 
>>mualhpcp01.hpc.muohio.edu-17684-2012/12/13-17:25:16:994209-RedhawkShared-
>>client-0-0
>> [2012-12-15 00:53:24.740055] W [socket.c:195:__socket_rwv]
>> 0-tcp.RedhawkShared-server: readv failed (Connection reset by peer)
>>
>> I can't find relevant logs on the client side.
>>
>> From the logs, can we judge for sure that this is a network reset
>> problem ?
>>
>When you say, "I can't find relevant logs on the client side," do you
>mean that you can't find the log, or that there's nothing in there from
>around the same timestamp? The client log will be in /var/log/glusterfs
>and will be named based on the mountpoint.
>_______________________________________________
>Gluster-users mailing list
>Gluster-users at gluster.org
>http://supercolony.gluster.org/mailman/listinfo/gluster-users