Re: HELP! Diapered block device

AJ Lewis <alewis@xxxxxxxxxx> · Mon, 11 Jul 2005 14:00:29 -0500

Just for anyone following this, pls refer to
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=162709 for more
information and updates.

On Thu, Jul 07, 2005 at 10:17:46AM -0600, Jon Scottorn wrote:
> Ok, so I am rerunning gfs_fsck again.  I have everything unmounted and
> the storage server not even in the cluster.  It has been running now for
> 30 mins again and it gets to this point and just doesn't look like it is
> doing anything.  This is where is stayed after running it for 24 hours
> yesterday:  Here is what the verbose output from gfs_fsck:
> 
> Initializing fsck
> Initializing lists...
> Initializing special inodes...
> Setting block ranges...
> Creating a block list of size 183146926...
> Clearing journals (this may take a while)
> Clearing journal 0
> Clearing journal 1
> Clearing journal 2
> Clearing journal 3
> Cleared journals
> Starting pass1
> Checking metadata in Resource Group 0
> Checking metadata in Resource Group 1
> Checking metadata in Resource Group 2
> Checking metadata in Resource Group 3
> 
> ........Omitted lines for space.........................
> 
> Checking metadata in Resource Group 2790
> Checking metadata in Resource Group 2791
> Checking metadata in Resource Group 2792
> Checking metadata in Resource Group 2793
> Pass1 complete
> Starting pass1b
> Looking for duplicate blocks...
> Found dup block at 61573000
> Found dup block at 61573014
> Found dup block at 61573015
> Found dup block at 61573016
> Found dup block at 61573017
> Found dup block at 61573018
> Found dup block at 61573019
> Found dup block at 61573020
> Found dup block at 61573021
> Found dup block at 61573022
> Found dup block at 61573024
> Found dup block at 61573047
> Found dup block at 61573048
> Found dup block at 61573052
> Found dup block at 61623032
> Found dup block at 61623033
> Found dup block at 61623034
> Found dup block at 61623035
> Scanning filesystem for inodes containing duplicate blocks...
> 
> Once it gets to this point it just sits there.  gfs_fsck is using 99% of
> the CPU for the whole time it runs.  What else can I do to get this fixed?
> 
> Thanks,
> 
> Jon
> 
> Jon Scottorn wrote:
> 
> >When I ran the fsck, i had everything unmounted as well as the gnbd serv
> >stopped.  I let it run for almost 24 hours and it was still running. 
> >That seems a little long for me.  Should I let it run again and see what
> >happens.  My main problem is I can't have the FS down for that long. 
> >
> >Thanks,
> >
> >Jon
> >
> >AJ Lewis wrote:
> >
> >  
> >
> >>On Thu, Jul 07, 2005 at 09:16:45AM -0600, Jon Scottorn wrote:
> >> 
> >>
> >>    
> >>
> >>>Thanks,
> >>>
> >>>   That made it so I can mount it from the other nodes, but now I can't
> >>>mount it on the storage server.
> >>>   
> >>>
> >>>      
> >>>
> >>Gah!  Is the fsck still running?  You *CANNOT* run the fsck while other nodes
> >>have the fs mounted.  The fsck changes the lock protocol to prevent others
> >>    
> >>
> >>from mounting after the fsck starts.  It will be changed back after
> >  
> >
> >>completion.
> >>
> >>The fsck can take a while in the duplicate block code - could you tell if it
> >>was still accessing storage?  If you have lots of inodes in the system, it's
> >>gonna take a while to work through them in the dup block handling code.
> >>
> >>Regards,
> >> 
> >>
> >>------------------------------------------------------------------------
> >>
> >>--
> >>
> >>Linux-cluster@xxxxxxxxxx
> >>http://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >>    
> >>
> >
> >--
> >
> >Linux-cluster@xxxxxxxxxx
> >http://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >  
> >
> 
> --
> 
> Linux-cluster@xxxxxxxxxx
> http://www.redhat.com/mailman/listinfo/linux-cluster
> 

-- 
AJ Lewis                                   Voice:  612-638-0500
Red Hat                                    E-Mail: alewis@xxxxxxxxxx
One Main Street SE, Suite 209
Minneapolis, MN 55414

Current GPG fingerprint = D9F8 EDCE 4242 855F A03D  9B63 F50C 54A8 578C 8715
Grab the key at: http://people.redhat.com/alewis/gpg.html or one of the
many keyservers out there...

Attachment:
pgp7tqOdnhOGk.pgp

Description: PGP signature
--

Linux-cluster@xxxxxxxxxx
http://www.redhat.com/mailman/listinfo/linux-cluster