On 02:53, tytso@xxxxxxx wrote: > On Wed, Jan 27, 2010 at 02:06:25PM +0100, Andre Noll wrote: > > On 11:19, bugzilla-daemon@xxxxxxxxxxxxxxxxxxx wrote: > > > After kill -9 of the sync run it took about 20 minutes before > > > it died. > > > > I was seeing similar behaviour on one of our servers, and changing > > the io scheduler to noop fixed things for me. So it seems to be an > > issue with cfq which is somehow triggered by ext4 but not by ext3. > > > > To change the IO scheduler, just execute > > > > echo noop > /sys/block/sda/queue/scheduler > > > > (replace sda if necessary). > > Andre or Michael. If switching away from cfq helps, that's > definitely... interesting. Given that cfq is the default scheduler, I > definitely want to understand what might be going on here. Are either > if you able to run blktrace so we can get a sense of what is going on > under the cfq and deadline/noop I/O schedulers? Yes, I can use that machine freely for testing purposes, including reboots. It is just our fallback server which creates hardlink-based snapshots using rsync. However, I have to recompile the kernel to include debugfs which is needed by blktrace and I'd like to wait until the currently running rsync completes before rebooting. Would you like to see the output of btrace /dev/mapper/... or should I use more sophisticated command line options? > And in both of your cases, were you using a new file system freshly > created using mke2fs -t ext4, or was this a ext2/ext3 filesystem that > was converted for use under ext4? The ext4 file system was created from scratch using -O dir_index,uninit_bg,extent, a block size of 4096 and 32768 bytes per inode. Thanks Andre -- The only person who always got his work done by Friday was Robinson Crusoe
Attachment:
signature.asc
Description: Digital signature