On Wed, Aug 6, 2008 at 10:07 AM, Theodore Tso <tytso@xxxxxxx> wrote: > On Tue, Aug 05, 2008 at 12:53:51PM +0200, Thomas Trauner wrote: >> On Fri, 2008-08-01 at 08:16 -0400, Theodore Tso wrote: >> > On Fri, Aug 01, 2008 at 11:43:40AM +0200, Thomas Trauner wrote: >> > > >> > > I have a problem with directories that contain more than 10000 entries >> > > (Ubuntu 8.04.1) or with more than 70000 entries (RHEL 5.2). If you use >> > > readdir(3) or readdir64(3) you get one entry twice, with same name and >> > > inode. >> > > >> I made new tests with the code under >> <http://www.unet.univie.ac.at/~a9100884/readdir.c> on a bunch of freshly >> generated and empty filesystems, every about 38GB large, of type fat >> (aborted after about 22000 entries because it took to long), ext2, xfs, >> jfs and again ext3.... > > OK, I have a workaroud for you. It appears there's a kernel bug > hiding here, since there shouldn't be duplicates returned by readdir() > even if we have hash collisions. Ted, The attached patch has served my employer (IBRIX) well for 2.5 years. It was only recently, when I re-raised this issue internally based on this thread, that a co-worker recalled the fix. regards, Mike
diff --git a/fs/ext3/dir.c b/fs/ext3/dir.c index 2eea96e..42c5391 100644 --- a/fs/ext3/dir.c +++ b/fs/ext3/dir.c @@ -410,7 +410,7 @@ static int call_filldir(struct file * filp, void * dirent, get_dtype(sb, fname->file_type)); if (error) { filp->f_pos = curr_pos; - info->extra_fname = fname->next; + info->extra_fname = fname; return error; } fname = fname->next; @@ -449,11 +449,21 @@ static int ext3_dx_readdir(struct file * filp, * If there are any leftover names on the hash collision * chain, return them first. */ - if (info->extra_fname && - call_filldir(filp, dirent, filldir, info->extra_fname)) - goto finished; - - if (!info->curr_node) + if (info->extra_fname) { + if (call_filldir(filp, dirent, filldir, info->extra_fname)) + goto finished; + + info->extra_fname = NULL; + info->curr_node = rb_next(info->curr_node); + if (!info->curr_node) { + if (info->next_hash == ~0) { + filp->f_pos = EXT3_HTREE_EOF; + goto finished; + } + info->curr_hash = info->next_hash; + info->curr_minor_hash = 0; + } + } else if (!info->curr_node) info->curr_node = rb_first(&info->root); while (1) {