Re: Segmentation fault in st_queue_handler()

Ian Kent <raven@xxxxxxxxxx> · Tue, 21 Jan 2014 08:51:23 +0800

On Mon, 2014-01-20 at 14:42 -0200, Leonardo Chiquitto wrote:
> On Thu, Dec 1, 2011 at 10:40 PM, Ian Kent <raven@xxxxxxxxxx> wrote:
> > On Thu, 2011-12-01 at 13:00 -0300, Leonardo Chiquitto wrote:
> >> On Thu, Nov 24, 2011 at 3:06 PM, Leonardo Chiquitto
> >> <leonardo.lists@xxxxxxxxx> wrote:
> >> > On Wed, Nov 23, 2011 at 7:14 AM, Leonardo Chiquitto
> >> > <leonardo.lists@xxxxxxxxx> wrote:
> >> >> On Tue, Nov 22, 2011 at 9:18 PM, Ian Kent <raven@xxxxxxxxxx> wrote:
> >> >>> On Tue, 2011-11-22 at 16:34 -0300, Leonardo Chiquitto wrote:
> >> >>>> Hello,
> >> >>>>
> >> >>>> We received a bug report describing a segfault in automount, more
> >> >>>> specifically at st_queue_handler():
> >> >>>>
> >> >>>> 1140 remove:
> >> >>>> 1141         /* No more tasks for this queue */
> >> >>>> 1142         if (list_empty(&task->pending)) {
> >> >>>> 1143           list_del(&task->list);
> >> >>>> 1144           free(task);
> >> >>>> 1145           continue;
> >> >>>> 1146         }
> >> >>>> 1147
> >> >>>> 1148         /* Next task */
> >> >>>> 1149         next = list_entry((&task->pending)->next,
> >> >>>> 1150               struct state_queue, pending);
> >> >>>> 1151
> >> >>>> 1152         list_del(&task->list);
> >> >>>> 1153         free(task);
> >> >>>> 1154
> >> >>>> 1155         list_del_init(&next->pending);     <== SEGV
> >> >>>> 1156         list_add_tail(&next->list, head);
> >> >>>> 1157         if (p == head)
> >> >>>> 1158           p = head->next;
> >> >>>> 1159       }
> >> >>>> 1160
> >> >>>> 1161       if (list_empty(head))
> >> >>>> 1162         break;
> >> >>>>
> >> >>>> The crash happens because we're trying to remove an element from an empty
> >> >>>> list (next->pending):
> >> >>>>
> >> >>>> (gdb) print next
> >> >>>> $38 = (struct state_queue *) 0x2aaaad1003b0
> >> >>>>
> >> >>>> (gdb) print *next
> >> >>>> $39 = {thid = 0, list = {next = 0x2aaaad1003b8, prev = 0x2aaaad1003b8},
> >> >>>>   pending = {next = 0x2aaaad500048, prev = 0x2aaaad500048},
> >> >>>>   ap = 0x55555569d1e0, state = ST_READMAP, busy = 0, done = 0, cancel = 0}
> >> >>>>
> >> >>>> (gdb) print next->pending
> >> >>>> $40 = {next = 0x2aaaad500048, prev = 0x2aaaad500048}  <== Empty
> >> >>>>
> >> >>>> (gdb) print *next->pending->next
> >> >>>> Cannot access memory at address 0x2aaaad500048
> >> >>>>
> >> >>>> Should it be testing if the list is not empty before trying to remove the
> >> >>>> element or this condition should never happen?
> >> >>>
> >> >>> There's definitely something fishy going on.
> >> >>>
> >> >>> Above, at line 1142 the pending list is checked to see if it is empty so
> >> >>> it can't be empty if we manage to get to the list_del_init() at line
> >> >>> 1155 since the list mutex is held during these operations.
> >> >>
> >> >> Ah, thanks for pointing this.
> >> >>
> >> >>>> Unfortunately I don't have the debug logs, only verbose:
> >> >>>>
> >> >>>> Oct 27 16:15:56 host automount[4621]: 83 remaining in /-
> >> >>>> Oct 27 16:16:03 host automount[4621]: 15 remaining in /usr2
> >> >>>> Oct 27 16:16:51 host automount[4621]: 83 remaining in /-
> >> >>>> Oct 27 16:19:10 host automount[4621]: 15 remaining in /usr2
> >> >>>> Oct 27 16:20:03 host automount[4621]: 83 remaining in /-
> >> >>>> Oct 27 16:20:03 host automount[4621]: 15 remaining in /usr2
> >> >>>> Oct 27 16:20:04 host automount[4621]: re-reading master map auto_master
> >> >>>> Oct 27 16:20:04 host automount[4621]: 83 remaining in /-
> >> >>>> Oct 27 16:20:04 host kernel: automount[4623]: segfault at
> >> >>>> 00002aaaad500050 rip 000055555556c9fb rsp 0000000040021190 error 6
> >> >>>
> >> >>> I have had recent reports of overlap of tasks.
> >> >>>
> >> >>> For example expire not completing before continuing (but only relevant
> >> >>> for submounts) and read map tasks starting before a current task is
> >> >>> done.
> >> >>>
> >> >>> I have a couple of patches but they've seen little testing since I can't
> >> >>> reproduce the problems at will.
> >> >>>
> >> >>>>
> >> >>>> And here is the full call trace, for reference:
> >> >>>>
> >> >>>> Thread 8 (Thread 4621):
> >> >>>> #0  do_sigwait (set=0x7fffb4737710, sig=0x7fffb473782c)
> >> >>>>     at ../nptl/sysdeps/unix/sysv/linux/../../../../../sysdeps/unix/sysv/linux/sigwait.c:65
> >> >>>> #1  0x00002b1a096c9aed in __sigwait (set=0x7fffb4737710, sig=0x7fffb473782c)
> >> >>>>     at ../nptl/sysdeps/unix/sysv/linux/../../../../../sysdeps/unix/sysv/linux/sigwait.c:100
> >> >>>> #2  0x0000555555560b96 in statemachine (argc=<value optimized out>,
> >> >>>>     argv=<value optimized out>) at automount.c:1327
> >> >>>> #3  main (argc=<value optimized out>, argv=<value optimized out>)
> >> >>>>     at automount.c:2142
> >> >>>>
> >> >>>> Thread 7 (Thread 4622):
> >> >>>> #0  0x00002b1a096c6517 in pthread_cond_timedwait@@GLIBC_2.3.2 ()
> >> >>>>     from /lib64/libpthread.so.0
> >> >>>> #1  0x0000555555573812 in alarm_handler (arg=<value optimized out>)
> >> >>>>     at alarm.c:206
> >> >>>> #2  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
> >> >>>>     at pthread_create.c:306
> >> >>>> #3  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
> >> >>>>
> >> >>>> Thread 6 (Thread 4626):
> >> >>>> #0  0x00002b1a09e3fdc6 in __poll (fds=0x40822150, nfds=3, timeout=-1)
> >> >>>>     at ../sysdeps/unix/sysv/linux/poll.c:87
> >> >>>> #1  0x000055555555f2a9 in get_pkt (arg=<value optimized out>)
> >> >>>>     at automount.c:882
> >> >>>> #2  handle_packet (arg=<value optimized out>) at automount.c:1019
> >> >>>> #3  handle_mounts (arg=<value optimized out>) at automount.c:1551
> >> >>>> #4  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
> >> >>>>     at pthread_create.c:306
> >> >>>> #5  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
> >> >>>>
> >> >>>> Thread 5 (Thread 4629):
> >> >>>> #0  0x00002b1a09e3fdc6 in __poll (fds=0x41023150, nfds=3, timeout=-1)
> >> >>>>     at ../sysdeps/unix/sysv/linux/poll.c:87
> >> >>>> #1  0x000055555555f2a9 in get_pkt (arg=<value optimized out>)
> >> >>>>     at automount.c:882
> >> >>>> #2  handle_packet (arg=<value optimized out>) at automount.c:1019
> >> >>>> #3  handle_mounts (arg=<value optimized out>) at automount.c:1551
> >> >>>> #4  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
> >> >>>>     at pthread_create.c:306
> >> >>>> #5  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
> >> >>>>
> >> >>>> Thread 4 (Thread 4630):
> >> >>>> #0  0x00002b1a09e3fdc6 in __poll (fds=0x41824150, nfds=3, timeout=-1)
> >> >>>>     at ../sysdeps/unix/sysv/linux/poll.c:87
> >> >>>> #1  0x000055555555f2a9 in get_pkt (arg=<value optimized out>)
> >> >>>>     at automount.c:882
> >> >>>> #2  handle_packet (arg=<value optimized out>) at automount.c:1019
> >> >>>> #3  handle_mounts (arg=<value optimized out>) at automount.c:1551
> >> >>>> #4  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
> >> >>>>     at pthread_create.c:306
> >> >>>> #5  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
> >> >>>>
> >> >>>> Thread 3 (Thread 23927):
> >> >>>> #0  _IO_vfscanf_internal (s=0x41921820, format=<value optimized out>,
> >> >>>>     argptr=0x41921940, errp=0x0) at vfscanf.c:1534
> >> >>>> #1  0x00002b1a09de6bd5 in _IO_vsscanf (string=0x41921ef8 "0 0",
> >> >>>>     format=0x2b1a09e91adf " %d %d ", args=0x41921940) at iovsscanf.c:45
> >> >>>> #2  0x00002b1a09de1e98 in __sscanf (s=0xa <Address 0xa out of bounds>,
> >> >>>>     format=0x41921100 "0") at sscanf.c:34
> >> >>>> #3  0x00002b1a09e433e5 in __getmntent_r (stream=0x55555596aa20, mp=0x41924e80,
> >> >>>>     buffer=0x41921ef8 "0 0", bufsiz=12288) at mntent_r.c:170
> >> >>>> #4  0x0000555555571429 in get_mnt_list (table=<value optimized out>,
> >> >>>>     path=0x55555569d610 "/net", include=0) at mounts.c:227
> >> >>>> #5  0x0000555555561ed9 in expire_proc_indirect (arg=<value optimized out>)
> >> >>>>     at indirect.c:554
> >> >>>> #6  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
> >> >>>>     at pthread_create.c:306
> >> >>>> #7  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
> >> >>>>
> >> >>>> Thread 2 (Thread 23928):
> >> >>>> #0  0x00002b1a09e3a91b in read () from /lib64/libc.so.6
> >> >>>> #1  0x00002b1a09defca2 in _IO_new_file_underflow (fp=0x555555989470)
> >> >>>>     at fileops.c:590
> >> >>>> #2  0x00002b1a09df042e in _IO_default_uflow (fp=0x67) at genops.c:435
> >> >>>> #3  0x00002b1a09de56e4 in _IO_getline_info (fp=0x555555989470,
> >> >>>>     buf=0x41a22e80 "auto.projects /a/b/c/d autofs
> >> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
> >> >>>> n=12184,
> >> >>>>     delim=10, extract_delim=1, eof=0x0) at iogetline.c:74
> >> >>>> #4  0x00002b1a09dedec9 in fgets_unlocked (
> >> >>>>     buf=0x41a22e80 "auto.projects /a/b/c/d autofs
> >> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
> >> >>>>     at fileops.c:590
> >> >>>> #2  0x00002b1a09df042e in _IO_default_uflow (fp=0x67) at genops.c:435
> >> >>>> #3  0x00002b1a09de56e4 in _IO_getline_info (fp=0x555555989470,
> >> >>>>     buf=0x41a22e80 "auto.projects /a/b/c/d autofs
> >> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
> >> >>>> n=12184,
> >> >>>>     delim=10, extract_delim=1, eof=0x0) at iogetline.c:74
> >> >>>> #4  0x00002b1a09dedec9 in fgets_unlocked (
> >> >>>>     buf=0x41a22e80 "auto.projects /a/b/c/d autofs
> >> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
> >> >>>>     n=<value optimized out>, fp=0x555555989470) at iofgets_u.c:56
> >> >>>> #5  0x00002b1a09e43252 in __getmntent_r (stream=0x555555989470, mp=0x41a25e80,
> >> >>>>     buffer=0x41a22e80 "auto.projects /a/b/c/d autofs
> >> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
> >> >>>>     bufsiz=12288) at mntent_r.c:133
> >> >>>> #6  0x0000555555571429 in get_mnt_list (table=<value optimized out>,
> >> >>>>     path=0x55555569d960 "/usr2", include=0) at mounts.c:227
> >> >>>> #7  0x0000555555561ed9 in expire_proc_indirect (arg=<value optimized out>)
> >> >>>>     at indirect.c:554
> >> >>>> #8  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
> >> >>>>     at pthread_create.c:306
> >> >>>> #9  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
> >> >>>>
> >> >>>> Thread 1 (Thread 4623):
> >> >>>> #0  st_queue_handler (arg=<value optimized out>) at ../include/list.h:79
> >> >>>> #1  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
> >> >>>>     at pthread_create.c:306
> >> >>>> #2  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
> >> >>>
> >> >>> So the SEGV is in thread 1?
> >> >>
> >> >> Yes.
> >> >>
> >> >>> Now did you work out the location of the SEGV from this backtrace?
> >> >>
> >> >> I cross checked some facts. This is what gdb prints when I open the core
> >> >> dump:
> >> >>
> >> >>  Core was generated by `/usr/sbin/automount -p /var/run/automount.pid'.
> >> >>  Program terminated with signal 11, Segmentation fault.
> >> >>  #0  st_queue_handler (arg=<value optimized out>) at ../include/list.h:79
> >> >>  79            next->prev = prev;
> >> >>
> >> >> And this is the segfault message printed by the kernel:
> >> >>
> >> >>  Oct 27 16:20:04 host kernel: automount[4623]: segfault at
> >> >>  00002aaaad500050 rip 000055555556c9fb rsp 0000000040021190 error 6
> >> >>
> >> >> Disassembling %RIP confirms the function st_queue_handler() was running:
> >> >>
> >> >>  (gdb) disassemble 0x000055555556c9fb
> >> >>  Dump of assembler code for function st_queue_handler:
> >> >>     0x000055555556c860 <+0>:   push   %r13
> >> >> (...)
> >> >>  => 0x000055555556c9fb <+411>: mov    %rdx,0x8(%rcx)
> >> >>
> >> >>  %RCX is 0x2aaaad500048 and the SEGV happened when it tried to deref
> >> >>  0x2aaaad500048+0x8 (0x2aaaad500050).
> >> >>
> >> >> Figuring out where exactly inside st_queue_handler() the crash happened
> >> >> was a bit trickier because all these list functions are inlined by the compiler,
> >> >> so the assembly code gets more complicated.
> >> >>
> >> >>> We can't be sure this is really the source of the SEGV.
> >> >>> Does your customer see a different failure location on other occurrences
> >> >>> of the SEGV?
> >> >>
> >> >> So far it occurred only once.
> >> >
> >> > Ian,
> >> >
> >> > Another idea:
> >> >
> >> > 1140 remove:
> >> > 1141         /* No more tasks for this queue */
> >> > 1142         if (list_empty(&task->pending)) {
> >> > 1143           list_del(&task->list);
> >> > 1144           free(task);
> >> > 1145           continue;
> >> > 1146         }
> >> >
> >> > As you said, at this point &task->pending has at least one element.
> >> >
> >> > 1148         /* Next task */
> >> > 1149         next = list_entry((&task->pending)->next,
> >> > 1150               struct state_queue, pending);
> >> >
> >> > Now what could be happening here is that (&task->pending)->next == task.
> >
> > Anything is possible but if there is anything on the pending list it
> > should not have had a task started, since it is pending. We would need
> > to work out if there is a case where a task is taken from the pending
> > list, started and not moved to the task list or just copied not deleted
> > from the pending list. On the face of it, it doesn't look like that can
> > happen.
> >
> >> >
> >> > 1152         list_del(&task->list);
> >> > 1153         free(task);
> >> >
> >> > Since we free task here, &next won't be valid anymore in the next line:
> >> >
> >> > 1155         list_del_init(&next->pending);
> >> > 1156         list_add_tail(&next->list, head);
> >> > 1157         if (p == head)
> >> > 1158           p = head->next;
> >> >
> >> > Proposed fix:
> >> >
> >> > diff --git a/daemon/state.c b/daemon/state.c
> >> > index 51809a1..0015618 100644
> >> > --- a/daemon/state.c
> >> > +++ b/daemon/state.c
> >> > @@ -1150,12 +1150,12 @@ remove:
> >> >                                                        struct state_queue, pending);
> >> >
> >> >                                list_del(&task->list);
> >> > -                               free(task);
> >> >
> >> >                                list_del_init(&next->pending);
> >> >                                list_add_tail(&next->list, head);
> >> >                                if (p == head)
> >> >                                        p = head->next;
> >> > +                               free(task);
> >> >                        }
> >> >
> >> >                        if (list_empty(head))
> >> >
> >> >
> >> > Before commit 6273ab63c, task was being freed after the call to list_del_init().
> >> > What do you think?
> >
> > We can try it but since the problem is so infrequent it will be hard to
> > confirm.
> >
> > The real problem is that we don't really know the root cause.
> > It could be late reporting or an indirect fail caused by something in
> > another place in the code. This sort of bug is really difficult to
> > diagnose.
> >
> >>
> >> Ian,
> >>
> >> Do you think this makes sense or is it just masking a symptom of a
> >> more generic bug?
> >
> > It's probably a symptom of something else, *sigh*.
> >
> > btw, the git repo is now available at kernel.org.
> > Check http://git.kernel.org/ and search for autofs.
> > I haven't committed any updates yet and probably won't until some time
> > after I have posted a message to the new autofs mailing list telling
> > everyone about it.
> >
> >>
> >> Thanks,
> >> Leonardo
> 
> Hello Ian,
> 
> Sorry to resurrect such an old thread. We're still carrying this patch
> in our package and I'd like to get rid of it, preferably by getting it
> accepted upstream :-)
> 
> As described above, the core dump analysis really pointed to an
> use after free condition when (&task->pending)->next == task. It's
> unfortunate that we received such report only once and had no
> specific info about the circumstances that could trigger the problem.
> 
> In any case, moving free(task) to the end of the block should be
> safe and is actually how it was done before commit 6273ab63c. Do
> you think you can accept the patch as is?

I didn't see a patch in the previous thread.
Presumably you have a patch with a description you could post.

TBH, I don't think it's worth perusing this further since the change
looks safe enough I can just add it to my commit queue for the next time
I commit changes.

Ian

--
To unsubscribe from this list: send the line "unsubscribe autofs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html