Re: Segmentation fault in st_queue_handler()

Leonardo Chiquitto <leonardo.lists@xxxxxxxxx> · Mon, 20 Jan 2014 14:42:00 -0200

On Thu, Dec 1, 2011 at 10:40 PM, Ian Kent <raven@xxxxxxxxxx> wrote:
> On Thu, 2011-12-01 at 13:00 -0300, Leonardo Chiquitto wrote:
>> On Thu, Nov 24, 2011 at 3:06 PM, Leonardo Chiquitto
>> <leonardo.lists@xxxxxxxxx> wrote:
>> > On Wed, Nov 23, 2011 at 7:14 AM, Leonardo Chiquitto
>> > <leonardo.lists@xxxxxxxxx> wrote:
>> >> On Tue, Nov 22, 2011 at 9:18 PM, Ian Kent <raven@xxxxxxxxxx> wrote:
>> >>> On Tue, 2011-11-22 at 16:34 -0300, Leonardo Chiquitto wrote:
>> >>>> Hello,
>> >>>>
>> >>>> We received a bug report describing a segfault in automount, more
>> >>>> specifically at st_queue_handler():
>> >>>>
>> >>>> 1140 remove:
>> >>>> 1141         /* No more tasks for this queue */
>> >>>> 1142         if (list_empty(&task->pending)) {
>> >>>> 1143           list_del(&task->list);
>> >>>> 1144           free(task);
>> >>>> 1145           continue;
>> >>>> 1146         }
>> >>>> 1147
>> >>>> 1148         /* Next task */
>> >>>> 1149         next = list_entry((&task->pending)->next,
>> >>>> 1150               struct state_queue, pending);
>> >>>> 1151
>> >>>> 1152         list_del(&task->list);
>> >>>> 1153         free(task);
>> >>>> 1154
>> >>>> 1155         list_del_init(&next->pending);     <== SEGV
>> >>>> 1156         list_add_tail(&next->list, head);
>> >>>> 1157         if (p == head)
>> >>>> 1158           p = head->next;
>> >>>> 1159       }
>> >>>> 1160
>> >>>> 1161       if (list_empty(head))
>> >>>> 1162         break;
>> >>>>
>> >>>> The crash happens because we're trying to remove an element from an empty
>> >>>> list (next->pending):
>> >>>>
>> >>>> (gdb) print next
>> >>>> $38 = (struct state_queue *) 0x2aaaad1003b0
>> >>>>
>> >>>> (gdb) print *next
>> >>>> $39 = {thid = 0, list = {next = 0x2aaaad1003b8, prev = 0x2aaaad1003b8},
>> >>>>   pending = {next = 0x2aaaad500048, prev = 0x2aaaad500048},
>> >>>>   ap = 0x55555569d1e0, state = ST_READMAP, busy = 0, done = 0, cancel = 0}
>> >>>>
>> >>>> (gdb) print next->pending
>> >>>> $40 = {next = 0x2aaaad500048, prev = 0x2aaaad500048}  <== Empty
>> >>>>
>> >>>> (gdb) print *next->pending->next
>> >>>> Cannot access memory at address 0x2aaaad500048
>> >>>>
>> >>>> Should it be testing if the list is not empty before trying to remove the
>> >>>> element or this condition should never happen?
>> >>>
>> >>> There's definitely something fishy going on.
>> >>>
>> >>> Above, at line 1142 the pending list is checked to see if it is empty so
>> >>> it can't be empty if we manage to get to the list_del_init() at line
>> >>> 1155 since the list mutex is held during these operations.
>> >>
>> >> Ah, thanks for pointing this.
>> >>
>> >>>> Unfortunately I don't have the debug logs, only verbose:
>> >>>>
>> >>>> Oct 27 16:15:56 host automount[4621]: 83 remaining in /-
>> >>>> Oct 27 16:16:03 host automount[4621]: 15 remaining in /usr2
>> >>>> Oct 27 16:16:51 host automount[4621]: 83 remaining in /-
>> >>>> Oct 27 16:19:10 host automount[4621]: 15 remaining in /usr2
>> >>>> Oct 27 16:20:03 host automount[4621]: 83 remaining in /-
>> >>>> Oct 27 16:20:03 host automount[4621]: 15 remaining in /usr2
>> >>>> Oct 27 16:20:04 host automount[4621]: re-reading master map auto_master
>> >>>> Oct 27 16:20:04 host automount[4621]: 83 remaining in /-
>> >>>> Oct 27 16:20:04 host kernel: automount[4623]: segfault at
>> >>>> 00002aaaad500050 rip 000055555556c9fb rsp 0000000040021190 error 6
>> >>>
>> >>> I have had recent reports of overlap of tasks.
>> >>>
>> >>> For example expire not completing before continuing (but only relevant
>> >>> for submounts) and read map tasks starting before a current task is
>> >>> done.
>> >>>
>> >>> I have a couple of patches but they've seen little testing since I can't
>> >>> reproduce the problems at will.
>> >>>
>> >>>>
>> >>>> And here is the full call trace, for reference:
>> >>>>
>> >>>> Thread 8 (Thread 4621):
>> >>>> #0  do_sigwait (set=0x7fffb4737710, sig=0x7fffb473782c)
>> >>>>     at ../nptl/sysdeps/unix/sysv/linux/../../../../../sysdeps/unix/sysv/linux/sigwait.c:65
>> >>>> #1  0x00002b1a096c9aed in __sigwait (set=0x7fffb4737710, sig=0x7fffb473782c)
>> >>>>     at ../nptl/sysdeps/unix/sysv/linux/../../../../../sysdeps/unix/sysv/linux/sigwait.c:100
>> >>>> #2  0x0000555555560b96 in statemachine (argc=<value optimized out>,
>> >>>>     argv=<value optimized out>) at automount.c:1327
>> >>>> #3  main (argc=<value optimized out>, argv=<value optimized out>)
>> >>>>     at automount.c:2142
>> >>>>
>> >>>> Thread 7 (Thread 4622):
>> >>>> #0  0x00002b1a096c6517 in pthread_cond_timedwait@@GLIBC_2.3.2 ()
>> >>>>     from /lib64/libpthread.so.0
>> >>>> #1  0x0000555555573812 in alarm_handler (arg=<value optimized out>)
>> >>>>     at alarm.c:206
>> >>>> #2  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
>> >>>>     at pthread_create.c:306
>> >>>> #3  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
>> >>>>
>> >>>> Thread 6 (Thread 4626):
>> >>>> #0  0x00002b1a09e3fdc6 in __poll (fds=0x40822150, nfds=3, timeout=-1)
>> >>>>     at ../sysdeps/unix/sysv/linux/poll.c:87
>> >>>> #1  0x000055555555f2a9 in get_pkt (arg=<value optimized out>)
>> >>>>     at automount.c:882
>> >>>> #2  handle_packet (arg=<value optimized out>) at automount.c:1019
>> >>>> #3  handle_mounts (arg=<value optimized out>) at automount.c:1551
>> >>>> #4  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
>> >>>>     at pthread_create.c:306
>> >>>> #5  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
>> >>>>
>> >>>> Thread 5 (Thread 4629):
>> >>>> #0  0x00002b1a09e3fdc6 in __poll (fds=0x41023150, nfds=3, timeout=-1)
>> >>>>     at ../sysdeps/unix/sysv/linux/poll.c:87
>> >>>> #1  0x000055555555f2a9 in get_pkt (arg=<value optimized out>)
>> >>>>     at automount.c:882
>> >>>> #2  handle_packet (arg=<value optimized out>) at automount.c:1019
>> >>>> #3  handle_mounts (arg=<value optimized out>) at automount.c:1551
>> >>>> #4  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
>> >>>>     at pthread_create.c:306
>> >>>> #5  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
>> >>>>
>> >>>> Thread 4 (Thread 4630):
>> >>>> #0  0x00002b1a09e3fdc6 in __poll (fds=0x41824150, nfds=3, timeout=-1)
>> >>>>     at ../sysdeps/unix/sysv/linux/poll.c:87
>> >>>> #1  0x000055555555f2a9 in get_pkt (arg=<value optimized out>)
>> >>>>     at automount.c:882
>> >>>> #2  handle_packet (arg=<value optimized out>) at automount.c:1019
>> >>>> #3  handle_mounts (arg=<value optimized out>) at automount.c:1551
>> >>>> #4  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
>> >>>>     at pthread_create.c:306
>> >>>> #5  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
>> >>>>
>> >>>> Thread 3 (Thread 23927):
>> >>>> #0  _IO_vfscanf_internal (s=0x41921820, format=<value optimized out>,
>> >>>>     argptr=0x41921940, errp=0x0) at vfscanf.c:1534
>> >>>> #1  0x00002b1a09de6bd5 in _IO_vsscanf (string=0x41921ef8 "0 0",
>> >>>>     format=0x2b1a09e91adf " %d %d ", args=0x41921940) at iovsscanf.c:45
>> >>>> #2  0x00002b1a09de1e98 in __sscanf (s=0xa <Address 0xa out of bounds>,
>> >>>>     format=0x41921100 "0") at sscanf.c:34
>> >>>> #3  0x00002b1a09e433e5 in __getmntent_r (stream=0x55555596aa20, mp=0x41924e80,
>> >>>>     buffer=0x41921ef8 "0 0", bufsiz=12288) at mntent_r.c:170
>> >>>> #4  0x0000555555571429 in get_mnt_list (table=<value optimized out>,
>> >>>>     path=0x55555569d610 "/net", include=0) at mounts.c:227
>> >>>> #5  0x0000555555561ed9 in expire_proc_indirect (arg=<value optimized out>)
>> >>>>     at indirect.c:554
>> >>>> #6  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
>> >>>>     at pthread_create.c:306
>> >>>> #7  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
>> >>>>
>> >>>> Thread 2 (Thread 23928):
>> >>>> #0  0x00002b1a09e3a91b in read () from /lib64/libc.so.6
>> >>>> #1  0x00002b1a09defca2 in _IO_new_file_underflow (fp=0x555555989470)
>> >>>>     at fileops.c:590
>> >>>> #2  0x00002b1a09df042e in _IO_default_uflow (fp=0x67) at genops.c:435
>> >>>> #3  0x00002b1a09de56e4 in _IO_getline_info (fp=0x555555989470,
>> >>>>     buf=0x41a22e80 "auto.projects /a/b/c/d autofs
>> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
>> >>>> n=12184,
>> >>>>     delim=10, extract_delim=1, eof=0x0) at iogetline.c:74
>> >>>> #4  0x00002b1a09dedec9 in fgets_unlocked (
>> >>>>     buf=0x41a22e80 "auto.projects /a/b/c/d autofs
>> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
>> >>>>     at fileops.c:590
>> >>>> #2  0x00002b1a09df042e in _IO_default_uflow (fp=0x67) at genops.c:435
>> >>>> #3  0x00002b1a09de56e4 in _IO_getline_info (fp=0x555555989470,
>> >>>>     buf=0x41a22e80 "auto.projects /a/b/c/d autofs
>> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
>> >>>> n=12184,
>> >>>>     delim=10, extract_delim=1, eof=0x0) at iogetline.c:74
>> >>>> #4  0x00002b1a09dedec9 in fgets_unlocked (
>> >>>>     buf=0x41a22e80 "auto.projects /a/b/c/d autofs
>> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
>> >>>>     n=<value optimized out>, fp=0x555555989470) at iofgets_u.c:56
>> >>>> #5  0x00002b1a09e43252 in __getmntent_r (stream=0x555555989470, mp=0x41a25e80,
>> >>>>     buffer=0x41a22e80 "auto.projects /a/b/c/d autofs
>> >>>> rw,fd=7,pgrp=4621,timeout=86400,minproto=5,maxproto=5,direct=5,maxproto=5,direct",
>> >>>>     bufsiz=12288) at mntent_r.c:133
>> >>>> #6  0x0000555555571429 in get_mnt_list (table=<value optimized out>,
>> >>>>     path=0x55555569d960 "/usr2", include=0) at mounts.c:227
>> >>>> #7  0x0000555555561ed9 in expire_proc_indirect (arg=<value optimized out>)
>> >>>>     at indirect.c:554
>> >>>> #8  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
>> >>>>     at pthread_create.c:306
>> >>>> #9  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
>> >>>>
>> >>>> Thread 1 (Thread 4623):
>> >>>> #0  st_queue_handler (arg=<value optimized out>) at ../include/list.h:79
>> >>>> #1  0x00002b1a096c2193 in start_thread (arg=<value optimized out>)
>> >>>>     at pthread_create.c:306
>> >>>> #2  0x00002b1a09e480dd in clone () from /lib64/libc.so.6
>> >>>
>> >>> So the SEGV is in thread 1?
>> >>
>> >> Yes.
>> >>
>> >>> Now did you work out the location of the SEGV from this backtrace?
>> >>
>> >> I cross checked some facts. This is what gdb prints when I open the core
>> >> dump:
>> >>
>> >>  Core was generated by `/usr/sbin/automount -p /var/run/automount.pid'.
>> >>  Program terminated with signal 11, Segmentation fault.
>> >>  #0  st_queue_handler (arg=<value optimized out>) at ../include/list.h:79
>> >>  79            next->prev = prev;
>> >>
>> >> And this is the segfault message printed by the kernel:
>> >>
>> >>  Oct 27 16:20:04 host kernel: automount[4623]: segfault at
>> >>  00002aaaad500050 rip 000055555556c9fb rsp 0000000040021190 error 6
>> >>
>> >> Disassembling %RIP confirms the function st_queue_handler() was running:
>> >>
>> >>  (gdb) disassemble 0x000055555556c9fb
>> >>  Dump of assembler code for function st_queue_handler:
>> >>     0x000055555556c860 <+0>:   push   %r13
>> >> (...)
>> >>  => 0x000055555556c9fb <+411>: mov    %rdx,0x8(%rcx)
>> >>
>> >>  %RCX is 0x2aaaad500048 and the SEGV happened when it tried to deref
>> >>  0x2aaaad500048+0x8 (0x2aaaad500050).
>> >>
>> >> Figuring out where exactly inside st_queue_handler() the crash happened
>> >> was a bit trickier because all these list functions are inlined by the compiler,
>> >> so the assembly code gets more complicated.
>> >>
>> >>> We can't be sure this is really the source of the SEGV.
>> >>> Does your customer see a different failure location on other occurrences
>> >>> of the SEGV?
>> >>
>> >> So far it occurred only once.
>> >
>> > Ian,
>> >
>> > Another idea:
>> >
>> > 1140 remove:
>> > 1141         /* No more tasks for this queue */
>> > 1142         if (list_empty(&task->pending)) {
>> > 1143           list_del(&task->list);
>> > 1144           free(task);
>> > 1145           continue;
>> > 1146         }
>> >
>> > As you said, at this point &task->pending has at least one element.
>> >
>> > 1148         /* Next task */
>> > 1149         next = list_entry((&task->pending)->next,
>> > 1150               struct state_queue, pending);
>> >
>> > Now what could be happening here is that (&task->pending)->next == task.
>
> Anything is possible but if there is anything on the pending list it
> should not have had a task started, since it is pending. We would need
> to work out if there is a case where a task is taken from the pending
> list, started and not moved to the task list or just copied not deleted
> from the pending list. On the face of it, it doesn't look like that can
> happen.
>
>> >
>> > 1152         list_del(&task->list);
>> > 1153         free(task);
>> >
>> > Since we free task here, &next won't be valid anymore in the next line:
>> >
>> > 1155         list_del_init(&next->pending);
>> > 1156         list_add_tail(&next->list, head);
>> > 1157         if (p == head)
>> > 1158           p = head->next;
>> >
>> > Proposed fix:
>> >
>> > diff --git a/daemon/state.c b/daemon/state.c
>> > index 51809a1..0015618 100644
>> > --- a/daemon/state.c
>> > +++ b/daemon/state.c
>> > @@ -1150,12 +1150,12 @@ remove:
>> >                                                        struct state_queue, pending);
>> >
>> >                                list_del(&task->list);
>> > -                               free(task);
>> >
>> >                                list_del_init(&next->pending);
>> >                                list_add_tail(&next->list, head);
>> >                                if (p == head)
>> >                                        p = head->next;
>> > +                               free(task);
>> >                        }
>> >
>> >                        if (list_empty(head))
>> >
>> >
>> > Before commit 6273ab63c, task was being freed after the call to list_del_init().
>> > What do you think?
>
> We can try it but since the problem is so infrequent it will be hard to
> confirm.
>
> The real problem is that we don't really know the root cause.
> It could be late reporting or an indirect fail caused by something in
> another place in the code. This sort of bug is really difficult to
> diagnose.
>
>>
>> Ian,
>>
>> Do you think this makes sense or is it just masking a symptom of a
>> more generic bug?
>
> It's probably a symptom of something else, *sigh*.
>
> btw, the git repo is now available at kernel.org.
> Check http://git.kernel.org/ and search for autofs.
> I haven't committed any updates yet and probably won't until some time
> after I have posted a message to the new autofs mailing list telling
> everyone about it.
>
>>
>> Thanks,
>> Leonardo

Hello Ian,

Sorry to resurrect such an old thread. We're still carrying this patch
in our package and I'd like to get rid of it, preferably by getting it
accepted upstream :-)

As described above, the core dump analysis really pointed to an
use after free condition when (&task->pending)->next == task. It's
unfortunate that we received such report only once and had no
specific info about the circumstances that could trigger the problem.

In any case, moving free(task) to the end of the block should be
safe and is actually how it was done before commit 6273ab63c. Do
you think you can accept the patch as is?

Thanks,
Leonardo
--
To unsubscribe from this list: send the line "unsubscribe autofs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html