Re: [PATCH 0/5] mm/kasan: advanced check

Dmitry Vyukov <dvyukov@xxxxxxxxxx> · Wed, 22 Nov 2017 09:51:11 +0100

On Wed, Nov 22, 2017 at 5:30 AM, Joonsoo Kim <iamjoonsoo.kim@xxxxxxx> wrote:
> On Mon, Nov 20, 2017 at 11:56:05AM -0800, Wengang wrote:
>>
>>
>> On 11/19/2017 05:50 PM, Joonsoo Kim wrote:
>> >On Fri, Nov 17, 2017 at 11:56:21PM +0100, Dmitry Vyukov wrote:
>> >>On Fri, Nov 17, 2017 at 11:30 PM, Wengang Wang <wen.gang.wang@xxxxxxxxxx> wrote:
>> >>>Kasan advanced check, I'm going to add this feature.
>> >>>Currently Kasan provide the detection of use-after-free and out-of-bounds
>> >>>problems. It is not able to find the overwrite-on-allocated-memory issue.
>> >>>We sometimes hit this kind of issue: We have a messed up structure
>> >>>(usually dynamially allocated), some of the fields in the structure were
>> >>>overwritten with unreasaonable values. And kernel may panic due to those
>> >>>overeritten values. We know those fields were overwritten somehow, but we
>> >>>have no easy way to find out which path did the overwritten. The advanced
>> >>>check wants to help in this scenario.
>> >>>
>> >>>The idea is to define the memory owner. When write accesses come from
>> >>>non-owner, error should be reported. Normally the write accesses on a given
>> >>>structure happen in only several or a dozen of functions if the structure
>> >>>is not that complicated. We call those functions "allowed functions".
>> >>>The work of defining the owner and binding memory to owner is expected to
>> >>>be done by the memory consumer. In the above case, memory consume register
>> >>>the owner as the functions which have write accesses to the structure then
>> >>>bind all the structures to the owner. Then kasan will do the "owner check"
>> >>>after the basic checks.
>> >>>
>> >>>As implementation, kasan provides a API to it's user to register their
>> >>>allowed functions. The API returns a token to users.  At run time, users
>> >>>bind the memory ranges they are interested in to the check they registered.
>> >>>Kasan then checks the bound memory ranges with the allowed functions.
>> >>>
>> >>>
>> >>>Signed-off-by: Wengang Wang <wen.gang.wang@xxxxxxxxxx>
>> >Hello, Wengang.
>> >
>> >Nice idea. I also think that we need this kind of debugging tool. It's very
>> >hard to detect overwritten bugs.
>> >
>> >In fact, I made a quite similar tool, valid access checker (A.K.A.
>> >vchecker). See the following link.
>> >
>> >https://github.com/JoonsooKim/linux/tree/vchecker-master-v0.3-next-20170106
>> >
>> >Vchecker has some advanced features compared to yours.
>> >
>> >1. Target object can be choosen at runtime by debugfs. It doesn't
>> >require re-compile to register the target object.
>> Hi Joonsoo, good to know you are also interested in this!
>>
>> Yes, if can be choosen via debugfs, it doesn't need re-compile.
>> Well, I wonder what do you expect to be chosen from use space?
>
> As you mentioned somewhere, this tool can be used when we find the
> overwritten happend on some particular victims. I assumes that most of
> the problem would happen on slab objects and userspace can choose the
> target slab cache via debugfs interface of the vchecker.

Most objects are allocated from kmalloc slabs. And this feature can't
work for all objects allocated from a kmalloc slab. I think checks
needs to be tied to allocation sites.

>
>>
>> >
>> >2. It has another feature that checks the value stored in the object.
>> >Usually, invalid writer stores odd value into the object and vchecker
>> >can detect this case.
>> It's good to do the check. Well, as I understand, it tells something
>> bad (overwitten) happened.
>> But it can't tell who did the overwritten, right?  (I didn't look at
>> your patch yet,) do you recall the last write somewhere?
>
> Yes, it stores the callstack of the last write and report it when
> the error is found.
>
>>
>> >
>> >3. It has a callstack checker (memory owner checker in yours). It
>> >checks all the callstack rather than just the caller. It's important
>> >since invalid writer could call the parent function of owner function
>> >and it would not be catched by checking just the caller.
>> >
>> >4. The callstack checker is more automated. vchecker collects the valid
>> >callstack by running the system.
>> I think we can merge the above two into one.
>> So you are doing full stack check.  Well, finding out the all the
>> paths which have the write access may be not a very easy thing.
>> Missing some paths may cause dmesg flooding, and those log won't
>> help at all. Finding out all the (owning) caller only is relatively
>> much easier.
>
> Vchecker can be easily modified to store only the caller. It just
> requires modifying callstack depth parameter so it's so easy.
> Moreover, it can be accomplished by adding debugfs interface.
>
> Anyway, I don't think that finding out all the (owning) caller only
> is much easier. Think about dentry or inode object. It is accessed by
> various code path and it's not easy to cover all the owning caller by
> manual approach.
>
>
>> There do is the case you pointed out here. In this case, the
>> debugger can make slight change to the calling path. And as I
>> understand,
>> most of the overwritten are happening in quite different call paths,
>> they are not calling the (owning) caller.
>
> Agreed.
>
>>
>> >
>> >FYI, I attach some commit descriptions of the vchecker.
>> >
>> >     vchecker: store/report callstack of value writer
>> >     The purpose of the value checker is finding invalid user writing
>> >     invalid value at the moment that the value is written. However, there is
>> >     a missing infrastructure that passes writing value to the checker
>> >     since we temporarilly piggyback on the KASAN. So, we cannot easily
>> >     detect this case in time.
>> >     However, by following way, we can emulate similar effect.
>> >     1. Store callstack when memory is written.
>>
>> Oh, seems you are storing the callstack for each write. -- I am not
>> sure if that would too heavy.
>
> Unlike KASAN that checks all type of the objects, this debugging
> feature is only enabled on the specific type of the objects so
> overhead would not be too heavy in terms of system overall
> performance.
>
>> Actually I was thinking to have a check on the new value. But seems
>> compiler doesn't provide that.
>
> Yes, look like we have a similar idea. I have some another ideas if
> ASAN hook provides the value to be written. However, it's not
> supported by compiler yet.
>
>> >     2. If check is failed in next access, report previous write-access
>> >     callstack
>> >     It will caught offending user properly.
>> >     Following output "Call trace: Invalid writer" part is the result
>> >     of this patch. We find the invalid value at workfn+0x71 but report
>> >     writer at workfn+0x61.
>> >     [  133.024076] ==================================================================
>> >     [  133.025576] BUG: VCHECKER: invalid access in workfn+0x71/0xc0 at addr ffff8800683dd6c8
>> >     [  133.027196] Read of size 8 by task kworker/1:1/48
>> >     [  133.028020] 0x8 0x10 value
>> >     [  133.028020] 0xffff 4
>> >     [  133.028020] Call trace: Invalid writer
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff81043b1b>] save_stack_trace+0x1b/0x20
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff812c0db9>] save_stack+0x39/0x70
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff812c0fe3>] check_value+0x43/0x80
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff812c1762>] vchecker_check+0x1c2/0x380
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff812be49d>] __asan_store8+0x8d/0xc0
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff815eadd1>] workfn+0x61/0xc0
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff810be3df>] process_one_work+0x28f/0x680
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff810bf272>] worker_thread+0xa2/0x870
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff810c86a5>] kthread+0x195/0x1e0
>> >     [  133.028020]
>> >     [  133.028020] [<ffffffff81b9d3d2>] ret_from_fork+0x22/0x30
>> >     [  133.028020] CPU: 1 PID: 48 Comm: kworker/1:1 Not tainted 4.10.0-rc2-next-20170106+ #1179
>> >     [  133.028020] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Bochs 01/01/2011
>> >     [  133.028020] Workqueue: events workfn
>> >     [  133.028020] Call Trace:
>> >     [  133.028020]  dump_stack+0x4d/0x63
>> >     [  133.028020]  kasan_object_err+0x21/0x80
>> >     [  133.028020]  vchecker_check+0x2af/0x380
>> >     [  133.028020]  ? workfn+0x71/0xc0
>> >     [  133.028020]  ? workfn+0x71/0xc0
>> >     [  133.028020]  __asan_load8+0x87/0xb0
>> >     [  133.028020]  workfn+0x71/0xc0
>> >     [  133.028020]  process_one_work+0x28f/0x680
>> >     [  133.028020]  worker_thread+0xa2/0x870
>> >     [  133.028020]  kthread+0x195/0x1e0
>> >     [  133.028020]  ? put_pwq_unlocked+0xc0/0xc0
>> >     [  133.028020]  ? kthread_park+0xd0/0xd0
>> >     [  133.028020]  ret_from_fork+0x22/0x30
>> >     [  133.028020] Object at ffff8800683dd6c0, in cache vchecker_test size: 24
>> >     [  133.028020] Allocated:
>> >     [  133.028020] PID = 48
>> >
>> >
>> >     vchecker: Add 'callstack' checker
>> >     The callstack checker is to find invalid code paths accessing to a
>> >     certain field in an object.  Currently it only saves all stack traces at
>> >     the given offset.  Reporting will be added in the next patch.
>> >     The below example checks callstack of anon_vma:
>> >       # cd /sys/kernel/debug/vchecker
>> >       # echo 0 8 > anon_vma/callstack  # offset 0, size 8
>> >       # echo 1 > anon_vma/enable
>> an echo "anon_vma" > <something> first?
>> How do you define and path the valid (owning) full stack to kasan?
>
> This interface only enables to store all the callstacks. No validation
> check here. I think that this feature would also be helpful to debug.
> If error happens, we can check all the previous callstacks and track
> the buggy caller.
>
>> >       # cat anon_vma/callstack        # show saved callstacks
>> >       0x0 0x8 callstack
>> >       total: 42
>> >       callstack #0
>> >         anon_vma_fork+0x101/0x280
>> >         copy_process.part.10+0x15ff/0x2a40
>> >         _do_fork+0x155/0x7d0
>> >         SyS_clone+0x19/0x20
>> >         do_syscall_64+0xdf/0x460
>> >         return_from_SYSCALL_64+0x0/0x7a
>> >       ...
>> >
>> >
>> >     vchecker: Support toggle on/off of callstack check
>> >     By default, callstack checker only collects callchains.  When a user
>> >     writes 'on' to the callstack file in debugfs, it checks and reports new
>> >     callstacks.  Writing 'off' to disable it again.
>> >       # cd /sys/kernel/debug/vchecker
>> >       # echo 0 8 > anon_vma/callstack
>> >       # echo 1 > anon_vma/enable
>> >       ... (do some work to collect enough callstacks) ...
>> How to define "enough" here?
>
> The bug usually doesn't happen immediately since it usually happens on
> the corner case. When debugging, we run the workload that causes the
> bug and then wait for some time until the bug happens. "Enough" can
> be defined as the middle of this waiting time. After some warm-up
> time, all the common callstack would be collected. Then,
> switching on this feature that reports a new callstack. If the corner
> case that is on a new callstack happens, this new callstack will be
> reported and we can check whether it is a true bug or not.
>
>> >       # echo on > anon_vma/callstack
>> >
>> >The reason I didn't submit the vchecker to mainline is that I didn't find
>> >the case that this tool is useful in real life. Most of the system broken case
>> >can be debugged by other ways. Do you see the real case that this tool is
>> >helpful? If so, I think that vchecker is more appropriate to be upstreamed.
>> >Could you share your opinion?
>> Yes, people find other ways to solve overwritten issue (so did I) in
>> the past. If kasan doesn't provide this functionality, developers
>> have no way to choose it.
>> Though people have other ways to find the root cause, the other ways
>> maybe take (maybe much) longer. I didn't solve problems with the
>> owner check yet since I just make available recently.  But
>> considering the overwritten issues I have ever hit, the owner check
>> definitely helps and I definitely will try the owner check when I
>> have a new overwritten issue!
>>
>> Why not send your patch for review?
>
> Okay! I hope to find more people that have interest on this feature
> and it seems that you are the one of them. :)
>
> I will send my patches soon. I think that we can be cooperative to
> improve this feature.
>
> Thanks.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@xxxxxxxxx";> email@xxxxxxxxx </a>