Re: Bcache still unstable for me (memory problems)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, Mar 09, 2016 at 03:59:30AM +0000, Eric Wheeler wrote:
> > gargamel:/mnt/mnt# cat /proc/sys/vm/min_free_kbytes
> > 19712
> 
> Ours is set to 256mb (256*1024) and I've never had a problem.  
> 
> > Should I change it?
> 
> Could try it, shrug.

Done :)

> > So clearly on this boot too, it got registered late (20h-ish after boot)
> 
> I find it interesting that it re-registered md5 within 5 minutes of 24 
> hours after initial registration: (86242-102)/3600 = 23:55:40
 
Oh, I didn't do the math, but yeah, that looks very suspicous :)

> Is there some kind of cron.daily thing going on?  If you have timestamps 
> for that kernel log, maybe check cron for logs too.

I don't have any cronjobs that do anything with bcache, but I have
cronjobs that scan all drives and save all partitions plus other related
stuff.
That said, if I recall correctly, it died just when I restarted a copy
to that filesystem (but my memory of that event is getting hazy now).

I checked syslog around that time though 
Mar  6 06:29:50 gargamel kernel: [204255.678548] bcache-register: page allocation failure: order:7, mode:0x24080c0
and found nothing related to a cronjob or /dev/sdl disappearing and
coming back.

> Are there any intevening non-bcache lines indicating a disk was removed 
> (eg, bad usb cable) and re-added?  

Good question, none that I can find.
 
> > > Do you have this patch?
> > > https://bitbucket.org/ewheelerinc/linux/commits/a7044848050ac60e178798d20ea8a3ef2be36bc7?at=master
> >  
> > I got the other patches you sent me last time, but didn't end up with
> > this one, sorry if you sent it to me and I dropped it.
> > I'll apply it now, thanks.
> 
> All of the patches related to troubleshooting with you are here:
> 
>   https://bitbucket.org/ewheelerinc/linux/branch/v4.5-rc6-bcache-fixes
> and here:
>   https://bitbucket.org/ewheelerinc/linux/branch/v4.5-rc7-bcache-fixes
> 
> so make sure all 3 are applied.  It might still OOM, but it shouldn't 
> crash if we got it all.

Thanks. I just checked that I have all those paatches now.

I will report back if I get more interesting output :)

Thanks,
Marc
-- 
"A mouse is a device used to point at the xterm you want to type in" - A.S.R.
Microsoft is to operating systems ....
                                      .... what McDonalds is to gourmet cooking
Home page: http://marc.merlins.org/                         | PGP 1024R/763BE901
--
To unsubscribe from this list: send the line "unsubscribe linux-bcache" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [Linux ARM Kernel]     [Linux Filesystem Development]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Security]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [ECOS]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux