Re: freezing system for several second on high I/O [kernel 4.15]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, Feb 16, 2018 at 12:02:28AM +0500, Mikhail Gavrilov wrote:
> On 15 February 2018 at 10:44, Dave Chinner <david@xxxxxxxxxxxxx> wrote:
> > I've already explained that we can't annotate these memory
> > allocations to turn off the false positives because that will also
> > turning off all detection of real deadlock conditions.  Lockdep has
> > many, many limitations, and this happens to be one of them.
> >
> > FWIW, is there any specific reason you running lockdep on your
> > desktop system?
> 
> Because I wanna make open source better (help fixing all freezing)

lockdep isn't a user tool - most developers don't even understand
what it tries to tell them. Worse, it is likely contributing to your
problems as it has a significant runtime CPU and memory overhead....

> > I think I've already explained that, too. The graphics subsystem -
> > which is responsible for updating the cursor - requires memory
> > allocation. The machine is running low on memory, so it runs memory
> > reclaim, which recurses back into the filesystem and blocks waiting
> > for IO to be completed (either writing dirty data pages or flushing
> > dirty metadata) so it can free memory.
> 
> Which means machine is running low on memory?
> How many memory needed?
> 
> $ free -h
>               total        used        free      shared  buff/cache   available
> Mem:            30G         17G        2,1G        1,4G         10G         12G
> Swap:           59G          0B         59G
> 
> As can we see machine have 12G available memory. Is this means low memory?

No, you only have 2.1G free memory. You have 10GB of *reclaimable
memory* in the buffer/page cache, and that gives you 12GB of
"available memory". Memory reclaim happens all the time in a normal
system - it does not mean you are running low on memory, it just
means your system is busy.

And, FWIW, we know you have memory pressure because the lockdep
reports you are pasting are a result of memory reclaim operating.

> > IOWs, your problems all stem from long IO latencies caused by the
> > overloaded storage subsystem - they are propagate to all
> > aspects of the OS via direct memory reclaim blocking on IO....
> 
> I'm surprised that no QOS analog for disk I/O.

There is, but it's not like a network where overload situations are
mitigated by dropping packets to reduce load. We cannot do that with
IO (dropped IO == broken filesystem), so QoS doesn't help when you
drive the storage subsystem in extreme, long term overload
conditions as you seem to be doing.

> This is reminiscent of the situation in past where a torrent client
> clogs the entire channel on the cheap router and it causes problems
> with opening web pages. In nowadays it never happens with modern
> routers even with overloaded network channel are possible video calls

Storage != network.

> In 2018 my personaly expectation that user can run any set of
> applications on computer and this never shoudn't harm system.

There's no "harm" occurring on your system - it's just slow
because the load you've put on it means no task can execute quickly.

Cheers,

Dave.
-- 
Dave Chinner
david@xxxxxxxxxxxxx
--
To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [XFS Filesystem Development (older mail)]     [Linux Filesystem Development]     [Linux Audio Users]     [Yosemite Trails]     [Linux Kernel]     [Linux RAID]     [Linux SCSI]


  Powered by Linux