Re: scoreboard is full, not at MaxRequestWorkers

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Changing KeepAlive to Off does not help, the issue occurs at this
point where obviously due to MaxConnectionsPerChild a Process
should become killed.

As observed, the problem is that (in the example stated) both
PIDs 11454 and 16555 do not become killed because of these
remaining connections, which is ok, since these could be
remaining downloads or something to a client.
But then, Apache does neither fork new processes to accept
new connections nor does it kill the both PIDs (which is as just
statet, ok) - the scoreboard is full then in that case and new
connections can not be accepted.

As I see a solution to this problem is that the main httpd process
should be 1. able to fork new processes 2. I see no reason why
the scoreboard is full then, 1+2 connections is 3, not 2x64=128,
in that given case 125 free slots are wasted since they are blocked
by the 2 remaining processes which show connections which fill up
the scoreboard, those are shown as "G"s in the scoreboard for both
PIDs - that should not be the case.

Since 2.4.2 (the first I've used) I did not understand why a
finishing/killing process shows all "G"s - however, that issue did
not occur in 2.2 when I used MPM event in experimental stage, but
in 2.2 I did not have that problem, the connections were counted as
usual. Probably because lots of the code was taken from MPM worker,
where the calculations were still correct, until fully adopted to 2.4.

By this theory, also MPM worker might experience this, however, due to
the different design MPM worker also may never get to that point to
show this behavior and error messages in the log.

So my conclusion is also a question: why are finishing processes showing
all "G"s in the scoreboard, and/or why are these finishing threads all
counted in the scoreboard (here: 2x64 instead of 1+2, causing 125 threads
to be wasted).

I would expect this issue to become much more important when most people
upgrade to 2.4, also I clearly would say this is a bug.

On Thu, Sep 6, 2012 at 11:21 PM, Sam <lennsen@xxxxxxxxx> wrote:
Never, the server is not even under high load. There
are several thousand vhosts, it is about shared webhosting.

This is what server-status says:

PID    Connections     Threads    Async connections
total    accepting    busy    idle    writing    keep-alive    closing
21388    257    yes  44   20    0  171   44
11454    2      no    0    0    0    0    0
16555    1      no    0    0    0    0    0
Sum      260         44   20    0  171   44



How about those 171 in keep-alive? What are your KeepAlive and KeepAliveTimeout settings? Maybe try switching KeepAlive off? Whit only 192 threads and KeepAlive On you are imposing DoS on your self in case of long timeout.
 
That one PID 11454 has "2" connections, 16555 has "1", all other
values are 0, I guess it should become killed?  - but it does not,
it stays like this for a while (I don't know on what this "while" depends on).

At this point all 3 rows are filled with stuff, such as:

WWW_W_WWWWWW_WW_W_W_W_WW__WWW__WW__WWWWWW__WWW__W__W_RWWWWRWWWWW
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
GGGGGGGGGGGGGGGLGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG

But as you see, due to several _ this means that there are
not all 192 busy, in fact:

105 requests/sec - 2.5 MB/second - 24.8 kB/request
44 requests currently being processed, 20 idle workers

WHY can no others be used? With the statement above it means that
128 (which is 192-64) workers are unused because they should become
killed already, to spawn new ones, these are the ones in the 2. and
3. row with "G"s.

--

So the rest is occupied with "G". In Apache 2.2 I used MPM_event
as well, this obviously never occured because the both PIDs above
which are showing G only showed just dots in that case, now after
one PID should be finished, all slots become occupied with G, 
which obviously causes a full scoreboard.

I would consider this a bug, a serious one for my case, since it
often occurs (~1-3 times per hour) that the server stops accepting
new requests, just because of that full scoreboard, where the server
is neither busy, nor do I see what configuration directive should
be configured badly. Again:

KeepAlive                   On
ListenBacklog             4095
Timeout                     20
KeepAliveTimeout             8
MaxKeepAliveRequests     16192
MaxRanges                  200
MaxRangeOverlaps            20
MaxRangeReversals           20
GracefulShutdownTimeout      6
LimitInternalRecursion      10
LimitRequestFieldSize     4094
LimitRequestFields          40
LimitRequestLine          4094
LimitXMLRequestBody     786432

MaxRequestWorkers 192
ServerLimit         3
StartServers        2

MaxMemFree                2048
MinSpareThreads             64
MaxSpareThreads             64
ThreadLimit                128
ThreadsPerChild             64
MaxConnectionsPerChild   10240
AsyncRequestWorkerFactor    10

I assumed AsyncRequestWorkerFactor to have something to do 
with it, I raised it from 2 (default) to 10, no change.

By the way: after some time (in this case above both) PIDs
become killed finally, releasing 2x64 slots in the scoreboard,
the waring messages vanish and the server can respond again
well to requests. In the interest of finding a workaround until
this is fixed: what configuration directives control that timeout
or whatever, which cause these PIDs to become killed?


On Thu, Sep 6, 2012 at 7:06 PM, Sam <lennsen@xxxxxxxxx> wrote:

Yes, I still see such messages even after upgrading to 2.4.3


On Sun, Aug 19, 2012 at 8:25 AM, Sam <lennsen@xxxxxxxxx> wrote:
I upgraded to Apache 2.4 and use mod_event on a server
having average high load.

My issue is that I am getting pretty much often the error message
within the error log

AH00485: scoreboard is full, not at MaxRequestWorkers

What causes this? The server almost stops to handle new requests
when this occurs.

MaxRequestWorkers 192
ServerLimit         3
StartServers        2

MaxMemFree                2048
MinSpareThreads             64
MaxSpareThreads             64
ThreadLimit                 64
ThreadsPerChild             64
MaxConnectionsPerChild   10240
AsyncRequestWorkerFactor     2

So you never have more than 192 simultaneous requests at any given time ???





[Index of Archives]     [Open SSH Users]     [Linux ACPI]     [Linux Kernel]     [Linux Laptop]     [Kernel Newbies]     [Security]     [Netfilter]     [Bugtraq]     [Squid]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Samba]     [Video 4 Linux]     [Device Mapper]

  Powered by Linux