Re: Looking for job which is causing a large work load

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




On Feb 16, 2010, at 12:36 PM, Stainforth, Matthew (SD/DS) wrote:

It does look like you're being bitten by that bug.

You can kill off the cat processes easily enough with something like:

ps auwwx |grep "cat /sys/hypervisor/uuid" | awk '{ print $2 }' | xargs kill -9
I tried to use killall on the cat processes. Then I tried to kill just one of the cat processes. This did not work; the "cat came back."

I did restart the xend process. The cat programs disappeared. The load came down to less than 1. The used memory cleared out. And sendmail is working again.

I am not the main administrator on this system, so I will have to wait to see if mcelog and xend is needed on this system.

Thanks for your help.

The cron.hourly script is calling mcelog.cron which is supplied by the mcelog package. If you don't need that package then either remove it (yum remove mcelog) or get rid of the script from cron.hourly (or make it non-executable).

You haven't mentioned that this server is acting as a xen hypervisor so getting rid of the xen kernel if it's not needed might be a good idea, but would definitely require a reboot.

-----Original Message-----
From: redhat-list-bounces@xxxxxxxxxx [mailto:redhat-list-bounces@xxxxxxxxxx ] On Behalf Of Margaret Doll
Sent: Tuesday, February 16, 2010 12:49 PM
To: General Red Hat Linux discussion list
Subject: Looking for job which is causing a large work load

I found the zombie to be

F S   UID   PID  PPID  C PRI  NI ADDR SZ WCHAN  TTY          TIME CMD
4 Z    99  5334  5209  0  75   0 -     0 exit   ?        00:00:00
dnsmasq <defunct>

which was started by

root      5209     1  0  2009 ?        00:00:00 libvirtd --daemon
nobody    5334  5209  0  2009 ?        00:00:00 [dnsmasq] <defunct>

I also found 622 cat programs in a sleeping state

root 2213 2211 0 Feb02 ? 00:00:00 cat /sys/hypervisor/ uuid root 2244 2242 0 Feb11 ? 00:00:00 cat /sys/hypervisor/ uuid root 2297 2295 0 Feb14 ? 00:00:00 cat /sys/hypervisor/ uuid root 2314 2312 0 Feb07 ? 00:00:00 cat /sys/hypervisor/ uuid root 2350 2348 0 Jan29 ? 00:00:00 cat /sys/hypervisor/ uuid


This is described in a bug report on

https://bugzilla.redhat.com/show_bug.cgi?id=525386

From the history of the bug report,  it show that the problem was
resolved.  I have not been able to find the resolution.

free
             total       used       free     shared    buffers
cached
Mem:      16099528   15190480     909048          0     491696
11839316
-/+ buffers/cache:    2859468   13240060
Swap:      6127608     105888    6021720


ps auwwx contains a lot of

???   print progname ":\n"?????   progname="";????       }????
{ print; }
root       677  0.0  0.0   3796   400 ?        D    Feb02   0:00 cat /
sys/hypervisor/uuid
root       698  0.0  0.0 106652  1608 ?        S    Feb10   0:00 crond
root       699  0.0  0.0   8700  1036 ?        Ss   Feb10   0:00 /bin/
bash /usr/bin/run-parts /etc/cron.hourly
root       700  0.0  0.0   8700   972 ?        S    Feb10   0:00 /bin/
bash /etc/cron.hourly/mcelog.cron
root       701  0.0  0.0   8768   668 ?        S    Feb10   0:00 awk -
v progname=/etc/cron.hourly/mcelog.cron progname {?????   print
progname ":\n"?????   progname="";????       }????       { print; }
root       702  0.0  0.0   3796   400 ?        D    Feb10   0:00 cat /
sys/hypervisor/uuid
root       773  0.0  0.0 106652  1608 ?        S    Feb06   0:00 crond
root       774  0.0  0.0   8700  1040 ?        Ss   Feb06   0:00 /bin/
bash /usr/bin/run-parts /etc/cron.hourly
root       775  0.0  0.0   8700   964 ?        S    Feb06   0:00 /bin/
bash /etc/cron.hourly/mcelog.cron
root       776  0.0  0.0   8768   672 ?        S    Feb06   0:00 awk -
v progname=/etc/cron.hourly/mcelog.cron progname {?????   print
progname ":\n"?????   progname="";????       }????       { print; }
root       777  0.0  0.0   3796   400 ?        D    Feb06   0:00 cat /
sys/hypervisor/uuid
root       803  0.0  0.0 106652  1608 ?        S    Feb02   0:00 crond
root       804  0.0  0.0   8700  1036 ?        Ss   Feb02   0:00 /bin/
bash /usr/bin/run-parts /etc/cron.hourly
root       805  0.0  0.0   8700   972 ?        S    Feb02   0:00 /bin/
bash /etc/cron.hourly/mcelog.cron
root       806  0.0  0.0   8768   668 ?        S    Feb02   0:00 awk -
v progname=/etc/cron.hourly/mcelog.cron progname {?????   print
progname ":\n"?????   progname="";????       }????       { print; }
root       807  0.0  0.0   3796   400 ?        D    Feb02   0:00 cat /
sys/hypervisor/uuid
root       837  0.0  0.0 106652  1608 ?        S    Feb10   0:00 crond
root       838  0.0  0.0   8700  1040 ?        Ss   Feb10   0:00 /bin/
bash /usr/bin/run-parts /etc/cron.hourly
root       839  0.0  0.0   8700   968 ?        S    Feb10   0:00 /bin/
bash /etc/cron.hourly/mcelog.cron

--
redhat-list mailing list
unsubscribe mailto:redhat-list-request@xxxxxxxxxx?subject=unsubscribe
https://www.redhat.com/mailman/listinfo/redhat-list

--
redhat-list mailing list
unsubscribe mailto:redhat-list-request@xxxxxxxxxx?subject=unsubscribe
https://www.redhat.com/mailman/listinfo/redhat-list

--
redhat-list mailing list
unsubscribe mailto:redhat-list-request@xxxxxxxxxx?subject=unsubscribe
https://www.redhat.com/mailman/listinfo/redhat-list

[Index of Archives]     [CentOS]     [Kernel Development]     [PAM]     [Fedora Users]     [Red Hat Development]     [Big List of Linux Books]     [Linux Admin]     [Gimp]     [Asterisk PBX]     [Yosemite News]     [Red Hat Crash Utility]


  Powered by Linux