Re: System freeze / hang, ping responding, can connect but no service answers

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 23 Feb 2003 at 9:01, Bart SCHELSTRAETE wrote:

> >I got a very weird problem with a RedHat-server. Occasionally (can't
> >even say that its under "high load" or something like that) the
> >system freezes / hangs. It's possible to ping the machine (responds
> >normal) and you can even connect to ports like pop3 (110) which were
> >open before. The connection is established but no service responds.
> >The same for mail in general, ssh, http etc. Nothing! Even console
> >doesn't work anymore until reboot.
> >
> >The PC consists of a ASUS-P4B533-V-board with 2 Maxtor-HDDs and a cd-
> >drive. Machine is running regular redhat 8.0 (kernel etc.), with all
> >RHN-updates applied, running latest apache 1.3.xx ...
> >
> >It's a server in a serverhousing-farm and it's there for production
> >use. So I can't simply take it off for a few days to play around with
> > it.
> >
> >Had the same problems with another server, different board, running
> >RedHat 7.3 with all updates applied. After looking for every possible
> > problem-solution I thought maybe it's a hardware defect. So I
> >changed from Athlon-cpu to a Intel P4 with different board, also
> >installed RedHat 8.0 (instead of 7.3) new from scratch. But problem
> >still exists.
> >
> >Only common thing: Both computers using same Maxtor-harddisks. But
> >could this be the problem? Unfortunately (as described above) I can't
> > simply take the computer down, change harddisks etc. ... not
> >possible.
> >
> >Did anybody experience such problem?
> >  
> >
> Hello,
> 
> It's difficult to tell you what the problem is.
> Is it possible to send me your /var/log/messages file?

There is nothing suspicious in the logfile, have looked it over 
completely many times. Even around the time of crash there is no 
entry for anything. Normally when the system hung I didn't even 
receive an error on the console so I doubt if the "Spuriour interrupt 
IRQ 7" I got yesterday on the console really belongs to the problem.

On the web and mailinglists I heard that some people believe the 
problem might have to do with a harddisk-lockup. This would, of 
course, justify why ping works and I also can connect but as soon as 
let's say a new ssh-session should be opened (or the console should 
be used) the PC stops working. But how to tell?!?

Are any harddisk-driver problems known? Both PCs were using 
normal/defensive timings - so I guess this should be the problem. And 
since both PCs had different mainboards I doubt that it has got to do 
with a specific IDE-controller. How about the harrdisks (Maxtor)? 
Have ONLY heard good things about those ...

Any ideas / solutions? Or at least an idea how to narrow the problems 
/ monitor?

> Also enable the SysRQ function, so you can see if that still works.

? What do you mean? How? What results do you expect?


Yours sincerely,
 Stefan Neufeind



-- 
Psyche-list mailing list
Psyche-list@redhat.com
https://listman.redhat.com/mailman/listinfo/psyche-list

[Index of Archives]     [Fedora General Discussion]     [Red Hat General Discussion]     [Centos]     [Kernel]     [Red Hat Install]     [Red Hat Watch]     [Red Hat Development]     [Red Hat 9]     [Gimp]     [Yosemite News]

  Powered by Linux