On 23 Feb 2003 at 2:35, Steve Madonna wrote: > I've used maxtor disks for a long time with no problems at all... What > happens on a "hang"? Is it the graphics? (meaning... what hangs? since > you can still connect to a pop3 port, I'm assuming it's not the > network) We get systems that hang all the time for video driver > problems... RH 7.2 + nVidia drivers. Since it's a server I don't run or even have a graphical surface installed. Hang means: Console (Textmode) doesn't respond anymore, can ping to machine okay, can connect to services but don't get a respond. Some people said it might be a problem with the harrdisk- layer. Assuming this would for sure make clear why the running kernel can work properly but neither can a service spawn new children (ssh, ftp, ...) nor could syslog write to disk (and maybe cause everything to stop) nor could the swap-space be accessed. I don't know what to believe ... I just know I have to do something to find out the cause. Waking up at night because of a bad dream that the server might be locked up again can't be okay :-( I'm running latest stable RedHat-kernel as well - so I assume there are no problems with the ide-drivers from RedHat ... otherwise I guess they would have released a fix-pack. Does anybody know about problems with harddisks? But remember: It's not said that it ARE the harddisks / the ide- layer. Might be something else. So I guess we need to keep open- minded. Yours sincerely, Stefan > On Sunday 23 February 2003 12:15 am, Stefan Neufeind wrote: > > On 23 Feb 2003 at 9:01, Bart SCHELSTRAETE wrote: > > > >I got a very weird problem with a RedHat-server. Occasionally > > > >(can't even say that its under "high load" or something like > > > >that) the system freezes / hangs. It's possible to ping the > > > >machine (responds normal) and you can even connect to ports like > > > >pop3 (110) which were open before. The connection is established > > > >but no service responds. The same for mail in general, ssh, http > > > >etc. Nothing! Even console doesn't work anymore until reboot. > > > > > > > >The PC consists of a ASUS-P4B533-V-board with 2 Maxtor-HDDs and a > > > >cd- drive. Machine is running regular redhat 8.0 (kernel etc.), > > > >with all RHN-updates applied, running latest apache 1.3.xx ... > > > > > > > >It's a server in a serverhousing-farm and it's there for > > > >production use. So I can't simply take it off for a few days to > > > >play around with > > > > it. > > > > > > > >Had the same problems with another server, different board, > > > >running RedHat 7.3 with all updates applied. After looking for > > > >every possible > > > > problem-solution I thought maybe it's a hardware defect. So I > > > >changed from Athlon-cpu to a Intel P4 with different board, also > > > >installed RedHat 8.0 (instead of 7.3) new from scratch. But > > > >problem still exists. > > > > > > > >Only common thing: Both computers using same Maxtor-harddisks. > > > >But could this be the problem? Unfortunately (as described above) > > > >I can't > > > > simply take the computer down, change harddisks etc. ... not > > > >possible. > > > > > > > >Did anybody experience such problem? > > > > > > Hello, > > > > > > It's difficult to tell you what the problem is. > > > Is it possible to send me your /var/log/messages file? > > > > There is nothing suspicious in the logfile, have looked it over > > completely many times. Even around the time of crash there is no > > entry for anything. Normally when the system hung I didn't even > > receive an error on the console so I doubt if the "Spuriour > > interrupt IRQ 7" I got yesterday on the console really belongs to > > the problem. > > > > On the web and mailinglists I heard that some people believe the > > problem might have to do with a harddisk-lockup. This would, of > > course, justify why ping works and I also can connect but as soon as > > let's say a new ssh-session should be opened (or the console should > > be used) the PC stops working. But how to tell?!? > > > > Are any harddisk-driver problems known? Both PCs were using > > normal/defensive timings - so I guess this should be the problem. > > And since both PCs had different mainboards I doubt that it has got > > to do with a specific IDE-controller. How about the harrdisks > > (Maxtor)? Have ONLY heard good things about those ... > > > > Any ideas / solutions? Or at least an idea how to narrow the > > problems / monitor? > > > > > Also enable the SysRQ function, so you can see if that still > > > works. > > > > ? What do you mean? How? What results do you expect? -- Psyche-list mailing list Psyche-list@redhat.com https://listman.redhat.com/mailman/listinfo/psyche-list