On 23 Feb 2003 at 8:42, Steve Madonna wrote: > I figured it probably wasn't a graphics problem as you said this > machine is a "server". You might want to search google for other > people having problems with your same motherboard. Have you tried that > yet? It's worth a shot... Been there, done that :-)) There are several messages with similar problems but all with different backgrounds and other symptoms. Since I have 2 servers running RedHat (one with latest 7.3, one latest 8.0), both Maxtor-harddisks and nothing more in common I doubt it's a motherboard-problem. All fans etc. are okay because the secons server is brandnew. I guess it must have something to do with system (eventhough the kernel is "stable") and MAYBE the harddisks. > Turning off UDMA is a good idea. If you suspect hard-disk probs. I lowered UDMA-settings (udma5 -> udma4) and turned off 32-bit- transfers. In my opinion it would be a bad idea to switch directly down to pio-mode or similar because this would have a big impact on the system. But maybe a lower udma works (worked for some people on the internet with somehow similar problems). > What about NIS or NFS? Are you running either of those? I've got > dailiy problems of NFS mounts going stale... if logins or shell > commands are authenticated via NIS or actually run from a different > system mounted via NFS systems will effectively hang. Usually it's AMD > (the automount daemon) causing the problem. All I can do when this > happens is telnet in as root (not very secure, I know) and reboot the > machine. (That works because root is a local account... doesn't rely > on NIS or the automounter). No network filesystems in use currently. But it's again one thing that should NEVER happen and shouldn't be able to hang the system :- (( > On Sunday 23 February 2003 2:45 am, Stefan Neufeind wrote: > > On 23 Feb 2003 at 2:35, Steve Madonna wrote: > > > I've used maxtor disks for a long time with no problems at all... > > > What happens on a "hang"? Is it the graphics? (meaning... what > > > hangs? since you can still connect to a pop3 port, I'm assuming > > > it's not the network) We get systems that hang all the time for > > > video driver problems... RH 7.2 + nVidia drivers. > > > > Since it's a server I don't run or even have a graphical surface > > installed. Hang means: Console (Textmode) doesn't respond anymore, > > can ping to machine okay, can connect to services but don't get a > > respond. Some people said it might be a problem with the harrdisk- > > layer. Assuming this would for sure make clear why the running > > kernel can work properly but neither can a service spawn new > > children (ssh, ftp, ...) nor could syslog write to disk (and maybe > > cause everything to stop) nor could the swap-space be accessed. > > > > I don't know what to believe ... I just know I have to do something > > to find out the cause. Waking up at night because of a bad dream > > that the server might be locked up again can't be okay :-( > > > > I'm running latest stable RedHat-kernel as well - so I assume there > > are no problems with the ide-drivers from RedHat ... otherwise I > > guess they would have released a fix-pack. > > > > Does anybody know about problems with harddisks? > > > > But remember: It's not said that it ARE the harddisks / the ide- > > layer. Might be something else. So I guess we need to keep open- > > minded. > > > > > > Yours sincerely, > > Stefan > > > > > On Sunday 23 February 2003 12:15 am, Stefan Neufeind wrote: > > > > On 23 Feb 2003 at 9:01, Bart SCHELSTRAETE wrote: > > > > > >I got a very weird problem with a RedHat-server. Occasionally > > > > > >(can't even say that its under "high load" or something like > > > > > >that) the system freezes / hangs. It's possible to ping the > > > > > >machine (responds normal) and you can even connect to ports > > > > > >like pop3 (110) which were open before. The connection is > > > > > >established but no service responds. The same for mail in > > > > > >general, ssh, http etc. Nothing! Even console doesn't work > > > > > >anymore until reboot. > > > > > > > > > > > >The PC consists of a ASUS-P4B533-V-board with 2 Maxtor-HDDs > > > > > >and a cd- drive. Machine is running regular redhat 8.0 > > > > > >(kernel etc.), with all RHN-updates applied, running latest > > > > > >apache 1.3.xx ... > > > > > > > > > > > >It's a server in a serverhousing-farm and it's there for > > > > > >production use. So I can't simply take it off for a few days > > > > > >to play around with > > > > > > it. > > > > > > > > > > > >Had the same problems with another server, different board, > > > > > >running RedHat 7.3 with all updates applied. After looking > > > > > >for every possible > > > > > > problem-solution I thought maybe it's a hardware defect. So > > > > > > I > > > > > >changed from Athlon-cpu to a Intel P4 with different board, > > > > > >also installed RedHat 8.0 (instead of 7.3) new from scratch. > > > > > >But problem still exists. > > > > > > > > > > > >Only common thing: Both computers using same > > > > > >Maxtor-harddisks. But could this be the problem? > > > > > >Unfortunately (as described above) I can't > > > > > > simply take the computer down, change harddisks etc. ... not > > > > > >possible. > > > > > > > > > > > >Did anybody experience such problem? > > > > > > > > > > Hello, > > > > > > > > > > It's difficult to tell you what the problem is. > > > > > Is it possible to send me your /var/log/messages file? > > > > > > > > There is nothing suspicious in the logfile, have looked it over > > > > completely many times. Even around the time of crash there is no > > > > entry for anything. Normally when the system hung I didn't even > > > > receive an error on the console so I doubt if the "Spuriour > > > > interrupt IRQ 7" I got yesterday on the console really belongs > > > > to the problem. > > > > > > > > On the web and mailinglists I heard that some people believe the > > > > problem might have to do with a harddisk-lockup. This would, of > > > > course, justify why ping works and I also can connect but as > > > > soon as let's say a new ssh-session should be opened (or the > > > > console should be used) the PC stops working. But how to tell?!? > > > > > > > > Are any harddisk-driver problems known? Both PCs were using > > > > normal/defensive timings - so I guess this should be the > > > > problem. And since both PCs had different mainboards I doubt > > > > that it has got to do with a specific IDE-controller. How about > > > > the harrdisks (Maxtor)? Have ONLY heard good things about those > > > > ... > > > > > > > > Any ideas / solutions? Or at least an idea how to narrow the > > > > problems / monitor? > > > > > > > > > Also enable the SysRQ function, so you can see if that still > > > > > works. > > > > > > > > ? What do you mean? How? What results do you expect? -- Psyche-list mailing list Psyche-list@redhat.com https://listman.redhat.com/mailman/listinfo/psyche-list