[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: System freeze / hang, ping responding, can connect but no service answers



I figured it probably wasn't a graphics problem as you said this machine is a 
"server". You might want to search google for other people having problems 
with your same motherboard. Have you tried that yet? It's worth a shot...

Turning off UDMA is a good idea. If you suspect hard-disk probs

What about NIS or NFS? Are you running either of those? I've got dailiy 
problems of NFS mounts going stale... if logins or shell commands are 
authenticated via NIS or actually run from a different system mounted via NFS 
systems will effectively hang. Usually it's AMD (the automount daemon) 
causing the problem. All I can do when this happens is telnet in as root (not 
very secure, I know) and reboot the machine. (That works because root is a 
local account... doesn't rely on NIS or the automounter).

On Sunday 23 February 2003 2:45 am, Stefan Neufeind wrote:
> On 23 Feb 2003 at 2:35, Steve Madonna wrote:
> > I've used maxtor disks for a long time with no problems at all... What
> > happens on a "hang"? Is it the graphics? (meaning... what hangs? since
> > you can still connect to a pop3 port, I'm assuming it's not the
> > network) We get systems that hang all the time for video driver
> > problems... RH 7.2 + nVidia drivers.
>
> Since it's a server I don't run or even have a graphical surface
> installed. Hang means: Console (Textmode) doesn't respond anymore,
> can ping to machine okay, can connect to services but don't get a
> respond. Some people said it might be a problem with the harrdisk-
> layer. Assuming this would for sure make clear why the running kernel
> can work properly but neither can a service spawn new children (ssh,
> ftp, ...) nor could syslog write to disk (and maybe cause everything
> to stop) nor could the swap-space be accessed.
>
> I don't know what to believe ... I just know I have to do something
> to find out the cause. Waking up at night because of a bad dream that
> the server might be locked up again can't be okay :-(
>
> I'm running latest stable RedHat-kernel as well - so I assume there
> are no problems with the ide-drivers from RedHat ... otherwise I
> guess they would have released a fix-pack.
>
> Does anybody know about problems with harddisks?
>
> But remember: It's not said that it ARE the harddisks / the ide-
> layer. Might be something else. So I guess we need to keep open-
> minded.
>
>
> Yours sincerely,
>  Stefan
>
> > On Sunday 23 February 2003 12:15 am, Stefan Neufeind wrote:
> > > On 23 Feb 2003 at 9:01, Bart SCHELSTRAETE wrote:
> > > > >I got a very weird problem with a RedHat-server. Occasionally
> > > > >(can't even say that its under "high load" or something like
> > > > >that) the system freezes / hangs. It's possible to ping the
> > > > >machine (responds normal) and you can even connect to ports like
> > > > >pop3 (110) which were open before. The connection is established
> > > > >but no service responds. The same for mail in general, ssh, http
> > > > >etc. Nothing! Even console doesn't work anymore until reboot.
> > > > >
> > > > >The PC consists of a ASUS-P4B533-V-board with 2 Maxtor-HDDs and a
> > > > >cd- drive. Machine is running regular redhat 8.0 (kernel etc.),
> > > > >with all RHN-updates applied, running latest apache 1.3.xx ...
> > > > >
> > > > >It's a server in a serverhousing-farm and it's there for
> > > > >production use. So I can't simply take it off for a few days to
> > > > >play around with
> > > > > it.
> > > > >
> > > > >Had the same problems with another server, different board,
> > > > >running RedHat 7.3 with all updates applied. After looking for
> > > > >every possible
> > > > > problem-solution I thought maybe it's a hardware defect. So I
> > > > >changed from Athlon-cpu to a Intel P4 with different board, also
> > > > >installed RedHat 8.0 (instead of 7.3) new from scratch. But
> > > > >problem still exists.
> > > > >
> > > > >Only common thing: Both computers using same Maxtor-harddisks.
> > > > >But could this be the problem? Unfortunately (as described above)
> > > > >I can't
> > > > > simply take the computer down, change harddisks etc. ... not
> > > > >possible.
> > > > >
> > > > >Did anybody experience such problem?
> > > >
> > > > Hello,
> > > >
> > > > It's difficult to tell you what the problem is.
> > > > Is it possible to send me your /var/log/messages file?
> > >
> > > There is nothing suspicious in the logfile, have looked it over
> > > completely many times. Even around the time of crash there is no
> > > entry for anything. Normally when the system hung I didn't even
> > > receive an error on the console so I doubt if the "Spuriour
> > > interrupt IRQ 7" I got yesterday on the console really belongs to
> > > the problem.
> > >
> > > On the web and mailinglists I heard that some people believe the
> > > problem might have to do with a harddisk-lockup. This would, of
> > > course, justify why ping works and I also can connect but as soon as
> > > let's say a new ssh-session should be opened (or the console should
> > > be used) the PC stops working. But how to tell?!?
> > >
> > > Are any harddisk-driver problems known? Both PCs were using
> > > normal/defensive timings - so I guess this should be the problem.
> > > And since both PCs had different mainboards I doubt that it has got
> > > to do with a specific IDE-controller. How about the harrdisks
> > > (Maxtor)? Have ONLY heard good things about those ...
> > >
> > > Any ideas / solutions? Or at least an idea how to narrow the
> > > problems / monitor?
> > >
> > > > Also enable the SysRQ function, so you can see if that still
> > > > works.
> > >
> > > ? What do you mean? How? What results do you expect?





[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]