scsi tape problem - goes and won't come back without reboot
Danny Yee
danny at anatomy.usyd.edu.au
Fri Aug 5 06:48:40 UTC 2005
I have a server which every so often "loses" its SCSI tape drive.
Dumps stop working a command like "mt rewind" returns an error
"/dev/tape: No such file or directory".
The message log shows
Aug 3 04:56:40 mail kernel: scsi1:0:5:0: Attempting to abort cmd c1bda980: 0x0 0x0 0x0 0x0 0x0 0x0
Aug 3 04:56:40 mail kernel: scsi1:0:5:0: Command found on device queue
Aug 3 04:56:40 mail kernel: scsi: Device offlined - not ready after error recovery: host 1 channel 0 id 5 lun 0
Aug 3 04:56:40 mail kernel: st0: Error 20000 (sugg. bt 0x0, driver bt 0x0, host bt 0x2).
If I unload and reload the st module, the message log shows
Aug 5 16:41:19 mail kernel: st: Version 20041025, fixed bufsize 32768, s/g segs 256
Aug 5 16:41:19 mail kernel: Attached scsi tape st0 at scsi1, channel 0, id 5, lun 0
Aug 5 16:41:19 mail kernel: st0: try direct i/o: yes (alignment 512 B), max page reachable by HBA 1048575
Which looks like it's finding the drive fine. But attempts to use
/dev/tape (or /dev/st0) still fail.
I have no other SCSI devices (my disks are on an i2o RAID card), but
I can't remove the aic79xx module for the SCSI bus the tape is on --
it reports "ERROR: Module aic79xx is in use" and I can't work out
what's using it.
I can fix the problem by rebooting, but it would be nicer if I didn't
have to.
Any suggestions?
Danny.
Network Administrator
School of Medical Sciences
University of Sydney
More information about the fedora-list
mailing list