[NCLUG] st driver problems

Michael Dwyer mdwyer at sixthdimension.com
Mon Dec 17 11:15:00 MST 2001


On a more mundane note, make sure you are talking to the right device.
mt and mtx talk to the SCSI Generic device (/dev/sg0), while tar talks
to the SCSI Tape device (/dev/st0).  Bad things(tm) happen if you send
mt commands to the st device (You overwrite your data!) and likely
unpredictable things happen if you send data to the sg device (your
changer tries to store your data using interpretive dance).

R P Herrold wrote:
> 
> Please trim clearly inapplicable cruft.  I've trimmed much and
> spaced it out to aid review below
> 
> There are two aix7xxx drivers -- and you may wish to manually
> use the other as a test matter.  It look as though you are
> booting from the IDE device, and so you may insert and remove
> those modules at will.
> 
> The shared interrupt 11 is troubling -- if possible make sure
> the SCSI controller is on its own IRQ.  (cat /proc/interrupts)
> -- the NIC vendor has given me problems in the past, and if it
> is a card (rather than onboard), I'd strip it out during
> testing.  ditto the soundcard.
> 
> disk swap is below the 2x memory recommendation, and a
> swapfile is probably in use from the way it loaded up.
> 
> dledford at redhat.com at Red Hat has worked this area heavily.
> See also: http://people.FreeBSD.org/~gibbs/linux/ and
> gibbs at FreeBSD.org who works at Adaptec -- together they are
> the most knowledgable pair on this topic.  A diagnostic FAQ is
> off a link at:  http://opensource.adaptec.com but it is a bit
> slender.
> 
> I'll file a bugzilla against the kernel component if you wish,
> or you can do so if you know the drill.  If the latter, please
> advise me of its number off list, and I'll see what I can do,
> as well.  http://bugzilla.redhat.com/
> 
> Append:  lspci -v -v   and   lsmod    output as well as a copy
> of this email
> 
> -- Russ Herrold
> 
> On Sun, 16 Dec 2001, David W. Graham   dgraham at riverrock.org
> wrote:
> 
> > I looked and /var/log/messages and there were several SCSI errors.  If
> > anyone has any ideas, I would like to here them.
> 
> > Dec 15 15:45:03 localhost kernel: Linux version 2.4.9-13
> > (bhcompile at stripples.devel.redhat.com) (gcc version 2.96 20000731 (Red Hat
> > Linux 7.1 2.96-98)) #1 Tue Oct 30 20:11:04 EST 2001
> 
> > Dec 15 15:45:04 localhost kernel: hda: QUANTUM FIREBALL CR4.3A, ATA DISK
> > drive
> 
> > Dec 15 15:45:04 localhost kernel:  hda: hda1 hda2 hda3 hda4 < hda5 hda6 hda7
> > >
> 
> > Dec 15 15:45:07 localhost kernel: PCI: Sharing IRQ 11 with 00:07.2
> 
> > Dec 15 15:45:08 localhost kernel: (scsi0) <Adaptec AHA-2944 Ultra SCSI host
> > adapter> found at PCI 0/18/0
> 
> > Dec 15 15:45:08 localhost kernel: (scsi0) Wide Channel, SCSI ID=7, 16/255
> > SCBs
> 
> > Dec 15 15:45:08 localhost kernel: (scsi0) Cables present (Int-50 NO, Int-68
> > NO, Ext-68 YES)
> 
> > Dec 15 15:45:08 localhost kernel: (scsi0) Downloading sequencer code... 436
> > instructions downloaded
> 
> > Dec 15 15:45:08 localhost kernel: scsi0 : Adaptec AHA274x/284x/294x
> > (EISA/VLB/PCI-Fast SCSI) 5.2.4/5.2.0
> 
> > Dec 15 15:44:45 localhost fsck: /usr: clean, 47341/299744 files,
> > 187975/599130 blocks
> 
> > Dec 15 15:45:08 localhost kernel:        <Adaptec AHA-2944 Ultra SCSI host
> > adapter>
> 
> > Dec 15 15:45:08 localhost kernel:   Vendor: ADIC      Model: FastStor DLT
> > Rev: 0110
> 
> > Dec 15 15:45:08 localhost kernel:   Type:   Medium Changer
> > ANSI SCSI revision: 02
> 
> > Dec 15 15:45:08 localhost kernel:   Vendor: QUANTUM   Model: DLT7000
> > Rev: 172D
> 
> > Dec 15 15:45:09 localhost kernel:   Type:   Sequential-Access
> > ANSI SCSI revision: 02
> 
> > Dec 15 15:45:09 localhost kernel: Adding Swap: 43000k swap-space
> > (priority -2)
> 
> > Dec 15 15:45:10 localhost kernel: st: Version 20010812, bufsize 32768, wrt
> > 30720, max init. bufs 4, s/g segs 16
> 
> > Dec 15 15:45:10 localhost kernel: Attached scsi tape st0 at scsi0, channel
> > 0, id 5, lun 0
> 
> > Dec 15 15:45:10 localhost kernel: pcnet32_probe_pci: found device
> > 0x001022.0x002000
> 
> > Dec 15 15:45:10 localhost kernel: PCI: Found IRQ 11 for device 00:11.0
> 
> > Dec 15 15:45:10 localhost kernel: PCI: Sharing IRQ 11 with 00:07.2
> 
> > Dec 15 15:45:10 localhost kernel: PCI: Sharing IRQ 11 with 00:12.0
> 
> > Dec 15 15:45:10 localhost kernel: pcnet32: pcnet32_private lp=c6b01000
> > lp_dma_addr=0x6b01000 assigned IRQ 11.
> 
> > Dec 15 15:46:06 localhost kernel: st0: Error 26030000 (sugg. bt 0x20, driver
> > bt 0x6, host bt 0x3).
> 
> > Dec 15 15:46:12 localhost last message repeated 2 times
> 
> > Dec 15 15:46:25 localhost kernel: (scsi0:0:5:0) Synchronous at 20.0
> > Mbyte/sec, offset 8.
> 
> > Dec 15 15:46:27 localhost kernel: (scsi0:0:5:-1) Unexpected busfree,
> > LASTPHASE = 0x0, SEQADDR = 0x58
> 
> > Dec 15 15:46:30 localhost kernel: (scsi0:0:5:-1) Unexpected busfree,
> > LASTPHASE = 0x0, SEQADDR = 0x57
> 
> > Dec 15 15:46:31 localhost kernel: st0: Block limits 19539 - 5062912 bytes.
> 
> > Dec 15 15:46:33 localhost kernel: (scsi0:0:5:-1) Unexpected busfree,
> > LASTPHASE = 0x0, SEQADDR = 0x59
> 
> > Dec 15 15:47:08 localhost last message repeated 4 times
> 
> > Dec 15 15:48:11 localhost last message repeated 4 times
> 
> > Dec 15 15:48:14 localhost kernel: (scsi0:0:5:-1) Unexpected busfree,
> > LASTPHASE = 0x0, SEQADDR = 0x59
> 
> > Dec 15 15:48:32 localhost kernel: (scsi0:0:5:0) Data overrun detected in
> > Data-Out phase, tag 0;
> 
> > Dec 15 15:48:32 localhost kernel:   Have seen Data Phase. Length=65536,
> > NumSGs=9.
> 
> > Dec 15 15:48:32 localhost kernel:   Raw SCSI Command: 0x0a 00 01 00 00 00
> 
> > Dec 15 15:48:32 localhost kernel: st0: Error 27070000 (sugg. bt 0x20, driver
> > bt 0x7, host bt 0x7).
> 
> > Dec 15 15:51:56 localhost kernel: (scsi0:0:5:-1) Unexpected busfree,
> > LASTPHASE = 0x0, SEQADDR = 0x58
> 
> > Dec 15 15:52:19 localhost last message repeated 2 times
> 
> > Dec 15 15:52:22 localhost kernel: (scsi0:0:5:-1) Unexpected busfree,
> > LASTPHASE = 0x0, SEQADDR = 0x59
> 
> > Dec 15 15:52:23 localhost kernel: (scsi0:0:5:0) Data overrun detected in
> > Data-Out phase, tag 0;
> 
> > Dec 15 15:52:23 localhost kernel:   Have seen Data Phase. Length=65536,
> > NumSGs=9.
> 
> > Dec 15 15:52:23 localhost kernel:   Raw SCSI Command: 0x0a 00 01 00 00 00
> 
> > Dec 15 15:52:23 localhost kernel: st0: Error 27070000 (sugg. bt 0x20, driver
> > bt 0x7, host bt 0x7).
> 
> _______________________________________________
> NCLUG mailing list
> NCLUG at nclug.org
> http://www.nclug.org/mailman/listinfo/nclug



More information about the NCLUG mailing list