[Bugme-new] [Bug 9102] New: sata_promise: (HSM violation) exceptions in combination with network load

bugme-daemon at bugzilla.kernel.org bugme-daemon at bugzilla.kernel.org
Sun Sep 30 07:39:15 PDT 2007


http://bugzilla.kernel.org/show_bug.cgi?id=9102

           Summary: sata_promise: (HSM violation) exceptions in combination
                    with network load
           Product: Drivers
           Version: 2.5
     KernelVersion: 2.6.23-rc8
          Platform: All
        OS/Version: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: PCI
        AssignedTo: greg at kroah.com
        ReportedBy: se.witt at gmx.net


Most recent kernel where this bug did not occur:
Distribution: Gentoo
Hardware Environment: Opteron 175, VIA K8T800Pro Host Bridge, VIA VT8237 PCI
bridge [K8T800/K8T890 South], Promise PDC40718 (SATA 300 TX4) (rev 02), VIA
VT6102 [Rhine-II], Intel Corporation 82541PI Gigabit Ethernet Controller,
SAMSUNG HD501LJ
Software Environment:
Problem Description:

When the SATA controller is under high load in combination with network I/O,
the kernel log shows the following exceptions:
ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2
ata1.00: port_status 0x20080000
ata1.00: cmd 25/00:58:bf:d1:83/00:00:1a:00:00/e0 tag 0 cdb 0x0 data 45056 in
         res 50/00:00:16:d2:83/00:00:1a:00:00/e0 Emask 0x2 (HSM violation)
ata1: soft resetting port
ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
ata1.00: configured for UDMA/133
ata1: EH complete

This occurs also on all other ata(1-4) ports (4 HD501LJ disks). The message
rate is about 4/minute when transferring at maximum speed (disk & network).

This does only happen when there is (high) network traffic. Reading > 300 GB
local does not trigger HSM violation messages. Starting network traffic (ping
-f ...) immediately triggers the messages.

The interesting thing is: Changing the network interface from e1000 to the
onboard VIA Rhine does not change this behaviour.

/proc/interrupts:

          CPU0       CPU1       
  0:         86          1   IO-APIC-edge      timer
  1:          0          8   IO-APIC-edge      i8042
  8:          0          2   IO-APIC-edge      rtc
  9:          0          0   IO-APIC-fasteoi   acpi
 14:          2       2676   IO-APIC-edge      ide0
 16:      57697         45   IO-APIC-fasteoi   eth0
 17:          0          0   IO-APIC-fasteoi   eth1
 18:          0        363   IO-APIC-fasteoi   ide2, ide3
 20:          0      93476   IO-APIC-fasteoi   sata_promise
 21:          0          0   IO-APIC-fasteoi   ehci_hcd:usb1, uhci_hcd:usb2,
uhci_hcd:usb3, uhci_hcd:usb4, uhci_hcd:usb5
NMI:          0          0 
LOC:     537738     537580 
ERR:          0
MIS:          0

Tickless and cpufreq is disabled. Disabling SMP does not change the behaviour.
Next thing I do is changing the PCI slot from the SATA controller.

Steps to reproduce:

1. Generate heavy disk I/O
2. Wait some time to check no messages occur
3. Generate network traffic on some interface
4. HSM violation messages occur


-- 
Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


More information about the Bugme-new mailing list