Hello,
I have the same problem.
HW is M610 with RH EL 5.5 x86_64 updated (kernel 2.6.18-194.8.1).
I'm connected with an EQL PS6010XV through 10Gbs adapters on the blade
sw iSCSI works with and without multipath.
Same problems with iSCSI offload.
Tried also installing version 6.2.0.871-0.18.el5 without success.
Latest steps:
- disable iscsi/iscsid at startup
- stop iscsid
- update iscsi-initiator-utils
- shutdown -r
- clean /var/lib/iscsi from previous configs:
cd /var/lib/iscsi
rm -rf nodes/*
rm -rf send_targets/*
# ifconfig eth0
eth0 Link encap:Ethernet HWaddr 00:10:18:58:E8:F8
inet addr:10.10.100.178 Bcast:10.10.100.255 Mask:
255.255.255.0
inet6 addr: fe80::210:18ff:fe58:e8f8/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:9000 Metric:1
RX packets:1025 errors:0 dropped:0 overruns:0 frame:0
TX packets:50 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:104130 (101.6 KiB) TX bytes:4700 (4.5 KiB)
Interrupt:82 Memory:dc800000-dcffffff
# ifconfig eth1
eth1 Link encap:Ethernet HWaddr 00:10:18:58:E8:FA
inet addr:10.10.100.179 Bcast:10.10.100.255 Mask:
255.255.255.0
inet6 addr: fe80::210:18ff:fe58:e8fa/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:9000 Metric:1
RX packets:1014 errors:0 dropped:0 overruns:0 frame:0
TX packets:9 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:83638 (81.6 KiB) TX bytes:630 (630.0 b)
Interrupt:90 Memory:dd800000-ddffffff
# cat ifaces/bnx2i.00:10:18:58:e8:f9
# BEGIN RECORD 2.0-871
iface.iscsi_ifacename = bnx2i.00:10:18:58:e8:f9
iface.ipaddress = 10.10.100.198
iface.hwaddress = 00:10:18:58:e8:f9
iface.transport_name = bnx2i
# END RECORD
# cat ifaces/bnx2i.00:10:18:58:e8:fb
# BEGIN RECORD 2.0-871
iface.iscsi_ifacename = bnx2i.00:10:18:58:e8:fb
iface.ipaddress = 10.10.100.199
iface.hwaddress = 00:10:18:58:e8:fb
iface.transport_name = bnx2i
# END RECORD
# service iscsid start
Starting iSCSI daemon: [ OK ]
[ OK ]
# iscsiadm -m discovery -t st -p
10.10.100.30:3260 -I bnx2i.
00:10:18:58:e8:fb -I bnx2i.00:10:18:58:e8:f9 -P 1
Target: iqn.2001-05.com.equallogic:0-8a0906-97d4b5e06-596000000264bc83-
blg9-vol3
Portal:
10.10.100.30:3260,1
Iface Name: bnx2i.00:10:18:58:e8:f9
Iface Name: bnx2i.00:10:18:58:e8:fb
Target: iqn.2001-05.com.equallogic:0-8a0906-8904b5e06-b66000000204bc83-
blg9-vol1
Portal:
10.10.100.30:3260,1
Iface Name: bnx2i.00:10:18:58:e8:f9
Iface Name: bnx2i.00:10:18:58:e8:fb
Target: iqn.2001-05.com.equallogic:0-8a0906-94c4b5e06-df9000000234bc83-
blg9-vol2
Portal:
10.10.100.30:3260,1
Iface Name: bnx2i.00:10:18:58:e8:f9
Iface Name: bnx2i.00:10:18:58:e8:fb
# iscsiadm -m node --loginall=all
the command show suddenly the first "successful" line, then stays
there for about three minutes and at the end shows the remaining
successfull one and the failed ones.
Complete output of the command:
Logging in to [iface: bnx2i.00:10:18:58:e8:f9, target: iqn.
2001-05.com.equallogic:0-8a0906-97d4b5e06-596000000264bc83-blg9-vol3,
portal: 10.10.100.30,3260]
Logging in to [iface: bnx2i.00:10:18:58:e8:fb, target: iqn.
2001-05.com.equallogic:0-8a0906-97d4b5e06-596000000264bc83-blg9-vol3,
portal: 10.10.100.30,3260]
Logging in to [iface: bnx2i.00:10:18:58:e8:f9, target: iqn.
2001-05.com.equallogic:0-8a0906-8904b5e06-b66000000204bc83-blg9-vol1,
portal: 10.10.100.30,3260]
Logging in to [iface: bnx2i.00:10:18:58:e8:fb, target: iqn.
2001-05.com.equallogic:0-8a0906-8904b5e06-b66000000204bc83-blg9-vol1,
portal: 10.10.100.30,3260]
Logging in to [iface: bnx2i.00:10:18:58:e8:f9, target: iqn.
2001-05.com.equallogic:0-8a0906-94c4b5e06-df9000000234bc83-blg9-vol2,
portal: 10.10.100.30,3260]
Logging in to [iface: bnx2i.00:10:18:58:e8:fb, target: iqn.
2001-05.com.equallogic:0-8a0906-94c4b5e06-df9000000234bc83-blg9-vol2,
portal: 10.10.100.30,3260]
Login to [iface: bnx2i.00:10:18:58:e8:f9, target: iqn.
2001-05.com.equallogic:0-8a0906-97d4b5e06-596000000264bc83-blg9-vol3,
portal: 10.10.100.30,3260]: successful
iscsiadm: Could not login to [iface: bnx2i.00:10:18:58:e8:fb, target:
iqn.2001-05.com.equallogic:0-8a0906-97d4b5e06-596000000264bc83-blg9-
vol3, portal: 10.10.100.30,3260]:
iscsiadm: initiator reported error (8 - connection timed out)
iscsiadm: Could not login to [iface: bnx2i.00:10:18:58:e8:f9, target:
iqn.2001-05.com.equallogic:0-8a0906-8904b5e06-b66000000204bc83-blg9-
vol1, portal: 10.10.100.30,3260]:
iscsiadm: initiator reported error (4 - encountered connection
failure)
iscsiadm: Could not login to [iface: bnx2i.00:10:18:58:e8:fb, target:
iqn.2001-05.com.equallogic:0-8a0906-8904b5e06-b66000000204bc83-blg9-
vol1, portal: 10.10.100.30,3260]:
iscsiadm: initiator reported error (4 - encountered connection
failure)
Login to [iface: bnx2i.00:10:18:58:e8:f9, target: iqn.
2001-05.com.equallogic:0-8a0906-94c4b5e06-df9000000234bc83-blg9-vol2,
portal: 10.10.100.30,3260]: successful
iscsiadm: Could not login to [iface: bnx2i.00:10:18:58:e8:fb, target:
iqn.2001-05.com.equallogic:0-8a0906-94c4b5e06-df9000000234bc83-blg9-
vol2, portal: 10.10.100.30,3260]:
iscsiadm: initiator reported error (8 - connection timed out)
iscsiadm: Could not log into all portals. Err 8.
# iscsiadm -m session
bnx2i: [1]
10.10.100.30:3260,1 iqn.2001-05.com.equallogic:
0-8a0906-94c4b5e06-df9000000234bc83-blg9-vol2
bnx2i: [2]
10.10.100.30:3260,1 iqn.2001-05.com.equallogic:
0-8a0906-97d4b5e06-596000000264bc83-blg9-vol3
During the three-minutes interval in messages:
Jul 16 15:16:21 orasvi2 iscsid: Received iferror -1
Jul 16 15:16:21 orasvi2 iscsid: cannot make a connection to
10.10.100.30:3260 (-1,11)
Jul 16 15:16:21 orasvi2 kernel: bnx2i [05:00.01]: ISCSI_INIT passed
Jul 16 15:16:21 orasvi2 kernel: bnx2i [05:00.00]: ISCSI_INIT passed
Jul 16 15:16:22 orasvi2 iscsid: Received iferror -101
Jul 16 15:16:22 orasvi2 iscsid: cannot make a connection to
10.10.100.30:3260 (-101,11)
Jul 16 15:16:23 orasvi2 kernel: bnx2i [05:00.01]: ISCSI_INIT passed
Jul 16 15:16:23 orasvi2 kernel: connection1:0: detected conn error
(1011)
Jul 16 15:16:23 orasvi2 iscsid: Could not broadcast to uIP after 3
tries
Jul 16 15:16:23 orasvi2 iscsid: Received iferror -101
Jul 16 15:16:23 orasvi2 iscsid: cannot make a connection to
10.10.100.30:3260 (-101,11)
Jul 16 15:16:23 orasvi2 kernel: bnx2i [05:00.01]: ISCSI_INIT passed
Jul 16 15:16:23 orasvi2 kernel: connection2:0: detected conn error
(1011)
Jul 16 15:16:24 orasvi2 iscsid: Received iferror -101
Jul 16 15:16:24 orasvi2 iscsid: cannot make a connection to
10.10.100.30:3260 (-101,11)
Jul 16 15:16:24 orasvi2 iscsid: Login authentication failed with
target iqn.2001-05.com.equallogic:0-8a0906-94c4b5e06-df9000000234bc83-
blg9-vol2
Jul 16 15:16:26 orasvi2 iscsid: Login authentication failed with
target iqn.2001-05.com.equallogic:0-8a0906-97d4b5e06-596000000264bc83-
blg9-vol3
Jul 16 15:16:29 orasvi2 kernel: connection1:0: bnx2i: conn update -
MBL 0x40000 FBL 0x10000MRDSL_I 0x40000 MRDSL_T 0x10000
Jul 16 15:16:29 orasvi2 kernel: Vendor: EQLOGIC Model:
100E-00 Rev: 4.3
Jul 16 15:16:29 orasvi2 kernel: Type: Direct-
Access ANSI SCSI revision: 05
Jul 16 15:16:29 orasvi2 kernel: SCSI device sdb: 419450880 512-byte
hdwr sectors (214759 MB)
Jul 16 15:16:29 orasvi2 kernel: sdb: Write Protect is off
Jul 16 15:16:29 orasvi2 kernel: SCSI device sdb: drive cache: write
through
Jul 16 15:16:29 orasvi2 kernel: SCSI device sdb: 419450880 512-byte
hdwr sectors (214759 MB)
Jul 16 15:16:29 orasvi2 kernel: sdb: Write Protect is off
Jul 16 15:16:29 orasvi2 kernel: SCSI device sdb: drive cache: write
through
Jul 16 15:16:29 orasvi2 multipathd: sdb: add path (uevent)
Jul 16 15:16:29 orasvi2 kernel: sdb: unknown partition table
Jul 16 15:16:29 orasvi2 kernel: sd 9:0:0:0: Attached scsi disk sdb
Jul 16 15:16:29 orasvi2 kernel: sd 9:0:0:0: Attached scsi generic sg3
type 0
Jul 16 15:16:30 orasvi2 kernel: connection2:0: bnx2i: conn update -
MBL 0x40000 FBL 0x10000MRDSL_I 0x40000 MRDSL_T 0x10000
Jul 16 15:16:30 orasvi2 kernel: Vendor: EQLOGIC Model:
100E-00 Rev: 4.3
Jul 16 15:16:30 orasvi2 kernel: Type: Direct-
Access ANSI SCSI revision: 05
Jul 16 15:16:30 orasvi2 kernel: SCSI device sdc: 419450880 512-byte
hdwr sectors (214759 MB)
Jul 16 15:16:30 orasvi2 kernel: sdc: Write Protect is off
Jul 16 15:16:30 orasvi2 kernel: SCSI device sdc: drive cache: write
through
Jul 16 15:16:30 orasvi2 kernel: SCSI device sdc: 419450880 512-byte
hdwr sectors (214759 MB)
Jul 16 15:16:30 orasvi2 kernel: sdc: Write Protect is off
Jul 16 15:16:30 orasvi2 kernel: SCSI device sdc: drive cache: write
through
Jul 16 15:16:30 orasvi2 kernel: sdc: unknown partition table
Jul 16 15:16:30 orasvi2 kernel: sd 9:0:1:0: Attached scsi disk sdc
Jul 16 15:16:30 orasvi2 kernel: sd 9:0:1:0: Attached scsi generic sg4
type 0
Jul 16 15:16:30 orasvi2 kernel: device-mapper: multipath round-robin:
version 1.0.0 loaded
Jul 16 15:16:30 orasvi2 multipathd: vol2: load table [0 419450880
multipath 1 queue_if_no_path 0 1 1 round-robin 0 1 1 8:16 512]
Jul 16 15:16:30 orasvi2 multipathd: vol2: event checker started
Jul 16 15:16:30 orasvi2 multipathd: sdc: add path (uevent)
Jul 16 15:16:30 orasvi2 iscsid: connection1:0 is operational now
Jul 16 15:16:30 orasvi2 iscsid: Could not write to /sys/bus/scsi/
devices/9:0:0:0/queue_depth. Invalid permissions.
Jul 16 15:16:30 orasvi2 iscsid: Could not queue depth for LUN 0 err
13.
Jul 16 15:16:30 orasvi2 iscsid: connection2:0 is operational now
Jul 16 15:16:30 orasvi2 iscsid: Could not write to /sys/bus/scsi/
devices/9:0:1:0/queue_depth. Invalid permissions.
Jul 16 15:16:30 orasvi2 iscsid: Could not queue depth for LUN 0 err
13.
Jul 16 15:16:30 orasvi2 multipathd: vol3: load table [0 419450880
multipath 1 queue_if_no_path 0 1 1 round-robin 0 1 1 8:32 512]
Jul 16 15:16:30 orasvi2 multipathd: vol3: event checker started
Jul 16 15:16:30 orasvi2 multipathd: dm-3: add map (uevent)
Jul 16 15:16:30 orasvi2 multipathd: dm-3: devmap already registered
Jul 16 15:16:30 orasvi2 multipathd: dm-4: add map (uevent)
Jul 16 15:16:30 orasvi2 multipathd: dm-4: devmap already registered
Jul 16 15:16:31 orasvi2 kernel: bnx2i [05:00.01]: ISCSI_INIT passed
Jul 16 15:16:31 orasvi2 kernel: bnx2i [05:00.01]: ISCSI_INIT passed
Jul 16 15:16:31 orasvi2 iscsid: Received iferror -101
Jul 16 15:16:31 orasvi2 iscsid: cannot make a connection to
10.10.100.30:3260 (-101,0)
and then many of the latest 3 lines above for the interval.
Also, when using offload, the file /var/log/brcm-iscsi.log exploded in
size logging many of these lines:
WARN [Fri Jul 16 16:08:01 2010]uip ip: packet shorter than reported
in IP header: tcp_ipv4_hdr->len: 328 ustack->uip_len:60.
WARN [Fri Jul 16 16:08:01 2010]uip ip: packet shorter than reported
in IP header: tcp_ipv4_hdr->len: 328 ustack->uip_len:60.
WARN [Fri Jul 16 16:08:22 2010]uip ip: packet shorter than reported
in IP header: tcp_ipv4_hdr->len: 328 ustack->uip_len:60.
WARN [Fri Jul 16 16:08:22 2010]uip ip: packet shorter than reported
in IP header: tcp_ipv4_hdr->len: 328 ustack->uip_len:60.
ERR [Fri Jul 16 16:08:53 2010]nic eth0: Missed interrupt! on 855 not
853
WARN [Fri Jul 16 16:10:05 2010]uip ip: packet shorter than reported
in IP header: tcp_ipv4_hdr->len: 328 ustack->uip_len:60.
WARN [Fri Jul 16 16:10:05 2010]uip ip: packet shorter than reported
in IP header: tcp_ipv4_hdr->len: 328 ustack->uip_len:60.
WARN [Fri Jul 16 16:10:17 2010]uip ip: packet shorter than reported
in IP header: tcp_ipv4_hdr->len: 328 ustack->uip_len:60.
WARN [Fri Jul 16 16:10:17 2010]uip ip: packet shorter than reported
in IP header: tcp_ipv4_hdr->len: 328 ustack->uip_len:60.
After the rpm update it seems that the login problem remains but I
don't get the errors in brcm-iscsi.log