The current version of open-iscsi uses the default routing for discovery
so if you cannot even do a normal ping like below then iscsiadm
discovery is not going to work. The code in git is able to use the iface
binding info to bind the discovery session to a interface.
When you disable eth2 shouldn't the network just figure out that the
other interface eth5 is on the same subnet and drop down to that? Do you
have to update the routing tables by hand then?
Yeah, that should do what you want. Let me know.
Looks like sysfs compat issues.
Where did you get the tools you were using before? From your distro?
Where did you get your kernel from?
Could you send your kernel's .config so I can see your sysfs settings?
--
You received this message because you are subscribed to the Google Groups "open-iscsi" group.
To post to this group, send email to open-...@googlegroups.com.
To unsubscribe from this group, send email to open-iscsi+...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/open-iscsi?hl=en.
Is this centos 5 or 6? If it is 6 could you change
CONFIG_SYSFS_DEPRECATED=y
CONFIG_SYSFS_DEPRECATED_V2=y
to
# CONFIG_SYSFS_DEPRECATED is not set
Also are you using 32 bit userspace for 64 bit? Make sure you compile as
64 bits like your kernel.
About Aug 15th.
Hi Mike,
Have you got chance to look into this issue?
Farhan
I have not. You tried changing the rp_filter values right?
Run iscsid by hand in debug mode
/etc/init.d/iscsid stop
iscsid -d 8 -f all
iscsiadm -m node ..... -l
Send iscsid output.
Also send /var/log/messages output. Also run tcpdump to check that the
tcp ip SYN is going through the right interface.
> Farhan
>
> On Jul 26, 2011 12:54 PM, "Farhan Ahmed" <ahmed....@gmail.com
> <mailto:ahmed....@gmail.com>> wrote:
>> Hi Mike,
>>
>> I installed CentOS 6.0 and git code worked well. I can perform the
>> discovery now but iscsiadm cant login
>>
>> eth2 is down from the Switch
>>
>> [root@nfs02 ~]# ping 192.168.42.190
>> PING 192.168.42.190 (192.168.42.190) 56(84) bytes of data.
>> ^C
>> --- 192.168.42.190 ping statistics ---
>> 3 packets transmitted, 0 received, 100% packet loss, time 2930ms
>>
>> [root@nfs02 ~]# ping -I eth3 192.168.42.190
>> PING 192.168.42.190 (192.168.42.190) from 192.168.42.71 eth3: 56(84)
>> bytes of data.
>> 64 bytes from 192.168.42.190 <http://192.168.42.190>: icmp_seq=1
> ttl=255 time=0.108 ms
>> 64 bytes from 192.168.42.190 <http://192.168.42.190>: icmp_seq=2
> ttl=255 time=0.107 ms
>> ^C
>> --- 192.168.42.190 ping statistics ---
>> 2 packets transmitted, 2 received, 0% packet loss, time 1311ms
>> rtt min/avg/max/mdev = 0.107/0.107/0.108/0.010 ms
>>
>> [root@nfs02 ~]# iscsiadm -m discovery -p 192.168.42.190 -t st -I eth3
>> 192.168.42.190:3260 <http://192.168.42.190:3260>,1
> iqn.2001-05.com.equallogic:0-8a0906-
>> f0a1ed402-1f40012bba54e28b-nfs02-bond
>>
>>
>> [root@nfs02 ~]# iscsiadm -m node -T iqn.2001-05.com.equallogic:
>> 0-8a0906-f0a1ed402-1f40012bba54e28b-nfs02-bond -I eth3 -l
>> Logging in to [iface: eth3, target: iqn.2001-05.com.equallogic:
>> 0-8a0906-f0a1ed402-1f40012bba54e28b-nfs02-bond, portal:
>> 192.168.42.190,3260] (multiple)
>> iscsiadm: Could not login to [iface: eth3, target: iqn.
>> 2001-05.com.equallogic:0-8a0906-f0a1ed402-1f40012bba54e28b-nfs02-bond,
>> portal: 192.168.42.190,3260].
>> iscsiadm: initiator reported error (8 - connection timed out)
>> iscsiadm: Could not log into all portals
>>
>>
>> Could you please advise? Secondly if we reboot our box while "eth2 is
>> down from switch" still iscsiadm cant login
>>
>> Regards,
>>
>> Farhan
>>
>>
>>
>>
>>
>> On Jul 24, 6:20 pm, Mike Christie <micha...@cs.wisc.edu
> <mailto:micha...@cs.wisc.edu>> wrote:
>>> On 07/24/2011 12:02 AM, Farhan Ahmed wrote:
>>>
>>> > It is centos 5.6. I tried to compile kernel without DEPRECATED option
>>> > but kernel didnt boot and threw errors so I had to compile it again
>>> > with DEPRECATED support, I cant udnerstand why SYSFS had no issue with
>>> > the default version of open-iscsi. We can upgrade CENTOS from 5.6 to
>>> > 6.0. May I ask when are you releasing new version of open-scsi that
>>> > fix this routing issue?
>>>
>>> About Aug 15th.
>>
>> --
>> You received this message because you are subscribed to the Google
> Groups "open-iscsi" group.
>> To post to this group, send email to open-...@googlegroups.com
> <mailto:open-...@googlegroups.com>.
>> To unsubscribe from this group, send email to
> open-iscsi+...@googlegroups.com
> <mailto:open-iscsi%2Bunsu...@googlegroups.com>.
Looks like iscsid/iscsiadm did not try to bind the session so that is
why it failed.
Could you do
iscsiadm -m node -P 1
and send the output?
and then do
iscsiadm -m node -I eth3
and send the output?
Will try here too in a minute.
Are you seeing packets going through eth3 when you run the iscsiadm
login command or discovery?
I tested this here and confirmed it at least works for me. In the iscsid
-d 8 -f & log we want to see a line like:
iscsid: Binding session -1 to eth5
that which indicates we are doing binding instead of just using the
default routing.
(my iscsi iface is for eth5 you should see eth3). In the iscsiadm.txt
outpout I did not see that line so for some reason the normal session is
not getting bound to the proper iface.
So send the info I was asking about in the other mail and could you also
send the output of
iscsiadm -m iface -I eth3
For discovery if you did
iscsiadm -m discovery -t st -p ip -I eth3 -d 8
then you would see a similar "Binding session" line indicating we are
binding the session to a iface.
Don't pass the login/-l parameter. Just do
iscsiadm -m node -I eth3
And send the output.
Did you pass "ip" or did you pass your discovery address
192.168.whatever.it.was?
What is the command you have been running to do discovery? Just run that
same command but also pass it the -d 8 argument to get some debugging.
Could you run with the attached patch? Run iscsid as iscsid -d 8 -f &
again and send all the output when you run the login command like you
did before.
Also try just setting either iface.net_ifacename or iface.hwaddress. Do
not set both at the same time.
That error can be ignored. It is just debugging info.
In the log it shows that bounded to eth3 and eth6 ok and disocvered ok.
The command you are running is wrong
iscsiadm -m discovery -t st -p 192.168.42.190 -l eth3 -d 8
eth3 is just ignored here and we try to use all interfaces. You have to
pass in the iface with -I
iscsiadm -m discovery -t st -p 192.168.42.190 -I eth3 -d 8
Also just forget -l in the discovery command for now.
Ignore this. I see you figured it out in the other mail.
Seems like it all worked there, right?
Could you take a tcpdump/wireshark trace?
Do not worry about discovery. Just do the normal session login.
I see us bind ok. I see us create a tcp socket ok. I see us send the
login command ok. But I do not see a response back from the target.
I ran three commands and their tpcdump output is attached
1) iscsiadm -m discovery -t st -p 192.168.42.190 -I eth3 -d 8
2) iscsiadm -m discovery -t st -p 192.168.42.190 -I eth3 -d 8 // -l login
3) iscsiadm -m node -T iqn.2001-05.com.equallogic:0-8a0906-f0a1ed402-1f40012bba54e28b-nfs02-bond -I eth3 -l
On 07/29/2011 01:36 AM, Farhan Ahmed wrote:
> I ran three commands and their tpcdump output is attached
>
> 1) iscsiadm -m discovery -t st -p 192.168.42.190 -I eth3 -d 8
>
> 2) iscsiadm -m discovery -t st -p 192.168.42.190 -I eth3 -d 8 // -l login
>
> 3) iscsiadm -m node -T
> iqn.2001-05.com.equallogic:0-8a0906-f0a1ed402-1f40012bba54e28b-nfs02-bond -I
> eth3 -l
>
You have to run tcpdump as
tcpdump -i eth3 -w mytcpdumpfile
then send me mytcpdumpfile
Do you know the answer to or can you redirect me to someone that can
answer the following:
The user below, Farhan Ahmed, is doing the following with a Equallogic
target.
On the initiator box he has multiple network interfaces on the same
subnet. He then has a iscsi iface for each one that is bound to the
iscsi target. So he has run
iscsiadm -m discovery -t st -p ip -I his_iface1
If he disables something in the network that affects the default route
then he can do:
ping -I net_interface $targets_ip
but when we try to login a normal session it will fail. See packet #97.
In this test he does not run something like route on the initiator box
to update the default route. Instead he is relying on the iscsi code's
interface binding to force the use of the interface.
We can see the iscsi login pdu get sent, but we do not see a iscsi
response. From what I can tell we just seem to see a tcp ack. Then later
in packet #104 we see the initiator give up on waiting for a login
response and disconnect the tcp/ip connection.
However, the weird thing is that before we try the normal session login
we see in packet #10 that a discovery session works ok in this test.
So command #1 works but command #3 does not and between that time he has
not done any changes to the iscsi config or network.
Is what Farhan is trying to do supported by EQL targets?
And is there a way to if the target is getting the iscsi login pdu
correctly?
I noticed that after sending the login request (Packet #97) the initiator is closing the connection (Packet #104) after 1 second, which is quite soon - I would suggest adjusting the node.conn[0].timeo.login_timeout value back up to the default of 15 seconds and trying again.
For more help, I'd recommend contacting the Dell EqualLogic support organization. They're quite good, and have a lot of experience with Linux (including this tricky multiple-NICS-on-the-same-subnet setup):
http://support.dell.com/equallogic/
--
Jim Ramsay
You can grab my test version here if you want the rpm:
http://people.redhat.com/mchristi/iscsi/rhel6.2/iscsi-initiator-utils/
That is just a test version. It is not for production yet. It has all
the features I intended to add. It just needs more testing and some
smaller changes.
On 09/06/2011 12:43 PM, Mike Christie wrote:
> On 09/06/2011 04:54 AM, Farhan Ahmed wrote:
>> Hi Mike,
>>
>> it is driving me nuts :) , I have installed the latest rpm from the
>> following URL and it seems it is having issues to add sessions after 15,
>> I can login to the targets till session and after that I am getting
>> these errors. Any clue?, I have 4 network interfaces so If I login to
>> target it creates 4 session so I can login to more than 4 targets as it
>> is having issues with session 16.
>>
>
> Looks like a regression added by dell when they added some leading loginActually ignore this. I just tried it here. When you run the login
> feature. There is basically a race going on during async login.
>
> If you login to each target/portal one by one does it work? So do
>
> iscsiadm -m node -T target -I iface -p ip -l
>
> for each target/portal combo.
>
command in /var/log/messages do you see something like:
Sep 6 12:50:55 meanminna iscsid: Received iferror -22: Invalid argument.
Sep 6 12:50:55 meanminna iscsid: can't bind conn 16:0 to session 16,
retcode 1 (2)
?
In the future include that info. It will help me out :)
I can replicate it here. It is caused by something that got brought in
with the qla4xxx management patches. I am working on it now.
It looks like userspace is sending down the correct info, but somewhere
between iscsid sending and scsi_transport_iscsi reading the data gets
corrupted and we hit this.
Found the problem. The new iscsi tools are sending the netlink request
in a way the iscsi class does not like. I should have a fix around Tues
(I am leaving on vacation after I send this).
A new upstream stable release should be done in about a month.
I think I have a fix. Waiting to hear back from qlogic about their testing.