It could have if the routing changed, but the initiator would have tried
to reconnect and we would have started using eth1. However, if this
happened more than 5 times to the same IO then we could have seen IO
errors. Either way though you should have seen some iscsi and scsi
errors in the logs.
I noticed these occurred:
May 29 16:47:31 r05s23 iscsid: Kernel reported iSCSI connection 1:0
error (1011) state (3)
May 29 16:47:34 r05s23 iscsid: received iferror -38
May 29 16:47:34 r05s23 last message repeated 3 times
May 29 16:47:34 r05s23 iscsid: connection1:0 is operational after
recovery (1 attempts)
... what id the -38 error code?
thanks in advance.
- a.
>
> >
>
--
A S P A S I A
. . . . . . . . . . ..
It just means that userspace tried to set a feature the kernel did not
support. It is not serious.
If there was nothing before that first line about the kernel reporting a
connection error, then the target could have initiated this. Do you see
anything in the target logs? What target is this again?
> It just means that userspace tried to set a feature the kernel did not
> support. It is not serious.
>
> If there was nothing before that first line about the kernel reporting a
> connection error, then the target could have initiated this. Do you see
> anything in the target logs? What target is this again?
>
Target is: Centos51 box also. We have noticed this to happen on this
one client (centos51 also) every 2-3 days ... This client seems to do
a lot of FS-related testing, not against the iscsiRoot I/O to another
FS-mounted system. No messages in the target logs pertaining to this
particular client and its iscsi target.
Today the same symptom occurred again, I was able to scribble the
messages in the console ... (now, I have a terminal server console
hooked-up and I'm capturing its output in file so I will have a more
complete logging info):
psid 0 of 1
psid 0 of 1
oversize name in 0000
psid 0 of 1
psid 0 of 1
|
|
readdir corruption in 000
psid 0 of 1
readdir corruption in 000
|
|
iscsi: cmd 0x2a is not queued (6)
end_request: I/O error dev sde; sector 50640
|
|
Buffer I/O error on device sde1, logical block 6323 last page write
due to I/O error on sde 1
iscsi: cmd 0x2a is not queued (6)
Aborting journal on device sde1
iscsi: cmd 0x2a is not queued (6)
sd 6:0:0:0 rejection I/O to device being mounted (this repeats a few times)
|
|
EXT3-fs error (device sde1): ext3_find_entry: reading directory 96001 offset 0
......
|
-------
I upgraded this server with a new iscsiRoot image that incorporates
Mike's suggestion to increase timeout specific for iscsiRoot type
connections ... would it be possible inaccessibility of root device is
possibly cause by this?
thanks in advance,
- aspasia.
What target is running on Centos? Is it IET or stgt or something else?
> Buffer I/O error on device sde1, logical block 6323 last page write
> due to I/O error on sde 1
> iscsi: cmd 0x2a is not queued (6)
> Aborting journal on device sde1
> iscsi: cmd 0x2a is not queued (6)
This means we either got a error from the target and we tried to
relogin, but we ended up killing the session because either the target
told us it was going away permantly or we bugged out and could not
handle the problem.
What are you running on the initiator side? Is that Centos 5.1 too? Are
you using the initiator that comes with it or a open-iscsi.org release?
What we need is some iscsid outpout which would be a little bit before
the log output you sent which would tell us why it decided to kill the
session. It would be something about a bug or fatal error or could not
log into target.
Oh yeah, are you doing anything on the target? Rebooting it? Changing
the config? Adding or removing targets or portals?
.... in the target when the /etc/ietd.conf file is updated with a new
iscsi root, a brief "service iscsi-target restart" occurs .... we're
thinking of just keeping a static list and not bother it any longer
....
- a.
Ah ok, as you saw in the other thread that can cause a disruption. We
should log back in. If you could try this rpm:
http://people.redhat.com/mchristi/iscsi/RHEL5/5.2/rpms/iscsi/iscsi-initiator-utils-6.2.0.868-0.7.el5.src.rpm
It is what will be in RHEL 5.2/Centos5.2. It fixes a couple places where
we would try to login but the target was rebooting/restarting and
would return errors that might have indicated that it was coming back
but we did not interpret them right and would kill the session.