Slow iSCSI

jo...@moonfruit.com

unread,

Sep 20, 2007, 7:20:56 AM9/20/07

to open-iscsi

This is weird, I have Dell (EMC) Ax150i running centos5
2.6.18-8.1.10.el5
iscsi-initiator-utils.i386-6.2.0.742-0.6.el5

My write speed
dd if=/dev/zero of=zero bs=4096 count=1572864
1572864+0 records in
1572864+0 records out
6442450944 bytes (6.4 GB) copied, 150.675 seconds, 42.8 MB/s

My read speed
dd if=zero of=/dev/null bs=4096
1572864+0 records in
1572864+0 records out
6442450944 bytes (6.4 GB) copied, 200.923 seconds, 32.1 MB/s

I'm using the default config for iscsid and am at my wits end, can
anyone on this list point me in the right direction?

Thanks,
Josh.

Boaz Harrosh

unread,

Sep 20, 2007, 10:59:30 AM9/20/07

to open-...@googlegroups.com

Perhaps you have 2-4 Gg of Memory on this machine. Also maybe lots of memory
on your iscsi target. Write behind cache of 1/4 * 6.4GB could certainly explain it.

Hiren Joshi

unread,

Sep 20, 2007, 11:10:01 AM9/20/07

to open-...@googlegroups.com

Interesting, I have 2G on the machine, not sure how much I have on the AX150...... what can I do about this?

From: open-...@googlegroups.com [mailto:open-...@googlegroups.com] On Behalf Of Boaz Harrosh
Sent: 20 September 2007 16:00
To: open-...@googlegroups.com
Subject: Re: Slow iSCSI

Boaz Harrosh

unread,

Sep 20, 2007, 12:21:47 PM9/20/07

to open-...@googlegroups.com

OK I was just told that dd does not have that problem any more. (It
used to in old systems) And it does wait for a sync. So the only thing
is if you have lots of caching on the AX150.

Other wise it is weird. I usually get about the same scores.

Hiren Joshi

unread,

Sep 20, 2007, 12:29:33 PM9/20/07

to open-...@googlegroups.com

I think the AX150 has 512M memory.... is this the limit of the
hardware?!
Any idea what this guy did?
http://groups.google.com/group/open-iscsi/browse_thread/thread/d9cf10878
afa5fa9/a140283dd4f169b2?#a140283dd4f169b2

Don Williams

unread,

Sep 20, 2007, 1:48:14 PM9/20/07

to open-...@googlegroups.com

For sequential read performance you can increase the read ahead value.

The default is 256 sectors.

Try:

#blockdev --setra /dev/<device> 4096

And re-run your test.

Also make sure that flowcontrol is enabled.

#ethtool -a ethX

You can then try larger values and see if that helps your application. If
your requirement is highly random then a high value will likely hurt
performance.

Don

-----Original Message-----
From: open-...@googlegroups.com [mailto:open-...@googlegroups.com] On

Mike Christie

unread,

Sep 20, 2007, 3:35:19 PM9/20/07

to open-...@googlegroups.com

jo...@moonfruit.com wrote:
> This is weird, I have Dell (EMC) Ax150i running centos5
> 2.6.18-8.1.10.el5
> iscsi-initiator-utils.i386-6.2.0.742-0.6.el5
>
> My write speed
> dd if=/dev/zero of=zero bs=4096 count=1572864
> 1572864+0 records in
> 1572864+0 records out
> 6442450944 bytes (6.4 GB) copied, 150.675 seconds, 42.8 MB/s
>
> My read speed
> dd if=zero of=/dev/null bs=4096
> 1572864+0 records in
> 1572864+0 records out
> 6442450944 bytes (6.4 GB) copied, 200.923 seconds, 32.1 MB/s
>

I think there is a bug in 2.6.18-8.1.10.el5 + any version of iscsi,
where in some setups the read speed is much slower than writes. So with
bs 4096, we may get some slower IO numbers, but if you do

echo noop > /sys/block/sdXYZ/queue/scheduler
if=/dev/zero of=zero bs=128k count=1572864

and

if=zero of=/dev/null bs=128k count=1572864

what numbers do you get?

With IET I get:

I am getting 112 - 119 MB/s for writes (119 with jumbo frames on both
the initiator box and IET box and both boxes are running e1000).

And only

35 - 71 MB/s for reads.

With different bs values the writes speed is always the same, but the
read speed is always slow sometimes 26MB/s with smaller bs and up to 90
with 256k, but never as fast as writes.

The with another box (same specs quad xeon and 4 gigs of mem and
e1000s), and same version of RHEL5, I get 112 - 119 MB/s for reads and
writes with any bs size.

I am not sure what the problem is yet. I just hit while trying to debug
the RHEL4/Centos4/linux-iscsi issue you reported and debugging a write
perf issues someone reported to this list. The weird thing is that with
RHEL4/centos4 and linux-iscsi I get 112 MB/s on both boxes.

Hiren Joshi

unread,

Sep 21, 2007, 5:14:20 AM9/21/07

to open-...@googlegroups.com

Thanks for the suggestion but the reads will be random. Flow control is
on.

-----Original Message-----
From: open-...@googlegroups.com [mailto:open-...@googlegroups.com]
On Behalf Of Don Williams
Sent: 20 September 2007 18:48
To: open-...@googlegroups.com
Subject: RE: Slow iSCSI

For sequential read performance you can increase the read ahead value.

The default is 256 sectors.

Try:

#blockdev --setra /dev/<device> 4096

And re-run your test.

Also make sure that flowcontrol is enabled.

#ethtool -a ethX

You can then try larger values and see if that helps your application.
If your requirement is highly random then a high value will likely hurt
performance.

Don

-----Original Message-----
From: open-...@googlegroups.com [mailto:open-...@googlegroups.com]
On Behalf Of jo...@moonfruit.com
Sent: Thursday, September 20, 2007 7:21 AM
To: open-iscsi
Subject: Slow iSCSI

This is weird, I have Dell (EMC) Ax150i running centos5
2.6.18-8.1.10.el5
iscsi-initiator-utils.i386-6.2.0.742-0.6.el5

My write speed
dd if=/dev/zero of=zero bs=4096 count=1572864
1572864+0 records in
1572864+0 records out
6442450944 bytes (6.4 GB) copied, 150.675 seconds, 42.8 MB/s

My read speed
dd if=zero of=/dev/null bs=4096
1572864+0 records in
1572864+0 records out
6442450944 bytes (6.4 GB) copied, 200.923 seconds, 32.1 MB/s

I'm using the default config for iscsid and am at my wits end, can

Hiren Joshi

unread,

Sep 21, 2007, 6:10:12 AM9/21/07

to open-...@googlegroups.com

The count was too high (about 190G?) so I reduced it to 90000ish (about
12G). The scheduler was using cfq (now set to noop for this test).

Write 24.5MB/s
Read 19MB/s =(

Do you have any more info on the 2.6.18 bug?

-----Original Message-----
From: open-...@googlegroups.com [mailto:open-...@googlegroups.com]

On Behalf Of Mike Christie
Sent: 20 September 2007 20:35
To: open-...@googlegroups.com
Subject: Re: Slow iSCSI

jo...@moonfruit.com wrote:
> This is weird, I have Dell (EMC) Ax150i running centos5
> 2.6.18-8.1.10.el5
> iscsi-initiator-utils.i386-6.2.0.742-0.6.el5
>
> My write speed
> dd if=/dev/zero of=zero bs=4096 count=1572864
> 1572864+0 records in
> 1572864+0 records out
> 6442450944 bytes (6.4 GB) copied, 150.675 seconds, 42.8 MB/s
>
> My read speed
> dd if=zero of=/dev/null bs=4096
> 1572864+0 records in
> 1572864+0 records out
> 6442450944 bytes (6.4 GB) copied, 200.923 seconds, 32.1 MB/s
>

I think there is a bug in 2.6.18-8.1.10.el5 + any version of iscsi,

Mike Christie

unread,

Sep 21, 2007, 2:07:04 PM9/21/07

to open-...@googlegroups.com

Hiren Joshi wrote:
> The count was too high (about 190G?) so I reduced it to 90000ish (about
> 12G). The scheduler was using cfq (now set to noop for this test).
>
> Write 24.5MB/s
> Read 19MB/s =(
>
> Do you have any more info on the 2.6.18 bug?
>

No, I am right in the middle of trying to hunt it down now.

But it looks like your problem is not the same since you have slow
speeds for both reads and writes.

Miguel Gonzalez Castaños

unread,

Sep 21, 2007, 2:16:44 PM9/21/07

to open-...@googlegroups.com

Mike Christie escribió:

I'd like to know how you are getting those results, I'm using bonnie,
but I'm not sure if my results are fine or I need to know how to tune
the settings (and if I have to, where should look at?)

Here are my benchmarks:

Version 1.03 ------Sequential Output------ --Sequential Input-
--Random-

-Per Chr- --Block-- -Rewrite- -Per Chr- --Block--
--Seeks--

Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP
/sec %CP

svn 1G 14604 65 18248 40 6406 11 19791 55 24722 15
283.8 3

------Sequential Create------ --------Random
Create--------

-Create-- --Read--- -Delete-- -Create-- --Read---
-Delete--

files /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP
/sec %CP

16 8128 88 +++++ +++ 8946 80 7959 89 +++++ +++
12198 86

Miguel

Dan Reagan

unread,

Sep 21, 2007, 3:01:05 PM9/21/07

to open-...@googlegroups.com

> I think the AX150 has 512M memory.... is this the limit of the
> hardware?!
> Any idea what this guy did?

Pretty sure that was me.

I've had to let that project drop so I haven't hit the box with any of
the more recent versions but my issue was a hardware/software problem in
the AX150 itself and I didn't solve it, my VAR did. Turned out that the
box was in some sort of limp mode because it was incorrectly seeing some
hardware as bad or non-existent. To fix it the VAR accessed a Windows XP
GUI on the thing (I knew it was XP based but I was horrified to see the
GUI ;') and messed around with some settings on it, rebooted it once or
twice and my speed was up to acceptable if not fast.

I've got it off in the corner of the lab now but should be dusting it
off and getting back to it soon if you want me to try and get duplicate
results.

I can't remember if I mentioned it in my other mails to the list but I
was running Debian unstable with various custom kernels and whatever the
latest version of open-iscsi was at the time.

Dan

Ross Vandegrift

unread,

Sep 21, 2007, 8:22:04 PM9/21/07

to open-...@googlegroups.com

On Fri, Sep 21, 2007 at 10:14:20AM +0100, Hiren Joshi wrote:
> Thanks for the suggestion but the reads will be random. Flow control is
> on.

[snip]

> Also make sure that flowcontrol is enabled.
>
> #ethtool -a ethX

This is somewhat off-topic, but unless there's substantially
different thought in the storage world (which I'm relatively new to),
Ethernet flow control is almost always a bad idea. Search for
head-of-line blocking to read about the common breakage that flow
control causes.

Rather than using flow-control, buy non-blocking switches that use
shared-memory for input frames. This will make flow-control a
non-issue, since the situations that would lead to pause frames can
never come about if the switch isn't oversubscribed.

If the switch is oversubscribed, then use QoS to classify packets into
queues so latency sensitive traffic can be switched across congested
links first.

--
Ross Vandegrift
ro...@kallisti.us

"The good Christian should beware of mathematicians, and all those who
make empty prophecies. The danger already exists that the mathematicians
have made a covenant with the devil to darken the spirit and to confine
man in the bonds of Hell."
--St. Augustine, De Genesi ad Litteram, Book II, xviii, 37

Don Williams

unread,

Sep 22, 2007, 2:10:56 PM9/22/07

to open-...@googlegroups.com

Re: Flowcontrol. Flowcontrol is not a bad idea. The flowcontrol issue isn't really with the switch. Many switches can't even SEND a flowcontrol frame to tell someone else to stop sending. The issue is the server talking to the switch. It can become overwhelmed on a high performing SAN. If flowcontrol is not available those packets the server can't handle are dropped and must be resent. Do that long enough and you'll start hitting disk timeout values on the servers. Also, flowcontrol comes into play almost exclusively on read operations. Sending data to the SAN almost never involves flowcontrol.

The other benefit to flowcontrol, is that on a switch well suited for SAN usage it has sufficient port buffering to hold those packets and send them as soon as the server is ready. Reducing retransmits.

Storage Area Networks are what I do. I can state unequivocally that Flowcontrol is essential for well performing and stable iSCSI SANs. QoS won't help when a server becomes the bottleneck.

You are correct, you must by non-blocking, not oversubscribed, high quality switches for iSCSI SANs.

Regards,

Don

Ross Vandegrift

unread,

Sep 22, 2007, 3:06:49 PM9/22/07

to open-...@googlegroups.com

On Sat, Sep 22, 2007 at 02:10:56PM -0400, Don Williams wrote:
> Also, flowcontrol comes into play almost exclusively on read

> operations.  Sending data to the SAN almost never involves
> flowcontrol.  <br>
> <br>

And while it's holding those frames at the head of the queue, remember
that the paused ports can't do anything else.

>  Storage Area Networks are what I do.  I can state unequivocally that
> Flowcontrol is essential for well performing and stable iSCSI SANs. 
> QoS won't help when a server becomes the bottleneck.  <br>

Let me give an example that seems to convince this network guy that
iSCSI should go without flow control as well. Please let me know
where I've incorrectly assumed something, or made an error!

Consider a network with two iSCSI targets, and a host that will access
both via a single interface.

The host initiates a connection with each target for two volumes and
begins reading from both. The read pattern to the first is bursty,
lots of small data files. The pattern to the second is streaming
large volumes of data. Say that 80% of the packets are from this bulk
data source. When the link to the host reaches saturation, two conditions
could obtain.

If flow control is enabled:
1) The host sends a PAUSE frame to the connected station.
2) The switch blocks data transmission for both disks.
3) Depending on the implementation and config, the swtich may
propogate PAUSE frames back to the MACs that are sending to the
congested port.
4) Your SAN is now blocked and not transmitting to anyone
becasue one host had a congested link.

If flow control is disabled:
1) The saturated host link begins to drop packets
2) TCP on both targets notice this, and begin to back off
until packet loss subsides.
3) Through this, performance on other devices using the SAN is
unaffected.

QoS can be used to make the second case even better. Use diffserv or
IP precedence to indicate that traffic to one target is higher-priority.
Most decent switches can convert those to 802.1p tags, putting the
higher-priority traffic into a seperate queue that gets first service.

Now, the bulk queue is only serviced after the higher priority queue
is empty. This ensures that all dropped frames are from your bulk
disk. TCP on that target now sees the packet loss and backs off. The
more critical disk didn't even notice the slowdown.

Don Williams

unread,

Sep 22, 2007, 3:50:20 PM9/22/07

to open-...@googlegroups.com

A pause frame should only happen if the server can't handle the traffic its getting at that point, so it can't do anything else anyway. So no other work could be done anyway on that server. It would only affect that port. Flowcontrol is point-to-point not end to end.   So you're whole SAN isn't impacted.   Pause events are very short in duration. The server catches up and more data flows.

I see it with at customer sites. They're not getting the proper performance.   Enable flowcontrol and the issue is resolved.

Also I've seen even fast retransmits effect SQL server performance.

Why wait for TCP to back things off when and retransmit allot of data?   And have to retransmit data.

Pause frames were created for this purpose.

Also w/QoS on a SAN how are you going to prioritize iSCSI traffic since it all occurs across port 3260?   You SAN should either be on a separate VLAN or on a dedicated switch.

Do as you wish, but literally thousands of installs the benefit of Flowcontrol is very clear.

Regards,

Don

Ross Vandegrift wrote:

Hiren Joshi

unread,

Sep 24, 2007, 4:25:48 AM9/24/07

to open-...@googlegroups.com

Using Navishpere express, this problem should show up on the logs?

Jeez, xp =(

-----Original Message-----
From: open-...@googlegroups.com [mailto:open-...@googlegroups.com]
On Behalf Of Dan Reagan
Sent: 21 September 2007 20:01
To: open-...@googlegroups.com
Subject: Re: Slow iSCSI

Hiren Joshi

unread,

Sep 24, 2007, 4:36:49 AM9/24/07

to open-...@googlegroups.com

Ok, firstly I'm not an expert on this so take what I say with a pinch of salt.

I used dd to copy over 10G worth of zeros and timed it.

The bonnie output I think shows that it's your cpu holding things back not the disks.....

I have no idea why it says ++++ on the reads bit.

This is worth a read:
http://www.textuality.com/bonnie/advice.html

-----Original Message-----
From: open-...@googlegroups.com [mailto:open-...@googlegroups.com] On Behalf Of Miguel Gonzalez Castaños
Sent: 21 September 2007 19:17
To: open-...@googlegroups.com
Subject: Re: Slow iSCSI

Hiren Joshi

unread,

Sep 24, 2007, 5:10:54 AM9/24/07

to open-...@googlegroups.com

The debate over flow control aside, I turned flow control off on the eth and:

Write

dd if=/dev/zero of=zero bs=4096 count=1572864
1572864+0 records in
1572864+0 records out

6442450944 bytes (6.4 GB) copied, 154.027 seconds, 41.8 MB/s

Read

dd if=zero of=/dev/null bs=4096
1572864+0 records in
1572864+0 records out

6442450944 bytes (6.4 GB) copied, 171.52 seconds, 37.6 MB/s

Better......

From: open-...@googlegroups.com [mailto:open-...@googlegroups.com] On Behalf Of Don Williams
Sent: 22 September 2007 20:50

To: open-...@googlegroups.com
Subject: Re: Slow iSCSI

Miguel Gonzalez Castaños

unread,

Sep 24, 2007, 11:36:27 AM9/24/07

to open-...@googlegroups.com

I have a mounted iSCSI drive on /mnt/virtualdata1, how can I test the
performance of that share using the example of dd? I tried the example
as it is but I don't know if I have to change the of in the writing and
the if in the reading...

Thanks,

Miguel

Hiren Joshi escribió:

> The debate over flow control aside, I turned flow control off on the
> eth and:
> Write
> dd if=/dev/zero of=zero bs=4096 count=1572864
> 1572864+0 records in
> 1572864+0 records out
> 6442450944 bytes (6.4 GB) copied, 154.027 seconds, 41.8 MB/s
> Read
> dd if=zero of=/dev/null bs=4096
> 1572864+0 records in
> 1572864+0 records out
> 6442450944 bytes (6.4 GB) copied, 171.52 seconds, 37.6 MB/s
>
> Better......
>
>

> ------------------------------------------------------------------------
> *From:* open-...@googlegroups.com
> [mailto:open-...@googlegroups.com] *On Behalf Of *Don Williams
> *Sent:* 22 September 2007 20:50
> *To:* open-...@googlegroups.com
> *Subject:* Re: Slow iSCSI

Hiren Joshi

unread,

Sep 24, 2007, 11:43:56 AM9/24/07

to open-...@googlegroups.com

Write test:
dd if=/dev/zero of=/mnt/virtualdata1/zero bs=4096 count=1572864

the bs equals the block size of my filesystem. the count gives me a 6G file (twice the size of my RAM.)

Read test:
dd if=/mnt/virtualdata1/zero of=/dev/null bs=4096

-----Original Message-----
From: open-...@googlegroups.com [mailto:open-...@googlegroups.com] On Behalf Of Miguel Gonzalez Castaños
Sent: 24 September 2007 16:36
To: open-...@googlegroups.com
Subject: Re: Slow iSCSI

Miguel Gonzalez Castaños

unread,

Sep 24, 2007, 1:37:48 PM9/24/07

to open-...@googlegroups.com

Many thanks Hiren!

I tried it on my Virtual Machine (it has only 256 Mb of memory right
now, but can be increased at will):

svn:~# dd if=/dev/zero of=/mnt/virtualdata1/zero bs=4096 count=1572864

1572864+0 records in
1572864+0 records out

6442450944 bytes (6.4 GB) copied, 486.128 seconds, 13.3 MB/s
svn:~#
svn:~# dd if=/mnt/virtualdata1/zero of=/dev/null bs=4096

1572864+0 records in
1572864+0 records out

6442450944 bytes (6.4 GB) copied, 505.934 seconds, 12.7 MB/s

Should I start increasing the memory assigned to the VM to see how it
affects the performance or should I look somewhere else? By the way, any
doc of tuning the performance?

Miguel

Hiren Joshi escribió:

Hiren Joshi

unread,

Sep 24, 2007, 1:49:13 PM9/24/07

to open-...@googlegroups.com

This sounds as bad as the numbers I'm getting.

I only suggested using a file size that's at least twice your memory so, you *know* the file is coming from the disk not the ram. So I don't think giving your VM more ram will help.....

Bonnie++ is a good benchmarking tool,
http://sourceforge.net/projects/bonnie/

This will (at the very least) show you if it's the disk or the processor that's slowing you down.

Miguel Gonzalez Castaños

unread,

Sep 24, 2007, 1:53:04 PM9/24/07

to open-...@googlegroups.com

I already sent to the list the results of bonnie, what can you imply
from them? How can I know why is not giving good numbers?

Miguel

Hiren Joshi escribió:

Hiren Joshi

unread,

Sep 24, 2007, 2:06:44 PM9/24/07

to open-...@googlegroups.com

Ah yes, now I remember..... I think it's your processor but I could be wrong =)

Miguel Gonzalez Castaños

unread,

Sep 25, 2007, 1:21:07 PM9/25/07

to open-...@googlegroups.com

What does this mean? I can't solve this issue? Any guidance of what to
do next would be apreciated