I have reviewed a lot of discussion about new WD 4k sector disks (...EARS).
I have RAIDZ pool of such disks with very bad performance. Now my GPT ZFS partitions don't start from value dividable by 4 (162).
Some guys noticed that aligning ZFS partitions according to recommendation wouldn't help at all because RAIDZ uses variable stripe size.
So where is the bottleneck of this configuration: 1) in ZFS which doesn't know about 4k sectorsize? 2) maybe somewhere inside FreeBSD VFS code or disk driver code? 3) somewhere else?
If you doesn't want to use ZFS there are two ways to deal with this problem:
1) Use GNOP with virtual 4k sectorsize.
2) Use UFS on properly aligned partition and some tuning.
--
Alexey Tarasov
(\__/)
(='.'=)
E[: | | | | :]З
(")_(")
Yes, it doesn't group data in aligned "clusters" - the basic data
alignment is sector-sized.
> So where is the bottleneck of this configuration: 1) in ZFS which doesn't know about 4k sectorsize? 2) maybe somewhere inside FreeBSD VFS code or disk driver code? 3) somewhere else?
AFAIK (I don't have actual experience with them) current 4k drives
emulate 512b drives and have a performance penalty in the above
scenario. Because of this emulation, ZFS doesn't know you have a 4k
drive. If you can, try disabling this emulation and make it present to
the operating system as a true 4k drive. Of course, this will make the
data on it unreadable - you will have to reformat it, and the drive will
be unbootable.
Good luck and report what you find.
On 28.03.2010, at 14:44, Ivan Voras wrote:
> AFAIK (I don't have actual experience with them) current 4k drives emulate 512b drives and have a performance penalty in the above scenario. Because of this emulation, ZFS doesn't know you have a 4k drive. If you can, try disabling this emulation and make it present to the operating system as a true 4k drive. Of course, this will make the data on it unreadable - you will have to reformat it, and the drive will be unbootable.
>
> Good luck and report what you find.
>
> _______________________________________________
> freeb...@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-fs
> To unsubscribe, send any mail to "freebsd-fs-...@freebsd.org"
Maybe that's a dumb idea, but I am curious.
As far as I understand, the following points apply:
- EARS and RAIDZ is slow, but
- if you make ZFS aware of the 4k sector size (gnop), performance is okay
If so, what happens if you configure geli for the whole drive with 4k
sectors? Maybe you get AES-128 for free :-)
Riggs
What do you mean "for free"? =)
I don't think they will ever come with a firmware update like that because WD has major operating systems (like Windows 7) covered.
It could be solved by asking the drive what is the size of a sector (such ATA call exists: see http://www.wdc.com/wdproducts/library/WhitePapers/ENG/2579-771430.pdf) and set the sector size of the device (/dev/[a]da*) as needed . That would be a permanent solution for ZFS, UFS, swap or any other consumer of this device.
I mean (4K sectors + geli + RAIDZ) could be faster than (4K sectors + RAIDZ) ?
Riggs
On 29.03.2010, at 11:36, Simun Mikecin wrote:
> I don't think they will ever come with a firmware update like that because WD has major operating systems (like Windows 7) covered.
> It could be solved by asking the drive what is the size of a sector (such ATA call exists: see http://www.wdc.com/wdproducts/library/WhitePapers/ENG/2579-771430.pdf) and set the sector size of the device (/dev/[a]da*) as needed . That would be a permanent solution for ZFS, UFS, swap or any other consumer of this device.
--
> _______________________________________________
> freeb...@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-fs
> To unsubscribe, send any mail to "freebsd-fs-...@freebsd.org"
--
On 29.03.2010, at 19:18, Ivan Voras wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> On 03/29/10 16:40, Alexey Tarasov wrote:
>> Thank you for your reply, I will try to realign my partitions as soon as possible.
>
> Another possible solution is gnop, which I think somebody already
> mentioned. It too can create sector sizes of multiple base sector size.
On 03/29/10 16:40, Alexey Tarasov wrote:
> Thank you for your reply, I will try to realign my partitions as soon as possible.
Another possible solution is gnop, which I think somebody already
mentioned. It too can create sector sizes of multiple base sector size.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.14 (FreeBSD)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAkuwxMsACgkQldnAQVacBcg4NQCcCV3GaPcbN+3eZV9+8T0ute4e
bqMAn3GceXqmxsH8fznglIxqEQ2hpzaU
=+M9c
-----END PGP SIGNATURE-----
I've tried all methods and realized that unfortunately the only working method is gnop. So you can't use these disks for ZFS at all now.
On 29.03.2010, at 19:18, Ivan Voras wrote:
> Another possible solution is gnop, which I think somebody already
> mentioned. It too can create sector sizes of multiple base sector size.
--
Why? And what are you actually trying to do?
My understanding was that even with 512-byte sectors ZFS still aligns its
on-disk data with > 4K alignment.
Do you see otherwise? What problem do you have?
--
Andriy Gapon
I see considerably increased performance when creating over gnop -S 4096 virtual disk. Even when I create zpool over raw disks the performance is very bad and concurent writes stalls. When using gnop, zfs works VERY fast!
Btw, here is another discussion, may be there is a bug in a mav@ commit, because he has added support for >512 sector size:
http://lists.freebsd.org/pipermail/freebsd-current/2010-April/016495.html
Превед Украине! =)
--
> On 09.04.2010 15:15, Alexey Tarasov wrote:
>> Btw, here is another discussion, may be there is a bug in a mav@ commit, because he has added
>> support for>512 sector size:
>
> First of can you look to the commit log and understand what it made?
http://svn.freebsd.org/viewvc/base?view=revision&revision=198897
- Add support for sector size > 512 bytes and physical sector of several
logical sectors, introduced by ATA-7 specification.
May be I have misunderstood this log message?
--
Alexey Tarasov
(\__/)
(='.'=)
E[: | | | | :]О©╫
(")_(")
First of can you look to the commit log and understand what it made?
--
WBR, Andrey V. Elsukov
It was MFC'ed to 8-STABLE.
Will try ahci with fresh STABLE later.
--
Alexey Tarasov
(\__/)
(='.'=)
E[: | | | | :]З
(")_(")
- Add support for sector size > 512 bytes and physical sector of several
logical sectors, introduced by ATA-7 specification.
May be I have misunderstood this log message?