rml...@gmail.com wrote on 31/08/12 9:34 AM:
> I question the uniform distribution of failures.
Bob,
you're dead right :-) A Simplifying Assumption.
There are two well known papers on large-scale HDD failures, published
within the last 5 years. One by Google. [others will know the refs]
HDD failure rate changes with age, use, temperature and power-cycles.
It's also not usefully near the Vendor-published figures from
accelerated ageing.
Nor, very surprisingly, does S.M.A.R.T. reporting give you much
predictive power. IIRC, google says more than 50% of failures are not
predictable from those logs.
The wild card is "new technologies" - how will they perform?
We've entered the last factor-10 increase of HDD recording density
(~2020) and three new recording techniques are yet to enter mainstream
service:
- HAMR [heat assisted Magnetic recording - higher coercivity media,
heated by laser]
- BPR [Bit Patterned Media. shape the bit areas
- Shingled Writes [better described as Multi-track overlapped write
with no in-place update]
The replacement technologies, broadly called "Storage Class Memories",
will, like Flash memory, have completely different wear and failure
characteristics.
Even 'Flash' seems to now be in a region of declining returns with
feature size reduction. We're currently at 100 electrons per cell,
looking to get to 10. Yes, that's One Hundred. A figure I find hard to
comprehend in consumer devices.
But, from where we are now, there aren't any technologies that will
surpass HDD in capacity/price for the next 20 years.
But as Neil has so ably pointed out recently with Fusion-IO and
PCI-SSD's, HDD's are no longer "useful" or cost-effective for serving
Random I/O loads. [Sell your shares in Enterprise Disk Array
manufacturers, but not Seagate or Western Digital].
HDD's work well for streaming-IO: think CD-ROM or DVD.
- while they can seek, they are horribly slow at it...
- It takes 100-1000 times as long to read a HDD with 'random I/O'
compared to seek-and-stream.
Point of that excursion:
HDD reliability figures are going to become important to archivists
and not for Performance Analysts.