On 2013 Apr 12. md, at 14:19 st, Luc Simard wrote:
> If smartpools is perceived as scanning irrelevant pools, then we should revisit the cluster's configuration and optimize.
It's by design as confirmed by Isilon. Because it operates on
"the cluster" rather than the pools individually.
So even if you have just a perf and nl pool,
and a single rule for migrating inactive files
to the nl pool, SmartPools will scan both pools to find
inactive files (according to the rule), just to find
most of them already on nl...
SmartPools also does some balancing sort of similar to AutoBalance,
which can produce further unexpected load. Also confirmed by Isilon.
> Mediascan is only checking media, it's not a cluster critical job, not impact on workflow . It only checks if are exceeding ECC threshold and a few other values, if the drive shows to have too many errors, it will be marked for ejection with Flexprotect.
>
I just feel better if the monthly scheduled MediaScan jobs
do finish properly. MediaScan surely has been built-in on purpose,
and with a multi-100TB single filesystem I want to have everything
checked and fixed as designed.
> Snap delete has a higher priority setting, given it is run on a regular basis , it will maintain proper management based on the policy setting. do not delete snapshots out of order, always from oldest to newest.
Yes, asking for deletion of individual snapshots
not in the order they were taken produces avoidable overhead.
But holding SnapshotDelete at rest (by priority, or by policy/schedule)
should not change the order. Just the snapshot usage piles
up for that period (3 days for us, but that's fine).
>
> If you feel you are not getting adequate performance with the job engine, I recommend you open a case with support, there are possibilities for fine tuning.
Thanks, we have been through this (took us most of 2012, 'thanks'
to additional disk stall issues), and this is where we have arrived.
Some occasional disk stalls still keep MultiScan (AutoBalance/Collect)
from ever finishing properly :-((( but that is another story...
and yes, most likely another case.
Our Isilon cluster is the main NAS storage over here,
and while it has proven pretty stable from the users' view,
controlling and maintaining the mechanisms
under the hood (media/balance/locks/alerts/notifications)
takes much more work than expected.
Thanks for your supportive feedback anyway!
Peter