Hi Zack,
do you have HSM in mind in the classical sense that files are being migrated
between the tiers on access and released on the upper tier when a certain
threshold is crossed? Or do you maybe rather have something like SSD caching in
mind, which could be done with software like lvmcache, bcache & friends on the
servers or RAID-controller based with something like MegaRAID CacheCade?
Typically, the people that we are aware of used either a caching technology to
improve generic use of the system. And when performance really mattered and
special applications should complete much faster, they typically rather went
with a separate BeeGFS instance, which was completely based on SSDs/NVMe.
What we are already working on and planning to have ready later this year is
support for different types of storage targets, so that you can define arbitrary
groups of similar storage targets within the same BeeGFS instance, e.g. like a
group of "SSD targets", a group of "normal HDD targets" and maybe "SMR disk
targets". We call this feature "Storage Pools".
Based on that, you will be able to define (e.g. on a per-directory basis) which
pool should be used when a new file is created inside a certain path. And there
will be a tool to migrate files from one pool to another - but that's manual,
not automatic, because it might not make sense to always copy a file from the
SMR disks to a higher layer just because it is read once; and in the other
direction, you might not want to start copying (temporary) data from the SSDs to
other pools just because the pool crosses a certain free space threshold.
Since individual quota limits can be defined on a per-pool basis then, there is
also a way to motivate system users to care about which data they have on which
pool.
Best regards,
Sven Breuner
ThinkParQ
zack....@sbcglobal.net wrote on 05.09.2016 05:08:
> We need to introduce a faster tier (formed with SSDs, obviously) into our
> storage infrastructure. I am aware of the fact that for Lustre (which is also
> widely used here), the HSM is available since Lustre 2.5
> <
http://www.seagate.com/files/www-content/solutions-content/cloud-systems-and-solutions/high-performance-computing/_shared/docs/clusterstor-inside-lustre-hsm-ti.pdf>.