Dear Team,
I would like to setup Irods v4.3.0 for managing the data residing across our infrastructure like HPC, Virtual machines, Servers, Storage box etc..
Let me explain my requirements below, please let me know if this can be doable using Irods.
We dealt with massive data let's say 4 Petabytes of data which sits across in HPC ( scratch and archive ) , Physical servers, Netapp Storage drive etc only in our Onpremise. We don't have data over the cloud.
Here I need a centralized setup where I can take the control of all the data. This means I will have a dedicated host where the iRods server running. From here i can communicate to other hosts datasets.
Requirements
===========
a) Data Connection:- Able to connect to HPC/servers to get the data. Here how can I establish the communication between the iRods server to HPC/Servers?
b) Data Movement:- I have the ability from iRods to modify or update the data.
c) Data Tagging: I can tag the data as per the host to identify from where the data comes in.
d) Data Classification: I can seggregate the data as per the data types. Let's say for example based on file extension like bams, vcf , tgz etc..
As of now, I started setup with iRoDS 4.3.0 server running on Ubuntu 20.04 and tested with local data and integrated with metalnx as the Web UI.
Setup questions:-
=============
a) Do you have the Docker setup for irods-server for v4.3.0? I can see the latest docker version for iRods as v4.0.3. In future, i would like to move to kubernetes setup. Please advise the pathway/pointer how to proceed.
b) Once the irods-server is up and running. How can i import the storage from other nodes. Do i need to install any agent on the remote server? or i can pull the information using SSH/NFS/SMB protocol ? Please advise.
c) For the UI part, I'm using the metalnx docker to visualize the data. Do you support any other software for UI? How can I customize the UI?
d) Do you provide API? My plan is to communicate to iRODS via API to fetch the data information from a custom developed inhouse software.
Kindly advise for my above questions and concerns with irods.
Thanks
Jay