Hi Robert,
I am working at Medallia.
Thanks for the information! I do know that airflow as a whole can work in a virtual environment. But I am looking to run each component or the tfx DAG in a virtual environment . The process would look like ->
1. Take the requirements.txt
2. Create the
environment
3. Run the component
4. Destroy the
environment
We cannot install tfx as is in Airflow as there would be dependency conflicts with other processes which are in place.
Correct me if I am wrong. Looking at the code base, I don't see a way to pass on the parameters for the virtual environment in the Airflow Dag Runner. And the airflow components are child classes of Python Operator. Would I need to change this such that it inherits from Airflow's PythonVirtualenvOperator?
To get the components to work in the virtual environment, is there a way to do it without changing the tfx source code?
Or would you suggest building a separate airflow cluster for tfx (may not be feasible)
Do you have any suggestions regarding best practices for tfx with airflow?
Thanks a lot for your guidance!