Airflow ->Greenplum

14 views
Skip to first unread message

Erlan Tayshabarov

unread,
Apr 1, 2024, 7:08:35 AMApr 1
to cloud-composer-discuss

In the old DAG code, there are many database queries similar to:
df1 = pd.read_sql('''                                
    select d.id, d.is_group, t.table_name, d.is_parallel
    from map_table d
    join etl_user.dwh_all_table t on t.table_id = d.target_table_id
    where d.is_complex = 1 and d.is_active = 1 and task_type_id = 15;
''')
Each of these queries creates many transactions, resulting in performance issues. For those who have similar experience, how did you optimize them? Please share your experience.
Reply all
Reply to author
Forward
0 new messages