I'm trying to add a Hive database to Superset that's running on a Kerberized cluster. I'm able to connect from
Airbnb's Airflow and Tableau using the following settings:
Airflow:
Conn type: Hive Server 2 Thrift
port: 10500
schema: [empty]
Extras:
{
"proxy_user": "login",
"use_beeline": true,
}
Tableau:
... so I know it's possible to connect, authenticated via Kerberos, to Hive.
Like Airflow, Superset's database connection form also has an 'Extras' field. The JSON format to enter in this field is slightly different: Superset asks for separate metadata and engine parameters, whereas Airflow accepts flat JSON containing key/values. It's therefore not possible to simply cut/paste the 'Extras' JSON from Airflow to Superset.
Looking through the Superset config.py, I didn't see a section for Kerberos. Does anyone have any advice for setting this up (perhaps an example of the '
metadata_params' and '
engine_params' JSON, values in superset_config.py, keytab creation steps for
FreeIPA, etc...)?
Cheers,
Alex Woolford