Nat TaylorBlog, AI, Product Management & Tinkering

Superset on Databricks

Published on .

We have data on S3 and SQL tables on it in Databricks, so I wanted to connect Superset for visualizing the data. Thanks to the databricks-dbapi project, it turns out to be as simple as pip install databricks-dbapi then pip install databricks-dbapi[sqlalchemy] and configuring a new Superset > Source > Database > SQLAlchemy URI to foo databricks+pyhive://token:<token>@<companyname>.cloud.databricks.com:443/<database>?cluster=<cluster_id>

Just keep in mind that:

Post Navigation

«
»