Nat TaylorBlog, Product Management & Tinkering

Superset on Databricks

Published on . Updated on

We have data on S3 and SQL tables on it in Databricks, so I wanted to connect Superset for visualizing the data. Thanks to the databricks-dbapi project, it turns out to be as simple as pip install databricks-dbapi then pip install databricks-dbapi[sqlalchemy] and configuring a new Superset > Source > Database > SQLAlchemy URI to foo databricks+pyhive://token:<token>@<companyname><database>?cluster=<cluster_id>

Just keep in mind that:

Popular Posts

Post Navigation