BETADatabricks is a data science and analytics platform built on top of Apache Spark. Databricks implement the Data Lakehouse concept in a single unified, cloud based platform.
The Databricks source lets you sync data from your Databricks clusters via Hightouch.Hightouch connects to Databricks using JDBC. This guide will walk through getting your JDBC URL for your Databricks cluster, and connecting it with Hightouch.
Hightouch will always connect to your warehouse from 188.8.131.52 or 184.108.40.206. You may whitelist this IP address in your VPC security groups.
- Navigate to settings page for your Databricks cluster.
- Expand the Advanced Options toggle and click on JDBC/ODBC
- Keep this page open. You'll need the Server Hostname, Port, and HTTP Path.
- Following the Databricks documentation on username/password authentication, get your Personal Access Token.
- Create a new source
- Select Databricks as the source type.
- Paste in the connection details you collected earlier.
- Click Test, then click Save.