Reverse ETL with Databricks – Etlworks Support

Reverse ETL is the pattern of reading data out of Databricks and writing it into another destination — a SaaS app (Salesforce, HubSpot, …), another database, a file, or a queue.

Build a Reverse ETL flow

Treat the Databricks connection as a regular database source. Etlworks executes a JDBC SELECT (or a SQL query you provide) against the Databricks compute resource, optionally transforms the result, and writes it to the chosen destination.

Create a Databricks connection.
Create the destination connection (SaaS app, database, file storage, queue, …).
In Flows, create the right flow type for the destination — for example, Database to Database if writing to another RDBMS, or Database to file for an export.
Add a transformation. Set FROM to the Databricks connection and the source table or query. Set TO to the destination connection and object.
Optional: enter a Source SQL on the transformation to control the exact query Databricks runs (joins, filters, aggregations, time-bounded windows).
Optional: enable an incremental read with change replication using High Watermark based on a Delta column such as __processed_at or a monotonic sequence.
Save and schedule the flow.

Note: Etlworks reads from Databricks via JDBC. Delta Sharing and Databricks SQL's predictive features are not used by the reverse-ETL path today — the read is a standard SELECT executed by the Databricks compute resource.

Articles in this section

Build a Reverse ETL flow

Related articles