Databricks | Spreadsheet

Connect your Databricks instance to Row Zero and import data straight into a spreadsheet designed for big data sets.

Connect Row Zero to Databricks

In Row Zero click on the 'connections' icon in the upper right-hand corner and then click the button to '+ Add connection.'Databricks new data source screen in Row ZeroDatabricks add new connection screen in Row Zero

The connection requires 4 pieces of information defined below.

  1. Name - This field can be anything that helps identify the data source being connected. (e.g. "Production Data" or "Product Metrics")
  2. Server hostname - This is the Databricks warehouse name, which can be found in your Databricks portal under the SQL Warehouses menu. In SQL Warehouses, click on the warehouse you wish to connect to. Your server hostname is listed under 'Connection details' and is the long string of letters and numbers under "server hostname" at the top of the screen (e.g. cbd-9hfliu9u-83jd.cloud.databricks.com).Databricks server hostname
  3. Warehouse ID - Warehouse ID is under the SQL Warehouses, Overview, Name (e.g. 0923idj093jd9j11)Databricks Warehouse ID
  4. Access Token - Your access token can be generated by going to your account icon in the upper right-hand corner of the Databricks portal, selecting 'Settings' and then 'Developer.' Click the blue button to generate your access token. If you do not see a blue button, contact your Databricks account admin as they will need to update your Databricks personal access token permissionsDatabricks access token

Once all the information is entered, hit 'Test connection' to ensure the information is correct. If the test returns green, you can click '+ Add Source.' If the test returns red, check your connection credentials and try again.

Write a query to import data

Now that Databricks is connected to Row Zero, write a query to pull data into the spreadsheet. First select the 'Catalog' and 'Schema' from the two drop downs. Write a query in the query editor. The easiest query to write is the 'select *' statement, which pulls in the entire table. Example:

select * from TABLENAME

Row Zero is equipped to handle large data sets and will easily ingest the entirety of a table. Write more complicated queries to pull in various subsets of data stored in Databricks. All results will be displayed in the 'connected data table' in the spreadsheet.

Write Databricks SQL query in Row Zero