Databricks

Prerequisites

Before you can configure and use Databricks with Connect AI, you must first connect a data source to your Connect AI account. See Sources for more information. You must also generate a Personal Access Token (PAT) on the Settings page. Copy this down, as it acts as your password during authentication.

Connect to Connect AI

To establish a connection from Databricks to Connect AI, follow these steps.

Download and install the Connect AI JDBC driver.

Open the Integrations page of Connect AI.

Search for JDBC or Databricks.

Click Download and select your operating system.

When the download is complete, run the setup file.

When the installation is complete, the JAR file can be found in the installation directory.

In the navigation pane, select Compute. Start any compute or create a new one.

Once the compute is started, click the compute and then select the Libraries tab.

Click Install new. The Install library dialog appears.

Select DBFS. Then drag and drop the JDBC JAR file into the indicated area. The file has the name cdata.jdbc.connect.jar. Click Install.

You must now run three notebook scripts, one by one.

The first script is below. Change the following:

Update User with your Connect AI username.
Update Password with the PAT you generated in the prerequisites.
Update Your_Connection_name with the name of the data source you created in the prerequisites.

driver = "cdata.jdbc.connect.ConnectDriver"
url ="jdbc:connect:AuthScheme=Basic;User=user@cdata.com;Password=***********;URL=https://cloud.cdata.com/api/;DefaultCatalog= Your_Connection_Name;"

Run the first script.

From the menu on the right side, select Add cell below to add a second script. The second script is below. Change the following:

Update User with your Connect AI username.
Update Password with the PAT you generated in the prerequisites.
Update Your_Connection_name with the name of the data source you created in the prerequisites.
Update YOUR_SCHEMA.YOUR_TABLE with your schema and table, for example, PUBLIC.CUSTOMERS.

remote_table = spark.read.format ( "jdbc" ) \
.option ( "driver" , "cdata.jdbc.connect.ConnectDriver") \
.option ( "url","jdbc:connect:AuthScheme=Basic;User=user@cdata.com;Password=*******;URL=https://cloud.cdata.com/api/;DefaultCatalog= Your_Connection_Name;") \
.option ( "dbtable" , "YOUR_SCHEMA.YOUR_TABLE") \
.load ()

Run the second script.

Add a cell for the third script. The third script is below. Select the columns you want to display.

display (remote_table.select ("ColumnName1","ColumnName2"))

Run the third script.

You can preview your data in Databricks.

Getting Started

Data Sources

Explore and Model

Integrations

Governance

Virtual Databases

Operations

Prerequisites

Connect to Connect AI

Getting Started

Data Sources

Explore and Model

Integrations

Governance

Virtual Databases

Operations

​Prerequisites

​Connect to Connect AI

Prerequisites

Connect to Connect AI