How to Connect Tableau to SnappyData
Download and install SnappyData Enterprise edition to connect Tableau to SnappyData. You can connect Tableau using one of the following options:
Connect Tableau using Thrift Server
Use the following steps to connect Tableau to SnappyData using Thrift Server that is compatible with Apache HiveServer2. This is also fully compatible with Spark's Thrift.
Step 1: Enable Thrift Server in SnappyData Cluster
- Download and Install the SnappyData Enterprise version 126.96.36.199 from the SnappyData Release page.
- Configure the SnappyData Cluster.
- In the Lead node configuration, set the following property:
- Launch the SnappyData cluster.
Step 2: Connect Tableau Desktop to SnappyData
- Download and install Tableau Desktop v2018.3.x from the Tableau Download page. You may also need to register your product.
- Open the Tableau Desktop application, on the left panel, from the To A Server section, select Spark SQL connector option.
In the Spark SQL configuration dialog box, enter the following details:
- Enter the host/IP of the Lead node in SnappyData cluster. The default port used by the Hive thrift server is 10000.
- Select SparkThriftServer option from Type dropdown.
- Select username and password option from the Authentication dropdown.
- Set Transport field to SASL.
- Provide a username/password. You could choose to use APP/APP for username/password if authentication was not configured in the cluster.
For more information about Spark SQL configurations, click here.
Click the Sign In button to connect to SnappyData. Tableau displays the page where you can browse and select Schema and Tables as per your requirements to create data visualizations.
If you have not installed the Simba Spark ODBC Driver on your system already, the Sign In button is disabled. To enable it, click the Download and Install the drivers link and install the Simba Spark ODBC Driver. After this, the Sign in button is enabled.
Handling Large Size Tableau Extracts in SnappyData
When you are using the Tableau extract feature and if your extracted data set will be large, you may need to do the following:
- Set the max result size allowed by SnappyData By default, SnappyData will terminate a query whose result exceeds 1GB. You can set the spark.driver.MaxResultSize property on the Lead node and bounce your cluster.
- Configure streaming of the result set to Tableau from SnappyData Hive serverTableau permits Initial SQL to be sent to the server when creating a data source connection as described here.
In the Initial SQL dialog box, type the following:
Connect Tableau using SnappyData ODBC Driver
Get the latest version of SnappyData and SnappyData ODBC driver from SnappyData Release page.
Use the following instructions to connect Tableau using SnappyData ODBC driver:
Step 1: Setup SnappyData ODBC Driver
Follow the instructions provided here to setup SnappyData ODBC Driver.
Step 2: Install Tableau Desktop (10.1 or Higher)
To install Tableau desktop:
Depending on your Windows installation, download the 32-bit or 64-bit version of the installer.
Follow the steps to complete the installation and ensure that you register and activate your product.
Step 3: Connect Tableau Desktop to SnappyData Server
When using Tableau with the SnappyData ODBC Driver for the first time, you must add the odbc-snappydata.tdc file that is available in the downloaded snappydata-odbc-1.0.2.zip.
To connect the Tableau Desktop to the SnappyData Server:
Copy the odbc-snappydata.tdc file to the <User_Home_Path>/Documents/My Tableau Repository/Datasources directory.
Open the Tableau Desktop application.
On the Start Page,
a. Under Connect > To a Server, click Other Databases (ODBC). The Other Databases (ODBC) window is displayed.
b. In the DSN drop-down list, select the name that you provided for your SnappyData ODBC connection (for example snappydsn), and then click Connect.
When the connection to the SnappyData server is established, the Sign In option is enabled. Click Sign In to log into Tableau.
From the Schema drop-down list, select a schema. For example, app. All tables from the selected schema are listed.
Select the required table(s) and drag it to the canvas. A view generated using the selected tables is displayed. If you make changes to the table, click Update Now to refresh and view your changes.
In the Worksheets tab, click sheet to start the analysis.
On this screen, you can click and drag a field from the Dimensions area to Rows or Columns. Refer to the Tableau documentation for more information on data visualization.