Tableau supports
connections to data stored in Cloudera,
Hortonworks and MapR distributions via Hive
and the Cloudera, Hortonworks, or MapR Hive ODBC driver.
Pre-requisites:
For connections to Hive
Server, you must have one of the following distributions:
- Cloudera Hadoop driver
- Hortonworks driver
- MapR driver
- A cluster with a Hadoop node running the
Hive server.
Start Cassandra
Before Start HiveServer, Start up an analytics node
using the -j option. Open Terminal and go to DSE installation root file,
Configure the Cassandra seeds
list in cassandra.yaml. DataStax Enterprise designates the
first analytics node from the seeds list as the job tracker node.
Start Hive Server
Launch new tab, the Thrift server can be started by running
the following:
Install the Driver
·
Cloudera: Cloudera
ODBC Driver for Apache Hive 2.5, 32-bit.
·
Hortonworks: Hortonworks
Hive ODBC Driver 1.2.x (32-bit).
·
MapR: MapR_odbc_2.1.0_x86.exe.
Test the connection
to Hadoop Hive cluster
The latest ODBC drivers
for Cloudera, Hortonworks, and MapR
allow you to use the driver configuration utility to test the connection to
your Hadoop Hive cluster. To do this, go to the 32-bit ODBC Administrator utility, and complete the procedure
below.
- Click the System DSN tab.
- Click the Add button.
- Select the appropriate driver for your
Hadoop distribution.
- Complete the information in the DSN Setup dialog box.
- Click the Test button. Test results display with
either, “TESTS COMPLETED SUCCESSFULLY” or “TEST
COMPLETED WITH ERROR.”
Connect in Tableau
Step 1
In Tableau Desktop,
select the following:
For Cloudera, select Data > Connect
to Data > Cloudera Hadoop.
For Hortonworks, select Data > Connect
to Data > Hortonworks Hadoop Hive.
For MapR, select Data > Connect
to Data > MapR Hadoop Hive.
Step 2
Complete the steps in
the Hadoop Hive Connection dialog
box, and click OK.
Once your workbook is
connected to Hive, drag and drop fields to build views as you would when
connected to any other database.