Wednesday 22 January 2014

Connecting Tableau 8.0 to Hadoop Hive Servers


Tableau supports connections to data stored in Cloudera, Hortonworks and MapR distributions via Hive and the Cloudera, Hortonworks, or MapR Hive ODBC driver.

Pre-requisites:

For connections to Hive Server, you must have one of the following distributions:
  • Cloudera Hadoop driver
  • Hortonworks driver
  • MapR driver
  • A cluster with a Hadoop node running the Hive server.
Start Cassandra
Before Start HiveServer, Start up an analytics node using the -j option. Open Terminal and go to DSE installation root file,
 
Configure the Cassandra seeds list in cassandra.yaml. DataStax Enterprise designates the first analytics node from the seeds list as the job tracker node.

Start Hive Server
Launch new tab, the Thrift server can be started by running the following:


Install the Driver
For both Hive Server or Hive Server 2, you must install the Cloudera, Hortonworks, or MapR ODBC driver from the http://www.tableausoftware.com/support/drivers page.
·         Cloudera: Cloudera ODBC Driver for Apache Hive 2.5, 32-bit.
·         Hortonworks: Hortonworks Hive ODBC Driver 1.2.x (32-bit).
·         MapR: MapR_odbc_2.1.0_x86.exe.

Test the connection to Hadoop Hive cluster

The latest ODBC drivers for Cloudera, Hortonworks, and MapR allow you to use the driver configuration utility to test the connection to your Hadoop Hive cluster. To do this, go to the 32-bit ODBC Administrator utility, and complete the procedure below.
  1. Click the System DSN tab.
  2. Click the Add button.
  3. Select the appropriate driver for your Hadoop distribution.
  4. Complete the information in the DSN Setup dialog box.
  5. Click the Test button. Test results display with either, “TESTS COMPLETED SUCCESSFULLY” or “TEST COMPLETED WITH ERROR.”
 Connect in Tableau

Step 1

In Tableau Desktop, select the following:
For Cloudera, select Data > Connect to Data > Cloudera Hadoop.



For Hortonworks, select Data > Connect to Data > Hortonworks Hadoop Hive.


           For MapR, select Data > Connect to Data > MapR Hadoop Hive.
  

Step 2
Complete the steps in the Hadoop Hive Connection dialog box, and click OK.
Once your workbook is connected to Hive, drag and drop fields to build views as you would when connected to any other database.

0 comments:

Post a Comment