Connect DBeaver SQL Tool to Cloudera Hive/Impala with Kerberos

DBeaver https://dbeaver.io/ is a a powerful free opensource SQL editor tool than can connect to 80+ different databases. The below procedures will enable DBeaver to connect to Cloudera Hive/Impala using kerberos. Initially tried to use the Cloudera JDBC connection but it kept giving kerberos error: [Cloudera]ImpalaJDBCDriver Error initialized or created transport for authentication: [Cloudera]ImpalaJDBCDriver Unable … Continue reading Connect DBeaver SQL Tool to Cloudera Hive/Impala with Kerberos

Use Beeline to query Hive table

Example how to query with beeline: Login with your userid on the linux server: [userxyz]$ beeline beeline> !connect jdbc:hive2://hive-server-hostname:10000/default;principal=hive/_HOST@XYZREALM.COM Error: Could not open client transport with JDBC Uri: : GSS initiate failed (state=08S01,code=0) This error is due to kinit not done. So do $ kinit userxyz beeline> !connect jdbc:hive2://hive-server-hostname:10000/default;principal=hive/_HOST@XYZREALM.COM Connected to: Apache Hive (version 1.1.0-cdh5.16.1) … Continue reading Use Beeline to query Hive table

Transfer parquet Hive table from one Hadoop cluster to another

EXAMPLE: HOW TO TRANSFER PARQUET HIVE TABLE FROM ONE CLUSTER TO ANOTHER CLUSTER First create a new table as CTAS to combine multiple hive table parquet files to a single parquet file for ease of transfer from one cluster to another. In Source cluster create a new table: CREATE TABLE default.mynewtable stored as PARQUET AS … Continue reading Transfer parquet Hive table from one Hadoop cluster to another

Connect Excel to Cloudera Hive/Impala

Below procedure will help you connect Microsoft Excel to Cloudera Impala or Hive using ODBC driver. First download and install the MIT Kerberos Client for windows from Kerberos for Windows Release 4.1 - current release Make sure you get the Kerberos userid/password from the Cloudera Administrator and your are able to login and get a … Continue reading Connect Excel to Cloudera Hive/Impala

Tableau Server connect to Cloudera Hive with MIT Kerberos

We know Tableau Desktop works with MIT Kerberos on Windows to connect to Cloudera Hive/Impala. But there is some confusing information in Tableau support sites whether Tableau SERVER can work with MIT Kerberos in an Windows environment. There is a note that Kerberos delegation requires Active Directory and MIT Kerberos is not supported. But let … Continue reading Tableau Server connect to Cloudera Hive with MIT Kerberos

Connect Microsoft Power BI desktop to Cloudera Impala or Hive with Kerberos

Microsoft Power BI desktop is free and is able to successfully connect to a Cloudera Impala or hive database with Kerberos security enabled. The below blog only shows Impala driver but you can use same procedure with Hive driver also. The basic steps are: Install the MIT Kerberos client for Windows and make sure you … Continue reading Connect Microsoft Power BI desktop to Cloudera Impala or Hive with Kerberos

Cloudera Hadoop Data Encryption at rest Notes

In Cloudera Hadoop there are few components that are used to implemented Data Encryption at rest: The Key Management Server (KMS) uses the Key Trustee Server as the enderlying keystore instead of the file-based Java KeyStore(JKS) used by the default Hadoop KMS. Cloudera Navigator Key Trustee Server is the actual keystore for the encryption keys … Continue reading Cloudera Hadoop Data Encryption at rest Notes

Tableau Desktop connect to Cloudera Hadoop using Kerberos

Reference: http://website4everything.blogspot.com/2015/04/connecting-tableau-to-hive-server-2.html The basic steps to connect Tableau to Cloudera Hive or Impala with Kerberos authentication involves the following steps: Download and Install the MIT Kerberos Client for WindowSet the C:\ProgramData\MIT\Kerberos5\krb5.ini with  the Kerberos realm and server details(Optional) KRB5CCNAME system environment variable may need to be set at times to a temporary value: FILE:C:\temp\kerberos\krb5cacheStart the MIT … Continue reading Tableau Desktop connect to Cloudera Hadoop using Kerberos

Access webhdfs using Kerberos from laptop client

The following blog shows how to access a kerberized hadoop cluster from a Chrome browser in laptop. https://community.hortonworks.com/articles/28537/user-authentication-from-windows-workstation-to-hd.html This will work mostly except change the below: 3. network.negotiate-auth.gsslib = C:\Program Files\MIT\Kerberos\bin\gssapi64.dll instead of the gssapi32.dll  since we mostly use 64-bit Firefox which doesnt work with the 32bit dll.    

Run a Python program to access Hadoop webhdfs and Hive with Kerberos enabled

Following python code makes REST calls to a secure Kerberos enabled Hadoop cluster to use webhdfs REST api to get file data: You need to first run $ knit userid@REALM to authenticate and initiate the Kerberos ticket for the user.Make sure the python modules requests and requests_kerberos have been installed. Otherwise install it for example: … Continue reading Run a Python program to access Hadoop webhdfs and Hive with Kerberos enabled