Connect DBeaver SQL Tool to Cloudera Hive/Impala with Kerberos

DBeaver https://dbeaver.io/ is a a powerful free opensource SQL editor tool than can connect to 80+ different databases. The below procedures will enable DBeaver to connect to Cloudera Hive/Impala using kerberos. Initially tried to use the Cloudera JDBC connection but it kept giving kerberos error: [Cloudera]ImpalaJDBCDriver Error initialized or created transport for authentication: [Cloudera]ImpalaJDBCDriver Unable … Continue reading Connect DBeaver SQL Tool to Cloudera Hive/Impala with Kerberos

Use Beeline to query Hive table

Example how to query with beeline: Login with your userid on the linux server: [userxyz]$ beeline beeline> !connect jdbc:hive2://hive-server-hostname:10000/default;principal=hive/_HOST@XYZREALM.COM Error: Could not open client transport with JDBC Uri: : GSS initiate failed (state=08S01,code=0) This error is due to kinit not done. So do $ kinit userxyz beeline> !connect jdbc:hive2://hive-server-hostname:10000/default;principal=hive/_HOST@XYZREALM.COM Connected to: Apache Hive (version 1.1.0-cdh5.16.1) … Continue reading Use Beeline to query Hive table

Transfer parquet Hive table from one Hadoop cluster to another

EXAMPLE: HOW TO TRANSFER PARQUET HIVE TABLE FROM ONE CLUSTER TO ANOTHER CLUSTER First create a new table as CTAS to combine multiple hive table parquet files to a single parquet file for ease of transfer from one cluster to another. In Source cluster create a new table: CREATE TABLE default.mynewtable stored as PARQUET AS … Continue reading Transfer parquet Hive table from one Hadoop cluster to another

Connect Excel to Cloudera Hive/Impala

Below procedure will help you connect Microsoft Excel to Cloudera Impala or Hive using ODBC driver. First download and install the MIT Kerberos Client for windows from Kerberos for Windows Release 4.1 - current release Make sure you get the Kerberos userid/password from the Cloudera Administrator and your are able to login and get a … Continue reading Connect Excel to Cloudera Hive/Impala

Tableau Server connect to Cloudera Hive with MIT Kerberos

We know Tableau Desktop works with MIT Kerberos on Windows to connect to Cloudera Hive/Impala. But there is some confusing information in Tableau support sites whether Tableau SERVER can work with MIT Kerberos in an Windows environment. There is a note that Kerberos delegation requires Active Directory and MIT Kerberos is not supported. But let … Continue reading Tableau Server connect to Cloudera Hive with MIT Kerberos

Run any ad-hoc SQL query in Power BI desktop

It is not documented clearly how to run any arbitrary SQL query in Power BI desktop. It is definitely possible to easily run any SQL query as below: First click on Edit Queries in the top ribbon and then go to Advanced Editor and type in the SQL query as given in the picture below. … Continue reading Run any ad-hoc SQL query in Power BI desktop

Connect Microsoft Power BI desktop to Cloudera Impala or Hive with Kerberos

Microsoft Power BI desktop is free and is able to successfully connect to a Cloudera Impala or hive database with Kerberos security enabled. The below blog only shows Impala driver but you can use same procedure with Hive driver also. The basic steps are: Install the MIT Kerberos client for Windows and make sure you … Continue reading Connect Microsoft Power BI desktop to Cloudera Impala or Hive with Kerberos

Tableau Desktop connect to Cloudera Hadoop using Kerberos

Reference: http://website4everything.blogspot.com/2015/04/connecting-tableau-to-hive-server-2.html The basic steps to connect Tableau to Cloudera Hive or Impala with Kerberos authentication involves the following steps: Download and Install the MIT Kerberos Client for WindowSet the C:\ProgramData\MIT\Kerberos5\krb5.ini with  the Kerberos realm and server details(Optional) KRB5CCNAME system environment variable may need to be set at times to a temporary value: FILE:C:\temp\kerberos\krb5cacheStart the MIT … Continue reading Tableau Desktop connect to Cloudera Hadoop using Kerberos

Upgrade MySQL 5.6 to 5.7 on Centos 7

Reference: http://linuxresolved.com/upgrade-mysql-5-6-mysql-5-7-centos/ 1. # service mysql stop 2. Important backup your databases before upgrade. Create a backup of the original MySQL data. $ cp /var/lib/mysql /var/lib/mysql.original 2. Download the MySQL 5.7 RPM $ wget http://repo.mysql.com/mysql57-community-release-el7.rpm -P /tmp/ 3. Remove the MySQL-Community RPM that contains MySQL 5.6 $ yum remove mysql-community-release 4. Install the MySQL 5.7 RPM … Continue reading Upgrade MySQL 5.6 to 5.7 on Centos 7

PostgreSQL notes

In Ubuntu, the server is run as a service called postgresql (configured in /etc/init.d/postgresql). The postgresql service is started automatically upon startup. Like all other services, you could: $ sudo service postgresql stop // Stop the service $ sudo service postgresql start // Start the service $ sudo service postgresql restart // Stop and restart the service $ sudo service postgresql reload … Continue reading PostgreSQL notes