Connect SQLalchemy to Cloudera Impala or Hive

Below code will connect to Impala with Kerberos enabled. You can also connect to Hive by changing host and port to 10000. import sqlalchemy from sqlalchemy.engine import create_engine connect_args={'auth': 'KERBEROS', 'kerberos_service_name': 'impala'} engine = create_engine('hive://impalad-host:21050', connect_args=connect_args) conn = engine.connect() ResultProxy = conn.execute("SELECT * FROM db1.table1 LIMIT 3") print(ResultProxy.fetchall())

Is Apache Impala 65 to 200 times faster than Apache Hive on Tez

EbebJust now·3 min read Ran the same query SELECT COUNT(*) FROM DB1.TABLE1 on a 35 million rows table. Each query was run 4 times without any delay between each run on both Apache Impala impala-shell and Apache Hive beeline cli. This was to make sure to avoid session creation and data caching timing issue during … Continue reading Is Apache Impala 65 to 200 times faster than Apache Hive on Tez

Use pyodbc with Cloudera Impala ODBC and Kerberos

Initially tried the python impyla package to connect to Cloudera Impala but ran into various errors and dependency issues. Also 2 of 3 queries would hang or give errors. So next tried pyodbc to connect to Impala. Linux System Requirements: The Cloudera ODBC Driver for Impala is recommended for Impala versions 2.8 through 3.3, and … Continue reading Use pyodbc with Cloudera Impala ODBC and Kerberos

Connect DBeaver SQL Tool to Cloudera Hive/Impala with Kerberos

DBeaver https://dbeaver.io/ is a a powerful free opensource SQL editor tool than can connect to 80+ different databases. The below procedures will enable DBeaver to connect to Cloudera Hive/Impala using kerberos. Initially tried to use the Cloudera JDBC connection but it kept giving kerberos error: [Cloudera]ImpalaJDBCDriver Error initialized or created transport for authentication: [Cloudera]ImpalaJDBCDriver Unable … Continue reading Connect DBeaver SQL Tool to Cloudera Hive/Impala with Kerberos

Tableau Server connect to Cloudera Hive with MIT Kerberos

We know Tableau Desktop works with MIT Kerberos on Windows to connect to Cloudera Hive/Impala. But there is some confusing information in Tableau support sites whether Tableau SERVER can work with MIT Kerberos in an Windows environment. There is a note that Kerberos delegation requires Active Directory and MIT Kerberos is not supported. But let … Continue reading Tableau Server connect to Cloudera Hive with MIT Kerberos

MicroStrategy Desktop connect to Impala

Environment: MicroStrategy Desktop 10.11 Cloudera CDH 5.12 Impala 2.x Steps to connect MicroStrategy Destop to Cloudera Impala: Best thing about MicroStrategy Desktop unlike Tableau Desktop is it is free to download and use and a powerful BI visualization/query tool. Tableau Public Desktop is free but it only has few connectors and cannot connect to Hadoop … Continue reading MicroStrategy Desktop connect to Impala