Integrate Hadoop Hue with LDAP

Authenticate Hue Users with LDAP Environment: CDH 5.12 on RHEL, Active Directory LDAP We will use Search Bind as it seems to be compatible with both AD and LDAP. We will follow the steps in the below manual: https://www.cloudera.com/documentation/enterprise/latest/topics/hue_sec_ldap_auth.htmlhttps://www.youtube.com/watch?time_continue=12&v=pCgUxQ8CU4o Log on to Cloudera Manager and click Hue. Click the Configuration tab and filter by scope=Service-wide and category=Security. Set the … Continue reading Integrate Hadoop Hue with LDAP

Advertisements

Cloudera Search (Solr) install steps

The following steps are used to install Cloudera Search which is based on Apache Solr. Environment: Cloudera CDH 5.12.x solr-spec 4.10.3 Deploying Cloudera Search Cloudera Search (powered by Apache Solr) is included in CDH 5. If you have installed CDH 5.0 or higher, you do not need to perform any additional actions to install Search. … Continue reading Cloudera Search (Solr) install steps

Business Intelligence, ETL and Data Science tools

Free or Opensource BI / ETL tools: Talend = ETL tool, leader in Gartner Magic Quadrant Streamsets = ETL tool Apache Nifi = ETL tool Pentaho = desktop and server version BI/ETL tool HUE = Hadoop Analytics server, BI, Query tool KNIME = Data Science leader in Gartner Magic Quadrant 2017 desktop version Jupyter Notebook … Continue reading Business Intelligence, ETL and Data Science tools

Install Anaconda Python package on Cloudera CDH.

  This blog will show how to install Anaconda parcel in CDH to enable Pandas and other python libraries on Hue pySpark notebook. Install Steps: Installing the Anaconda Parcel 1.From the Cloudera Manager Admin Console, click the “Parcels” indicator in the top navigation bar. 2.Click the “Configuration” button on the top right of the Parcels … Continue reading Install Anaconda Python package on Cloudera CDH.

Install Hue Spark Notebook with Livy on Cloudera

This blog will show simple steps to install and configure Hue Spark notebook to run interactive pySpark  scripts using Livy. Environment used: CDH 5.12.x , Cloudera Manager, Hue 4.0, Livy 0.3.0, Spark 1.6.0 on RHEL linux. Sentry was installed in unsecure mode. Kerberos was not used in the Hadoop cluster. Kerberos will need additional steps … Continue reading Install Hue Spark Notebook with Livy on Cloudera