Install Kudu in Cloudera CDH 5.16

Reference:

https://blog.clairvoyantsoft.com/installing-apache-kudu-on-clouderas-quickstart-vm-3bdc202ce142

Apache Kudu is a relational database in the Hadoop ecosystem which provides CRUD update/delete capabilities in Impala tables. It stores data outside of hdfs in tablet files in the hadoop datanodes. It is useful for fast IOT data storage and querying as soon as data is inserted into the table unlike HDFS hive tables where the file needs to be closed before you can query causing delays. Also like datawarehouse you can update data. Nice!!

INSTALL STEPS:

  1. Kudu parcels are already installed in the CDH cluster so just do a Add Service from Clouder Manager and select Kudu service to enable.

KUDU Admin commands:

[root]# sudo -u kudu kudu cluster ksck master1hostname ,master2hostname,master3hostname

[root]# sudo -u kudu kudu cluster ksck master1hostname ,master2hostname,master3hostname -tables=impala::mydb.mytable

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.