Connect Hadoop to ElasticSearch using Talend

(BLOG IN PROGRESS - INCOMPLETE) This blog will show how to update an ElasticSearch index with data from HDFS file using the Talend Open Studio for Big Data ETL tool. First create a new job in Talend Studio such as HDFStoESindexjob. Drag the following components into the Design area: tHDFSconnection_1----onsubok----> tHDFSinput_1-----row1(Main)--> tWriteJSONField_2-----row2(Main)--->tRESTClient_1 3.   Talend … Continue reading Connect Hadoop to ElasticSearch using Talend

Use Talend Open Studio for Big Data to ETL to Hadoop

Talend Open Studio for Big Data is a powerful ETL tool which is also open source. You can download and use it to do ETL to and from Hadoop including both HDFS and Hive. Talend Install steps Downloaded the free Talend Open Studio for Big Data from https://www.talend.com/products/big-data/big-data-open-studio/ The download file location is set to c:\temp … Continue reading Use Talend Open Studio for Big Data to ETL to Hadoop