Page 1 of 1

Datastage JDBC Connector Ingest Data to Apache Kudu

Posted: Sun Jun 16, 2019 10:23 am
by olgc
Hello every one, does any one ingest data to Hadoop platform (Hive, Impala) or Apache Kudu? How do you resolve the performance issue? We use Datastage JDBC connector with Cloudera Impala JDBC driver, it works well for extraction, but not loading - the insert performance is terrible: 20,000 records takes almost 8 minutes. Has any one there trying ingesting data to Hadoop platform/Apache Kudu? What's your experience?

Thanks,

Posted: Mon Jun 17, 2019 9:13 am
by eostic
Hi. I don't have any experience with Kudo, but in past threads here and elsewhere over the years, I've heard many people talk about loading to their hive and hive related tables using direct writes to the hdfs files that they are abstracting....much faster.....usually using the File Connector.

Ernie

Posted: Thu Jun 20, 2019 2:20 pm
by olgc
Thanks, eostic. Yes, we try use file connector uploading result file to hdfs file system, then through there to Kudu. It'w a work around, not look that good.