Page 1 of 1

Hive Connector Vs ODBC Connector

Posted: Mon Apr 10, 2017 10:43 pm
by TNZL_BI
Hi All ,

I have recently developed a job to connect to the Hive database in the Hadoop Ecosystem. Now I have used two methods to connect to the Hive database which are :-

1. ODBC Connector
2. Hive Connector

However , I am facing massive performance issues with the hive connector stage . Its taking hours to simply load some 80k rows where as when I use the ODBC connector stage , the performance is very good. We see this getting loaded in around 5 minutes time.

Does any one have an idea on this. Ideally the native connector stage should be faster and should have more options but in my case , the performance is really bad ...

Any inputs here will be very helpful.

Posted: Sun Apr 30, 2017 6:00 pm
by TNZL_BI
I have just got some patches to be installed on my services / engine tier as suggested by IBM . This may improve the speed. Will do that and then revert back with my findings

Hive Connector Vs ODBC Connector

Posted: Mon Mar 05, 2018 5:45 am
by AnnDSX
Hello,

Did you install the patches and see performance enhancement

Thanks

Posted: Mon Mar 05, 2018 4:16 pm
by rkashyap
Hive connector leverages JDBC connectivity.

We are using both ODBC Connector and Hive Connector for connect with Hive and have not seen much difference between the performance of the two.

Posted: Mon Mar 05, 2018 10:58 pm
by AnnDSX
We are using the FileConnector for moving the files to HDFS and the performance is fair. However the performance of Hive connector is dismal.

The best that we could achieve was writing 1000 records in 20 minutes.