Fetch XML CLOB data through Oracle Connector faster

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
apoorvadwi
Participant
Posts: 4
Joined: Tue Feb 16, 2016 4:58 am

Fetch XML CLOB data through Oracle Connector faster

Post by apoorvadwi »

Hi ,

I am trying to read 4 columns from Oracle database using a SELECT query in Oracle connector. One of the columns has CLOB data type in the Oracle database. The query has a WHERE clause to fetch data for a single day.

If I run the query in SQL Developer it finishes in 90 sec for 70,000 records. While in Connector stage it takes 1 hour to read 70,000 records.

Could someone please tell me how I can fetch this data faster? If I remove the XML column the job finishes in 1 minute. If I add the XML column in the select query then the job runs for 1 hour .

Please someone help me asap. Thanks in advance.
appy
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

You could try:
  • putting the CLOB column last in the SELECT clause

    casting the CLOB column as LONG VARCHAR2

    consulting with your Oracle DBA
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
apoorvadwi
Participant
Posts: 4
Joined: Tue Feb 16, 2016 4:58 am

Post by apoorvadwi »

Thanks for your reply. But my select query already had this CLOB column in last position. And I have kept the datatype as LongVarChar.

Do you have any other suggestions as to what values to set for array size, record count, buffer size etc?

Or please let me know if I could use some other stage to perform this functionality. Please help me with any suggestions you have.

Thanks a lot.
appy
eostic
Premium Member
Premium Member
Posts: 3838
Joined: Mon Oct 17, 2005 9:34 am

Post by eostic »

Just curious...but what are you doing to make your comparisons? Are you processing all of the xml clob columns in both tools? What is your Job doing downstream from the Oracle Connector? What is the other tool doing with the XML that it retrieves?

Let's see exactly "where" the performance issue is.

Ernie
Ernie Ostic

blogit!
<a href="https://dsrealtime.wordpress.com/2015/0 ... ere/">Open IGC is Here!</a>
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Array size = 1.
Record count = 1.
Buffer size - which buffers are you asking about, virtual data set buffers or transport buffers or some other? Any buffer needs to be large enough to hold at least one record.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
apoorvadwi
Participant
Posts: 4
Joined: Tue Feb 16, 2016 4:58 am

Post by apoorvadwi »

hi Thanks for the reply . I tried to increase the number of partitions and nodes . I am running now with 3 nodes and 7 partitions and my job is finishing in 5 minutes. :)
appy
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Can you please explain how you get 7 partitions from three nodes? This may, of course, be confusion of terminology.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
apoorvadwi
Participant
Posts: 4
Joined: Tue Feb 16, 2016 4:58 am

Post by apoorvadwi »

We have a parameter set in that we can pass the values for partition and nodes. The maximum no. of nodes allowed is 3 . So I kept 3 for the noces and in the number of partitions I increased to 7. so the job was running on 21 partitions and hence finished in 5 minutes
appy
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Can you post the parameter names involved, please, the ones controlling your two options? Just so we make sure we're all on the same page, terminology-wise. :wink:
-craig

"You can never have too many knives" -- Logan Nine Fingers
kottinaresh
Participant
Posts: 9
Joined: Fri Feb 18, 2011 1:12 am
Location: Hyderabad

Post by kottinaresh »

apoorvadwi: Confused :o with "3 nodes & 7 partitions =21" can you please brief on your settings please..
Naresh
ETL Developer
Post Reply