Sequential File Stage in PX

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
srikanthd1978
Charter Member
Charter Member
Posts: 38
Joined: Wed Mar 17, 2004 1:16 am
Location: USA

Sequential File Stage in PX

Post by srikanthd1978 »

people,

When using a sequential stage with the read method as 'File Pattern', i cannot use "Number of Readers Per Node" option as they are mutually exclusive..

can someone explain why they are mutually exculusive ??

thanks
gh_amitava
Participant
Posts: 75
Joined: Tue May 13, 2003 4:14 am
Location: California
Contact:

Post by gh_amitava »

They are mutually exclusive because DataStage will read Sequential file in sequential mode.. Row by Row.. Only one node will be used to read the entire file.

Regards
Amitava
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

The number of Readers Per Node has a complex start up involving counting the size of the files, dividing the file across the specified number of readers to partition your data. I'm guessing that they don't allow this option for File Patterns due to complexity it adds to the stage code. It may be an option that is much easier to apply programmatically when you have a fully defined set of files rather then a fuzzy file pattern.

You can get around the problem by retrieving your file list at the sequence job level and running the job multiple times with different file names.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

As well as everything Vincent said, note that Readers Per Node is intended only for fixed-width format files. It's fully described in the Parallel Job Developer's Guide
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
bcarlson
Premium Member
Premium Member
Posts: 772
Joined: Fri Oct 01, 2004 3:06 pm
Location: Minnesota

Post by bcarlson »

One final note - the goal of multiple readers is to take advantage of multiple nodes - instead of one single stream of data, you have as many as there are readers. On the other hand, if you are using either filesets or the file pattern, you will already be taking advantage of multiple nodes - one per file

Now, if you could specify multiple readers to be use for each file (fixed length, of course), that would be great - but I believe that would be a discussion for another forum ( PX enhancements?)
Post Reply