Difference between Dataset stage and Sequential File stage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
kvenkatareddy
Participant
Posts: 2
Joined: Sat Feb 28, 2004 12:28 am

Difference between Dataset stage and Sequential File stage

Post by kvenkatareddy »

Could anybody describe what is the basic difference between the Data set stage and Sequential File stage and which circumstance these stages will be used?

Thx
lakshmipriya
Participant
Posts: 31
Joined: Tue Jul 13, 2004 5:26 am
Location: chennai
Contact:

Post by lakshmipriya »

Data set can be partioned as of the required number of nodes and thus it can be run in parallel. You can defines as many number of nodes as you can.

But incase of sequential file, it extracts and writes through sequential.
Lakshmi
l_homme_de_rio
Participant
Posts: 17
Joined: Fri Jul 30, 2004 1:16 am

Post by l_homme_de_rio »

Hi,
Datasets are the most important thing to understand to start anything within PX, you should attend a PX training before.
The Datasets are the "magic key" to handle scalability. in addition all the partionning stuff is based on them.
Don't forget that Flat files aren't so "popular" in PX comparing to "classical" DS and should be used only to read a source or generate an output. In the intermediate phase, forget them.
All the best
Post Reply