which suits large volume of data dataset or fileset

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
kruthika
Participant
Posts: 21
Joined: Mon May 31, 2004 11:14 pm

which suits large volume of data dataset or fileset

Post by kruthika »

Hi,

In a parallel job, Which suits large volume of data better...Fileset or Dataset..

Thanks
Kruthika
vbeeram
Participant
Posts: 63
Joined: Fri Apr 09, 2004 9:40 pm
Contact:

Post by vbeeram »

Hi,

Datset


Thanks
Thiru
mandyli
Premium Member
Premium Member
Posts: 898
Joined: Wed May 26, 2004 10:45 pm
Location: Chicago

Post by mandyli »

Hi kruthika ,

If you are using more then 4g.b data go for dataset b'cus file set will not workin in PX mode.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Do you have some proof for this claim, Mandy?

Even if you are on an operating system that only supports 2GB files, a fileset can contain many physical files, so there should not be any limit to the size of a fileset.

Chapter 6 of the Parallel Job Developer's Guide provides all the extra information you need to know.

The main difference between dataset and fileset, it seems to me, is the fact that a file set also includes information about how the data are formatted.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply