Search found 178 matches

by RodBarnes
Wed Sep 14, 2005 4:01 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: '' in HASH file
Replies: 12
Views: 3200

Under Universe (the underlying system below DataStage) a NULL is nothing, not a value. So '' will be an emtpy string not null.
by RodBarnes
Wed Sep 14, 2005 3:46 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: DS functions as Stage Variables
Replies: 4
Views: 2198

You cannot use these routines in this fashion. DSAttachJob requires passing of a job NAME while DSGetLinkInfo requirs passing a job HANDLE. I think what you want to do is this: DSGetLinkInfo(DSJ.ME, "Transformer_3", "DSLink4", DSJ.LINKROWCOUNT) DSJ.ME is an internal variable that...
by RodBarnes
Thu Jun 16, 2005 9:20 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: One large sequence or separate sequences?
Replies: 2
Views: 1229

Thanks for your input. After discussing this some, we've concluded that we're going to organize our project so the shared dimensions will be built in one sequence, with individual sequences for each of the additional dimensions and fact tables. We'll probably have a single master sequence that will ...
by RodBarnes
Wed Jun 15, 2005 2:10 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: One large sequence or separate sequences?
Replies: 2
Views: 1229

One large sequence or separate sequences?

I am sure this has been asked before but I know we all enjoy sharing our opinions. :-) We have a complete ETL sequence and now have the case where an additional dimension and fact table is needed. These new tables are related to the existing tables by using shared dimensions. So.... I can think of r...
by RodBarnes
Thu Jun 02, 2005 11:33 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Storing CRC values
Replies: 16
Views: 10686

Thanks for the input, Alan. Much appreciated. I will look into your idea of exporting the CRC data to a flat file and then reimporting on each run. I'll compare the performance of this with just regenerating the CRC from the database. I expect the load-from-flat-file will be much faster. Having work...
by RodBarnes
Thu Jun 02, 2005 8:53 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Trapping a connection failure and proceeding
Replies: 3
Views: 1383

Ah, "validate". Sure, that makes sense. I hadn't used that yet but had read about it. I'll look into it, thanks. The jobs are run from a single sequence. That sequence is invoked from master sequence that maintains our control information for the run. How are you running these scheduled jo...
by RodBarnes
Thu Jun 02, 2005 8:51 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Storing CRC values
Replies: 16
Views: 10686

Yes, I would be interested in learning more about the method you are using. It sounds like, during each ETL run, it is making a pass through the source data and generating a list of which are inserts, updates, deletes. Then this list is used in later processing to actually control the processing of ...
by RodBarnes
Wed Jun 01, 2005 4:08 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Trapping a connection failure and proceeding
Replies: 3
Views: 1383

Trapping a connection failure and proceeding

[I've searched but I think this is a topic that is difficult to locate with a search. It could be under any number of things.] I have had to go to manually unscheduling the ETL when a weekend arrives where there is scheduled maintenance on a target DB. If I don't do this, the ETL will fail and abort...
by RodBarnes
Wed Jun 01, 2005 3:56 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Storing CRC values
Replies: 16
Views: 10686

Storing CRC values

I'd like some input from the community. Background: I've been working with DataStage for about six months now and feel pretty competent with it. I am using a CRC-check model where a change in a record is determined by a change in the CRC generated for the pertinent fields. The new generated value is...
by RodBarnes
Wed Mar 30, 2005 10:26 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Very slow updates vs. inserts
Replies: 5
Views: 3049

Thanks, ray.wurlod. I suspected that was the case but was hoping there was some tuning in DS I could use. :-) roy, I did search but didn't seem to find anything that particularly fit my question. #1 might work but I would need to include the keys from the original table in the inserts into the new t...
by RodBarnes
Tue Mar 29, 2005 2:57 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Very slow updates vs. inserts
Replies: 5
Views: 3049

Very slow updates vs. inserts

I am seeing VERY slow times when using an update vs. an insert (3 rows/second vs. 3000 rows/second). Is this normal for DataStage to exhibit this drastic of a difference? I am relatively new to DataStage but have successfully created an ETL sequence that works very well using a type 2 (Kimball) mode...
by RodBarnes
Mon Mar 28, 2005 10:15 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Container-based vs. sequence-based design models
Replies: 3
Views: 1435

Thanks for the feedback.

Like you, we have a job template containing all the infrastructure and parameters. We then just do a "Save As" when beginning a new job.

I appreciate your perspective on container usage. Again, thanks.
by RodBarnes
Fri Mar 25, 2005 11:47 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Container-based vs. sequence-based design models
Replies: 3
Views: 1435

Container-based vs. sequence-based design models

I've been exploring use of containers within a job to separate tasks vs. putting the tasks into separate jobs and then putting the jobs together with a sequence. Here's my thoughts and I ask for related feedback from the experts -- bascially everyone else :-) Container-based Design Model - Pros: 1) ...