DSXchange

RodBarnes

Under Universe (the underlying system below DataStage) a NULL is nothing, not a value. So '' will be an emtpy string not null.

RodBarnes

You cannot use these routines in this fashion. DSAttachJob requires passing of a job NAME while DSGetLinkInfo requirs passing a job HANDLE. I think what you want to do is this: DSGetLinkInfo(DSJ.ME, "Transformer_3", "DSLink4", DSJ.LINKROWCOUNT) DSJ.ME is an internal variable that...

RodBarnes

Thanks for your input. After discussing this some, we've concluded that we're going to organize our project so the shared dimensions will be built in one sequence, with individual sequences for each of the additional dimensions and fact tables. We'll probably have a single master sequence that will ...

RodBarnes

I am sure this has been asked before but I know we all enjoy sharing our opinions. :-) We have a complete ETL sequence and now have the case where an additional dimension and fact table is needed. These new tables are related to the existing tables by using shared dimensions. So.... I can think of r...

RodBarnes

Thanks for the input, Alan. Much appreciated. I will look into your idea of exporting the CRC data to a flat file and then reimporting on each run. I'll compare the performance of this with just regenerating the CRC from the database. I expect the load-from-flat-file will be much faster. Having work...

RodBarnes

Ah, "validate". Sure, that makes sense. I hadn't used that yet but had read about it. I'll look into it, thanks. The jobs are run from a single sequence. That sequence is invoked from master sequence that maintains our control information for the run. How are you running these scheduled jo...

RodBarnes

Yes, I would be interested in learning more about the method you are using. It sounds like, during each ETL run, it is making a pass through the source data and generating a list of which are inserts, updates, deletes. Then this list is used in later processing to actually control the processing of ...

RodBarnes

[I've searched but I think this is a topic that is difficult to locate with a search. It could be under any number of things.] I have had to go to manually unscheduling the ETL when a weekend arrives where there is scheduled maintenance on a target DB. If I don't do this, the ETL will fail and abort...

RodBarnes

I'd like some input from the community. Background: I've been working with DataStage for about six months now and feel pretty competent with it. I am using a CRC-check model where a change in a record is determined by a change in the CRC generated for the pertinent fields. The new generated value is...

RodBarnes

Thanks, ray.wurlod. I suspected that was the case but was hoping there was some tuning in DS I could use. :-) roy, I did search but didn't seem to find anything that particularly fit my question. #1 might work but I would need to include the keys from the original table in the inserts into the new t...

RodBarnes

I am seeing VERY slow times when using an update vs. an insert (3 rows/second vs. 3000 rows/second). Is this normal for DataStage to exhibit this drastic of a difference? I am relatively new to DataStage but have successfully created an ETL sequence that works very well using a type 2 (Kimball) mode...

RodBarnes

Thanks for the feedback.

Like you, we have a job template containing all the infrastructure and parameters. We then just do a "Save As" when beginning a new job.

I appreciate your perspective on container usage. Again, thanks.

RodBarnes

I've been exploring use of containers within a job to separate tasks vs. putting the tasks into separate jobs and then putting the jobs together with a sequence. Here's my thoughts and I ask for related feedback from the experts -- bascially everyone else :-) Container-based Design Model - Pros: 1) ...

DSXchange

Search found 178 matches

One large sequence or separate sequences?

Trapping a connection failure and proceeding

Storing CRC values

Very slow updates vs. inserts

Container-based vs. sequence-based design models