Search found 3045 matches

by vmcburney
Fri Feb 21, 2003 3:16 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Ver4 and Ver 6 Performance
Replies: 3
Views: 1320

Version 5 introduced the following: Hashed File Stage and Shared Memory Disk Caching The Hashed File stage has been enhanced to use a write-shared disk cache available in the DataStage server. This improves performance by using in-memory cached files and lets a single instance of a file be shared be...
by vmcburney
Thu Feb 20, 2003 2:46 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Distinct values
Replies: 7
Views: 2351

Do you have an Oracle client installed on your datastage server? If so then you should be able to use the Oracle OCI stage instead of the ODBC stage. This will give you more update options and should have faster performance. There are some configuration instructions in the PDF Upgrade guide and the ...
by vmcburney
Wed Feb 19, 2003 9:51 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Distinct values
Replies: 7
Views: 2351

There are a few options. You could extract it into a local hash file then put it into the database. The primary key on the hash file should ensure you only get distinct rows added to the database. If you are using Oracle you could use the option to only add new rows and not update existing rows. You...
by vmcburney
Wed Feb 19, 2003 3:13 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Restricting stages
Replies: 13
Views: 2505

Appears as though your quotes are in the wrong spot.

strSQL = "echo select 'FLAG-'||flag from mytable; | sqlplus " : dbLogin : "/" : dbPassword : "@" : dbName

DSExecute(Unix, strSQL, strFlag, strStatus)


Vincent McBurney
Data Integration Services
www.intramatix.com
by vmcburney
Wed Feb 19, 2003 1:45 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: DS Tunning
Replies: 7
Views: 2956

Thanks Craig! Got my array and transaction fields mixed up.


Vincent McBurney
Data Integration Services
www.intramatix.com
by vmcburney
Tue Feb 18, 2003 5:59 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: DS Tunning
Replies: 7
Views: 2956

If you are using an Oracle OCI stage to write to Oracle then have a look at the Array Size field on the Input - General tab. This controls how many rows are processed before a commit is performed. Fiddle with this figure to increase performance, anything from 1000 to 50000 should give you faster per...
by vmcburney
Tue Feb 18, 2003 4:56 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Restricting stages
Replies: 13
Views: 2505

You could do this with a sleep command embedded in a loop. The following statements can be put inside a routine and retrieves the flag from an Oracle table in what I hope is a 10 minute loop. You can pass in the login details: loop until strFlag = "N" strLogin = dbLogin : "/" : d...
by vmcburney
Mon Feb 17, 2003 5:07 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: how different job configurations affect job speed
Replies: 1
Views: 771

Agree with Ray all his answers. I've got a some additional thoughts on when to use database views: - if it is to be used as a source by multiple jobs. You face greater risk if you are trying to maintain the same custom SQL in multiple jobs. - your query uses complex SQL and you want it managed withi...
by vmcburney
Mon Feb 17, 2003 4:45 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Datastage documentation
Replies: 4
Views: 1861

Reporting Assistant needs to downloads project metadata into an odbc database on a client machine before the data appears in the standard reports. By default your installation comes with an Access database in the Reporting Tool directory and creates an ODBC for this database. You choose which object...
by vmcburney
Mon Feb 17, 2003 4:32 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Scheduling / Access
Replies: 4
Views: 1416

You can run DataStage jobs from other scheduling tools. The command for running a job is: dsjob -run -param paramname=paramvalue -param paramname=paramvalue jobname Setting parameters is optional. Another way is to schedule the job to run once a month and in the job control code exit straight away i...
by vmcburney
Mon Feb 17, 2003 4:23 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Restricting stages
Replies: 13
Views: 2505

Could you clarify your constriant? It sounds like you are only interested in making one decision and choosing a path based on that decision. If you use a constraint you will be running that statement against every row of data, it may be more efficient to run your condition statement in a sequence fi...
by vmcburney
Sun Feb 16, 2003 12:16 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Performance of "Merge" stage
Replies: 7
Views: 2721

You see one of the drawbacks of the merge stage, also true of the aggregation stage, that DataStage builds the merge within the temp directory space which adds a performance overhead. You may find it faster to load file 2 onto file 1. Getting back to shared containers, if you want to process two fil...
by vmcburney
Fri Feb 14, 2003 12:23 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: NFS File SYSTEM
Replies: 1
Views: 1246

You can use DataStage to FTP the files from the NT server onto the Unix server. Just write a Unix script which ftps the files and call this from a DataStage sequence or batch job or use the FTP file stage. If you want to place the data directly onto the Unix system you could consider connecting to t...
by vmcburney
Thu Feb 13, 2003 2:26 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: parallel processing 2
Replies: 7
Views: 2322

By the way, my knowledge of Unix administration functions is pretty woeful. Be interested to know if my theory works!


Vincent McBurney
Data Integration Services
www.intramatix.com
by vmcburney
Thu Feb 13, 2003 2:16 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: parallel processing 2
Replies: 7
Views: 2322

Generally speaking each DataStage job will take up about one CPU at any one time. If you restrict your sequence/batch jobs to under 4 server jobs at any one time you should leave a lot of server capacity free. To assign processors to particular jobs you will probably need the version 6 parallel exte...