DSXchange

le thuong

We have the following job sequence:
Job A followed by Job B

In the Triggers tab of Job A:
Expression type = Custom - (Conditional)
Expression = @FALSE

What does it mean ? Does it mean that Job A and Job B are triggered to run concurrently ?

Thank you

le thuong

Our Datastage support suggested a better solution: use a wild card (Filename*) in the source Unstructured stage. We no longer need a loop, only an elementary job reading an Unstructured stage and writing to data sets. The performance is very good compared to the former design with a loop (less than ...

le thuong

Thank you for your suggestion.
We are thinking of another option: transforming the Excel files into csv files with Excel macro development, then present the csv files to Datastage instead of presenting Excel files (replacing the Unstructured stage by Sequential file stage as source).

le thuong

Thank you for your proposal.
The job which reads the Excel file writes 4 data sets (1 data set for each Excel tabcard) in append mode. Having a multi instance of the job would mean that 1 instance will have to wait for the other because they are writing to the same data sets. Any risk of 'deadlock' ?

le thuong

Ray, the elementary job runs even longer (25 sec) to read 1 Excel file (4 tabcards) , writing to 4 data sets. There is no transformation, only a simple constraint in the Transformer stage. All the Excel files are in the same folder on the Datastage server. There are approximately 20 columns (Varchar...

le thuong

chulett - This is a data migration , therefore each Excel file would be processed only once. We may process 1000 files or more per batch, total of files could reach 60000. Today, in Development environment, we could only process 500 files per hour.

le thuong

I have a sequence with a Unix script to get a list of file (xlsx) names, then passing this list to a loop. This is a a job sequence reading each Excel file (having 4 worksheets) with an Unstructured stage and writing to 4 data sets. Each iteration (reading 1 Excel file and writing 4 data sets) takes...

le thuong

I assume that the correct method is Hash partition and Sort the 2 inputs on the key columns of a Change Capture stage. Is there a risk of incorrect result when leaving Auto partition, No sort ?
Thanks for your support.

le thuong

We have a Select joining many tables with Oracle connector. When enabling partitioned read, we see that the number of records read is not constant. When Partitioned read method = Rowid hash or Rowid round robin, the count is correct. When Partitioned read method = Rowid range, the count is lower tha...

le thuong

In the Help of Repository Export user interface, I have not found documentation for the check box "Include dependent items". What is meant "dependent item" in this context ?

le thuong

I encounter the following issue on a join stage on 5 columns (3 varchar, 2 numeric). The 2 inputs are sorted and hash partitioned on the 5 columns. For a given combination of the 5 key columns, we were expecting a match between the 2 inputs, and Datastage job did not return a match. I found out that...

le thuong

We have a discussion on the optimal number of stages in a job. With a complex business / functional requirement, we quickly reach a job with over 50 stages, and sometimes , we end up with a failure (fork() failed, Not enough space). As a work around, we have to split up the job into 2 jobs (or more)...

le thuong

Additional remark: no need to recompile if it is a stand alone job. Recompilation is required if the job is part of a sequence job (otherwise, when running the sequence job, you get the following message: [ParamName does not reference a known parameter of the job]

le thuong

After testing it, the correct answer is: no need to recompile if the former job does not use the newly added parameter. Thanks all for your attention.

le thuong

We have N jobs using a parameter set PS1 (for example) with 5 parameters.
If we add a 6th parameter to parameter set PS1 to be used in a new job, do we need to recompile the former N jobs (which do not need the 6th parameter) ?

DSXchange

Search found 75 matches

Trigger configured in Sequence job

Performance issue while reading Unstructured stage

Auto partition on Change Capture stage

Oracle Rowid when Enabling partitioned read

Export DataStage components (Include dependent items)

Hash partition with difference in numeric key columns

Acceptable number of stages in a job

Add a parameter to an existing parameter set