Job design - compare rows between datasets

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
vgundavarapu
Premium Member
Premium Member
Posts: 22
Joined: Wed May 08, 2013 8:38 am
Contact:

Job design - compare rows between datasets

Post by vgundavarapu »

Hi
I need a feedback on job design
Where I have a requirement to compare two
Rows between two datasets and return all rows from one dataset
indicating which one matched.

Code: Select all

. Example
Datasets A
Id code 
1.   1640. 
2.   1427.  

Datasets B

I'd.   First code. Seccode. Third code fourthcode
1.        1427.        8200.     8000.        50000
2.         50000.       1640.      6000.       80000
I want all the rows from dataset B indicating to which column it matched

I used the join but it generates 4 times records and I have to use remove dups

I want to use compare or difference stage which one you recommand

Thks
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Did you partition your data using Hash or Modulus based on Id value? Did you sort the data based on Id value?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply