Page 1 of 1

matching

Posted: Mon Nov 24, 2003 8:14 am
by nag0143
After matching i got records as XA and DA how do i relate this to the output file i want to use in datastage ....

Posted: Mon Nov 24, 2003 6:06 pm
by ray.wurlod
XA means the "master record" in a block of probable duplicates. It tends to have the highest composite weight.

DA means one of the other records in a block of probably duplicates.

RA means a "residual", a record with no probably duplicates. You have to remember to include these in your survivorship rules.

CR means "clerical review", a record whose composite weight is between the two cutoffs you establish to segregate confirmed duplicates (composite weight above the upper threshold), confirmed non-duplicates (composite weight below the lower threshold), and "clerical review" records which should be inspected by a clerk for decision.

There is also a number associated with this. For example RA1 is a residual found in the first match pass, DA2 is a duplicate found in the second match pass, and so on.

The blocks into which these probable duplicates fall form the bases for your decision making during the next phase of processing, survivorship.

Posted: Sat Feb 03, 2007 12:54 pm
by ashok
In extract file you need to create a field to allow match stage to populate match set numbers, this is explained in QS documents,

example
TYPE SET
XA 1
DA 1
DA 1
XA 2
DA 2
RA 3
RA 4