Limit on organization comparison tokens

Infosphere Master Data Management theory and best practices

Moderators: chulett, rschirm, falsehate

Post Reply
pk7
Participant
Posts: 44
Joined: Fri Aug 03, 2007 8:35 am
Location: Kansas CIty

Limit on organization comparison tokens

Post by pk7 »

It seems that when organization names are derived they produce a compare string that contains a maximum of four non-anonymous tokens. For example, let's say we have these two records:

ACME AIRCRAFT PARTS CO DBA BIG DOG MECHANICS
and
BIG DOG MECHANICS

then, after elimination of anonymous values like CO and DBA, the two comparison strings are:

ACME:AIRCRAFT:PARTS:BIG
and
BIG:DOG:MECHANICS

Notice that the first comparison string only contains the first four non-anonymous values. Hence the matching is less than desirable.

How can we overcome this limit of four tokens? The Max Bucket Tokens limit is obviously not the problem as we have that set at 6.
pk7
Participant
Posts: 44
Joined: Fri Aug 03, 2007 8:35 am
Location: Kansas CIty

Post by pk7 »

Just discovered the solution:

In the CXNM Name standardization, set Derivation Arguments to the number of tokens desired to be included in the comparison string. IBM indicates that the max for the CXNM standardization is 8.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Thank you for posting the solution.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply