investigation stage to Standardization stage.. problem!

This forum is in support of all issues about Data Quality regarding DataStage and other strategies.

Moderators: chulett, rschirm

Post Reply
pradeepnotagain
Participant
Posts: 1
Joined: Mon Aug 20, 2007 1:01 pm

investigation stage to Standardization stage.. problem!

Post by pradeepnotagain »

Hi QS experts!

Here is the scenario,

there are something around 5000 records, out of which almost 4000 records are recognized as unidentified tokens in standardization stage. Its not possible to reject data of such percentage.
The main problem is with city names, it doesn't recognize the city names.

someone gave me suggestion to edit the PAT file and investigate again.
I am a newbie in QS seeking hep from experts.
Can some one help me out?
Can anyone suggest the best practice. in this case?
if i have to change the PAT file , how to do that properly ?

Thank You

waiting for your response!
P'deep
pradeep
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Welcome aboard.

No, you don't need to change the PAT file. The right file to change is the classification table for xxAREA, so that it includes the city names.

You could, of course, use rule overrides, but adding city names to the classification table xxAREA.cls is the better approach.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply