DSXchange

kevink

Unfortunately there is no common key in the two data sets. Is there any other strategy we might be able to use?

kevink

We have a reference match on standardized address and area data. The reference data set has 25 million rows, and loads into the job at only 1700 rows per second. There are only 12,000 rows at a time in the source data set. Can the experts please share ways to improve the performance of a reference m...

kevink

I am having several problems with QualityStage match designer, and wonder whether I'm doing anything wrong. Whenever I leave the match designer by clicking OK the changes are saved OK, but DataStage Designer immediately stops working. I have to close it and open a new one (after waiting for a period...

kevink

Can Teradata 13 be used for the QualityStage Match Designer database? It tests successfully when configuring the Test Environment in Match Designer, but throws syntax errors when attempting to test a match pass. ##E IIS-DSEE-TDOD-00007 15:03:54(002) <main_program> [IBM(DataDirect OEM)][ODBC Teradata...

kevink

I am using QualityStage but, because there is so much extraneous information and so many different ad hoc formats, the AUAREA and AUADDR rule sets are proving quite poor at parsing out significant information. There are some 377 different input patterns using AUAREA rule set. Over 30% of records hav...

kevink

I have a DataStage job that splits the string at the first street type token in the data; the token and all preceding it go into the addr bucket while everything following the token goes into the area bucket. If no street type token exists then all the data go into the area bucket. Using character i...

kevink

Hi, I've got some really dirty Australian address data, with lots of "added" information. The entire address is in a single VarChar field with no consistent format. Can you suggest ideas for cleansing the data (using DataStage, with which I am quite familiar) or QualityStage? I am permitte...

DSXchange

Search found 7 matches

Reference Match Performance

Problems with Reference Match

Match Designer Database

Cleansing Data before Standardization