Warning in Reference Match During Blocking
Posted: Fri Oct 25, 2013 1:15 pm
Hi All,
I have designed a Reference Match job where the Data will contain around 10k - 100k records and my reference set right now contains around 3 million and is expected to grow.
The match is being done on the first and last names and the blocking is on the NYSIIS values of the first and last name. But I am getting the following warning:
The number of records in the reference block with key NYSIIS_LAST_NAME(HAL) exceeds the maximum number specified of 10000.
All records in the block will be treated as residuals
The IBM site: http://www-01.ibm.com/support/docview.w ... wg21409638
Suggests that we dont update the Overflow values (which also can have a max of only 40k) but instead limit the number of records with invalid or default values.
But I am filtering all the invalid values much before it reaches Matching.
Is there any other way to prevent these records from becoming residual or impact the matching process? Would be grateful if anyone can point me in the right direction.
I have designed a Reference Match job where the Data will contain around 10k - 100k records and my reference set right now contains around 3 million and is expected to grow.
The match is being done on the first and last names and the blocking is on the NYSIIS values of the first and last name. But I am getting the following warning:
The number of records in the reference block with key NYSIIS_LAST_NAME(HAL) exceeds the maximum number specified of 10000.
All records in the block will be treated as residuals
The IBM site: http://www-01.ibm.com/support/docview.w ... wg21409638
Suggests that we dont update the Overflow values (which also can have a max of only 40k) but instead limit the number of records with invalid or default values.
But I am filtering all the invalid values much before it reaches Matching.
Is there any other way to prevent these records from becoming residual or impact the matching process? Would be grateful if anyone can point me in the right direction.