All,
I have a job in which I am sampleing records using a Sample stage.
I am using Percent mode to sample the data. Below are the properties I have set:
Percent = 5.0
Sample Mode = Percent
Input to Sample Stage: 261444 Rows
Output Of Sample Stage: 12895 Rows
Expected Output : (261444*5/100) = 13072.2
The input data is Hash partitioned and Sorted on a key column which has all unique values.
I am using DS Version 8.0.
Can someone help me understand why is there difference in the count of sample stage? Am I missing any other property to be set, like Seed and all?
Please suggest. Thanks In Advance!!!!
Sample Stage Output Not As Expected
Moderators: chulett, rschirm, roy
The seed is for the pseudo random number generator, with the same seed value the output of the sample will always be identical. If you start with a 1-node configuration is the output sample what you expect?
<a href=http://www.worldcommunitygrid.org/team/ ... TZ9H4CGVP1 target="WCGWin">
</a>
</a>