i am using p.f analyze one porject's data for profiling.
one column contains 69,781 unique values(distribution domainvalue shld also be 69,781?), while after column analysis, all the related reports keep saying only 32,000 distribution/unique/disntinct/cardinality values found for this column though it read all the source data of 89661rows.
i finally find some clue in the log file, one line mentioned:
---------- 2/2 18:13:32 -- ELAPSED: 11.333 min wall, 9.047 min cpu ----------
Stored 32000 rows into PSDB as full distribution for column 'EQPNO' (Id 1703).
Cardinality is 32000; MrdbDistrbutionLimit is '32000'.
"
i suspect the MrdbDistrbutionLimit casue the problem though all the p.f related documents i can find never tell there's such limit.
is there anyone knows whether the software has such limit and if it is true to cause abv mentioned problem? More important, how to solve it?
deem it is very successful tool in profiling, and i wld prefer to deem it be me not properly use it...
franky struglling with profilestage...
question about "MrdbDistrbutionLimit" of profilest
This forum contains ProfileStage posts and now focuses at newer versions Infosphere Information Analyzer.
Return to “Information Analyzer (formerly ProfileStage)”
Jump to
- Moderators' Choice
- ↳ Editor's BLOG Corner
- ↳ Ask the Experts! - Dads and Grads
- ↳ DSXchange Testimonials
- ↳ Cognos (IBM BI)
- FAQs
- ↳ FAQs
- ↳ FAQ Discussion
- DataStage
- ↳ General
- ↳ IBM<sup>®</sup> Infosphere DataStage Server Edition
- ↳ IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
- ↳ Archive of DataStage Users@Oliver.com
- IBM<sup>®</sup>Infosphere Products<sup></sup>
- ↳ Business Glossary
- Suggestions
- ↳ Site/Forum
- ↳ Enhancement Wish List
- Consulting
- ↳ Talent
- ↳ Looking for Talent
- Support
- ↳ Parameter Manager
- ↳ Compile All Plus
- Usergroup Forums
- ↳ Usergroup Central Forum
- ↳ Heartland Usergroup Forum
- The Written Word
- ↳ Articles, White Papers and Tips and Tricks
- ↳ Product Documentation
- Third Party Applications
- ↳ Third Party Applications
- Product Derivatives
- ↳ Functions
- ↳ Routines
- ↳ Jobs
- ↳ Logs
- Tools
- ↳ Tools Forum
- Category
- ↳ Infosphere Master Data Management
- ↳ Data Quality Best Practices
- ↳ IBM QualityStage
- ↳ Information Analyzer (formerly ProfileStage)
- ↳ IBM<sup>®</sup> SOA Editions (Formerly RTI Services)
- ↳ IBM<sup>®</sup> DataStage TX
- ↳ BI
- ↳ Data Integration