Hi ,
I need templates/inputs as to how to create a data profiling sheet (process).
Being in ETL all these days I dont have much idea as to how to document a profiling process.
I would appreciate any help on this.
Thanks in advance,
Regards,
Santosh.
Information Analyzer Template
-
- Charter Member
- Posts: 35
- Joined: Sun Jan 16, 2005 8:39 am
- Location: US
Information Analyzer Template
Santosh
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Basically, let the tool guide you and add notes, classes, terms and policies to the metadata and analysis results. Produce reports at each phase - there are ten available. There are five analyses that need to be performed in turn: column analysis (which is resource hungry, as it ought to examine the totality of the source data), primary key analysis (single column or multi column), table (intra-table dependency) analysis, cross-domain analysis (looking for foreign key candidates and redundant data). You can establish a baseline set of analysis results against which you can compare (a) other analyses based on different assumptions/thresholds, and (b) future analyses using the same specifications (which can be marked as checkpoints, and used to establish trends). Publish analysis results into the Information Services repository so that they are available to other tools such as FastTrack and DataStage.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
-
- Premium Member
- Posts: 145
- Joined: Fri Sep 21, 2007 9:35 am
- Location: Boston