ETL Development standards, best practices, guidelines etc.,

A forum for discussing DataStage<sup>®</sup> basics. If you're not sure where your question goes, start here.

Moderators: chulett, rschirm, roy

Post Reply
anamika
Participant
Posts: 16
Joined: Sat Feb 27, 2016 9:43 am
Location: Ottawa

ETL Development standards, best practices, guidelines etc.,

Post by anamika »

Hello,
I have been asked to take on the task of evolving ETL architecture, design and development standards specifically using Datastage and the Infosphere suite of tools.
I have started gathering and reviewing documents, presentations searching the web.
I look towards this group for suggestions, comments, ideas regarding the above.
Thank you for reading this post and any contributions.

/A
ETL, DW, BI Consultant
qt_ky
Premium Member
Premium Member
Posts: 2895
Joined: Wed Aug 03, 2011 6:16 am
Location: USA

Post by qt_ky »

Search on the IBM Redbook, IBM InfoSphere DataStage Parallel Framework Standard Practices. It has several chapters and appendices on these topics.
Choose a job you love, and you will never have to work a day in your life. - Confucius
leandrohmvieira
Participant
Posts: 44
Joined: Wed Sep 02, 2015 7:19 am
Location: Brasilia, Brazil

Post by leandrohmvieira »

There is a similar book for Server Job development?
Leandro Vieira

Data Expert - Brasilia, Brazil
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Not that I recall on 'standards' and such. There has always be a Server Job Developer's Guide that shipped with the product or the online documentation that you can find here.
-craig

"You can never have too many knives" -- Logan Nine Fingers
anamika
Participant
Posts: 16
Joined: Sat Feb 27, 2016 9:43 am
Location: Ottawa

Post by anamika »

Thanks everybody for pointing those links.
Yes, I have been reading up on lots of DS documentation, IBM Redbooks, general web searches and the like. Please do post if you think it is relevant.

Received some private messages as well.

Thanks
ETL, DW, BI Consultant
PaulVL
Premium Member
Premium Member
Posts: 1315
Joined: Fri Dec 17, 2010 4:36 pm

Post by PaulVL »

Here's another non-documented practice you should instill in your developers right from the start.

Create Diagnostic jobs for your database connectivity.


Connector Stage -> Peek


select * from table_x WHERE 1=2


You will save so much time just by that stupid little job using the same parms as your real etl job. All it does is validate connectivity. You would never run it on a daily or nightly flow. It would only be used if you change your connectivity credentials or if your environment got hosed and you simply want a ping test to your database.

Executing your suite of diagnostic jobs the day before your big GO LIVE roll-out... gold.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

... or run your real jobs in Validate mode.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply