Tips how to optimized DataStage Jobs
Posted: Thu Nov 13, 2008 8:42 pm
Hello experts,
My DS jobs are processing about 12 million customers.
Objective of DS Jobs:
- To know how many duplicate customers
- To get the best breed record among the duplicates.
- To integrate customers from different sources.
- To generate enterprise customer data warehouse.
But the jobs took several hours to complete. And sometimes, I got fatal errors :
"APT_CombinedOperatorController(0),0: Write to dataset on [fd 15] failed (Error 0) on node node1, hostname CRM01"
Do anybody have idea about optimizing jobs... or know how to avoid the above error?
Can I remove DS generated (unnecessary) columns? Does it helps?
Thanks!
My DS jobs are processing about 12 million customers.
Objective of DS Jobs:
- To know how many duplicate customers
- To get the best breed record among the duplicates.
- To integrate customers from different sources.
- To generate enterprise customer data warehouse.
But the jobs took several hours to complete. And sometimes, I got fatal errors :
"APT_CombinedOperatorController(0),0: Write to dataset on [fd 15] failed (Error 0) on node node1, hostname CRM01"
Do anybody have idea about optimizing jobs... or know how to avoid the above error?
Can I remove DS generated (unnecessary) columns? Does it helps?
Thanks!