MQ Job Failing

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
ankursaxena.2003
Participant
Posts: 96
Joined: Mon May 14, 2012 1:30 pm

MQ Job Failing

Post by ankursaxena.2003 »

Hi,

I have a MQ job and it is failing in production with below error. And there is no pattern for job failure. Sometimes job runs for 2 days without failing and some days it fails 3 to 4 times in an hour.

I have similiar job running against other 4 non-prod regions and they have no issues and are running fine.

APT_CombinedOperatorController(8),0: Fatal Error: Waiting on EOW has reached its timeout "120" seconds for process "66,322,586" on communication handle "1". Aborting...
Thanks,
Ankur Saxena
JRodriguez
Premium Member
Premium Member
Posts: 425
Joined: Sat Nov 19, 2005 9:26 am
Location: New York City
Contact:

Post by JRodriguez »

Difficult to say without more.details, lots of variables to consider when MQ series is in the picture. What's your job design? Do you have a.similar job or exactly the same job running in the other regions?

Did you define a time out or an EOW property in the MQ connector? Is this job an always running job?

Regards
Julio Rodriguez
ETL Developer by choice

"Sure we have lots of reasons for being rude - But no excuses
Post Reply