Anyway to force job to run on all compute nodes on a grid?

bobyon · Post by **bobyon** » Tue Nov 12, 2013 11:18 am

I am suspsicious that the ulimit settings are not as they should be across all of our compute nodes in the grid. I would like to create a sequence/job that will run on each compute node in the grid and return the results of a couple ulimit statements. I have the few statements that are required (ulimit-Sa; ulimit -Ha) in a shell script.

I would like the process to be as generic as possible so I can port it to different environments (grids) and run without modification.

However, the only way I can think of to control which node a job runs on is via a config file and I don't really want to create a bunch of config files.

Am I brain dead? Is there a way to do this? Or am I reinventing a wheel here and there is some other way than datastage to do this?

TIA
Bob

deepak.hsbc · Post by **deepak.hsbc** » Tue Nov 12, 2013 1:29 pm

I used peek stage to display result on each node and using ulimit -ah in job start subrutine,that should work for you.

ray.wurlod · Post by **ray.wurlod** » Tue Nov 12, 2013 2:56 pm

When you create configuration files for grid execution you don't specify the exact nodes, only the number that you require (at two levels). The grid management software actually allocates machines to nodes. This is fully documented in the manuals.

bobyon · Post by **bobyon** » Tue Nov 12, 2013 3:23 pm

Thanks for the response Ray. I do understand the dynamic nature of building the config file.

What I am trying to find is a way to coax the grid management software to running my job on each of the compute nodes so that I can see the ulimit settings on each and every node.

My thought regarding creating config files would have required NOT grid enabling the job.

lstsaur · Post by **lstsaur** » Tue Nov 12, 2013 5:48 pm

So you mean that you want to run a non-grid job on every compute node in a grid?

ray.wurlod · Post by **ray.wurlod** » Wed Nov 13, 2013 12:01 am

OK, an explicit configuration file will do it. Create a job that uses an External Source stage to execute hostname ; ulimit -a and capture all the lines of output into wherever makes sense. This operator should execute on each compute node.

bobyon · Post by **bobyon** » Wed Nov 13, 2013 8:33 am

lstsaur wrote:So you mean that you want to run a non-grid job on every compute node in a grid?

Actually it does not matter to me if it is a grid enabled job or not. As long as I can get the job to run on all the nodes in the grid.

All I am trying to do is confirm the ulimit settings on each server.

bobyon · Post by **bobyon** » Wed Nov 13, 2013 8:37 am

ray.wurlod wrote:OK, an explicit configuration file will do it. Create a job that uses an External Source stage to execute hostname ; ulimit -a and capture all the lines of output into wherever makes sense. This operator should execute on each compute node.

Now we are getting to the heart of my question. I have the job that captures the ulimit output. But, how do I get it to run on all the compute nodes?

Is the only way to put one job in a sequence for each compute passing an explicit config file for each job?

PaulVL · Post by **PaulVL** » Wed Nov 13, 2013 8:41 am

Why don't you just ask your administrator to confirm the settings according to the user id you are using?

bobyon · Post by **bobyon** » Thu Nov 14, 2013 9:46 am

Well, 2 reasons basically:
1 - If you are referring to a Unix Admin, because they have no way to see what the vaule of ulimit is as seen from a DataStage job. It is often different from what is seen by just issuing commands on the unix command line.
and
2 - If you are referring to a DataStage Admin, because I am the admin however unfortunate that might be.

ray.wurlod · Post by **ray.wurlod** » Thu Nov 14, 2013 2:25 pm

bobyon wrote:Is the only way to put one job in a sequence for each compute passing an explicit config file for each job?

That, or a job that has a configuration file mentioning every compute node.

Post by **daignault** » Mon Aug 25, 2014 2:09 pm

We have a large grid, so what we do is create APT files for each compute node on the grid.

Disable the GRID thru the environment variable and then resubmit the job for each compute node.

Ray D

DSXchange

Anyway to force job to run on all compute nodes on a grid?

Anyway to force job to run on all compute nodes on a grid?

Re: Anyway to force job to run on all compute nodes on a gri