Throttling performance on a grid?
Moderators: chulett, rschirm, roy
Throttling performance on a grid?
Hi everyone,
We have a grid setup that is highly utilised already.
Now there is another group of developers that need access, but we want to limit the amount of resources they get to use, to reduce the potential performance impact on the other jobs that are running (some of them quite critical).
Is there a way to do this? How do you configure things so some projects have access to some of the CPUs, and some have access to all of them?
TIA.
We have a grid setup that is highly utilised already.
Now there is another group of developers that need access, but we want to limit the amount of resources they get to use, to reduce the potential performance impact on the other jobs that are running (some of them quite critical).
Is there a way to do this? How do you configure things so some projects have access to some of the CPUs, and some have access to all of them?
TIA.
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Usually in a grid environment, that's where your Resources Management software such as PBS Pro comes in to play. It's a piece of cake for a situation like yours for PBS Pro to handle. You can even collect the resource usages data generated from PBS Pro to do the charge-back to each project, department, etc.
So it's not something you can do in the DataStage config file, to limit how much of the grid a project's jobs can use?lstsaur wrote:Yes, you configure that kind of information, how many CPUs, job's priority, etc. in PBS Pro and it will pass on that information to the Grid enablement software. It's much more to it.
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
You can certainly create a configuration file that requests fewer nodes than usual, but you can't prevent "them" from starting zillions of jobs each of which will dynamically request that number of nodes from the grid.
Some grid management software will let you throttle the number of nodes that a particular user may request, but that's not a feature within DataStage.
Some grid management software will let you throttle the number of nodes that a particular user may request, but that's not a feature within DataStage.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
No, once your DataStage server is "gridified", your config_file is generated dynamacally for you based on the resource allocation configured in the PBS Pro. So that means I can configure your project always run on a particular server with only 1 CPU. You can also control your project job's priority and time parameters in there, so even zillions jobs are submitted from that project, but they will always be held in the "cube" until the priority and the time are met.
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
We're a grid shop.
I would investigate submitting Project X (which you want to run on a subset of your total GRID compute nodes) to a different Queue.
Your Grid Resource Manager defines whish servers can services which queues. So, if you want to limit them to 5 our of 20 grid servers, do it via your queue.
You can also submit them to a lower priority queue. This will only affect their wait time in the queue. Once dispatched to the GRID, you will not be able to lower their priority since DataStage is not really using the GRM to it's fullest potential.
I would investigate submitting Project X (which you want to run on a subset of your total GRID compute nodes) to a different Queue.
Your Grid Resource Manager defines whish servers can services which queues. So, if you want to limit them to 5 our of 20 grid servers, do it via your queue.
You can also submit them to a lower priority queue. This will only affect their wait time in the queue. Once dispatched to the GRID, you will not be able to lower their priority since DataStage is not really using the GRM to it's fullest potential.