Borrar filtros
Borrar filtros

Why do I receive "qsub: Job exceeds queue resource limits" error during parallel job submissiont to a Torque / PBS cluster?

46 visualizaciones (últimos 30 días)
I am submitting a parallel job to a Parallel Computing Toolbox configuration for a Torque or PBS cluster. When ClusterSize is set to a value greater than the number of nodes in the cluster (though fewer than the total number of cores or MATLAB Parallel Server worker licenses) I receive the an error similar to the following:
ERROR: Error executing the PBS script command 'qsub'. The reason given is qsub: Job exceeds queue resource limits MSG=cannot locate feasible nodes
The job completes successfully if ClusterSize is set to a value equal to or less than the number of nodes in the cluster. How can my parallel jobs take advantage of the additional cores and worker licenses available?

Respuesta aceptada

MathWorks Support Team
MathWorks Support Team el 16 de Sept. de 2024 a las 0:00
Editada: MathWorks Support Team el 16 de Sept. de 2024 a las 15:02
The error indicates that the job request exceeds the queue resource limits. Verify with the system administrator the exact limits of the queue, number of physical nodes and number of processors/cores per node. If there are multiple cores per node, and the number of workers per job exceeds the number of physical nodes, you will need to modify the communicatingSubmitFcn.m file (pbsNonSharedParallelSubmitFcn.m in older releases) on the client. In particular you will need to change the line containing:
procsPerNode = 1;
Change the value assigned to procsPerNode from 1 to 2,3,4...N cores to take advantage of all available cores on cluster.
For more information please refer to generic scheduler section of the MATLAB Parallel Computing Toolbox Documentation:
Configure Using the Generic Scheduler Interface
NOTE: Starting in R2019a the following name changes occurred:
  •     MATLAB Distributed Computing Server was renamed to MATLAB Parallel Server
  •     mdce_def was renamed to mjs_def
  •     mdce binary was renamed to mjs

Más respuestas (0)

Categorías

Más información sobre Introduction to Installation and Licensing en Help Center y File Exchange.

Etiquetas

Aún no se han introducido etiquetas.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by