Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | |||
en:centro:servizos:hpc:envio_trabajo [2018/09/19 18:28] – fernando.guillen | en:centro:servizos:hpc:envio_trabajo [2018/09/20 10:20] (current) – fernando.guillen | ||
---|---|---|---|
Line 40: | Line 40: | ||
| | ||
</ | </ | ||
- | The state column indicates | + | The first letter of the state column indicates if the queue is (E)nabled or (D)isabled and the second letter if the queue is (R)unning |
- | ==== Información de los trabajos | + | ==== Jo information |
- | Cada vez que se envía un trabajo se le asigna un JOB_ID | + | Each time a job is assigned a JOB_ID |
<code bash> | <code bash> | ||
Line 49: | Line 49: | ||
Job id Name | Job id Name | ||
------------------------- ---------------- --------------- | ------------------------- ---------------- --------------- | ||
- | 999999.ctcomp2 | + | 999999.ctcomp2 |
</ | </ | ||
- | La columna | + | The Time Use column shows the CPU time used. |
- | La columna | + | The S column is the job state, which can be one of the following: |
| | ||
| | ||
Line 62: | Line 62: | ||
<code bash> | <code bash> | ||
- | ct$ qstat -f 999999.ctcomp2 | + | ct$ qstat -f 999999.ctcomp2 |
Job Id: 999999.ctcomp2.innet | Job Id: 999999.ctcomp2.innet | ||
- | Job_Name = nombre_del_trabajo | + | Job_Name = job_name |
- | Job_Owner = nombre_usuario@ctcomp2.innet | + | Job_Owner = user_name@ctcomp2.innet |
job_state = Q | job_state = Q | ||
queue = np32 | queue = np32 | ||
Line 71: | Line 71: | ||
Checkpoint = u | Checkpoint = u | ||
ctime = Fri Feb 12 10:09:34 2016 | ctime = Fri Feb 12 10:09:34 2016 | ||
- | Error_Path = ctcomp2.innet:/ | + | Error_Path = ctcomp2.innet:/ |
Hold_Types = n | Hold_Types = n | ||
Join_Path = n | Join_Path = n | ||
Line 78: | Line 78: | ||
Mail_Users = nombre_usuario@usc.es | Mail_Users = nombre_usuario@usc.es | ||
mtime = Fri Feb 12 10:09:34 2016 | mtime = Fri Feb 12 10:09:34 2016 | ||
- | Output_Path = ctcomp2.innet:/ | + | Output_Path = ctcomp2.innet:/ |
Priority = 0 | Priority = 0 | ||
qtime = Fri Feb 12 10:09:34 2016 | qtime = Fri Feb 12 10:09:34 2016 | ||
Line 88: | Line 88: | ||
Resource_List.walltime = 12:00:00 | Resource_List.walltime = 12:00:00 | ||
substate = 10 | substate = 10 | ||
- | Variable_List = PBS_O_QUEUE=batch, | + | Variable_List = PBS_O_QUEUE=batch, |
PBS_O_LOGNAME=nombre_usuario, | PBS_O_LOGNAME=nombre_usuario, | ||
PBS_O_PATH=/ | PBS_O_PATH=/ | ||
PBS_O_MAIL=/ | PBS_O_MAIL=/ | ||
- | PBS_O_LANG=es_ES.UTF-8, | + | PBS_O_LANG=es_ES.UTF-8, |
PBS_O_HOST=ctcomp2.innet, | PBS_O_HOST=ctcomp2.innet, | ||
- | euser = nombre_usuario | + | euser = user_name |
egroup = citius | egroup = citius | ||
queue_rank = 2110 | queue_rank = 2110 | ||
Line 105: | Line 105: | ||
</ | </ | ||
- | Una característica interesante de los trabajos finalizados es el EXIT_STATUS | + | An interesting characteristic about the finished jobs is the EXIT_STATUS |
- | ^ | + | ^ |
| JOB_EXEC_OVERLIMIT | | JOB_EXEC_OVERLIMIT | ||
| JOB_EXEC_STDOUTFAIL | | JOB_EXEC_STDOUTFAIL | ||
Line 119: | Line 119: | ||
| JOB_EXEC_OK | | JOB_EXEC_OK | ||
| | | | ||
- | | | + | | |
<code bash> | <code bash> | ||
- | ct$ checkjob 999999.ctcomp2 | + | ct$ checkjob 999999.ctcomp2 |
checking job 999999 | checking job 999999 | ||
State: Running | State: Running | ||
- | Creds: | + | Creds: |
WallTime: 00:25:46 of 12:00:00 | WallTime: 00:25:46 of 12:00:00 | ||
SubmitTime: Tue Feb 16 10:40:31 | SubmitTime: Tue Feb 16 10:40:31 | ||
Line 152: | Line 152: | ||
<code bash> | <code bash> | ||
- | ct$ tracejob -n 3 999999.ctcomp2 | + | ct$ tracejob -n 3 999999.ctcomp2 |
Job: 136553.ctcomp2.innet | Job: 136553.ctcomp2.innet | ||
Line 162: | Line 162: | ||
02/10/2016 15: | 02/10/2016 15: | ||
02/10/2016 15: | 02/10/2016 15: | ||
- | 02/10/2016 15: | + | 02/10/2016 15: |
- | jobname=nombre_trabajo | + | jobname=job_name |
02/10/2016 16: | 02/10/2016 16: | ||
resources_used.mem=234868kb resources_used.vmem=1002480kb | resources_used.mem=234868kb resources_used.vmem=1002480kb | ||
resources_used.walltime=00: | resources_used.walltime=00: | ||
02/10/2016 16: | 02/10/2016 16: | ||
- | 02/10/2016 16: | + | 02/10/2016 16: |
02/10/2016 17: | 02/10/2016 17: | ||
</ | </ | ||
- | ==== Información de los nodos ==== | + | ==== Node info ==== |
- | Para obtener una vista global | + | To get a global |
<code bash> | <code bash> | ||
$ nodes-usage | $ nodes-usage | ||
Line 202: | Line 202: | ||
</ | </ | ||
- | Para obtener información sobre los usuarios que se encuentran en cada nodo, se puede utilizar el comando | + | To get information of the users in each node, the '' |
<code bash> | <code bash> | ||
Line 210: | Line 210: | ||
</ | </ | ||
- | Para obtener información más detallada sobre los nodos, se puede utilizar el comando | + | To get more detailed information on the nodes, the '' |
<code bash> | <code bash> | ||
- | ct$ pbsnodes | + | ct$ pbsnodes |
node1 | node1 | ||
state = free | state = free | ||
Line 238: | Line 238: | ||
<code bash> | <code bash> | ||
- | ct$ pbsnodes -l # Listado de los nodos apagados(down) | + | ct$ pbsnodes -l # Shutdown node list(down) |
node2 down, | node2 down, | ||
node3 down, | node3 down, | ||
Line 258: | Line 258: | ||
- | ===== Eliminar un trabajo de la cola ===== | + | ===== Cancel a job from the queue ===== |
------------- | ------------- | ||
- | El comando | + | The '' |
< | < |