Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
en:centro:servizos:servidores_de_computacion_gpgpu [2023/01/11 13:58] – [Service description] fernando.guillenen:centro:servizos:servidores_de_computacion_gpgpu [2023/10/11 13:56] – [How to connect the servers] fernando.guillen
Line 2: Line 2:
  
 ===== Service description ===== ===== Service description =====
- +==== Servers with free access GPUs ====
-Servers with graphic cards: +
-  * ''ctgpgpu3'': +
-    * PowerEdge R720 +
-    * 1 x [[http://ark.intel.com/products/64588|Intel Xeon E52609]] +
-    * 16 GB RAM (1 DDR3 DIMM  1600MHz) +
-    * Connected to a graphical card extensión box with: +
-      * Gigabyte GeForce GTX Titan 6GB (2014) +
-      * Nvidia Titan X Pascal 12GB (2016) +
-    * Ubuntu 18.04 operative system +
-      * Slurm (//mandatory to queue jobs!//) +
-      * CUDA 10.2 (//Nvidia official repo//) +
-      * Docker-ce 18.06 (//Docker official repo//) +
-      * Nvidia-docker 2.0.3 (//Nvidia official repo//) +
-      * Nvidia cuDNN v7.2.1 for CUDA 9.2 +
-      * Intel Parallel Studio Professional for C++ 2015 (//single license! coordinate with other users!//) +
-      * ROS Melodic Morenia (//repositorio oficial de ROS//)+
   * ''ctgpgpu4'':   * ''ctgpgpu4'':
       * PowerEdge R730       * PowerEdge R730
Line 24: Line 8:
       * 128 GB RAM (4 DDR4 DIMM  2400MHz)       * 128 GB RAM (4 DDR4 DIMM  2400MHz)
       * 2 x Nvidia GP102GL 24GB [Tesla P40]       * 2 x Nvidia GP102GL 24GB [Tesla P40]
-      * Centos 7.4 +      * AlmaLinux 9.1 
-          * Docker 17.09 and nvidia-docker 1.0.1 +          * Cuda 12.0 
-          * OpenCV 2.4.5 +          * **Mandatory use of Slurm queue manager**
-          * Dliv, Caffe, Caffe2 and pycaffe + 
-          Python 3.4cython, easydict, sonnet +  HPC cluster servers[[ en:centro:servizos:hpc | HPC cluster ]] 
-          TensorFlow +  CESGA servers: [[ en:centro:servizos:cesga | Access procedure info ]]  
-  * ''ctgpgpu5'':+ 
 +==== Restricted access GPU servers  ==== 
 + * ''ctgpgpu5'':
       * PowerEdge R730       * PowerEdge R730
       * 2 x  [[https://ark.intel.com/products/92980/Intel-Xeon-Processor-E5-2623-v4-10M-Cache-2_60-GHz|Intel Xeon E52623v4]]       * 2 x  [[https://ark.intel.com/products/92980/Intel-Xeon-Processor-E5-2623-v4-10M-Cache-2_60-GHz|Intel Xeon E52623v4]]
Line 55: Line 41:
           * Docker 19.03           * Docker 19.03
           * [[https://github.com/NVIDIA/nvidia-docker | Nvidia-docker  ]]           * [[https://github.com/NVIDIA/nvidia-docker | Nvidia-docker  ]]
-  * ''ctgpgpu7'':  +  * ''ctgpgpu9'': 
-      * hpc-gpu2 in the HPC cluster +      * Dell PowerEdge R750 
-  * ''ctgpgpu8'':  +      * 2 x [[ https://ark.intel.com/content/www/es/es/ark/products/215274/intel-xeon-gold-6326-processor-24m-cache-2-90-ghz.html |Intel Xeon Gold 6326 ]] 
-      * hpc-gpu1 in the HPC cluster+      * 128 GB RAM  
 +      * 2 x NVIDIA Ampere A100 80 GB 
 +      * AlmaLinux 8.6 
 +           * NVIDIA 515.48.07 driver and CUDA 11.7 
 +  * ''ctgpgpu10'': 
 +      * PowerEdge R750 
 +      * 2 x [[ https://ark.intel.com/content/www/es/es/ark/products/215272/intel-xeon-gold-5317-processor-18m-cache-3-00-ghz.html |Intel Xeon Gold 5317 ]] 
 +      * 128 GB  RAM  
 +      * NVIDIA Ampere A100 80 GB 
 +      * Sistema operativo AlmaLinux 8.7 
 +           * Driver NVIDIA 525.60.13 and CUDA 12.0 
 +  * ''ctgpgpu11'': 
 +      * Server Gybabyte  G482-Z54 
 +      * 2 x [[ https://www.amd.com/es/products/cpu/amd-epyc-7413 | AMD EPYC 7413 @2,65 GHz 24c ]] 
 +      * 256 GB RAM 
 +      * 4 x NVIDIA Ampere A100 de 80 GB   
 +      * AlmaLinux 9.1 
 +           * Driver NVIDIA 520.61.05 and CUDA 11.8 
 +  * ''ctgpgpu12'': 
 +      * Servidor Dell PowerEdge R760 
 +      * 2 procesadores [[ https://ark.intel.com/content/www/xl/es/ark/products/232376.html |Intel Xeon Silver 4410Y ]] 
 +      * 384 GB de memoria RAM  
 +      * 2 x NVIDIA Hopper H100 de 80 GB 
 +      * Sistema operativo AlmaLinux 9.2 
 +           * Driver NVIDIA 535.104.12 para CUDA 12.2 
 ===== Activation ===== ===== Activation =====
-All CITIUS users can access this service, but as not all servers are available all the time you have to register beforehand filling the [[https://citius.usc.es/dashboard/enviar-incidencia| requests and problem reporting form]]. +Not all servers are available to use freely. Access must be requested filling the [[https://citius.usc.es/dashboard/enviar-incidencia| requests and problem reporting form]]. Users without access permission will receive an incorrect password error message.
  
 ===== User Manual ===== ===== User Manual =====
Line 66: Line 77:
 Use SSH. Hostnames and ip addresses are: Use SSH. Hostnames and ip addresses are:
  
-  * ctgpgpu3.inv.usc.es - 172.16.242.93:22+
   * ctgpgpu4.inv.usc.es - 172.16.242.201:22   * ctgpgpu4.inv.usc.es - 172.16.242.201:22
   * ctgpgpu5.inv.usc.es - 172.16.242.202:22   * ctgpgpu5.inv.usc.es - 172.16.242.202:22
   * ctgpgpu6.inv.usc.es - 172.16.242.205:22   * ctgpgpu6.inv.usc.es - 172.16.242.205:22
-  * ctgpgpu7.inv.usc.es - 172.16.242.207:22 +  * ctgpgpu9.inv.usc.es - 172.16.242.94:22 
-  * ctgpgpu8.inv.usc.es - 172.16.242.208:22 +  * ctgpgpu10.inv.usc.es - 172.16.242.95:22 
 +  * ctgpgpu11.inv.usc.es - 172.16.242.96:22 
 +  * ctgpgpu12.inv.usc.es - 172.16.242.97:22
 Connection in only possible from inside the CITIUS network. To connect from other places or from the RAI network it is necessary to use the [[https://wiki.citius.usc.es/en:centro:servizos:vpn:start | VPN]] or the [[https://wiki.citius.usc.es/en:centro:servizos:pasarela_ssh|SSH gateway]]. Connection in only possible from inside the CITIUS network. To connect from other places or from the RAI network it is necessary to use the [[https://wiki.citius.usc.es/en:centro:servizos:vpn:start | VPN]] or the [[https://wiki.citius.usc.es/en:centro:servizos:pasarela_ssh|SSH gateway]].