Differences

This shows you the differences between two versions of the page.

--- en:centro:servizos:servidores_de_computacion_gpgpu [2020/10/28 14:11] – fernando.guillen
+++ en:centro:servizos:servidores_de_computacion_gpgpu [2024/10/01 17:34] (current) – jorge.suarez
@@ Line 2: / Line 2: @@
 ===== Service description =====
+==== Servers with free access GPUs ====
-Servers with graphic cards:
-  * ''ctgpgpu2'':
-    * Dell Precision R5400
-    * 2 x [[http://ark.intel.com/products/33082/|Intel Xeon E5440]]
-    * 8 GB RAM (4 x DDR2 FB-DIMM 667 MHz)
-    * 1 Nvidia GK104 [Geforce GTX 680]
-    * Ubuntu 18.04 operative system
-      * Slurm (//mandatory to queue jobs!//)
-      * CUDA 9.2 (//Nvidia official repo//)
-      * Docker-ce 18.06 (//Docker official repo//)
-      * Nvidia-docker 2.0.3 (//Nvidia official repo//)
-      * Nvidia cuDNN v7.2.1 for CUDA 9.2
-      * Intel Parallel Studio Professional for C++ 2015 (//single license! coordinate with other users!//)
-  * ''ctgpgpu3'':
-    * PowerEdge R720
-    * 1 x [[http://ark.intel.com/products/64588|Intel Xeon E52609]]
-    * 16 GB RAM (1 DDR3 DIMM  1600MHz)
-    * Connected to a graphical card extensión box with:
-      * Gigabyte GeForce GTX Titan 6GB (2014)
-      * Nvidia Titan X Pascal 12GB (2016)
-    * Ubuntu 18.04 operative system
-      * Slurm (//mandatory to queue jobs!//)
-      * CUDA 9.2 (//Nvidia official repo//)
-      * Docker-ce 18.06 (//Docker official repo//)
-      * Nvidia-docker 2.0.3 (//Nvidia official repo//)
-      * Nvidia cuDNN v7.2.1 for CUDA 9.2
-      * Intel Parallel Studio Professional for C++ 2015 (//single license! coordinate with other users!//)
-      * ROS Melodic Morenia (//repositorio oficial de ROS//)
   * ''ctgpgpu4'':
       * PowerEdge R730
@@ Line 38: / Line 8: @@
       * 128 GB RAM (4 DDR4 DIMM  2400MHz)
       * 2 x Nvidia GP102GL 24GB [Tesla P40]
-      * Centos 7.4
+      * AlmaLinux 9.1
-          * Docker 17.09 and nvidia-docker 1.0.1
+          * Cuda 12.0
-          * OpenCV 2.4.5
+          * **Mandatory use of Slurm queue manager**.
-          * Dliv, Caffe, Caffe2 and pycaffe
-          * Python 3.4: cython, easydict, sonnet
+  * HPC cluster servers: [[ en:centro:servizos:hpc | HPC cluster ]]
-          * TensorFlow
+  * CESGA servers: [[ en:centro:servizos:cesga | Access procedure info ]]
-  * ''ctgpgpu5'':
+==== Restricted access GPU servers  ====
+ * ''ctgpgpu5'':
       * PowerEdge R730
       * 2 x  [[https://ark.intel.com/products/92980/Intel-Xeon-Processor-E5-2623-v4-10M-Cache-2_60-GHz|Intel Xeon E52623v4]]
@@ Line 69: / Line 41: @@
           * Docker 19.03
           * [[https://github.com/NVIDIA/nvidia-docker | Nvidia-docker  ]]
-  * ''ctgpgpu7'':
+  * ''ctgpgpu9'':
-      * Server Dell PowerEdge R740
+      * Dell PowerEdge R750
-      * 2 processors[[https://ark.intel.com/content/www/us/en/ark/products/193388/intel-xeon-gold-5220-processor-24-75m-cache-2-20-ghz.html|Intel Xeon Gold 5220]]
+      * 2 x [[ https://ark.intel.com/content/www/es/es/ark/products/215274/intel-xeon-gold-6326-processor-24m-cache-2-90-ghz.html |Intel Xeon Gold 6326 ]]
-      * 192 GB RAM (12 DDR4 DIMM a 2667MHz)
+      * 128 GB RAM
-      * 2 x Nvidia Tesla V100S 32GB (2019)
+      * 2 x NVIDIA Ampere A100 80 GB
-      * Operating system Centos 8.1
+      * AlmaLinux 8.6
-          * **Slurm as a mandatory use queue manager**.
+           * NVIDIA 515.48.07 driver and CUDA 11.7
-          * ** Modules for library version management **.
+  * ''ctgpgpu10'':
-          * Nvidia Driver 440.64.00 for CUDA 10.2
+      * PowerEdge R750
-          * Docker 19.03
+      * 2 x [[ https://ark.intel.com/content/www/es/es/ark/products/215272/intel-xeon-gold-5317-processor-18m-cache-3-00-ghz.html |Intel Xeon Gold 5317 ]]
-          * [[  https://github.com/NVIDIA/nvidia-docker | Nvidia-docker  ]]
+      * 128 GB  RAM
-  * ''ctgpgpu8'':
+      * NVIDIA Ampere A100 80 GB
-      * Dell PowerEdge R740
+      * Sistema operativo AlmaLinux 8.7
-      * 2 processors  [[https://ark.intel.com/content/www/us/en/ark/products/193388/intel-xeon-gold-5220-processor-24-75m-cache-2-20-ghz.html|Intel Xeon Gold 5220]]
+           * Driver NVIDIA 525.60.13 and CUDA 12.0
-      * 192 GB RAM (12 DDR4 DIMM a 2667MHz)
+  * ''ctgpgpu11'':
-      * 2 x Nvidia Tesla V100S 32GB (2019)
+      * Server Gybabyte  G482-Z54
-      * Operating System Centos 8.1
+      * 2 x [[ https://www.amd.com/es/products/cpu/amd-epyc-7413 | AMD EPYC 7413 @2,65 GHz 24c ]]
-          * **Slurm as a mandatory use queue manager**.
+      * 256 GB RAM
-          * ** Modules for library version management **.
+      * 4 x NVIDIA Ampere A100 de 80 GB
-          * Nvidia Driver  440.64.00 for CUDA 10.2
+      * AlmaLinux 9.1
-          * Docker 19.03
+           * Driver NVIDIA 520.61.05 and CUDA 11.8
-          * [[  https://github.com/NVIDIA/nvidia-docker | Nvidia-docker  ]]
+  * ''ctgpgpu12'':
+      * Servidor Dell PowerEdge R760
+      * 2 x [[ https://ark.intel.com/content/www/xl/es/ark/products/232376.html |Intel Xeon Silver 4410Y ]]
+      * 384 GB RAM
+      * 2 x NVIDIA Hopper H100 de 80 GB
+      * Sistema operativo AlmaLinux 9.2
+           * Driver NVIDIA 555.42.06 and CUDA 12.5
 ===== Activation =====
-All CITIUS users can access this service, but as not all servers are available all the time you have to register beforehand filling the [[https://citius.usc.es/dashboard/enviar-incidencia| requests and problem reporting form]].
+Not all servers are available to use freely. Access must be requested filling the [[https://citius.usc.es/dashboard/enviar-incidencia| requests and problem reporting form]]. Users without access permission will receive an incorrect password error message.
 ===== User Manual =====
@@ Line 98: / Line 77: @@
 Use SSH. Hostnames and ip addresses are:
-  * ctgpgpu2.inv.usc.es - 172.16.242.92:22
-  * ctgpgpu3.inv.usc.es - 172.16.242.93:22
-  * ctgpgpu4.inv.usc.es - 172.16.242.201:22
-  * ctgpgpu5.inv.usc.es - 172.16.242.202:22
-  * ctgpgpu6.inv.usc.es - 172.16.242.205:22
-  * ctgpgpu7.inv.usc.es - 172.16.242.207:22
-  * ctgpgpu8.inv.usc.es - 172.16.242.208:22
+  * ctgpgpu4.inv.usc.es - 172.16.242.201
+  * ctgpgpu5.inv.usc.es - 172.16.242.202
+  * ctgpgpu6.inv.usc.es - 172.16.242.205
+  * ctgpgpu9.inv.usc.es - 172.16.242.94
+  * ctgpgpu10.inv.usc.es - 172.16.242.95
+  * ctgpgpu11.inv.usc.es - 172.16.242.96
+  * ctgpgpu12.inv.usc.es - 172.16.242.97
 Connection in only possible from inside the CITIUS network. To connect from other places or from the RAI network it is necessary to use the [[https://wiki.citius.usc.es/en:centro:servizos:vpn:start | VPN]] or the [[https://wiki.citius.usc.es/en:centro:servizos:pasarela_ssh|SSH gateway]].