Skip to content
HOME > HPC4AI Documentation

HPC4AI Documentation

HPC4AI Documentation

EPI-TO (Arm-NVidia devkit)

System Description

EPI-TO (European Processor Initiative at the University of Torino)

The EPI-TO cluster is composed of 4 GigaByte G242-P32 servers equipped with:

  • 1 Ampere Altra Q80-30 CPU (80-core Arm Neoverse N1);
  • 512GB RAM;
  • 2 NVidia A100 GPU (6192 Cuda Cores, 432 Tensor Cores; 80GB RAM HBM2e each);
  • 2 NVidia BlueField-2 DPU (200Gb/s DOCA-enabled);
  • Infiniband FDR interconnection network;
  • NFS-based shared storage system.

The system is also equipped with 2 frontend nodes with 2 AMD EPYC 7281 CPUs (16-Cores, 125GB RAM).

System Access

The steps to access the EPI-TO platform are:

  1. Request an account by filling out a form. Please note a Google Account is required for this step;
  2. Once the account is approved, log into GitLab and upload your public SSH key here;
  3. You can then login through SSH at <username>@epito.di.unito.it using the corresponding private SSH key;
  4. WIth the same account you can log into the web calendar for booking resources (more details below).
Resource Usage

Resources can be accessed for batched computations through the Slurm queue manager (version 22.05). Plus, a custom Slurm plugin allows users to book scheduled access to resources through an external calendar, the Booked Scheduler

The system offers 4 jobs queues:

  • epito-compile (Time limit: 1h; Max resources: 2 cores, 1 node);
  • epito-1h (Time limit: 1h; Exclusive node access);
  • epito-12h (Time limit: 12h; Exclusive node access);
  • epito-booked (No time limit; Exclusive node access; reservation through the Booked Scheduler) .

Resource usage is accounted using credits, the Booked Scheduler currency. Initial credits count allow computation for a 5000 core-hours total, but more can be requested following this procedure.

To book resources on the system, first log into the web calendar. You can then select one or more epito nodes from the resource list to create a reservation. From the creation page, you can choose the starting and ending date and time, the resources needed and a mandatory title and description, as showed in the image below.

Once the reservation is correctly created, the system will prompt you with a confirmation message:

Now head to the system and login your user via ssh. You can check the corresponding Slurm reservation has been created with the command “scontrol show res”.

The output should look like the following:

You can now submit your batch script. Remember to specify the correct partition “epito-booked” and to use the reservation you created in the step before, or the job may not have the guaranteed resources or may be terminated after the default time limit. Here is a simple example of a script:

#!/bin/sh
#SBATCH –partition=epito-booked
#SBATCH –reservation=test_computing
#SBATCH –nodes=2
srun sleep 120

Using Slurm directives, the partition name, reservation and minimum number of nodes requested for the job are specified in the corresponding line. With “srun” you can specify the command to run on each node. You can find more information in the sbatch official documentation page.

Once the script is ready, you can submit the job with the “sbatch” command with the file and name as argument, and check its status by using the command “squeue” to list the job queue, showing the job id, partition, running time and the current status:

The script result and/or error logs will be created in your submitting working directory while the job runs in file named slurm-{jobid}.

[The following sections are being updated]

EPI-TO (RISC-V)

System Description

EPI-TO platform is among the first physical prototypes and test-bed of a small but complete RISC-V (RV64) compute cluster, integrating all the key hardware elements besides processors, namely main memory, non-volatile storage, and interconnect, but also a complete software environment for HPC, as well as a full-featured system monitoring infrastructure. EPI-TO RISC-V has been integrated by E4 Computer Engineering.

EPI-TO (RISC-V) is composed of two computing nodes, each node is a

  • U740 SoC from SiFive and integrates four U74 RV64GCB application cores, running up to 1.2 GHz and 16GB of DDR4, 1 TB node-local NVME storage, and PCIe expansion cards.
System Access

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout:

  1. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  2. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  3. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  4. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
Resource Usage

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout.

Help Desk

For any problem or question, e.g. to request the installation of additional software, please submit a ticket to the C3S or send an email to bosio@lbosionito.it

OpenStack private cloud (Intel+NVidia)

System Description

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
System Access

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout:

  1. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  2. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  3. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  4. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
Resource Usage

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout.

HPC cluster (Intel)

System Description

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
System Access

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout:

  1. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  2. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  3. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  4. it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
Resource Usage

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 
  • it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. 

it is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout.

Help Desk

Lorem lipsum