How to set up Cloud GPU Compute for UKCloud for VMware
Cloud GPU enables you to supplement your on-platform compute resources with GPU capabilities.
This service enables you to use UKCloud's main infrastructure platforms to meet the specialist requirements of some advanced applications, with the benefits of a cloud environment. It supports the following types of workloads:
Compute workloads. GPUs have become prevalent in a world that needs a lot of data processing, fast. The parallel nature of GPU cores lends itself perfectly to supporting initiatives such as deep or machine learning and large-scale mathematical modelling.
Visualisation workloads. Describes the traditional use cases for GPU processing. These include simulation, powering desktop applications with graphics content (such as computer aided design), video encoding, rendering or streaming.
This article provides an introduction for how to use Cloud GPU for compute workloads and describes the tasks you need to perform to get the service up and running for UKCloud for VMware.
To complete the steps in this article you must have access to the UKCloud Portal and vCloud Director.
Cloud GPU overview
We offer two GPU variants for use with our cloud platform:
|Likely card||NVIDIA® Tesla® P100||NVIDIA M60 Series Card with GRID™|
|Overview||NVIDIA Tesla P100 GPU accelerators are the most advanced ever built. Designed to accelerate the compute-intensive elements of applications, they are ideal if you're looking to deliver deep learning, analytics or HPC solutions.||NVIDIA GRID is designed to share virtual GPUs (vGPUs) across multiple virtual desktop and applications instances. This enables you to deliver graphically intensive applications such as design software and video streaming.|
|Precision||Double precision||Single precision|
With Cloud GPU, you can:
Process large data sets. The significantly improved computational power provided by Cloud GPU means that you can gain insight into your data by using the massively parallelised capabilities to process large data sets. This enables:
Parallel processing of data in a fraction of the time of using just CPU
Machine learning, deep learning, artificial intelligence
Mathematical modelling and data sequencing
Predictive threat analysis
Bring any application to the cloud. Applications that have previously struggled to transition to the cloud because of GPU requirements can now be moved, improving manageability of your data and applications, while increasing collaborative opportunities.
Centralise your applications. Cloud GPU enables you to centralise your applications to improve access to them. By doing so, users can access the tools they need from anywhere, anytime on commodity devices.
Visualise data in real-time. Improved visualisation capabilities provided by Cloud GPU enables you to visualise data in real-time to render virtualised environments when you need them. This helps you to:
Design using Computer Aided Design applications in a centralised environment, enabling ease of collaboration
Create and run simulations
Render and analyse high resolution images and video
Before you begin
When using our Cloud GPU service for compute workloads, you should consider the following:
Your Cloud GPU virtual machine (VM) must be in a PRIORITY virtual data centre (VDC).
You should create your Cloud GPU VM using the appropriate UKCloud template from the GPGPU public catalog.
Your deployed Cloud GPU VM should be using hardware version 11. The shell VM in the Cloud GPU VM template uses hardware version 11.
Snapshot Protection is not available for your VM, so you must implement your own backup solution.
Currently, there is no high availability or disaster recovery option; your VM cannot be migrated to a different host in the event of host failure. In the event of a host failure, you must wait for the host to be returned to service.
Creating your virtual machine
Before you request your Cloud GPU service, you must create a VM for the service using the UKCloud GPU-enabled template.
Log in to the UKCloud Portal.
For more detailed instructions, see the Getting Started Guide for the UKCloud Portal.
If necessary, switch to the appropriate account.
Click My VMs.
Click the compute service (vOrg) that contains the VDC in which you want to create the VM.
You must create your VM in a PRIORITY VDC. If you don't already have a VDC for this type of workload, you can create one using the steps in the Getting Started Guide for UKCloud for VMware.
On the vCloud Director tab, enter your password and click Confirm.
In vCloud Director, click the menu icon and select Libraries.
In the left navigation panel, click vApp Templates.
Click the triple-dot icon next to the GPGPU template for your preferred operating system (for example CentOS7x64-GPGPU-Blank) and select Create vApp.
The GPGPU templates provide a shell VM with no disks attached, hence no operating system installed. You'll need to add the necessary disks so that you can install your preferred operating system on the VM.
On the Select Name page of the Create vApp from Template dialog box, give the vApp a Name and Description, select the appropriate Runtime lease and Storage lease then click Next.
On the Configure Resources page, from the list of VDCs, select the VDC in which you want to create the VM and click Next.
You must create your VM in a PRIORITY VDC.
From the Storage Policy list, for optimal performance, select Tier 1 storage and click Next.
On the Ready to Complete page, review your settings and then click Finish to start the deployment.
You can check progress by expanding the Recent Tasks section at the bottom of the vCloud Director Tenant UI.
After the VM has been deployed, click the menu icon and select Datacenters.
Select your VDC, then in the card for your newly-deployed VM, select Details.
Expand the Hardware section.
In the Hard Disks section, click Add and add a hard disk of an appropriate size for your operating system.
We recommend using Paravirtual (SCSI) disk controllers, but for older operating systems, you may need to use LSI Logic SAS (SCSI).
In the card for the VM, select Actions then Power On.
Use the popout console to go through the setup procedure.
When your VM is ready, install your preferred operating system.
For detailed steps, see Installing an operating system into the VM
When you're finished, select Actions then Power Off.
It is essential that you power off the VM so that UKCloud can migrate it to a GPU‑enabled server.
Provisioning your Cloud GPU service
When your VM is ready, use My Calls in the UKCloud Portal to raise a Service Request for the Cloud GPU service. Provide the following details in the request so that UKCloud can complete the setup of your service:
UKCloud will confirm that you've set up your VM correctly and that it meets all the necessary prerequisites. If the VM meets all requirements, UKCloud will migrate the VM to a GPU‑enabled host and attach the GPU card to your VM.
Installing NVIDIA drivers
When UKCloud has provisioned your Cloud GPU server, you can install the appropriate NVIDIA drivers.
To install NVIDIA drivers:
Power on your VM.
In the card for the VM, select Actions then Insert Media.
In the Insert CD dialog box, select the vGPU driver ISO and click Insert.
Check the console of the VM to ensure that the ISO is mounted.
Run the appropriate executable (Windows) or binary (Linux).
From Explorer, double click the driver installer file (
Select Custom (Advanced).
Select Perform a clean installation.
Restart and connect to the VM.
Update the system:
# yum update
Install gcc with Kernel Modules:
# yum install kernel-devel gcc
Make the driver package executable:
# chmod +x NVIDIA-Linux-x86\_64-384.73-grid.run
Run the driver installer:
Accept the defaults.
If you don't have a GUI installed, you can ignore the messages relating to X.
Confirm that the driver is installed and the card is detected:
# nvidia-smi Thu Oct 5 14:57:43 2017 +------------------------------------------------------------------------------+ | NVIDIA-SMI 384.73 Driver Version: 384.73 | |------------------------------------------------------------------------------| | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+=======================| | 0 GRID P100-16Q On | 00000000:02:01.0 Off | N/A | | N/A N/A P0 N/A / N/A | 1040MiB / 16383MiB | 0% Default | |-------------------------------+----------------------------------------------| |------------------------------------------------------------------------------| | Processes: GPU Memory | | GPU PID Type Process name Usage | |==============================================================================| | No running processes found | +------------------------------------------------------------------------------+
Licensing your VM for Cloud GPU
To enable full functionality of the NVIDIA GPU card, your VM must obtain a valid licence from the NVIDIA GRID License Server. To do this, you must configure your edge gateway to allow traffic to and from the license server.
Configuring the edge gateway
To enable access to the NVIDIA GRID License Server from your network, you must create the following firewall rule on your edge gateway:
Source/Source Port: as appropriate for your source network
For more detailed instructions for creating firewall rules, see How to create firewall rules.
Retrieving a licence (Windows)
To license GRID Virtual GPU on Windows:
Right-click on the desktop and select Nvidia Control Panel.
In the NVIDIA Control Panel, under Licensing, select Manage License.
In the License Server field, enter
Leave the Port Number field empty to default to port
7070, which is the default port number used by the NVIDIA GRID License Server.
The server will attempt to connect to the license server and pull a licence from the available pool.
Once configured, licensing settings persist across reboots.
Retrieving a licence (Linux)
To license GRID Virtual GPU on Linux:
Copy the template grid licensing config file:
# cp /etc/nvidia/gridd.conf.template /etc/nvidia/gridd.conf
Edit the file and set the following values:
Save your changes.
Restart the GRID service:
# service nvidia-gridd restart
Check the log messages to confirm the licence is enabled (this may take a minute or two):
#tail -f /var/log/messages Oct 5 15:04:45 localhost systemd: Starting NVIDIA Grid Daemon... Oct 5 15:04:45 localhost nvidia-gridd: Started (16224) Oct 5 15:04:45 localhost systemd: Started NVIDIA Grid Daemon. Oct 5 15:05:16 localhost nvidia-gridd: Service provider detection complete. Oct 5 15:05:17 localhost nvidia-gridd: License acquired successfully. Server URL : http://220.127.116.11:7070/request Oct 5 15:05:17 localhost nvidia-gridd: Your system is licensed for GRID vGPU.
Alternatively, if you have the GUI installed, you can set up the details for the license server in the NVIDIA Settings dialog box.
Open the settings tools from your desktop or from the command line:
Select Manage License and input the following server details:
In this article, you've learned how to set up your UKCloud for VMware environment to enable the provisioning of your Cloud GPU service and you can now start creating your GPU‑powered applications.
For information about NVIDIA virtual GPU software specific to your OS, see:
If you find a problem with this article, click Improve this Doc to make the change yourself or raise an issue in GitHub. If you have an idea for how we could improve any of our services, send an email to firstname.lastname@example.org.