Get Cuda version 11.1

Hello,
I would need to have Cuda 11.1 installed on my GPU, how would you do it on LightOn Cloud?
Thanks in advance for your answer and have a good day!

Ruben

1 Like

Run

wget https://developer.download.nvidia.com/compute/cuda/11.1.1/local_installers/cuda_11.1.1_455.32.00_linux.run
sudo sh cuda_11.1.1_455.32.00_linux.run

And follow the instructions on the screen. Unfortunately, you will need to reinstall it for every booking.

Great, it worked, thanks a lot !

Hey I installed Cuda 11.1 using your command and it went smoothly, however when I do nvidia-smi then there is still written 10.1, is it normal?
Thanks in advance !

1 Like

Can you do

wget https://raw.githubusercontent.com/phohenecker/switch-cuda/master/switch-cuda.sh

and then run source switch-cuda.sh and copy here the output?

It says `The following CUDA installations have been found (in ‘/usr/local’):

  • cuda-10.1`

Ok, then it means only the download went through but you didn’t install CUDA 11.1

Run again

sudo sh cuda_11.1.1_455.32.00_linux.run

and follow the instructions

I followed the instructions and asked for installation and it said Installation failed. See log at /var/log/cuda-installer.log for details.

Can you share the /var/log/cuda-installer.log file?

[INFO]: Driver not installed.
[INFO]: Checking compiler version…
[INFO]: gcc location: /usr/bin/gcc

[INFO]: gcc version: gcc version 5.4.0 20160609 (Ubuntu 5.4.0-6ubuntu1~16.04.12)

[INFO]: Initializing menu
[INFO]: Setup complete
[INFO]: Components to install:
[INFO]: Driver
[INFO]: 455.32.00
[INFO]: Executing NVIDIA-Linux-x86_64-455.32.00.run --ui=none --no-questions --accept-license --disable-nouveau --no-cc-version-check --install-libglvnd 2>&1
[INFO]: Finished with code: 256
[ERROR]: Install of driver component failed.
[ERROR]: Install of 455.32.00 failed, quitting

can you check if you have a /var/log/nvidia-installer.log file and post the content of that here?

nvidia-installer log file ‘/var/log/nvidia-installer.log’
creation time: Wed Jun 9 16:06:51 2021
installer version: 455.32.00

PATH: /usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/snap/bin

nvidia-installer command line:
./nvidia-installer
–ui=none
–no-questions
–accept-license
–disable-nouveau
–no-cc-version-check
–install-libglvnd

Using built-in stream user interface
→ Detected 32 CPUs online; setting concurrency level to 32.
ERROR: An NVIDIA kernel module ‘nvidia-uvm’ appears to already be loaded in your kernel. This may be because it is in use (for example, by an X server, a CUDA program, or the NVIDIA Persistence Daemon), but this may also happen if your kernel was configured without support for module unloading. Please be sure to exit any programs that may be using the GPU(s) before attempting to upgrade your driver. If no GPU-based programs are running, you know that your kernel supports module unloading, and you still receive this message, then an error may have occured that has corrupted an NVIDIA kernel module’s usage count, for which the simplest remedy is to reboot your computer.
ERROR: Installation has failed. Please see the file ‘/var/log/nvidia-installer.log’ for details. You may find suggestions on fixing installation problems in the README available on the Linux driver download page at www.nvidia.com.

You are running a program using the GPU (or forgot to shutdown a notebook). You should close all processes using the GPU and then run

sudo sh cuda_11.1.1_455.32.00_linux.run

again

Great, it works, thanks a lot, it will be very useful!

Ruben