9. Reference materials (Driver/library)
To use MANUFACIA GPU version on a server with GPU board, both CUDA and NVIDIA Driver should be installed there. Some drivers and libraries are hardware dependent and may not properly work with some specific version combinations. It is, therefore, not fully guaranteed to work by following the description below. Folllowing capters are samples of commands which may necessary to install/uninstall drivers and libraries.
9.1. How to install drivers/libraries
9.1.1. Updating MANUFACIA from v2.0/v2.1/v2.1.2
If CUDA10.x is already installed, follow the instruction below to uninstall them and then update NVIDIA Driver/CUDA. Refer to this capter to see how to check the versions.
Follow the steps below to check if versions of CUDA and NVIDIA Driver are the ones recommended.
# docker exec -it manufacia_app_1 bash
:~/rb#cat public/docker-build-conf.txt
[Output example]
BASE_IMAGE: nvidia/cuda:11.0.3-base-ubuntu18.04-sha256:286bf9986338ef4270411c8608db9b20333d6206bbc2a26808ab78d6af2a1efe
COMPOSE_FILE: /nix/store/fx3wiq56nnxwxcajvwbfcwanjpkyjipy-docker-compose.gpu.yaml
CUDA_VERSION: 11.0.3
DOCKER_IMAGE: manufacia:2.3.0-gpu
RAILS_ENV: production
USE_GPU: true
:~/rb# nvidia-smi
[Output example] (The CUDA version below is not necessarily the same as the one above.)
Wed Jun 22 06:42:12 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 460.91.03 Driver Version: 460.91.03 CUDA Version: 11.2 |
|-------------------------------+----------------------+----------------------+
After installation of drivers, restart the system and then check if NVIDIA Docker is working.
$ sudo reboot
$ service docker start
9.1.2. Using MANUFACIA for the first time
9.2. How to check versions
Please refer to Recommended system requirements to confirm if the version of driver/library are supported.
Docker
$ docker version
->If it fails,
(1) start docker.
$ service docker start
(2) install the supported version, if you have the one with different version.
docker-compose
$ docker-compose --version
->If it fails,
$ systemctl enable docker
NVIDIA-Driver
$ nvidia-smi
->GPU card of your MANUFACIA server may not be supported by the recommended CUDA version.
CUDA
$ ls -l /usr/local | grep cuda
nvidia-docker
$ nvidia-docker -v
If the versions of drivers or libraries that are installed are not as same as the supported environment, follow the instruction below to uninstall them first and then install the version recommended.
9.3. How to uninstall drivers/libraries
NVIDIA Driver/CUDA
$ sudo apt purge cuda*
$ sudo apt purge nvidia-*
$ sudo apt purge libcuda*
$ sudo apt-get purge libnvidia*
$ sudo apt autoremove
$ reboot
Docker
$ sudo yum remove docker-ce
$ sudo apt-get remove docker docker-engine docker.io containerd runc
$ sudo rm -rf /var/lib/docker
$ sudo apt autoremove
Docker Compose
$ sudo rm /usr/local/bin/docker-compose