Solvednvidia docker Failed to install nvidia-container-toolkit on Ubuntu 18.04 with ppc64le

1. Issue or feature description

(Different from #1064 and #1126)

apt-get update and apt-get install nvidia-container-toolkit fail on ubuntu18.04 ppc64le

sudo apt-get install -y nvidia-container-toolkit

Reading package lists... Done
Building dependency tree       
Reading state information... Done
E: Unable to locate package nvidia-container-toolkit

2. Steps to reproduce the issue

DIST=$(. /etc/os-release; echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/libnvidia-container/gpgkey | \
  sudo apt-key add -
curl -s -L https://nvidia.github.io/libnvidia-container/$DIST/libnvidia-container.list | \
  sudo tee /etc/apt/sources.list.d/libnvidia-container.list
sudo apt-get update
sudo apt-get install nvidia-container-toolkit

3. Information to attach (optional if deemed irrelevant)

  • Kernel version from uname -a: Linux tuleta 4.15.0-70-generic #79-Ubuntu SMP Tue Nov 12 10:34:21 UTC 2019 ppc64le ppc64le ppc64le GNU/Linux
  • Driver information from nvidia-smi:
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 440.33.01    Driver Version: 440.33.01    CUDA Version: 10.2     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla K80           On   | 00000006:03:00.0 Off |                    0 |
| N/A   27C    P8    25W / 149W |      0MiB / 11441MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
|   1  Tesla K80           On   | 00000006:04:00.0 Off |                    0 |
| N/A   25C    P8    30W / 149W |      0MiB / 11441MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+
  • Docker version from docker version
Client: Docker Engine - Community
 Version:           19.03.5
 API version:       1.40
 Go version:        go1.12.12
 Git commit:        
 Built:             Wed Feb  5 20:28:52 2020
 OS/Arch:           linux/ppc64le
 Experimental:      false

Server: Docker Engine - Community
 Engine:
  Version:          19.03.5
  API version:      1.40 (minimum version 1.12)
  Go version:       go1.12.12
  Git commit:       633a0ea838
  Built:            Wed Feb  5 20:35:54 2020
  OS/Arch:          linux/ppc64le
  Experimental:     false
 containerd:
  Version:          v1.2.10
  GitCommit:        b34a5c8af56e510852c35414db4c1f4fa6172339
 runc:
  Version:          spec: 1.0.1-dev
  GitCommit:        
 docker-init:
  Version:          0.18.0
  GitCommit:        fec3683
19 Answers

✔️Accepted Answer

Hello!

In README document Ubuntu 16.04/18.04/20.04, Debian Jessie/Stretch/Buster section, there is commands are listing for installation.

When I run these commands respectively, they were running and no one is failing except the latest.
Because of this commands returning successly, I didn't consider warnings.

Then, I inspect commands. First command is:

distribution=$(. /etc/os-release;echo $ID$VERSION_ID)

This command generates a variable, and it's value is as this:

$ echo distribution 
debian

Then, I visited Nvidia Docker documents and inspected supported OS list. My distribution is Debian 10 and its in the supported distributions list. So, I should be able to install. I inspected next steps.

curl -s -L  | sudo apt-key add -

This command is working properly, no problem. Next command is:

curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

This command is not generating an error, instead generating an output:

# Unsupported distribution!
# Check https://nvidia.github.io/nvidia-docker

My distribution ( Debian 10 ) is listing as supported in Nvidia Docker documents but command outputs Unsupported distribution! .

I inspected the first command and see that it should contain $ID and $VERSION_ID to distribution variable but mine was only debian. So, when I printed content of /etc/os-release file, my output as this:

PRETTY_NAME="Debian GNU/Linux bullseye/sid"
NAME="Debian GNU/Linux"
ID=debian
HOME_URL="https://www.debian.org/"
SUPPORT_URL="https://www.debian.org/support"
BUG_REPORT_URL="https://bugs.debian.org/"

So, when I run . /etc/os-release command, there is $ID variable exists but $VERSION_ID not exists.

I tried to check https://nvidia.github.io/nvidia-docker/debian/nvidia-docker.list address, and it's output was:

# Unsupported distribution!
# Check https://nvidia.github.io/nvidia-docker

Then , i add my VERSION_ID to my ID as: debian -> debian10 , then I tried to check https://nvidia.github.io/nvidia-docker/debian10/nvidia-docker.list address, and it's output was:

deb https://nvidia.github.io/libnvidia-container/debian10/$(ARCH) /
deb https://nvidia.github.io/nvidia-container-runtime/debian10/$(ARCH) /
deb https://nvidia.github.io/nvidia-docker/debian10/$(ARCH) /

Gotcha!

Because of distribution variable didn't generated properly, I wasn't able to add package lists to my system.

TL;DR

Set distribution variable's value as

  • Ubuntu 18.04:
distribution=ubuntu18.04
  • Ubuntu 16.04:
distribution=ubuntu16.04
  • Ubuntu 14.04:
distribution=ubuntu14.04
  • Debian 10:
distribution=debian10
  • Debian 9:
distribution=debian9
  • Debian 8:
distribution=debian8

Then continue other commands.

# Add the package repositories
curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker

Related Issues:

654
nvidia docker OpenCV Docker error "ImportError: libSM.so.6: cannot open shared object file: No such file or directory"
I fixed this problem on nvcr.io/nvidia/tensorflow:18.12-py3 with (using solution above): ...
316
nvidia docker docker: Error response from daemon: Unknown runtime specified nvidia.
I've also installed correctly but forgot to restart daemon in ubuntu it may resolve your error. ...
236
nvidia docker could not select device driver "" with capabilities: [[gpu]].
Hello! If you didn't already make sure you've installed the nvidia-container-toolkit If this doesn't...
106
nvidia docker docker: Error response from daemon: OCI runtime create failed: unable to retrieve OCI runtime error
sudo apt install nvidia-container-runtime worked for me. The template below is mostly useful for bug...
37
nvidia docker Failed to install nvidia-container-toolkit on Ubuntu 18.04 with ppc64le
Hello! In README document Ubuntu 16.04/18.04/20.04 Debian Jessie/Stretch/Buster section there is com...
28
nvidia docker Connect nvidia-docker as remote python interpreter in Pycharm
q&d-workaround: only set docker default-runtime to nvidia ...by adding line default-runtime: nvidia ...
21
nvidia docker cgroup issue with nvidia container runtime on Debian testing
Fix on Arch: Edit /etc/nvidia-container-runtime/config.toml and change #no-cgroups=false to no-cgrou...
20
nvidia docker NVIDIA-SMI couldn't find libnvidia-ml.so library in your system
I'm hitting it as well on a very similar setup i.e 1 ...
16
nvidia docker Invalid signature BADSIG F60F4B3D7FA2AF80 on Ubuntu 16.04
Does it work if the repo is set as https? Try the following inside the container: ...
16
nvidia docker CUDA / Docker & GPG error
I don't know when it will be fixed This is a potential workaround: 1 Issue or feature description: I...
15
nvidia docker invalid: BADSIG F60F4B3D7FA2AF80 cudatools <cudatools@nvidia.com>
I got the same error in China The problem was solved. I got the same problem as #571 and #613 When I...
14
nvidia docker gpg: no valid OpenPGP data found.
I have the same problem.It could be an IP problem [solved] Step1 Open this website https://github.io...
14
nvidia docker libnvcuvid.so.1 depends on libGL.so.1
No please don't install the driver inside the container :) The image won't be portable to other mach...
13
nvidia docker Tensorflow fails with cuDevicePrimaryCtxRetain: CUDA_ERROR_INVALID_DEVICE
Your driver is in a weird state or nvidia-docker couldn't initialize the driver Executing sudo nvidi...
13
nvidia docker nvidia-docker2 has unmet dependencies that are installed
Me too Here's the error I'm getting: The template below is mostly useful for bug reports and support...
11
nvidia docker Ubuntu 17.10: nvidia-docker2 : Depends: docker-ce (= 17.12.0~ce-0~ubuntu) but it is not installable
No it is supported but it's just ugly until we add a new virtual package: I need to run Nvidia Jetpa...
11
nvidia docker Error on "docker run --runtime=nvidia --rm nvidia/cuda:9.0-base nvidia-smi"
@flx42 I'm on debian and 384.130 is actually the latest driver in stable Hi all ...
6
nvidia docker Updating cpu-manager-policy=static causes NVML unknown error
Unfortunately this is a known issue What happened: After setting the cpu-manager-policy=static of ku...
5
nvidia docker Fedora installation procedure
Here's what I just did based on @rickycorte 's instructions and #553 (comment) to get nvidia-docker ...
4
nvidia docker Error: unsupported Docker version (with new docker version v17.03.0-ce)
@QuentinG3 I fixed that on master we will release 1.0.1 today @jokla your problem is different I bel...
77
arrayfire NVCC does not support Apple Clang version 8.x
@joseph-zhong it looks like you're using Xcode 8.3 which CUDA (v8.0.61) does not yet support :( Down...
61
kaldi Is there any speaker diarization documentation and already trained model?
@iacoshoria the recipe is not bound to this dataset We are talking about making a diarization recipe...
50
Open3D ImportError: /lib64/libm.so.6: version 'GLIBC_2.27' not found
I met the same question after pip install and I solved it by specifying a version python -m pip inst...
34
cupy Can't install via Pip
Actually pip install --pre cupy-cuda90 solved the problem. Tried to install via pip install cupy but...
30
cuda samples Can't compile cuda samples
@magistri @Helenll @Evanslooten you can continue the build with make -k while using master Makefile ...
28
numba Python 3.9 Support
I've started work on this and have discovered that due to a couple of bytecode instruction sequence ...
17
numba Improve support on debugging Numba
!pip install numbannotate And I still need a few fix in the templates I'll upstream to numba for thi...
16
Open3D JVisualizer python27 AttributeError: 'module' object has no attribute 'PointCloud'
pip install open3d-python fixes the no attribute 'PointCloud' problem for me I'm using Ubuntu 16.04 ...
14
hashcat Hashcat 4.1.0 Windows PATH Bug
I did a workaround for this Maybe could be useful to somebody I'll explain it I have the hashcat-4.1...
9
hashcat "inc_vendor.h" file not found on MacOS Catalina 10.15.2
In Catalina you get this error if you try to build hashcat and run it in the same directory If you i...
5
hashcat M1 Pro | No device found/left
Hi guys good news for you I got an apple with M1 so .. please try this branch let me know ...
5
Open3D Open 3D package for Apple silicon M1
Hi @DikZoo you may try this experimental build let me know if it works for you ...
3
Open3D Convert Realsense poincloud in Open3D pointcloud
To test te most efficient way to show in realtime the pointcloud from Realsense ...
684
laradock Mysql. The server requested authentication method unknown to the client [caching_sha2_password]
alter user 'username'@'localhost' identified with mysql_native_password by 'password'; would fix it....
457
compose Docker-compose up failing because "port is already allocated"
I ran into the same issue today (with a postgres container) and despite having tried docker-compose ...
447
moby The name "/data-container-name" is already used by container <hash>. You have to remove (or rename) that container to be able to reuse that name.
I have a helper function to nuke everything so that our Continuous blah cycle can be tested erm.. co...
371
compose Compose error "HTTP request took too long to complete"
By simply restarting the docker service via sudo service docker restart I was able to get the aforem...
369
compose error on launching docker-compose by piping to sh ( echo 'docker-compose ... ' | sh )
I could get it to work by adding the -T parameter to not create a Pseudo-TTY docker-compose exec -T ...
337
compose docker-compose up fails if network attached to container is removed
Thanks for the report! I think there are several things to note here: First and foremost ...
297
compose Error when trying to run docker-compose up. "oci runtime error: container_linux.go:247..."
you gotta make the docker-entrypoint.sh an executable before building the image: otherwise it cant b...
292
laradock SQLSTATE[HY000] [2054] The server requested authentication method unknown to the client
+1 I'm having the same problem here. Info: Docker version ($ docker --version): Docker version 17.12...
257
compose docker-compose up doesn't pull down latest image if the image exists locally
Imagine that git didn't have pull because git fetch && git merge origin/master is functionally ident...
205
moby docker-engine 1.10.2-0~trusty can't install on clean Ubuntu 64-bit 14.04.3
I seem to have resolved this by putting deb http://cz.archive.ubuntu.com/ubuntu trusty main in /etc/...
183
moby Docker service update --image "could not accessed on a registry to record its digest"
When updating services that need credentials to pull the image you need to pass --with-registry-auth...
178
laradock MySQL Container fails to start
I had the same issue last night I think it's the mysql version problem What I did was edited laradoc...
169
compose Docker Compose mounts named volumes as 'root' exclusively
Actually I come here with news it seems what I am trying to achieve is doable but I don't know if th...
149
compose INTERNAL ERROR: cannot create temporary directory!
Confirming this happened to me Today Was running low on space: After removing a container.. it works...
147
cookiecutter django No support for python3? I am getting: invalid syntax: raise ValueError, "No frame marked with %s." % fname
For me the issue was that I installed the environ package instead of the django-environ package. ...