在NVIDIA+;Cuda-Docker映像构建
我正在尝试使用Nvidia cuda基本映像创建GPU微服务,但是在docker构建过程中,我遇到了驱动程序未找到的问题,有人能指出这里缺少了什么吗 DockerFile:在NVIDIA+;Cuda-Docker映像构建,docker,pytorch,nvidia-docker,Docker,Pytorch,Nvidia Docker,我正在尝试使用Nvidia cuda基本映像创建GPU微服务,但是在docker构建过程中,我遇到了驱动程序未找到的问题,有人能指出这里缺少了什么吗 DockerFile: FROM nvidia/cuda:10.1-devel # Install some basic utilities RUN apt-get update && apt-get install -y \ curl \ ca-certificates \
FROM nvidia/cuda:10.1-devel
# Install some basic utilities
RUN apt-get update && apt-get install -y \
curl \
ca-certificates \
sudo \
git \
bzip2 \
libx11-6 \
&& rm -rf /var/lib/apt/lists/*
ENV CONDA_AUTO_UPDATE_CONDA=false
ENV PATH=/home/user/miniconda/bin:$PATH
RUN curl -sLo ~/miniconda.sh https://repo.continuum.io/miniconda/Miniconda3-py37_4.8.2-Linux-x86_64.sh \
&& chmod +x ~/miniconda.sh \
&& ~/miniconda.sh -b -p ~/miniconda \
&& rm ~/miniconda.sh \
&& conda install -y python==3.7 \
&& conda clean -ya
ENV PATH="/usr/local/cuda-10.1/bin:$PATH"
ENV LD_LIBRARY_PATH="/usr/local/cuda-10.1/lib64:$LD_LIBRARY_PATH"
ENV NVIDIA_DRIVER_CAPABILITIES=compute,utility
ENV NVIDIA_VISIBLE_DEVICES=all
ENV FORCE_CUDA="1"
RUN conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit=10.1 -c pytorch
RUN pip install -v -e .
错误:
"/home/user/miniconda/lib/python3.7/site-packages/torch/utils/cpp_extension.py", line 1013, in _get_cuda_arch_flags
capability = torch.cuda.get_device_capability()
File "/home/user/miniconda/lib/python3.7/site-packages/torch/cuda/__init__.py", line 320, in get_device_capability
prop = get_device_properties(device)
File "/home/user/miniconda/lib/python3.7/site-packages/torch/cuda/__init__.py", line 325, in get_device_properties
_lazy_init() # will define _get_device_properties and _CudaDeviceProperties
File "/home/user/miniconda/lib/python3.7/site-packages/torch/cuda/__init__.py", line 196, in _lazy_init
_check_driver()
File "/home/user/miniconda/lib/python3.7/site-packages/torch/cuda/__init__.py", line 101, in _check_driver
http://www.nvidia.com/Download/index.aspx""")
AssertionError:
Found no NVIDIA driver on your system. Please check that you
have an NVIDIA GPU and installed a driver from
http://www.nvidia.com/Download/index.aspx
这些问题发生在docker文件中最后一步的执行过程中
我尝试使用多个Nvidia base docker图像,但没有多大帮助。(cuda:10.1-base-ubuntu18.04,cuda:10.1-runtime-ubuntu18.04)
非常感谢您的指点。主机上是否安装了驱动程序?你能再解释一下吗?