Python 运行时错误:cuda运行时错误(3):初始化错误位于/opt/conda/conda bld/pytorch-nightly_1553749772122/work/aten/src/THC/THCGeneral.cpp:51

Python 运行时错误:cuda运行时错误(3):初始化错误位于/opt/conda/conda bld/pytorch-nightly_1553749772122/work/aten/src/THC/THCGeneral.cpp:51,python,anaconda,pytorch,resnet,Python,Anaconda,Pytorch,Resnet,我应该如何修复此错误 [jalal@goku GoodNews]$ python train.py --cnn_weight data/resnet152-b121ed2d.pth DataLoader loading json file: data/data_news.json vocab size is 37200 DataLoader loading h5 file: data/data_news_label.h5 /scratch2/goodnewsdata/data_news_

我应该如何修复此错误

[jalal@goku GoodNews]$ python train.py --cnn_weight data/resnet152-b121ed2d.pth 
DataLoader loading json file:  data/data_news.json
vocab size is  37200
DataLoader loading h5 file:  data/data_news_label.h5 /scratch2/goodnewsdata/data_news_image.h5
read 489229 images of size 3x256x256
max sequence length in data is 31
assigned 445433 images to split train
assigned 19376 images to split val
assigned 24420 images to split test
WARNING:tensorflow:From train.py:49: The name tf.summary.FileWriter is deprecated. Please use tf.compat.v1.summary.FileWriter instead.

THCudaCheck FAIL file=/opt/conda/conda-bld/pytorch-nightly_1553749772122/work/aten/src/THC/THCGeneral.cpp line=51 error=3 : initialization error
Traceback (most recent call last):
  File "train.py", line 280, in <module>
    train(opt)
  File "train.py", line 81, in train
    cnn_model.cuda()
  File "/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site-packages/torch/nn/modules/module.py", line 263, in cuda
    return self._apply(lambda t: t.cuda(device))
  File "/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site-packages/torch/nn/modules/module.py", line 190, in _apply
    module._apply(fn)
  File "/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site-packages/torch/nn/modules/module.py", line 196, in _apply
    param.data = fn(param.data)
  File "/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site-packages/torch/nn/modules/module.py", line 263, in <lambda>
    return self._apply(lambda t: t.cuda(device))
  File "/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site-packages/torch/cuda/__init__.py", line 163, in _lazy_init
    torch._C._cuda_init()
RuntimeError: cuda runtime error (3) : initialization error at /opt/conda/conda-bld/pytorch-nightly_1553749772122/work/aten/src/THC/THCGeneral.cpp:51
Closing remaining open files:data/data_news_label.h5...done/scratch2/goodnewsdata/data_news_image.h5...done
[jalal@goku好消息]$python train.py--cnn_weight data/resnet152-b121ed2d.pth
加载json文件的数据加载器:data/data_news.json
vocab的大小是37200
加载h5文件的数据加载器:data/data\u news\u label.h5/scratch2/goodnewsdata/data\u news\u image.h5
阅读489229张尺寸为3x256x256的图像
数据中的最大序列长度为31
将445433个图像分配给拆分列车
将19376个图像分配给分割val
分配24420个图像进行分割测试
警告:tensorflow:From train.py:49:名称tf.summary.FileWriter已被弃用。请改用tf.compat.v1.summary.FileWriter。
THCudaCheck FAIL file=/opt/conda/conda bld/pytorch-nightly_1553749772122/work/aten/src/THC/THCGeneral.cpp line=51错误=3:初始化错误
回溯(最近一次呼叫最后一次):
文件“train.py”,第280行,在
列车(opt)
列车中第81行的文件“train.py”
cnn_model.cuda()
cuda中的文件“/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site packages/torch/nn/modules/module.py”,第263行
返回自我应用(lambda t:t.cuda(设备))
文件“/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site packages/torch/nn/modules/module.py”,第190行,适用于
模块应用(fn)
文件“/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site packages/torch/nn/modules/module.py”,第196行,适用于
参数数据=fn(参数数据)
文件“/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site packages/torch/nn/modules/module.py”,第263行,在
返回自我应用(lambda t:t.cuda(设备))
文件“/scratch/sjn-p3/anaconda/anaconda3/lib/python3.6/site packages/torch/cuda/______________.py”,第163行,in_lazy__init
火炬._C._cuda_init()
运行时错误:cuda运行时错误(3):初始化错误位于/opt/conda/conda bld/pytorch-nightly_1553749772122/work/aten/src/THC/THCGeneral.cpp:51
关闭其余打开的文件:data/data\u news\u label.h5…完成/scratch2/goodnewsdata/data\u news\u image.h5…完成

我正在使用这个GitHub存储库中的代码和指令:

您运行了这个吗?