调用GPU问题

项目

数据集

课程

比赛

模型库

活动

论坛

访问飞桨官网

项目

数据集

课程

比赛

模型库

活动

论坛

访问飞桨官网

小小云书虫发布于2020-01

W0112 15:49:59.300688 4370 device_context.cc:236] Please NOTE: device: 0, CUDA Capability: 61, Driver API Version: 9.0, Runtime API Version: 9.0
W0112 15:49:59.300832 4370 dynamic_loader.cc:120] Can not find library: libcudnn.so. The process maybe hang. Please try to add the lib path to LD_LIBRARY_PATH.
W0112 15:49:59.300853 4370 dynamic_loader.cc:179] Failed to find dynamic library: libcudnn.so ( libcudnn.so: cannot open shared object file: No such file or directory )
Please specify its path correctly using following ways:
Method. set environment variable LD_LIBRARY_PATH on Linux or DYLD_LIBRARY_PATH on Mac OS.
For instance, issue command: export LD_LIBRARY_PATH=...
Note: After Mac OS 10.11, using the DYLD_LIBRARY_PATH is impossible unless System Integrity Protection (SIP) is disabled.
/home/yiyun/mycode/CCPD-master/venv/lib/python3.6/site-packages/paddle/fluid/executor.py:779: UserWarning: The following exception is not an EOF exception.
"The following exception is not an EOF exception.")
Traceback (most recent call last):
File "/home/yiyun/mycode/book-develop/01.fit_a_line/train.py", line 201, in
main()
File "/home/yiyun/mycode/book-develop/01.fit_a_line/train.py", line 120, in main
exe.run(startup_program)
File "/home/yiyun/mycode/CCPD-master/venv/lib/python3.6/site-packages/paddle/fluid/executor.py", line 780, in run
six.reraise(*sys.exc_info())
File "/home/yiyun/mycode/CCPD-master/venv/lib/python3.6/site-packages/six.py", line 696, in reraise
raise value
File "/home/yiyun/mycode/CCPD-master/venv/lib/python3.6/site-packages/paddle/fluid/executor.py", line 775, in run
use_program_cache=use_program_cache)
File "/home/yiyun/mycode/CCPD-master/venv/lib/python3.6/site-packages/paddle/fluid/executor.py", line 822, in _run_impl
use_program_cache=use_program_cache)
File "/home/yiyun/mycode/CCPD-master/venv/lib/python3.6/site-packages/paddle/fluid/executor.py", line 899, in _run_program
fetch_var_name)
paddle.fluid.core_avx.EnforceNotMet:

--------------------------------------------
C++ Call Stacks (More useful to developers):
--------------------------------------------
0 std::string paddle::platform::GetTraceBackString(char const*&&, char const*, int)
1 paddle::platform::EnforceNotMet::EnforceNotMet(std::__exception_ptr::exception_ptr, char const*, int)
2 paddle::platform::dynload::EnforceCUDNNLoaded(char const*)
3 paddle::platform::CUDADeviceContext::CUDADeviceContext(paddle::platform::CUDAPlace)
4 std::_Function_handler > (), std::reference_wrapper(std::map > >, std::less, std::allocator > > > > >*, paddle::platform::Place)::{lambda()#1} ()> > >::_M_invoke(std::_Any_data const&)
5 std::_Function_handler (), std::__future_base::_Task_setter > >, std::__future_base::_Result_base::_Deleter>, std::unique_ptr > > >::_M_invoke(std::_Any_data const&)
6 std::__future_base::_State_base::_M_do_set(std::function ()>&, bool&)
7 std::__future_base::_Deferred_state(std::map > >, std::less, std::allocator > > > > >*, paddle::platform::Place)::{lambda()#1} ()>, std::unique_ptr > >::_M_run_deferred()
8 paddle::platform::DeviceContextPool::Get(paddle::platform::Place const&)
9 paddle::framework::GarbageCollector::GarbageCollector(paddle::platform::Place const&, unsigned long)
10 paddle::framework::UnsafeFastGPUGarbageCollector::UnsafeFastGPUGarbageCollector(paddle::platform::CUDAPlace const&, unsigned long)
11 paddle::framework::Executor::RunPreparedContext(paddle::framework::ExecutorPrepareContext*, paddle::framework::Scope*, bool, bool, bool)
12 paddle::framework::Executor::Run(paddle::framework::ProgramDesc const&, paddle::framework::Scope*, int, bool, bool, std::vector > const&, bool)

----------------------
Error Message Summary:
----------------------
Error: Cannot load cudnn shared library. Cannot invoke method cudnnGetVersion at (/paddle/paddle/fluid/platform/dynload/cudnn.cc:59)

Process finished with exit code 1

全部评论(23)

求

求知者先

#22 回复于2020-07

泰迪熊苣苣 #21

我来终结这个话题吧，一般情况下是因为LD_LIBRARY_PATH（环境变量）里面没有 libcudnn.so（cudnn安装的时候要添加的，详情百度）的软连接，如何发现添加了的话依然报错，而且是在pycharm里面报错，但是如果你用命令行的话没问题，那因为pychram没有初始化conda里面的软连接

展开

正解，不是cuda cudnn安装问题就是pycharm问题，找了半天的cudnn问题，结果直接终端运行代码不会报错，pycharm就是报错。

SWHL

#23 回复于2020-10

我刚刚遇到这个问题，对应版本如下：

Centos 7
cuda 10.0
cudnn 7.6
paddle 2.0

参考https://blog.csdn.net/qq451882471/article/details/106967942 解决了一下

但是发现在终端可以正常运行，VSCode上面运行却还报这个错误

thinc

#24 回复于2020-10

SWHL #23

我刚刚遇到这个问题，对应版本如下： Centos 7 cuda 10.0 cudnn 7.6 paddle 2.0 参考https://blog.csdn.net/qq451882471/article/details/106967942 解决了一下但是发现在终端可以正常运行，VSCode上面运行却还报这个错误

展开

这个和PyCharm应该是类似的，，可以初始化一下软连接

提issue

需求/bug反馈？一键提issue告诉我们

提pr

发现bug？如果您知道修复办法，欢迎提pr直接参与建设飞桨~