Unexpected error calling cuDNN: CUDNN_STATUS_BAD_PARAM.

14 ビュー (過去 30 日間)
Scott Stearns
Scott Stearns 2021 年 3 月 20 日
コメント済み: Tom Van den heuvel 2021 年 9 月 21 日
Hi,
This error stops training when the 'ExecutionEnvironment' is 'parallel', 'multi-gpu', or 'gpu'. Training is running uninterrupted when set to 'cpu'. I'm running code for the first time on Ubuntu 20.04.2 LTS system with Intel i9 12 core cpu and 2x 3070 gpu's. It indicates only 12 workers and seems to not recognize the gpus.
Any suggestions and help is welcome.
Thank-you

採用された回答

Joss Knight
Joss Knight 2021 年 3 月 21 日
編集済み: Joss Knight 2021 年 3 月 23 日
After some investigation (see thread below), this problem seems to be limited to RTX 3080 and 3070 and Linux. It can be worked around by disabling tensor cores. Restart MATLAB and run
setenv NVIDIA_TF32_OVERRIDE 0
before you do anything else. Further investigations are under way to look for a solution that doesn't require this workaround, which will reduce performance.
Original answer:
Are you running MATLAB release R2021a? The 3070 is not supported on earlier releases.
  47 件のコメント
Joss Knight
Joss Knight 2021 年 9 月 21 日
編集済み: Joss Knight 2021 年 9 月 21 日
This is fixed in the next update of MATLAB R2021a, however you'd be better off simply downloading R2021b which will be out in a week or so.
Unfortunately NVIDIA weren't able to provide us with a fix that has no effect on performance, but we can at least limit the workaround to the problematic convolutions. A proper fix will arrive with the next CUDA upgrade.
We've never seen this problem on Windows.
Tom Van den heuvel
Tom Van den heuvel 2021 年 9 月 21 日
Thx for the update!

サインインしてコメントする。

その他の回答 (0 件)

カテゴリ

Help Center および File ExchangeGPU Computing in MATLAB についてさらに検索

タグ

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by