Skip to content
This repository has been archived by the owner on Jan 15, 2024. It is now read-only.

train_gnmt.py met memory runing issue #694

Answered by szha
momo1986 asked this question in Q&A
Discussion options

You must be logged in to vote

@momo1986 the missing shared object file libnvrtc.so.9.0 is for NVRTC, which should be part of cuda 9.0 as shown in the doc. Also, mxnet-cu90 only works with cuda 9.0 and mxnet-cu92 only with cuda 9.2, so let's try not to mix them on the same system.

I know installing cuda can be tricky on some systems, and personally I find installing from the local runfile to be most reliable. Given that the symptom points to a problem in cuda installation, I'd recommend the following:

  • If possible, use CUDA 9.2 and mxnet-cu92.
  • If you don't have sudo access, make sure you work with you system admin on setting the right LD_LIBRARY_PATH. If you're sure the right path is included and it's still not working…

Replies: 10 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by szha
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #694 on August 30, 2020 19:11.