photo

Joss Knight

MathWorks

Last seen: 1日 前 2013 年からアクティブ

Followers: 0   Following: 0

Although I cannot be contacted directly, if you would like to ask me a question all you have to do is mention "GPU" somewhere in your MATLAB Answers question.

統計

  • Guiding Light
  • 36 Month Streak
  • Knowledgeable Level 5
  • Revival Level 2
  • First Answer

バッジを表示

Feeds

表示方法

回答済み
GPU memory usage for Hadamard product
I can't reproduce this. You say you have 48Gb of GPU memory available...have you checked this? Try running gpuDevice and lookin...

4日 前 | 0

回答済み
Initializing LSTM which is imported using ONNX
This code is suspect % initialize input arrays obs = dlarray(rand(obs_size),"BS"); state = dlarray(rand(state_size),"SBS"); ...

4日 前 | 0

回答済み
trainnet gives training loss is NaN
Do your network weights contain NaNs? Try this nansInMyNetwork = ~(all(cellfun(@allfinite, net.Learnables.Value)) && all(cellfu...

11日 前 | 0

回答済み
how to use printf inside a CUDA kernel?
Just use it, and launch MATLAB from a terminal. On Linux, the output will appear in the terminal window. On Windows you will nee...

16日 前 | 0

| 採用済み

回答済み
mexcuda compiler error: "__global__" does not apply here
You cannot call a function declared __global__ from another function declared __global__. Declare the second function as __devic...

28日 前 | 0

| 採用済み

回答済み
CUSOLVER_STATUS_INTERNAL_ERROR with RTX 4080 Super in MATLAB 2022a C MEX Function
This is a known issue with CUDA 11.2, Ada cards, and cusolver. The solution is to upgrade MATLAB to R2023a or above, because it ...

29日 前 | 1

| 採用済み

回答済み
CUSOLVER_STATUS_INTERNAL_ERROR with RTX 4080 Super in MATLAB 2022a C MEX Function
It's probably necessary for you to provide your code, since any number of things could have gone wrong prior to intializing cuso...

29日 前 | 0

回答済み
why do I got such a low score when i use bench function in matlab2024a
Do check your computer's power settings, to make sure they are set to "best performance".

29日 前 | 0

回答済み
Matab is very slow in processing figures: might it be caused by NVidia Optimus?
Check your laptop's power settings and make sure they are in 'best performance' mode.

約1ヶ月 前 | 0

| 採用済み

回答済み
Which is the difference between 'multi-gpu' and 'parallel-gpu' in 'trainingOption' function of the DeepLearning Toolbox?
The purpose of 'multi-gpu' is effectively to try to ensure you are using a local pool with numGpus workers, rather than needing ...

約1ヶ月 前 | 1

回答済み
MATLAB does not detect NVIDIA T400 GPU despite updated drivers
The 516 driver is too old for R2024a, which is on CUDA 12.2. Download the recommended 552 driver for your T400 from https://www....

約1ヶ月 前 | 0

回答済み
MATLAB GPU UBUNTU 20.04 compatibility CUDA driver
Try the 555 and 550 drivers recommended for your device available from https://www.nvidia.com/en-gb/drivers/, rather than relyin...

約1ヶ月 前 | 0

回答済み
Memory leak occurring when repeatedly performing matrix left division on GPU
Thank you very much for reporting this. This appears to be a bug in MATLAB's gpuArray support for overdetermined solves. For now...

2ヶ月 前 | 0

| 採用済み

回答済み
How do I use multiple inputs in mexcuda?
You have passed in two inputs but you have a check if (nrhs!=1), so you are erroring because you don't have one input. You need ...

3ヶ月 前 | 1

回答済み
Why training stops showing error (out of memory) during implementing transfer learning with pre-trained network despite having a laptop of well configuration?
You have a very small GPU I'm afraid. You need to lower your |MiniBatchSize| until it works. Also, set your |ExecutionEnvironmen...

3ヶ月 前 | 1

| 採用済み

回答済み
Parallel Processing is better than buying another MATLAB licence?
There may be a misunderstanding here. Your typical license is to a user or a computer. That user, or that computer, can open as ...

3ヶ月 前 | 0

回答済み
I can`t solve out this problem, there is always Output argument "varargout{2}" (and possibly others) not assigned a value in the execution with "dlarray/dlgradient" function.
Your mistake is |dlfeval(@dlgradient,...)|. You need to put your code that computes the loss and the loss gradients into a funct...

3ヶ月 前 | 0

| 採用済み

回答済み
AlexNet (filter groups) cannot be well resolved inside dlfeval (dimension mismatch error)
Sorry about this. This is because you have an old version of alexnet (although yes, it shouldn't error). For now, please get you...

3ヶ月 前 | 0

| 採用済み

回答済み
arrayfun vs loops again
I wish it were a cleverer answer, but I'm afraid that it's simply that MATLAB has been heavily optimized for |for| loops over th...

3ヶ月 前 | 3

回答済み
Extreme memory usage when using gather
As others have worked out, it looks like the issue is the indexed assignment into u: u(TERRAIN:end,m) = abs(gather(head(ffield,...

4ヶ月 前 | 0

| 採用済み

回答済み
How can i fix the error mismatch format when convert multiplication layer from LayerGraph to dlnetwork in prune and quantize network example
For one solution, replace the fully connected layers with convolution layers with filters the size of the input and num filters ...

4ヶ月 前 | 0

回答済み
How can i fix the error mismatch format when convert multiplication layer from LayerGraph to dlnetwork in prune and quantize network example
That's an annoying limitation which is hopefully fixed in current releases. Is there any particular reason why you can't use |mu...

4ヶ月 前 | 0

回答済み
Arrayfun/gpuArray CUDA kernel need to be able to remember previous steps
This is a bit too vague to answer. Without indexing, how can each subproblem retrieve its subset of the data? If you just mean i...

4ヶ月 前 | 0

回答済み
Loop through list of vectors (ie, the rows of a matrix), applying same "simple" function to each one, on GPU? Should I use arrayfun somehow?
If your calculation is truly sequential then by definition you cannot parallelize along the sequence. But it sounds like you def...

4ヶ月 前 | 0

回答済み
Parallel Computing in C when using Matlab Coder (parpool and gpuArray)
GPU Coder will generate CUDA code for you. It can even automatically vectorize |for| loops. For a multithreaded parallel for loo...

6ヶ月 前 | 1

回答済み
Can parfor run a series of GPU programs simultaneously?
It looks like you just have a bug in your CUDAKernel implementation, probably accessing unallocated memory. This is putting the ...

7ヶ月 前 | 0

回答済み
error of GPU, net = trainNetwork(datastore, lgraph, options);
You are using your display GPU for computation and it does not have enough capacity. Try disabling all hardware acceleration for...

7ヶ月 前 | 1

| 採用済み

回答済み
How to train a sequence to classification network on GPU
This performance discrepancy is normal. Small sequence networks often cannot benefit from GPU parallelism, especially if they us...

7ヶ月 前 | 0

回答済み
gpuArray large sparse arrays. Error codes: "CUSPARSE_INTERNAL_ERROR" / "UNKNOWN_ERROR"
Hi Joseph. It's hard to be definitive. There were some problems with cusparse and also Windows drivers when supporting the newes...

7ヶ月 前 | 0

回答済み
NVIDIA A2 performance in Matlab R2023b is lousy
The A2's spec says its double precision performance is 140 GFLOPS vs the V100's 7 TFLOPS, so this is pretty much expected. Even ...

9ヶ月 前 | 3

| 採用済み

さらに読み込む