52 gpu utilization #58

giorgiapitteri · 2024-06-20T09:56:51Z

PR related to the issue #52

CUDA utilization now reaches 70% max. It still drops to 10% for some operations.

Changes made:

The whole dataset is loaded into cuda.
Covariance operations are done on cuda.
Batch size increased to 128.

… the selected device.

giorgiapitteri · 2024-06-20T12:05:20Z

src/veriflow/flows.py

 class Flow(torch.nn.Module):
 """Base implementation of a flow model"""

 # Export mode determines whether the log_prob or the sample function is exported to onnx
 export_modes = Literal["log_prob", "sample"]
 export: export_modes = "log_prob"
- device = "cpu"
+ device = "cuda" 


Is this field "device" used somewhere?

The class variable acts as a default value in case the someone accesses the device before calling to

obj = Flow(...) obj.device

Since strings are immutable, the class variable cannot be changed and the instance variable "device" is created when someone sets the device variable. As soon as that happens the instance variable has precedence over the class variable.

In case that you are not familiar with class variables, have a look into this post

Anyway, this assumes that the initial location is always cpu. It is probably better to have it as another instantiation argument with default value "cpu". Then you can change the value in the config file.

Thanks for the explanation. I was wondering indeed why then we need to pass it as an argument of the fit method. I get now that is the variable we set also in the config when creating the Flow object, if I am not mistaken.

fariedabuzaid

Looks good. I only saw that there where two potentially unnecessary casts to torch.Tensor. Could be that this causes some unnecessary copying.

src/veriflow/flows.py

fariedabuzaid

Thanks great improvement!

Summary:

On MNIST3 with batch size of 128 GPU utilization
- Mostly between 50% - 70%
- Some sudden short drops to 1% (after epoch?)
Hence, problem (mostly) solved

giorgiapitteri and others added 3 commits June 20, 2024 08:51

all dataset is moved to the device. covariance operations are done on…

1f2a487

… the selected device.

removed print of the norms

4bc24a7

minor changes

4f34135

giorgiapitteri changed the title ~~52 gpu utilization~~ DRAFT: 52 gpu utilization Jun 20, 2024

giorgiapitteri commented Jun 20, 2024

View reviewed changes

restore default value of device to cpu

e6c86ab

fariedabuzaid reviewed Jun 24, 2024

View reviewed changes

src/veriflow/flows.py Outdated Show resolved Hide resolved

src/veriflow/flows.py Outdated Show resolved Hide resolved

giorgiapitteri added 2 commits June 25, 2024 08:31

removed unnecessary casts to torch tensor

3150824

fix merge conflicts

159e0f6

giorgiapitteri changed the title ~~DRAFT: 52 gpu utilization~~ 52 gpu utilization Jun 25, 2024

giorgiapitteri marked this pull request as ready for review June 25, 2024 07:01

giorgiapitteri mentioned this pull request Jun 25, 2024

Low GPU utilization CUDA #52

Open

3 tasks

fariedabuzaid approved these changes Jun 26, 2024

View reviewed changes

fariedabuzaid merged commit bdd121f into main Jun 26, 2024
7 checks passed

fariedabuzaid deleted the 52_gpu_utilization branch June 26, 2024 17:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

52 gpu utilization #58

52 gpu utilization #58

giorgiapitteri commented Jun 20, 2024 •

edited

Loading

giorgiapitteri Jun 20, 2024

fariedabuzaid Jun 20, 2024 •

edited

Loading

giorgiapitteri Jun 21, 2024

fariedabuzaid left a comment

fariedabuzaid left a comment

52 gpu utilization #58

52 gpu utilization #58

Conversation

giorgiapitteri commented Jun 20, 2024 • edited Loading

giorgiapitteri Jun 20, 2024

Choose a reason for hiding this comment

fariedabuzaid Jun 20, 2024 • edited Loading

Choose a reason for hiding this comment

giorgiapitteri Jun 21, 2024

Choose a reason for hiding this comment

fariedabuzaid left a comment

Choose a reason for hiding this comment

fariedabuzaid left a comment

Choose a reason for hiding this comment

giorgiapitteri commented Jun 20, 2024 •

edited

Loading

fariedabuzaid Jun 20, 2024 •

edited

Loading