feat: add syncing models utility to ivy #28818

YushaArif99 · 2024-09-12T06:37:43Z

Wanted to have your thoughts on how to expose these utility functions? I think it makes sense to have them inside the ivy.functional.backends.tensorflow.module.py module. But this creates an issue as to how we import this helper. So there wont be any ivy.sync_models_torch_and_tf but instead we'll have to do:

from ivy.functional.backends.tensorflow.module import sync_models_torch_and_tf
sync_models_torch_and_tf(...)

What do you guys suggest we do here @Sam-Armstrong @hmahmood24

hmahmood24 · 2024-09-12T06:53:11Z

Wanted to have your thoughts on how to expose these utility functions? I think it makes sense to have them inside the ivy.functional.backends.tensorflow.module.py module. But this creates an issue as to how we import this helper. So there wont be any ivy.sync_models_torch_and_tf but instead we'll have to do:
from ivy.functional.backends.tensorflow.module import sync_models_torch_and_tf
sync_models_torch_and_tf(...)
What do you guys suggest we do here @Sam-Armstrong @hmahmood24

My thinking was just exposing these as util functions so something like ivy.utils.sync_models_torch_and_tf or ivy.syncing_utils.sync_models_torch_and_tf seems cleaner to me rather than having to import the backends explicitly. What do you think @YushaArif99 ?

Sam-Armstrong · 2024-09-12T08:17:20Z

@YushaArif99 Couldn't we just store all helper functions like this in a new file like ivy/utils/syncing.py or something like that? Is there any need to have this in the backend?

Also, I'd suggest we could expose a general ivy.sync_models function, which will detect the relevant models types and route to the correct function - sync_models_torch_and_tf or whatever. If anything other than torch and tf models are passed, for now we can just throw a ivy.exceptions.IvyNotImplementedException. I think this would be the best UX.

YushaArif99 · 2024-09-12T08:27:11Z

Sure! This is indeed a better UX. I was only inclined to not go this route as the logic contained within these utility helpers is framework-specific. And so it seems to deviate from the general convention of ivy API being an intermediary.
Then there's also a case of having dependency imports so will probably need to have the framework-specific imports as local imports.

Otherwise, this is definitely cleaner.

Based on this, should we still go forward with having these inside ivy.utils @hmahmood24 @Sam-Armstrong ?

YushaArif99 · 2024-09-12T08:29:33Z

@YushaArif99 Couldn't we just store all helper functions like this in a new file like ivy/utils/syncing.py or something like that? Is there any need to have this in the backend?

Also, I'd suggest we could expose a general ivy.sync_models function, which will detect the relevant models types and route to the correct function - sync_models_torch_and_tf or whatever. If anything other than torch and tf models are passed, for now we can just throw a ivy.exceptions.IvyNotImplementedException. I think this would be the best UX.

Yeah that's a good idea 👍🏼

Sam-Armstrong · 2024-09-12T08:31:59Z

@YushaArif99 I don't see an issue with it, because you'd have to import torch into the tensorflow backend anyway, which is also against the general convention of ivy. But I guess if we do expose ivy.sync_models it wouldn't be super important where functions like sync_models_torch_and_tf are located anyway, as they wouldn't be used directly by the user. So I guess it's up to you really 👍

YushaArif99 · 2024-09-12T08:48:16Z

@YushaArif99 I don't see an issue with it, because you'd have to import torch into the tensorflow backend anyway, which is also against the general convention of ivy. But I guess if we do expose ivy.sync_models it wouldn't be super important where functions like sync_models_torch_and_tf are located anyway, as they wouldn't be used directly by the user. So I guess it's up to you really 👍

Yeah I think exposing ivy.sync_models as the main interface is much cleaner. This way, we can have sync_models_torch_and_tf inside the TF module.py and similarily sync_models_torch_and_jax inside the JAX module.py which I feel is more intuitive.

I'll go ahead with this approach then. Thanks for the suggestions both!

YushaArif99 · 2024-09-18T16:19:19Z

Hey @hmahmood24 @Sam-Armstrong, could you guys give this a final look and confirm whether we're all okay with this approach. I have added all the syncing logic inside ivy.stateful.utilities.py and have exposed ivy.sync_models_torch as the main interface for users.

Sam-Armstrong

lgtm! quick question though, could we generalise this to just be called ivy.sync_models, and in the longer term use this same function for syncing models from any framework? I feel like that would be a cleaner api. for now we can just throw a meaningful exception if a torch.nn.Module is not the first argument. what do you think @YushaArif99?

ivy/functional/backends/tensorflow/module.py

YushaArif99 · 2024-09-19T04:55:25Z

question though, could we generalise this to just be called ivy.sync_models, and in the longer term use this same function for syncing models from any framework?

yeah we can definately do that. We can add source and target kwargs and then reroute to the appropriate implementation based on their values.

Co-authored-by: Sam Armstrong <[email protected]>

…odels` and adding a `source` kwarg to route to the appropriate helper.

Sam-Armstrong · 2024-09-19T07:07:24Z

@YushaArif99 do we need source/target kwargs, can't we just infer based on the input types?

hmahmood24

@YushaArif99 LGTM. Agreed with @Sam-Armstrong's point though that keeping a single ivy.sync_models API should be cleaner if we can implement that 👍🏼

YushaArif99 · 2024-09-19T07:26:12Z

@YushaArif99 do we need source/target kwargs, can't we just infer based on the input types?

@Sam-Armstrong that would require us importing all frameworks inside ivy.sync_models which is not ideal imo.

Sam-Armstrong · 2024-09-19T07:43:07Z

@YushaArif99 maybe we could check if each framework is in sys.modules when doing the check? so something like:

if "torch" in sys.modules:
    import torch

    if isinstance(x, torch.nn.Module):

hmahmood24 · 2024-09-19T08:09:30Z

@YushaArif99 do we need source/target kwargs, can't we just infer based on the input types?

@Sam-Armstrong that would require us importing all frameworks inside ivy.sync_models which is not ideal imo.

@YushaArif99 Can't we just use this which shouldn't require us importing any fws

…tances of native modules by traversing through the `mro` chain.

…and instead using `_is_submodule`

feat: add syncing models utility to ivy

9563197

YushaArif99 requested review from hmahmood24 and Sam-Armstrong September 12, 2024 06:41

YushaArif99 added 2 commits September 18, 2024 16:11

chore: removing the sync models logic from the stateful module.py

f3ad98d

feat: adding model syncing helpers to ivy.stateful.utilites.py

47ca552

fix: renaming the USE_NATIVE_KERAS_LAYERS env variable

27dc926

Sam-Armstrong approved these changes Sep 18, 2024

View reviewed changes

ivy/functional/backends/tensorflow/module.py Outdated Show resolved Hide resolved

YushaArif99 and others added 4 commits September 19, 2024 09:55

Update ivy/functional/backends/tensorflow/module.py

ee6694d

Co-authored-by: Sam Armstrong <[email protected]>

fix (stateful)(utilities): adding try-except blocks for torch imports

5f2feb4

feat (stateful)(utilities): renaming sync_models_torch with `sync_m…

6c69f3a

…odels` and adding a `source` kwarg to route to the appropriate helper.

fix invalid import

10153ef

hmahmood24 approved these changes Sep 19, 2024

View reviewed changes

YushaArif99 added 2 commits September 19, 2024 16:36

feat (stateful)(utilities): adding a helper function to check for ins…

bd57690

…tances of native modules by traversing through the `mro` chain.

feat(stateful)(utilities): removing the source and target kwargs …

27b91f5

…and instead using `_is_submodule`

YushaArif99 merged commit ba475c1 into main Sep 19, 2024
2 of 5 checks passed

YushaArif99 deleted the sync_model_pt branch September 19, 2024 16:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add syncing models utility to ivy #28818

feat: add syncing models utility to ivy #28818

YushaArif99 commented Sep 12, 2024 •

edited

Loading

hmahmood24 commented Sep 12, 2024

Sam-Armstrong commented Sep 12, 2024

YushaArif99 commented Sep 12, 2024

YushaArif99 commented Sep 12, 2024

Sam-Armstrong commented Sep 12, 2024 •

edited

Loading

YushaArif99 commented Sep 12, 2024

YushaArif99 commented Sep 18, 2024 •

edited

Loading

Sam-Armstrong left a comment •

edited

Loading

YushaArif99 commented Sep 19, 2024

Sam-Armstrong commented Sep 19, 2024

hmahmood24 left a comment •

edited

Loading

YushaArif99 commented Sep 19, 2024 •

edited

Loading

Sam-Armstrong commented Sep 19, 2024

hmahmood24 commented Sep 19, 2024 •

edited

Loading

feat: add syncing models utility to ivy #28818

feat: add syncing models utility to ivy #28818

Conversation

YushaArif99 commented Sep 12, 2024 • edited Loading

hmahmood24 commented Sep 12, 2024

Sam-Armstrong commented Sep 12, 2024

YushaArif99 commented Sep 12, 2024

YushaArif99 commented Sep 12, 2024

Sam-Armstrong commented Sep 12, 2024 • edited Loading

YushaArif99 commented Sep 12, 2024

YushaArif99 commented Sep 18, 2024 • edited Loading

Sam-Armstrong left a comment • edited Loading

Choose a reason for hiding this comment

YushaArif99 commented Sep 19, 2024

Sam-Armstrong commented Sep 19, 2024

hmahmood24 left a comment • edited Loading

Choose a reason for hiding this comment

YushaArif99 commented Sep 19, 2024 • edited Loading

Sam-Armstrong commented Sep 19, 2024

hmahmood24 commented Sep 19, 2024 • edited Loading

YushaArif99 commented Sep 12, 2024 •

edited

Loading

Sam-Armstrong commented Sep 12, 2024 •

edited

Loading

YushaArif99 commented Sep 18, 2024 •

edited

Loading

Sam-Armstrong left a comment •

edited

Loading

hmahmood24 left a comment •

edited

Loading

YushaArif99 commented Sep 19, 2024 •

edited

Loading

hmahmood24 commented Sep 19, 2024 •

edited

Loading