Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor as a (self hosted) service #339

Open
rom1504 opened this issue Aug 20, 2023 · 2 comments
Open

Refactor as a (self hosted) service #339

rom1504 opened this issue Aug 20, 2023 · 2 comments

Comments

@rom1504
Copy link
Owner

rom1504 commented Aug 20, 2023

https://github.com/rom1504/img2dataset/tree/streaming_refacto some work I started on that some 8 months ago
I still think it's the right direction

Screenshot_20230820_233013

may try to finish it soon

would close #82 #188 and #135

@rom1504
Copy link
Owner Author

rom1504 commented Aug 20, 2023

in term of implementation, maybe ray eg #272 can help guide things / compare with the http path

@rom1504 rom1504 changed the title Refactor as a service Refactor as a (self hosted) service Aug 20, 2023
@rom1504 rom1504 pinned this issue Aug 20, 2023
@rom1504
Copy link
Owner Author

rom1504 commented Oct 8, 2023

https://github.com/ml6team/fondant is doing some good things in term of packaging, a bit similar to what jina is doing. They're doing docker though which is not clear how it can be made to work

looks like they're becoming dependent on dask though https://github.com/ml6team/fondant/blob/main/components/load_from_parquet/src/main.py which is what I'd like to avoid (being dependent on any given distribution framework)

https://github.com/ml6team/fondant/blob/main/src/fondant/component.py

they're also locking themselves into kubeflow

things to consider taking from them in term of design:

  • component with input and output (similar to video2dataset subsampler)
  • pipeline that composes multiple components (but why is it not a component ?)
  • https://fondant.ai/en/latest/#pipeline their doc is pretty clean

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant