CLIP #126

awsaf49 · 2023-07-01T04:27:58Z

awsaf49
Jul 1, 2023

I think CLIP (Contrastive Language-Image Pre-training) from Openai would be a great addition to this library. Also having a script for clip training (vision + language) on datasets like LAION would be even more sweet.

Here are some references,

https://github.com/openai/CLIP [Official]
https://github.com/mlfoundations/open_clip [Communiy, used by timm]
https://arxiv.org/abs/2111.02114 [LAION]

leondgarse · 2023-07-02T08:30:15Z

leondgarse
Jul 2, 2023
Maintainer

Ya, I would like that too, and tried the basic implementation in 05-08_clip.md, but still havn't got some training tests...

0 replies

leondgarse · 2023-07-25T14:36:16Z

leondgarse
Jul 25, 2023
Maintainer

The basic framework is working now, but cannot guarantee the efficiency, as it's only tested on a subset of COCO caption... May refer Custom caption detaset for creating a dataset, and keras_cv_attention_models/clip for a basic usage, and kecam_caption_test.ipynb for a basic test in colab.

0 replies

leondgarse · 2023-08-14T13:03:20Z

leondgarse
Aug 14, 2023
Maintainer

It's now a single script clip_train_script.py for both TF and PyTorch backend, and a basic usage keras_cv_attention_models/clip. They are using almost same model / loss / data, but still TF training results not satisfying. I think maybe it's their optimizers behaving different...

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLIP #126

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

CLIP #126

awsaf49 Jul 1, 2023

Replies: 3 comments

leondgarse Jul 2, 2023 Maintainer

leondgarse Jul 25, 2023 Maintainer

leondgarse Aug 14, 2023 Maintainer

awsaf49
Jul 1, 2023

leondgarse
Jul 2, 2023
Maintainer

leondgarse
Jul 25, 2023
Maintainer

leondgarse
Aug 14, 2023
Maintainer