Support LLava or other local vision models instead of using OpenAI GPT4-vision #674

ai-agents-challenge · 2024-05-15T19:14:22Z

Feature request

Instead of relying solely ton OpenAI's GPT4-vision for image processing, provide a locally hosted alternative, such as LLAVA.

Motivation

OpenAI often gives this error when parsing images: "Your input image may contain content that is not allowed by our safety system."

abrichr · 2024-05-15T20:11:57Z

abrichr · 2024-06-11T01:30:51Z

Related: https://community.openai.com/t/your-input-image-may-contain-content-that-is-not-allowed-by-our-safety-system-vision-api-response/653372/17

I expect that the AI is denying your request because it doesn’t know if you are trying to solve a CAPTCHA or attempting to use the AI for other purposes it has been trained to prohibit, such as driving cars or tasks beyond the capabilities of computer vision.

https://community.openai.com/t/vision-api-image-not-allowed-by-our-safety-system/679147

One thing which can help is to modify the image slightly to make it look less like a CAPTCHA.
I discovered this as a side-effect of using “set-of-marks” prompting with the vision model.

Mostly it’s “business related” information that OpenAI will refuse to OCR, like people’s names, addresses, emails, phone numbers, company names, etc. So as long as your use case doesn’t involve business info you’ll be fine, …unless/until OpenAI changes their mind and censors your use case as well.

ai-agents-challenge added the enhancement New feature or request label May 15, 2024

abrichr self-assigned this Jun 11, 2024

abrichr mentioned this issue Jun 13, 2024

Iterate over openadapt.drivers in openadapt.adapters.prompt until success #751

Open

abrichr removed their assignment Jun 13, 2024

abrichr added good first issue Good for newcomers help wanted Extra attention is needed $ bounty $ Please suggest a price range 🙏 labels Jun 13, 2024

R-ohit-B-isht linked a pull request Jun 18, 2024 that will close this issue

Fix for issue #674 R-ohit-B-isht/OpenAdapt#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support LLava or other local vision models instead of using OpenAI GPT4-vision #674

Support LLava or other local vision models instead of using OpenAI GPT4-vision #674

ai-agents-challenge commented May 15, 2024

abrichr commented May 15, 2024 •

edited

Loading

abrichr commented Jun 11, 2024 •

edited

Loading

Support LLava or other local vision models instead of using OpenAI GPT4-vision #674

Support LLava or other local vision models instead of using OpenAI GPT4-vision #674

Comments

ai-agents-challenge commented May 15, 2024

Feature request

Motivation

abrichr commented May 15, 2024 • edited Loading

abrichr commented Jun 11, 2024 • edited Loading

abrichr commented May 15, 2024 •

edited

Loading

abrichr commented Jun 11, 2024 •

edited

Loading