AnyCaption

这是一个能够将图片内容推理为任何语言的标签工具，支持包括中文在内的26种语言。

部署方法：

需要有conda，需要cuda12.1
conda create --name anycaption python=3.10.14
conda activate anycaption
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
打开终端，cd进项目文件夹内，输入：pip install -r requirements.txt
下载模型
1. mbart-large-50-many-to-many-mmt：链接: https://pan.baidu.com/s/1Fw8kXYNPIMO9VJMpy2pRGA?pwd=3qsj 提取码: 3qsj。下载完将里面的模型放在“mbart-large-50-many-to-many-mmt”内。
2. 下面两个模型可以都下载，也可以二选一：
  - Florence_2_large：链接: https://pan.baidu.com/s/1Vczv6GOA9PjpaCJi2sPmRQ?pwd=zsmi 提取码: zsmi。下载完将模型放在Florence_2_large文件夹内；
  - MiniCPM-V-2_6：链接: https://pan.baidu.com/s/1F-53qpFWEOjpoE26Lop8xQ?pwd=ifag 提取码: ifag。下载完将模型放在MiniCPM-V-2_6文件夹内。
启动：python AnyCaptionUI.py
你可以看到：

支持

1，语言：

"English","中文","日本語","韩文","Russian","French ","Deutsch","Español","Eesti","Suomi","阿拉伯语","Français","Italiano","Nederlands","Română","Türkçe","Afrikaans","Hrvatski","Bahasa Indonesia","Polski","Português","Svenska","Kiswahili","Xhosa","Galego","Slovenščina"

2，模型：

Florence_2_large(运行需要12GB显存) https://huggingface.co/microsoft/Florence-2-large
MiniCPM-V-2_6(运行需要20GB显存) https://huggingface.co/openbmb/MiniCPM-V-2_6/tree/main

注意

推理图片标签的时候，会将推理失败的图片放到新生成的error_img文件夹中，不过大多数情况下都会成功，这只是一个保险措施

开发计划

即时的模型支持
丰富标签处理工具
图片分类工具

ai松柏君

📧：[email protected]

X：

B站主页：https://space.bilibili.com/523893438?spm_id_from=333.1007.0.0

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Florence_2_large		Florence_2_large
MiniCPM-V-2_6		MiniCPM-V-2_6
assets		assets
mbart-large-50-many-to-many-mmt		mbart-large-50-many-to-many-mmt
test		test
AnyCaptionUI.py		AnyCaptionUI.py
gen_Florence.py		gen_Florence.py
gen_MiniCPM.py		gen_MiniCPM.py
readme.md		readme.md
readme_en.md		readme_en.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AnyCaption

这是一个能够将图片内容推理为任何语言的标签工具，支持包括中文在内的26种语言。

部署方法：

支持

注意

开发计划

About

Releases

Packages

Languages

wusongbai139/AnyCaption

Folders and files

Latest commit

History

Repository files navigation

AnyCaption

这是一个能够将图片内容推理为任何语言的标签工具，支持包括中文在内的26种语言。

部署方法：

支持

注意

开发计划

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages