Stars
This is a study aim to transfer the single concept by using DIT model self-attention capablity
Official repository of In-Context LoRA for Diffusion Transformers
Cog wrapper for ostris/ai-toolkit + post-finetuning cog inference for flux models
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
AI 智能生成 PPT,通过主题/文件/网址等方式生成PPT,支持原生图表、动画、3D特效等复杂PPT的解析和渲染,支持用户自定义模板,支持智能添加动画,可在线体验。AI generates PowerPoint Presentation, Supports parsing and rendering of complex PPT features such as native charts…
抖音爬虫(a_bogus最新版)、快手、哔哩哔哩、小红书、淘宝、京东、微博等平台爬虫开源api接口服务器。docker一键快速部署。
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Technical sharing of 100 excellent AI models
A combination of ip_adapter SDv1.5 and mediapipe-face to swap over a face
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
A ComfyUI node to automatically extract masks for body regions and clothing/fashion items. Made with 💚 by the CozyMantis squad.
PromeAIpro / diffusers
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Stable Diffusion Dreambooth Inpainting Finetuning
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Custom prompt styler node for SDXL in ComfyUI
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…
A generative speech model for daily dialogue.