Stars
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Match two faces' shape before using other face swap nodes
A Unified Toolkit for Deep Learning Based Document Image Analysis
Custom nodes for ComfyUI such as CLIP Text Encode++
[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".
[MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.
专门训练controlnet模型的训练包。A training package specifically designed for training controllnet models。
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Rembg is a tool to remove images background
ComfyUI Nodes for HPSv2, Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
An extensive node suite for ComfyUI with over 210 new nodes
All my self trained & released AI upscaling models. After gathering and applying over 600 different upscaling models, I learned how to train my own models, and these are the results.
This custom node lets you train LoRA directly in ComfyUI!
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official implementations for paper: Anydoor: zero-shot object-level image customization
ComfyUI's ControlNet Auxiliary Preprocessors
State-of-the-art 2D and 3D Face Analysis Project
Examples for using ONNX Runtime for machine learning inferencing.
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention