Highlights
- Pro
Block or Report
Block or report capjamesg
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage: Jupyter Notebook
Sort by: Most stars
Starred repositories
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Python programs, usually short, of considerable difficulty, to perfect particular skills.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
LAVIS - A One-stop Library for Language-Vision Intelligence
This repository contains demos I made with the Transformers library by HuggingFace.
PyTorch code and models for the DINOv2 self-supervised learning method.
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models l…
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners
Open-source and strong foundation image recognition models.
CoTracker is a model for tracking any point (pixel) on a video.
Efficient few-shot learning with Sentence Transformers
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Collection of notebook guides created by the Brev.dev team!
OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Instructional notebooks on music information retrieval.
Convert any text to a graph of knowledge. This can be used for Graph Augmented Generation or Knowledge Graph based QnA
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
Creating fun photomosaics, GIFs, and murals from your family pictures using ML & similarity search
[ICML2024] Unified Training of Universal Time Series Forecasting Transformers
🌮 Trash Annotations in Context Dataset Toolkit
High quality resources & applications for LLMs, multi-modal models and VectorDBs