Block or Report
Block or report kkjh0723
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
Matryoshka Query Transformer for Large Vision-Language Models
(TPAMI 2024) A Survey on Open Vocabulary Learning
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
✨✨Latest Advances on Multimodal Large Language Models
An open source implementation of CLIP.
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
This repository contains implementations and illustrative code to accompany DeepMind publications
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
End-to-End Object Detection with Transformers
Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Introduction to Parallel Programming class code
A Python module to decode video frames directly, using the FFmpeg C API.
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
A simple and effective method for detecting out-of-distribution images in neural networks.
Tensorflow implementation of Learning-based Video Motion Magnification
Torch implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).
A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks, https://arxiv.org/abs/1610.02915)
A deep learning library for streamlining research and development using the Torch7 distribution.