[go: nahoru, domu]

Skip to content
View kkjh0723's full-sized avatar
Block or Report

Block or report kkjh0723

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 760 53 Updated Jul 10, 2024

Matryoshka Query Transformer for Large Vision-Language Models

Python 81 11 Updated Jul 1, 2024

(TPAMI 2024) A Survey on Open Vocabulary Learning

755 42 Updated Jun 27, 2024

[NeurIPS 2023] A faithful benchmark for vision-language compositionality

Python 62 7 Updated Feb 13, 2024

✨✨Latest Advances on Multimodal Large Language Models

10,814 719 Updated Jul 23, 2024

An open source implementation of CLIP.

Python 9,307 928 Updated Jul 23, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,471 325 Updated Jun 16, 2024

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 12,975 2,554 Updated Jul 15, 2024

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Python 1,484 206 Updated Apr 9, 2024

End-to-End Object Detection with Transformers

Python 13,166 2,385 Updated Mar 12, 2024

Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"

Python 775 120 Updated Feb 18, 2024

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 6,411 1,189 Updated Jun 6, 2024

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,205 1,360 Updated Jul 24, 2024

Introduction to Parallel Programming class code

Cuda 1,286 1,141 Updated Jun 27, 2022

A Python module to decode video frames directly, using the FFmpeg C API.

C 260 38 Updated Apr 6, 2019

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,012 610 Updated Jul 24, 2024

A simple and effective method for detecting out-of-distribution images in neural networks.

Python 526 102 Updated Oct 12, 2021

Tensorflow implementation of Learning-based Video Motion Magnification

Python 451 131 Updated Oct 12, 2018

Torch implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).

Lua 128 38 Updated Oct 31, 2017

A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks, https://arxiv.org/abs/1610.02915)

Python 272 42 Updated Jul 5, 2020

A deep learning library for streamlining research and development using the Torch7 distribution.

Lua 343 140 Updated Sep 1, 2016