[go: nahoru, domu]

Skip to content
View ZZYuting's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report ZZYuting

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,501 89 Updated Jul 6, 2024

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,168 108 Updated Apr 3, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,209 912 Updated Jun 10, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 45,504 5,375 Updated Jun 24, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 1,494 82 Updated Jul 10, 2024

✨✨Latest Advances on Multimodal Large Language Models

10,551 701 Updated Jul 4, 2024

The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".

479 19 Updated Mar 21, 2024

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

Python 1,802 186 Updated Jun 25, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,656 504 Updated May 31, 2024

A latent text-to-image diffusion model

Jupyter Notebook 66,642 9,977 Updated Jun 18, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Python 37,539 4,847 Updated Jun 16, 2024

[Arxiv] A Survey on Video Diffusion Models

1,525 76 Updated Jul 2, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,142 1,306 Updated May 23, 2024

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python 1,909 200 Updated Jul 10, 2024

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python 2,080 228 Updated Apr 7, 2024

assistant tools for attention visualization in deep learning

Jupyter Notebook 910 74 Updated Jun 9, 2022

Explainability for Vision Transformers

Python 777 91 Updated Mar 12, 2022
Jupyter Notebook 197 26 Updated Sep 9, 2021

Implementation of popular deep learning networks with TensorRT network definition API

C++ 6,728 1,738 Updated Jul 10, 2024

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 6,389 1,187 Updated Jun 6, 2024

主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。

Jupyter Notebook 1,540 810 Updated Mar 16, 2022

2019 农业银行雅典娜杯数据挖掘大赛高校 Top2 Solution

Jupyter Notebook 47 16 Updated Jan 12, 2020

用户贷款风险预测

Jupyter Notebook 557 317 Updated Apr 18, 2018

[ECCV 2020] Actions as Moving Points

Python 264 37 Updated Dec 19, 2020

从YouTube上爬取视频

Python 69 19 Updated Mar 24, 2020

Enhance your application with the ability to see and interact with humans using any RGB camera.

Python 731 105 Updated Dec 7, 2021

(CVPR 2022 Oral) Official implemention: TransRAC

Python 108 20 Updated Jul 13, 2023

repnet for mobile (counting repetitions in videos)

Python 4 Updated Jun 27, 2021

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Python 2,033 416 Updated Oct 3, 2023

This is an official implementation for "Video Swin Transformers".

Python 1,375 195 Updated Mar 8, 2023
Next