-
Huazhong University of Science & Technology
- Wuhan, Hubei Province, China
-
00:03
(UTC +08:00) - https://orcid.org/0009-0009-4752-6118
- @THELMDOFZHOUXIN
- https://lmd0311.github.io/
Highlights
- Pro
Block or Report
Block or report LMD0311
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
[Arxiv] A Survey on Video Diffusion Models
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
HUST experiments, reports, and useful tools.
Enhancing End-to-End Autonomous Driving with Latent World Model
Generative Models by Stability AI
Emu Series: Generative Multimodal Models from BAAI
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
vHeat: Building Vision Models upon Heat Conduction
A Generalizable World Model for Autonomous Driving
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
CVPR 2024 论文和开源项目合集
A point cloud visualization repo
[ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection
[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"
[CVPR 2024] HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention
Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)
[ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
[CVPR2024 Hightlight] No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"