Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 626 107 Updated Aug 28, 2024

Yutong-Zhou-cv / Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,073 186 Updated Aug 20, 2024

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 710 34 Updated Aug 20, 2024

TsinghuaC3I / Intuitive-Fine-Tuning

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process

Python 16 Updated Aug 2, 2024

DaertML / context_distillation

Framework to achieve context distillation in LLMs

Python 7 2 Updated Nov 24, 2023

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

722 18 Updated Jul 31, 2024

lucidrains / autoregressive-diffusion-pytorch

Implementation of Autoregressive Diffusion in Pytorch

Python 240 3 Updated Jul 30, 2024

huyphan168 / PEER

Mixture of A Million Experts

Python 29 1 Updated Jul 30, 2024

feizc / DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Python 172 7 Updated Sep 2, 2024

fusiming3 / MARS

Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

74 2 Updated Jul 16, 2024

kwsong0113 / diffusion-forcing-transformer

Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 49 3 Updated Aug 15, 2024

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 469 16 Updated Aug 30, 2024

xinchengshuai / Awesome-Image-Editing

A Survey of Image Editing

206 8 Updated Jul 22, 2024

gojasper / flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Python 430 32 Updated Jul 3, 2024

GAIR-NLP / anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 629 35 Updated Aug 5, 2024

TideDra / VL-RLHF

A RLHF Infrastructure for Vision-Language Models

Python 85 5 Updated Jun 12, 2024

TobyYang7 / Llava_Qwen2

Visual Instruction Tuning for Qwen2 Base Model

Python 13 1 Updated Jun 29, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,185 46 Updated Aug 15, 2024

RLHF-V / RLAIF-V

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 199 6 Updated Sep 2, 2024

ShihaoZhaoZSH / LaVi-Bridge

[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shu shufangxun

Achievements

Achievements

Block or report shufangxun

Starred repositories

allenai / OLMo

ZrrSkywalker / MAVIS

showlab / Awesome-Unified-Multimodal-Models

lucidrains / transfusion-pytorch

showlab / Show-o

IntelLabs / multimodal_cognitive_ai

OpenBMB / MiniCPM-V

JUNJIE99 / MLVU

Alpha-VLLM / Lumina-mGPT

facebookresearch / generative-recommenders