[go: nahoru, domu]

Skip to content
View whuaxiom's full-sized avatar
Block or Report

Block or report whuaxiom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

623 results for source starred repositories
Clear filter

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,377 88 Updated Jun 21, 2024

YaFSDP: Yet another Fully Sharded Data Parallel

Python 751 36 Updated Jun 27, 2024

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 132 14 Updated May 26, 2024

[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving

Python 3,065 325 Updated Mar 19, 2024

Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152, Iโ€ฆ

Python 483 80 Updated Jul 2, 2024

Ongoing research training transformer models at scale

Python 9,264 2,088 Updated Jul 1, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,173 255 Updated Jul 2, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 10,866 971 Updated Jul 1, 2024
Python 225 37 Updated Jul 2, 2024

ring-attention experiments

Python 75 10 Updated Apr 10, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 7,971 564 Updated Jul 2, 2024

Fast and Easy Infinite Neural Networks in Python

Jupyter Notebook 2,250 228 Updated Mar 1, 2024

Implementation of ๐Ÿ’ Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 397 22 Updated Jul 2, 2024

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing โ€ฆ

Cuda 669 178 Updated Jul 3, 2024

Transformers with Arbitrarily Large Context

Python 563 43 Updated Jun 22, 2024

๐Ÿ“–A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

1,869 131 Updated Jul 3, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,473 231 Updated May 1, 2024

HPT - Open Multimodal LLMs from HyperGAI

Python 301 14 Updated Jun 6, 2024

High performance distributed framework for training deep learning recommendation models based on PyTorch.

Rust 388 51 Updated Feb 7, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,138 488 Updated Jul 2, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,288 423 Updated May 3, 2024

The AI-native database built for LLM applications, providing incredibly fast full-text and vector search

C++ 2,033 171 Updated Jul 3, 2024

A primitive library for neural network

C++ 1,239 209 Updated Jun 21, 2024

Universal LLM Deployment Engine with ML Compilation

Python 17,684 1,407 Updated Jul 2, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 753 62 Updated Jul 2, 2024

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Python 2,761 179 Updated Jul 2, 2024

The official repo of Qwen-VL (้€šไน‰ๅƒ้—ฎ-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,235 325 Updated May 28, 2024
Next