LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,080 178 Updated Jul 12, 2024

karpathy / llama2.c

Inference Llama 2 in one file of pure C

C 16,790 1,959 Updated Jul 10, 2024

meta-llama / llama

Inference code for Llama models

Python 54,216 9,330 Updated May 15, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,107 1,970 Updated Jul 3, 2024

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 5,650 876 Updated Mar 27, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 22,829 3,229 Updated Jul 13, 2024

wangsiping97 / FastGEMV

High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.

Cuda 72 2 Updated Jun 6, 2023

liusy58 / CompilerNotes

TeX 19 3 Updated Nov 21, 2023

ggml-org / p1

LLM-based code completion engine

171 Updated Jun 21, 2023

stanford-crfm / levanter

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Python 468 71 Updated Jul 11, 2024

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 17,768 1,415 Updated Jul 12, 2024

determined-ai / determined

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

Go 2,924 348 Updated Jul 12, 2024

Mooler0410 / LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,012 688 Updated May 31, 2024

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,432 2,406 Updated Apr 28, 2024

Stability-AI / StableLM

StableLM: Stability AI Language Models

Jupyter Notebook 15,850 1,035 Updated Apr 8, 2024

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 164,037 43,572 Updated Jul 12, 2024

fpgaminer / GPTQ-triton

GPTQ inference Triton kernel

Jupyter Notebook 270 20 Updated May 18, 2023

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,026 565 Updated Jul 12, 2024

young-geng / EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,316 243 Updated Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kathryn (Jinqi) Chen Kathryn-cat

Achievements

Achievements

Highlights

Block or report Kathryn-cat

Stars

tensorflow / serving

horovod / horovod

microsoft / DeepSpeed

oobabooga / text-generation-webui

persimmon-ai-labs / adept-inference

ray-project / llm-numbers

steven2358 / awesome-generative-ai

gongminmin / awesome-aigc

quantprep / quantnewgrad2022

pingan8787 / awesome-ai-tools

suno-ai / bark

ModelTC / lightllm