[go: nahoru, domu]

Skip to content
View Kathryn-cat's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Carnegie Mellon University
  • Pittsburgh, PA
  • 22:31 (UTC -04:00)
  • X @meow_cat_7

Highlights

  • Pro
Block or Report

Block or report Kathryn-cat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A flexible, high-performance serving system for machine learning models

C++ 6,124 2,190 Updated Jul 12, 2024

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,065 2,221 Updated Jul 12, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 33,837 3,973 Updated Jul 12, 2024

A Gradio web UI for Large Language Models.

Python 38,321 5,078 Updated Jul 12, 2024

Inference code for Persimmon-8B

Python 415 23 Updated Sep 9, 2023

Numbers every LLM developer should know

3,991 138 Updated Jan 16, 2024

A curated list of modern Generative Artificial Intelligence projects and services

5,244 577 Updated Jul 12, 2024

A list of awesome AIGC works

536 43 Updated Oct 21, 2023

Finding the AI tools you need!

246 39 Updated Feb 15, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 33,802 4,011 Updated Jul 10, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,080 178 Updated Jul 12, 2024

Inference Llama 2 in one file of pure C

C 16,790 1,959 Updated Jul 10, 2024

Inference code for Llama models

Python 54,216 9,330 Updated May 15, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,107 1,970 Updated Jul 3, 2024

Transformer related optimization, including BERT, GPT

C++ 5,650 876 Updated Mar 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 22,829 3,229 Updated Jul 13, 2024

High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.

Cuda 72 2 Updated Jun 6, 2023
TeX 19 3 Updated Nov 21, 2023

LLM-based code completion engine

171 Updated Jun 21, 2023

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Python 468 71 Updated Jul 11, 2024

Universal LLM Deployment Engine with ML Compilation

Python 17,768 1,415 Updated Jul 12, 2024

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

Go 2,924 348 Updated Jul 12, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,012 688 Updated May 31, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,432 2,406 Updated Apr 28, 2024

StableLM: Stability AI Language Models

Jupyter Notebook 15,850 1,035 Updated Apr 8, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 164,037 43,572 Updated Jul 12, 2024

GPTQ inference Triton kernel

Jupyter Notebook 270 20 Updated May 18, 2023

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,026 565 Updated Jul 12, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,316 243 Updated Jun 26, 2024
Next