karpathy

Andrej karpathy

I like to train Deep Neural Nets on large datasets.

82.8k followers · 8 following

Stanford
https://twitter.com/karpathy

Achievements

x2 x2 x4

Achievements

x2 x2 x4

Highlights

Block or Report

Block or report karpathy

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

AnswerDotAI / gpu.cpp

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 803 17 Updated Jul 16, 2024

ash-01xor / bpe.c

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

C 97 2 Updated Jul 7, 2024

clu0 / unet.cu

UNet diffusion model in pure CUDA

Cuda 521 23 Updated Jun 28, 2024

microsoft / Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 699 38 Updated Jul 11, 2024

pranavjad / mlx-gpt2

gpt-2 from scratch in mlx

Python 326 22 Updated Jun 12, 2024

open-webui / open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 31,466 3,465 Updated Jul 16, 2024

kvfrans / jax-diffusion-transformer

Implementation of Diffusion Transformer (DiT) in JAX

Python 231 4 Updated Jun 11, 2024

ridgerchu / matmulfreellm

Implementation for MatMul-free LM.

Python 2,674 159 Updated Jun 27, 2024

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 1,698 55 Updated Jul 12, 2024

skypilot-org / skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Python 6,273 430 Updated Jul 16, 2024

ItzCrazyKns / Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 11,148 991 Updated Jul 15, 2024

mcinglis / c-style

My favorite C programming practices.

1,918 94 Updated Oct 1, 2020

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,388 50 Updated Jul 16, 2024

adam-maj / tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 6,701 500 Updated Jun 14, 2024

google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,781 491 Updated Jul 16, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 61,741 8,837 Updated Jul 16, 2024

Mozilla-Ocho / llamafile

Distribute and run LLMs with a single file.

C++ 17,043 850 Updated Jul 6, 2024

BobMcDear / attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 420 18 Updated Jul 15, 2024

gautierdag / bpeasy

Fast bare-bones BPE for modern tokenizer training

Python 129 2 Updated Dec 19, 2023

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,169 490 Updated Jul 11, 2024

carlini / yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 807 59 Updated Jun 27, 2024

Codium-ai / AlphaCodium

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,306 238 Updated May 17, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,021 3,256 Updated Jul 16, 2024

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 15,788 898 Updated Jul 16, 2024

normster / llm_rules

RuLES: a benchmark for evaluating rule-following in language models

Python 199 15 Updated Jun 21, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,165 4,015 Updated Mar 12, 2024

abacaj / fine-tune-mistral

Fine-tune mistral-7B on 3090s, a100s, h100s

Python 696 63 Updated Oct 11, 2023

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,369 488 Updated Jul 13, 2024

unslothai / unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 12,718 830 Updated Jul 16, 2024

isafulf / inbox_cleaner

A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.

Python 408 26 Updated Dec 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Andrej karpathy

Achievements

Achievements

Highlights

Block or report karpathy

Stars

AnswerDotAI / gpu.cpp

ash-01xor / bpe.c

clu0 / unet.cu

microsoft / Samba

pranavjad / mlx-gpt2

open-webui / open-webui

kvfrans / jax-diffusion-transformer

ridgerchu / matmulfreellm

facebookresearch / schedule_free

skypilot-org / skypilot

ItzCrazyKns / Perplexica

mcinglis / c-style

HazyResearch / ThunderKittens

adam-maj / tiny-gpu

google / gemma.cpp

ggerganov / llama.cpp

Mozilla-Ocho / llamafile

BobMcDear / attorch

gautierdag / bpeasy

google / gemma_pytorch

carlini / yet-another-applied-llm-benchmark

Codium-ai / AlphaCodium

vllm-project / vllm

ml-explore / mlx

normster / llm_rules

tatsu-lab / stanford_alpaca

abacaj / fine-tune-mistral

pytorch-labs / gpt-fast

unslothai / unsloth

isafulf / inbox_cleaner