[go: nahoru, domu]

Skip to content
View karpathy's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report karpathy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 803 17 Updated Jul 16, 2024

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

C 97 2 Updated Jul 7, 2024

UNet diffusion model in pure CUDA

Cuda 521 23 Updated Jun 28, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 699 38 Updated Jul 11, 2024

gpt-2 from scratch in mlx

Python 326 22 Updated Jun 12, 2024

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 31,466 3,465 Updated Jul 16, 2024

Implementation of Diffusion Transformer (DiT) in JAX

Python 231 4 Updated Jun 11, 2024

Implementation for MatMul-free LM.

Python 2,674 159 Updated Jun 27, 2024

Schedule-Free Optimization in PyTorch

Python 1,698 55 Updated Jul 12, 2024

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Python 6,273 430 Updated Jul 16, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 11,148 991 Updated Jul 15, 2024

My favorite C programming practices.

1,918 94 Updated Oct 1, 2020

Tile primitives for speedy kernels

Cuda 1,388 50 Updated Jul 16, 2024

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 6,701 500 Updated Jun 14, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,781 491 Updated Jul 16, 2024

LLM inference in C/C++

C++ 61,741 8,837 Updated Jul 16, 2024

Distribute and run LLMs with a single file.

C++ 17,043 850 Updated Jul 6, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 420 18 Updated Jul 15, 2024

Fast bare-bones BPE for modern tokenizer training

Python 129 2 Updated Dec 19, 2023

The official PyTorch implementation of Google's Gemma models

Python 5,169 490 Updated Jul 11, 2024

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 807 59 Updated Jun 27, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,306 238 Updated May 17, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,021 3,256 Updated Jul 16, 2024

MLX: An array framework for Apple silicon

C++ 15,788 898 Updated Jul 16, 2024

RuLES: a benchmark for evaluating rule-following in language models

Python 199 15 Updated Jun 21, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,165 4,015 Updated Mar 12, 2024

Fine-tune mistral-7B on 3090s, a100s, h100s

Python 696 63 Updated Oct 11, 2023

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,369 488 Updated Jul 13, 2024

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 12,718 830 Updated Jul 16, 2024

A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.

Python 408 26 Updated Dec 2, 2023
Next