Tensor parallelism is all you need. Run LLMs on weak devices or make powerful devices even more powerful by distributing the workload and dividing the RAM usage.

C++ 1,021 68 Updated Jun 29, 2024

MLGroupJLU / LLM-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1,313 82 Updated Jun 3, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

12,878 561 Updated Jun 28, 2024

facebookincubator / gloo

Collective communications library with various primitives for multi-machine training.

C++ 1,159 293 Updated Jun 26, 2024

UbiquitousLearning / Efficient_Foundation_Model_Survey

Survey Paper List - Efficient LLM and Foundation Models

168 6 Updated Mar 19, 2024

stealth / libusipp

unix socket interface for C++ raw IP/IP6/UDP/TCP, Layer2 etc. framework

C++ 40 10 Updated Mar 1, 2023

TsingZ0 / FedKTL

CVPR 2024 accepted paper, An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous Federated Learning

Python 23 Updated Jun 6, 2024

MilhouseVH / bcmstat

Simple Raspberry Pi monitoring tool

Python 259 36 Updated Dec 24, 2023

mtnwrw / fyusenet

Forked from Fyusion-Open-Source/fyusenet

FyuseNet is an OpenGL(ES) based library that allows to run neural network inference on GPUs that support OpenGL or OpenGL/ES, which is the case for most desktop and mobile GPUs on the market.

C++ 43 2 Updated Mar 13, 2024

Peirong Zheng zhengpeirong

Highlights

Block or report zhengpeirong

Lists (5)

INC

awesome list

edge AI

Model

LLM-APP

Starred repositories

MATLAB