[go: nahoru, domu]

Skip to content
View zhengpeirong's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report zhengpeirong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results
Python 16 1 Updated Jun 19, 2024

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Python 321 8 Updated May 14, 2024

an example to use udp to boradcast

C 13 10 Updated Dec 20, 2023

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,061 149 Updated Jun 12, 2024

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 39,097 2,022 Updated Jun 29, 2024

Tensor parallelism is all you need. Run LLMs on weak devices or make powerful devices even more powerful by distributing the workload and dividing the RAM usage.

C++ 1,013 68 Updated Jun 29, 2024

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1,311 82 Updated Jun 3, 2024

LLM101n: Let's build a Storyteller

12,495 540 Updated Jun 28, 2024

Collective communications library with various primitives for multi-machine training.

C++ 1,159 293 Updated Jun 26, 2024

Survey Paper List - Efficient LLM and Foundation Models

165 6 Updated Mar 19, 2024

unix socket interface for C++ raw IP/IP6/UDP/TCP, Layer2 etc. framework

C++ 40 10 Updated Mar 1, 2023

CVPR 2024 accepted paper, An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous Federated Learning

Python 23 Updated Jun 6, 2024

Simple Raspberry Pi monitoring tool

Python 259 36 Updated Dec 24, 2023

FyuseNet is an OpenGL(ES) based library that allows to run neural network inference on GPUs that support OpenGL or OpenGL/ES, which is the case for most desktop and mobile GPUs on the market.

C++ 43 2 Updated Mar 13, 2024

ShaderNN is a lightweight deep learning inference framework optimized for Convolutional Neural Networks on mobile platforms.

C++ 153 20 Updated Aug 15, 2023

Application layer implementation of TCP and UDP protocols using linux RAW sockets

C++ 3 Updated Jan 9, 2024

An example about working with raw sockets under GNU/Linux

C 39 8 Updated Jul 6, 2019

Self-hosted AI coding assistant

Rust 18,247 766 Updated Jun 29, 2024

A very basic first attempt at RAG with ollama and Phi-3 Mini

Python 1 Updated Jun 1, 2024
JavaScript 3 Updated May 4, 2023
Python 112 58 Updated Apr 6, 2022

P4 on Raspberry Pi for Networking Education

JavaScript 121 30 Updated May 8, 2024

Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on infer…

100 4 Updated Jun 26, 2024

Packet Test Framework

Python 143 99 Updated Apr 18, 2024

Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 2,167 181 Updated Jun 18, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Python 3,772 290 Updated Jun 20, 2024

p4

P4 1 2 Updated Dec 30, 2022
P4 1 1 Updated Aug 18, 2023

Awesome Docker Compose samples

HTML 31,782 6,128 Updated Jun 20, 2024
Next