Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and…

Python 6,314 1,225 Updated Jul 24, 2024

AutoGPTQ / AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,156 431 Updated Jul 17, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 129,712 25,765 Updated Jul 25, 2024

sharc-lab / Edge-MoE

Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts

C++ 77 12 Updated May 10, 2024

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 431 47 Updated Jul 25, 2024

GraphPKU / PygHO

A library for subgraph GNN based on pyg

Python 37 Updated Jun 6, 2024

FMInference / DejaVu

Python 251 31 Updated Apr 2, 2024

FMInference / FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,089 531 Updated Jul 24, 2024

IST-DASLab / sparsegpt

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 665 84 Updated May 30, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 9,491 2,139 Updated Jul 23, 2024

facebookresearch / metaseq

Repo for external large-scale work

Python 6,440 722 Updated Apr 27, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

25,937 2,160 Updated Jun 18, 2024

qxchuckle / vsc-cec-ide

一个插件，国产化你的VSCode，来源于CEC-IDE，有敏感词检测、防沉迷等功能。

TypeScript 770 24 Updated Mar 16, 2024

luxiaodong / Book

214 142 Updated Aug 23, 2017

meta-llama / llama

Inference code for Llama models

Python 54,470 9,345 Updated Jul 23, 2024

snap-stanford / pretrain-gnns

Strategies for Pre-training Graph Neural Networks

Python 948 162 Updated Jul 29, 2023

beabevi / SUN

Understanding and Extending Subgraph GNNs by Rethinking their Symmetries (NeurIPS 2022 Oral)

Python 38 2 Updated Jan 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Taoshu TaoLbr1993

Achievements

Achievements

Block or report TaoLbr1993

Starred repositories

ShiArthur03 / ShiArthur03

jy-yuan / KIVI

cooooorn / Pytorch-XNOR-Net

1adrianb / binary-networks-pytorch

KernelTuner / kernel_tuner

wangzyon / NVIDIA_SGEMM_PRACTICE

allenai / XNOR-Net

Aaronhuang-778 / BiLLM

IST-DASLab / marlin

alibaba / graphlearn-for-pytorch

aHuiWang / plot_demo

intel / neural-compressor

google-deepmind / graphcast

intel-analytics / ipex-llm