[go: nahoru, domu]

Skip to content
View ArtificialZeng's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
Block or Report

Block or report ArtificialZeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,426 320 Updated Jul 8, 2024

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Python 2,824 182 Updated Jul 12, 2024

Repo for ShenNong-TCM-LLM (“神农”大模型,首个中医药中文大模型)

Python 256 36 Updated Aug 31, 2023

Yuan 2.0 Large Language Model

Python 671 84 Updated Jul 11, 2024

Ongoing research training transformer models at scale

Python 9,374 2,114 Updated Jul 12, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,302 63 Updated Mar 8, 2024

Mixture-of-Experts (MoE) Language Model

Python 163 39 Updated Jul 11, 2024

首个中医大语言模型——“仲景”。受古代中医学巨匠张仲景深邃智慧启迪,专为传统中医领域打造的预训练大语言模型。 The first-ever Traditional Chinese Medicine large language model - "CMLM-ZhongJing". Inspired by the profound wisdom of the ancient Chinese me…

Jupyter Notebook 219 13 Updated Jul 9, 2024

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 1,306 150 Updated Jun 9, 2024

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2,424 365 Updated Jul 12, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 29,159 2,671 Updated Jul 12, 2024

Flax is a neural network library for JAX that is designed for flexibility.

Python 5,806 617 Updated Jul 12, 2024

Minimal library to train LLMs on TPU in JAX with pjit().

Python 264 35 Updated Dec 20, 2023

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 221 12 Updated Jul 8, 2024

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 605 32 Updated Jul 9, 2024

Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b MoE model based on llama3.

Python 18 3 Updated Jul 1, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,106 1,970 Updated Jul 3, 2024
Python 2,581 297 Updated Jul 11, 2024

AGIGuest is an ambitious project aimed at advancing Artificial General Intelligence (AGI) by teaching AI to solve programming problems on LeetCode. AGIGuest 是一个雄心勃勃的项目,旨在通过教授人工智能解决 LeetCode 上的编程问题来…

Python 1 Updated Jul 7, 2024

明医 (MING):中文医疗问诊大模型

Python 755 96 Updated Jun 6, 2024

类似按键精灵的鼠标键盘录制和自动化操作 模拟点击和键入 | automate mouse clicks and keyboard input

Python 6,569 975 Updated Mar 1, 2024

Its an open source LLM based on MOE Structure.

54 9 Updated Jul 2, 2024

A list of LLM benchmark frameworks.

51 3 Updated Feb 17, 2024

常用线性代数算法的python实现。包括lu、svd、qr、diagonalization、inv、pinv、lstsq、solve Ax、solve nullspace

Python 21 11 Updated Jan 27, 2022

Locating and editing factual associations in GPT (NeurIPS 2022)

Python 525 110 Updated Apr 20, 2024

A series of large language models trained from scratch by developers @01-ai

Python 7,470 455 Updated Jun 27, 2024

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06

JavaScript 53,776 15,775 Updated Jun 26, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 922 41 Updated Jan 16, 2024

Representation learning on large graphs using stochastic graph convolutions.

Python 3,354 838 Updated Nov 21, 2022
Next