Highlights
Block or Report
Block or report ArtificialZeng
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
Yuan2.0-M32 Public
Forked from IEIT-Yuan/Yuan2.0-M32Mixture-of-Experts (MoE) Language Model
Python Apache License 2.0 UpdatedJul 11, 2024 -
APUS-xDAN-4.0-moe Public
Forked from shootime2021/APUS-xDAN-4.0-moeIts an open source LLM based on MOE Structure.
Other UpdatedJul 2, 2024 -
llama3-8x8b-MoE Public
Forked from cooper12121/llama3-8x8b-MoECopy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b MoE model based on llama3.
Python UpdatedJul 1, 2024 -
Qwen2 Public
Forked from QwenLM/Qwen2Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Shell UpdatedJun 7, 2024 -
-
byzer-llm Public
Forked from allwefantasy/byzer-llmEasy, fast, and cheap pretrain,finetune, serving for everyone
-
ragflow Public
Forked from infiniflow/ragflowRAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Python Apache License 2.0 UpdatedMay 30, 2024 -
baby-llama2-chinese Public
Forked from DLLXW/baby-llama2-chinese用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Python MIT License UpdatedApr 22, 2024 -
llama3_explained Public
the newest version of llama3,source code explained line by line using Chinese
-
llama3 Public
Forked from meta-llama/llama3The official Meta Llama 3 GitHub site
Python Other UpdatedApr 18, 2024 -
humanoid-gym Public
Forked from roboterax/humanoid-gymHumanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer https://arxiv.org/abs/2404.05695
Python UpdatedApr 18, 2024 -
LLaMA-Factory-Explained Public
LLaMA-Factory that can fine tune more than 100 model! this is an Chinese code explanation of this model.
-
-
HCL Public
Forked from pkualpha/HCLHypergraph Contrastive Learning for EHR
Python UpdatedMar 20, 2024 -
GraphCare Public
Forked from pat-jj/GraphCare[ICLR'24] Enhancing Healthcare Predictions with Personalized Knowledge Graphs
Python UpdatedMar 12, 2024 -
ProG Public
Forked from sheldonresearch/ProGAll in One: Multi-task Prompting for Graph Neural Networks, KDD 2023.
Python UpdatedMar 4, 2024 -
-
-
parameter-efficient-moe Public
Forked from for-ai/parameter-efficient-moePython UpdatedOct 31, 2023 -
-
Baichuan-Qwen-Llama-tuning-Explained
-
Firefly Public
Forked from yangjianxin1/FireflyFirefly(流萤): 中文对话式大语言模型(全量微调+QLoRA),支持微调Llma2、Llama、Baichuan、InternLM、Ziya、Bloom等大模型
Python UpdatedSep 26, 2023 -
-
Qwen-7B Public
Forked from QwenLM/QwenThe official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.
-
transformers-Explained Public
官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。
-
Baichuan2-Explained Public
Baichuan2代码的逐行解析版本,适合小白
-
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedSep 12, 2023 -
Baichuan2 Public
Forked from baichuan-inc/Baichuan2A series of large language models developed by Baichuan Intelligent Technology
-
OpenBuddy Public
Forked from OpenBuddy/OpenBuddyOpen Multilingual Chatbot for Everyone
Python GNU General Public License v3.0 UpdatedSep 7, 2023