-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
RuntimeError: "addmm_impl_cpu_" not implemented for 'Half' 华为910 命令行推理报错
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4622
opened Jun 30, 2024 by
apachemycat
updated Jul 1, 2024
1 task done
🚨FAQs | 常见问题🚨
good first issue
Good for newcomers
#4614
opened Jun 28, 2024 by
hiyouga
updated Jun 29, 2024
fsdp + DPO + fullyfintune会报错
bug
Something isn't working
pending
This problem is yet to be addressed
#4608
opened Jun 28, 2024 by
qy1026
updated Jun 29, 2024
1 task done
Phi-3-small exploding gradient issue.
pending
This problem is yet to be addressed
#3881
opened May 23, 2024 by
HideLord
updated Jun 27, 2024
1 task done
可以支持英伟达的新模型Nemotron 340B吗?
pending
This problem is yet to be addressed
#4313
opened Jun 16, 2024 by
laosuan
updated Jun 24, 2024
1 task done
请问是否会在框架内集成RLOO算法,最新的online RLHF?
enhancement
New feature or request
pending
This problem is yet to be addressed
#4287
opened Jun 14, 2024 by
ArcherShirou
updated Jun 24, 2024
1 task done
关于npu训练模型总结以及疑问
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4388
opened Jun 20, 2024 by
sweetning0809
updated Jun 22, 2024
1 task done
Ideas behind sharing parameters of policy model and value model?
enhancement
New feature or request
pending
This problem is yet to be addressed
#1563
opened Nov 19, 2023 by
MagiaSN
updated Jun 22, 2024
sft+freeze训练internlm2-base-7b报错,RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4101
opened Jun 6, 2024 by
1737686924
updated Jun 19, 2024
1 task done
昇腾多卡训练问题
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#3810
opened May 19, 2024 by
1737686924
updated Jun 19, 2024
1 task done
MODPO: Multi-Objective Direct Preference Optimization
enhancement
New feature or request
pending
This problem is yet to be addressed
#3973
opened May 30, 2024 by
AlexYoung757
updated Jun 19, 2024
【NPU】GLM-4-9B-Chat PPO 出错
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4135
opened Jun 7, 2024 by
hunterhome
updated Jun 19, 2024
1 task done
昇腾卡训练不支持offload
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4146
opened Jun 7, 2024 by
wangbing35
updated Jun 19, 2024
1 task done
在npu上,对大模型用zero3进行全参微调,初始换参数很大,是什么原因造成的?
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4272
opened Jun 14, 2024 by
fjw1049
updated Jun 19, 2024
1 task done
Ascend卡上无法训练deepseek模型 是否支持呢
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4361
opened Jun 18, 2024 by
sweetning0809
updated Jun 19, 2024
1 task done
FSDP QDoRa
pending
This problem is yet to be addressed
#3550
opened May 2, 2024 by
etemiz
updated Jun 19, 2024
1 task done
[Feature request] 支持Qwen-VL
pending
This problem is yet to be addressed
#4375
opened Jun 19, 2024 by
marko1616
updated Jun 19, 2024
Function tool calling inference without llama-factory openai style api.
pending
This problem is yet to be addressed
#4364
opened Jun 18, 2024 by
svjack
updated Jun 18, 2024
1 task done
Jamba & Deepspeed zero-3
pending
This problem is yet to be addressed
#4300
opened Jun 15, 2024 by
lwang2070
updated Jun 17, 2024
1 task done
Output difference between LLaMA-Factory and llama.cpp
pending
This problem is yet to be addressed
#3563
opened May 3, 2024 by
anidh
updated Jun 9, 2024
1 task done
data.utils.split_dataset中的切分和随机逻辑能否迁移到data.loader.get_dataset中?
pending
This problem is yet to be addressed
#4140
opened Jun 7, 2024 by
luoqishuai
updated Jun 7, 2024
1 task done
对于微调分类任务,如何在使用api inference时获取输出标签置信分数
enhancement
New feature or request
pending
This problem is yet to be addressed
#3932
opened May 28, 2024 by
xhdu
updated Jun 7, 2024
1 task done
有计划支持LoRAMoE吗?
pending
This problem is yet to be addressed
#2749
opened Mar 8, 2024 by
luyuntao92
updated Jun 5, 2024
1 task done
用openai库 请求时,流式请求时缺stream_options={"include_usage": True}的处理,用于计算流式tokens
pending
This problem is yet to be addressed
#3998
opened May 30, 2024 by
sasicDHH
updated Jun 3, 2024
1 task done
Feature suggestion: cutoff_len could optionally drop too long examples from dataset.
pending
This problem is yet to be addressed
#3995
opened May 30, 2024 by
s4s0l
updated Jun 3, 2024
ProTip!
Adding no:label will show everything without a label.