-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
WARNING:root:Some parameters are on the meta device device because they were offloaded to the cpu.
pending
This problem is yet to be addressed
#4697
opened Jul 5, 2024 by
stromyu520
updated Jul 5, 2024
1 task done
triton.runtime.autotuner.OutOfResources
pending
This problem is yet to be addressed
#4688
opened Jul 5, 2024 by
GitIgnoreMaybe
updated Jul 5, 2024
1 task done
Feature request: is Adam-mini optimizer worth adding?
pending
This problem is yet to be addressed
#4696
opened Jul 5, 2024 by
jim-plus
updated Jul 5, 2024
1 task done
glm4 deepspeed lora sft : Cannot copy out of meta tensor; no data!
pending
This problem is yet to be addressed
#4689
opened Jul 5, 2024 by
ldknight
updated Jul 5, 2024
1 task done
大模型运行推理的时候,请求日志,打印重复
pending
This problem is yet to be addressed
#4690
opened Jul 5, 2024 by
caijx168
updated Jul 5, 2024
1 task done
疑问:历史消息在训练时可以只作为上文不参与模型的预测吗?~
pending
This problem is yet to be addressed
#4684
opened Jul 4, 2024 by
ylsdamxssjxxdd
updated Jul 4, 2024
1 task done
llamafactory-cli api 加载Gemma2模型,运行了一段时间后出现 CUDA error: unspecified launch failure
pending
This problem is yet to be addressed
#4641
opened Jul 2, 2024 by
ToviHe
updated Jul 4, 2024
1 task done
qwen2 72b 910b lora后merge生成的权重 推理失败
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4659
opened Jul 3, 2024 by
wphtrying
updated Jul 3, 2024
1 task done
ValueError: Failed to convert pandas DataFrame to Arrow Table from file
pending
This problem is yet to be addressed
#4650
opened Jul 2, 2024 by
fzp0424
updated Jul 2, 2024
1 task done
[PPU]大佬有对ppu环境进行过测试么
pending
This problem is yet to be addressed
#4606
opened Jun 28, 2024 by
willionZS
updated Jul 2, 2024
1 task done
Unable to run model.generate() for MoD model
pending
This problem is yet to be addressed
#4063
opened Jun 4, 2024 by
Zkli-hub
updated Jul 2, 2024
1 task done
RuntimeError: "addmm_impl_cpu_" not implemented for 'Half' 华为910 命令行推理报错
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4622
opened Jun 30, 2024 by
apachemycat
updated Jul 1, 2024
1 task done
🚨FAQs | 常见问题🚨
good first issue
Good for newcomers
#4614
opened Jun 28, 2024 by
hiyouga
updated Jun 29, 2024
fsdp + DPO + fullyfintune会报错
bug
Something isn't working
pending
This problem is yet to be addressed
#4608
opened Jun 28, 2024 by
qy1026
updated Jun 29, 2024
1 task done
Phi-3-small exploding gradient issue.
pending
This problem is yet to be addressed
#3881
opened May 23, 2024 by
HideLord
updated Jun 27, 2024
1 task done
可以支持英伟达的新模型Nemotron 340B吗?
pending
This problem is yet to be addressed
#4313
opened Jun 16, 2024 by
laosuan
updated Jun 24, 2024
1 task done
请问是否会在框架内集成RLOO算法,最新的online RLHF?
enhancement
New feature or request
pending
This problem is yet to be addressed
#4287
opened Jun 14, 2024 by
ArcherShirou
updated Jun 24, 2024
1 task done
关于npu训练模型总结以及疑问
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4388
opened Jun 20, 2024 by
sweetning0809
updated Jun 22, 2024
1 task done
Ideas behind sharing parameters of policy model and value model?
enhancement
New feature or request
pending
This problem is yet to be addressed
#1563
opened Nov 19, 2023 by
MagiaSN
updated Jun 22, 2024
sft+freeze训练internlm2-base-7b报错,RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4101
opened Jun 6, 2024 by
1737686924
updated Jun 19, 2024
1 task done
昇腾多卡训练问题
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#3810
opened May 19, 2024 by
1737686924
updated Jun 19, 2024
1 task done
MODPO: Multi-Objective Direct Preference Optimization
enhancement
New feature or request
pending
This problem is yet to be addressed
#3973
opened May 30, 2024 by
AlexYoung757
updated Jun 19, 2024
【NPU】GLM-4-9B-Chat PPO 出错
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4135
opened Jun 7, 2024 by
hunterhome
updated Jun 19, 2024
1 task done
昇腾卡训练不支持offload
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4146
opened Jun 7, 2024 by
wangbing35
updated Jun 19, 2024
1 task done
在npu上,对大模型用zero3进行全参微调,初始换参数很大,是什么原因造成的?
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4272
opened Jun 14, 2024 by
fjw1049
updated Jun 19, 2024
1 task done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-06-05.