-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
glm4 deepspeed lora sft : Cannot copy out of meta tensor; no data!
pending
This problem is yet to be addressed
#4689
opened Jul 5, 2024 by
ldknight
1 task done
Ascend卡上无法训练deepseek模型 是否支持呢
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4361
opened Jun 18, 2024 by
sweetning0809
1 task done
Function tool calling inference without llama-factory openai style api.
pending
This problem is yet to be addressed
#4364
opened Jun 18, 2024 by
svjack
1 task done
[Feature request] 支持Qwen-VL
pending
This problem is yet to be addressed
#4375
opened Jun 19, 2024 by
marko1616
关于npu训练模型总结以及疑问
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4388
opened Jun 20, 2024 by
sweetning0809
1 task done
[PPU]大佬有对ppu环境进行过测试么
pending
This problem is yet to be addressed
#4606
opened Jun 28, 2024 by
willionZS
1 task done
fsdp + DPO + fullyfintune会报错
bug
Something isn't working
pending
This problem is yet to be addressed
#4608
opened Jun 28, 2024 by
qy1026
1 task done
RuntimeError: "addmm_impl_cpu_" not implemented for 'Half' 华为910 命令行推理报错
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4622
opened Jun 30, 2024 by
apachemycat
1 task done
ValueError: Failed to convert pandas DataFrame to Arrow Table from file
pending
This problem is yet to be addressed
#4650
opened Jul 2, 2024 by
fzp0424
1 task done
qwen2 72b 910b lora后merge生成的权重 推理失败
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4659
opened Jul 3, 2024 by
wphtrying
1 task done
疑问:历史消息在训练时可以只作为上文不参与模型的预测吗?~
pending
This problem is yet to be addressed
#4684
opened Jul 4, 2024 by
ylsdamxssjxxdd
1 task done
WARNING:root:Some parameters are on the meta device device because they were offloaded to the cpu.
pending
This problem is yet to be addressed
#4697
opened Jul 5, 2024 by
stromyu520
1 task done
triton.runtime.autotuner.OutOfResources
pending
This problem is yet to be addressed
#4688
opened Jul 5, 2024 by
GitIgnoreMaybe
1 task done
大模型运行推理的时候,请求日志,打印重复
pending
This problem is yet to be addressed
#4690
opened Jul 5, 2024 by
caijx168
1 task done
Feature request: is Adam-mini optimizer worth adding?
pending
This problem is yet to be addressed
#4696
opened Jul 5, 2024 by
jim-plus
1 task done
Phi-3-small Different Chat Template
pending
This problem is yet to be addressed
#4712
opened Jul 7, 2024 by
maksimstw
1 task done
Enable Contamination-Free Packaging Method During Pretraining
pending
This problem is yet to be addressed
#4744
opened Jul 9, 2024 by
kostum123
1 task done
Faild to save the gptq quantized weight on Qwen2 72B.
pending
This problem is yet to be addressed
#4737
opened Jul 9, 2024 by
fzp0424
1 task done
如果我想把训练后的模型权重由32bit转化为16bit
pending
This problem is yet to be addressed
#4731
opened Jul 9, 2024 by
Suiji12
1 task done
使用glaive数据集,实际训练数据中没有obersation的数据并且conversation没有完全参与训练,只有对话前两段参与训练
pending
This problem is yet to be addressed
#4729
opened Jul 9, 2024 by
tttonytan
1 task done
如何将合并后的模型,导出为int4?
pending
This problem is yet to be addressed
#4728
opened Jul 9, 2024 by
kynow2
1 task done
如果我想把训练后的模型权重由32bit转化为16bit,
pending
This problem is yet to be addressed
#4719
opened Jul 8, 2024 by
Suiji12
1 task done
About “RuntimeError: 'weight' must be 2-D”
pending
This problem is yet to be addressed
#4718
opened Jul 8, 2024 by
ldknight
1 task done
Yi的template存在问题(附简单测试代码)
pending
This problem is yet to be addressed
#4699
opened Jul 6, 2024 by
rangehow
1 task done
Previous Next
ProTip!
no:milestone will show everything without a milestone.