-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Enable Contamination-Free Packaging Method During Pretraining
pending
This problem is yet to be addressed
#4744
opened Jul 9, 2024 by
kostum123
1 task done
Faild to save the gptq quantized weight on Qwen2 72B.
pending
This problem is yet to be addressed
#4737
opened Jul 9, 2024 by
fzp0424
1 task done
如果我想把训练后的模型权重由32bit转化为16bit
pending
This problem is yet to be addressed
#4731
opened Jul 9, 2024 by
Suiji12
1 task done
使用glaive数据集,实际训练数据中没有obersation的数据并且conversation没有完全参与训练,只有对话前两段参与训练
pending
This problem is yet to be addressed
#4729
opened Jul 9, 2024 by
tttonytan
1 task done
如何将合并后的模型,导出为int4?
pending
This problem is yet to be addressed
#4728
opened Jul 9, 2024 by
kynow2
1 task done
如果我想把训练后的模型权重由32bit转化为16bit,
pending
This problem is yet to be addressed
#4719
opened Jul 8, 2024 by
Suiji12
1 task done
About “RuntimeError: 'weight' must be 2-D”
pending
This problem is yet to be addressed
#4718
opened Jul 8, 2024 by
ldknight
1 task done
Phi-3-small Different Chat Template
pending
This problem is yet to be addressed
#4712
opened Jul 7, 2024 by
maksimstw
1 task done
同配置和环境下从检查点继续训练OOM
pending
This problem is yet to be addressed
#4710
opened Jul 7, 2024 by
Mr-KenLee
1 task done
间歇性 RuntimeError: CUDA error: an illegal memory access was encounteredCUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect
pending
This problem is yet to be addressed
#4709
opened Jul 7, 2024 by
Gierry
1 task done
使用readme中提供的hugging face预训练数据集报错
pending
This problem is yet to be addressed
#4708
opened Jul 7, 2024 by
xiao-liya
1 task done
怎么实现自建vlm模仿llava进行pt,报错RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn,发现问题在于添加了visual_inputs: true的情况下,stage:pt最后读入的数据不包含图片pixel_value
pending
This problem is yet to be addressed
#4707
opened Jul 7, 2024 by
RONINGOD
1 task done
Plan for Documentation?
pending
This problem is yet to be addressed
#4703
opened Jul 6, 2024 by
sangttruong
Yi的template存在问题(附简单测试代码)
pending
This problem is yet to be addressed
#4699
opened Jul 6, 2024 by
rangehow
1 task done
Invalid device string: 'float32'
pending
This problem is yet to be addressed
#4698
opened Jul 6, 2024 by
OnewayLab
1 task done
WARNING:root:Some parameters are on the meta device device because they were offloaded to the cpu.
pending
This problem is yet to be addressed
#4697
opened Jul 5, 2024 by
stromyu520
1 task done
Feature request: is Adam-mini optimizer worth adding?
pending
This problem is yet to be addressed
#4696
opened Jul 5, 2024 by
jim-plus
1 task done
大模型运行推理的时候,请求日志,打印重复
pending
This problem is yet to be addressed
#4690
opened Jul 5, 2024 by
caijx168
1 task done
glm4 deepspeed lora sft : Cannot copy out of meta tensor; no data!
pending
This problem is yet to be addressed
#4689
opened Jul 5, 2024 by
ldknight
1 task done
triton.runtime.autotuner.OutOfResources
pending
This problem is yet to be addressed
#4688
opened Jul 5, 2024 by
GitIgnoreMaybe
1 task done
疑问:历史消息在训练时可以只作为上文不参与模型的预测吗?~
pending
This problem is yet to be addressed
#4684
opened Jul 4, 2024 by
ylsdamxssjxxdd
1 task done
qwen2 72b 910b lora后merge生成的权重 推理失败
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4659
opened Jul 3, 2024 by
wphtrying
1 task done
ValueError: Failed to convert pandas DataFrame to Arrow Table from file
pending
This problem is yet to be addressed
#4650
opened Jul 2, 2024 by
fzp0424
1 task done
RuntimeError: "addmm_impl_cpu_" not implemented for 'Half' 华为910 命令行推理报错
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4622
opened Jun 30, 2024 by
apachemycat
1 task done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-06-09.