[go: nahoru, domu]

Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

WARNING:root:Some parameters are on the meta device device because they were offloaded to the cpu. pending This problem is yet to be addressed
#4697 opened Jul 5, 2024 by stromyu520 updated Jul 5, 2024
1 task done
triton.runtime.autotuner.OutOfResources pending This problem is yet to be addressed
#4688 opened Jul 5, 2024 by GitIgnoreMaybe updated Jul 5, 2024
1 task done
Feature request: is Adam-mini optimizer worth adding? pending This problem is yet to be addressed
#4696 opened Jul 5, 2024 by jim-plus updated Jul 5, 2024
1 task done
glm4 deepspeed lora sft : Cannot copy out of meta tensor; no data! pending This problem is yet to be addressed
#4689 opened Jul 5, 2024 by ldknight updated Jul 5, 2024
1 task done
大模型运行推理的时候,请求日志,打印重复 pending This problem is yet to be addressed
#4690 opened Jul 5, 2024 by caijx168 updated Jul 5, 2024
1 task done
疑问:历史消息在训练时可以只作为上文不参与模型的预测吗?~ pending This problem is yet to be addressed
#4684 opened Jul 4, 2024 by ylsdamxssjxxdd updated Jul 4, 2024
1 task done
llamafactory-cli api 加载Gemma2模型,运行了一段时间后出现 CUDA error: unspecified launch failure pending This problem is yet to be addressed
#4641 opened Jul 2, 2024 by ToviHe updated Jul 4, 2024
1 task done
qwen2 72b 910b lora后merge生成的权重 推理失败 npu This problem is related to NPU devices pending This problem is yet to be addressed
#4659 opened Jul 3, 2024 by wphtrying updated Jul 3, 2024
1 task done
ValueError: Failed to convert pandas DataFrame to Arrow Table from file pending This problem is yet to be addressed
#4650 opened Jul 2, 2024 by fzp0424 updated Jul 2, 2024
1 task done
[PPU]大佬有对ppu环境进行过测试么 pending This problem is yet to be addressed
#4606 opened Jun 28, 2024 by willionZS updated Jul 2, 2024
1 task done
Unable to run model.generate() for MoD model pending This problem is yet to be addressed
#4063 opened Jun 4, 2024 by Zkli-hub updated Jul 2, 2024
1 task done
RuntimeError: "addmm_impl_cpu_" not implemented for 'Half' 华为910 命令行推理报错 npu This problem is related to NPU devices pending This problem is yet to be addressed
#4622 opened Jun 30, 2024 by apachemycat updated Jul 1, 2024
1 task done
🚨FAQs | 常见问题🚨 good first issue Good for newcomers
#4614 opened Jun 28, 2024 by hiyouga updated Jun 29, 2024
fsdp + DPO + fullyfintune会报错 bug Something isn't working pending This problem is yet to be addressed
#4608 opened Jun 28, 2024 by qy1026 updated Jun 29, 2024
1 task done
Phi-3-small exploding gradient issue. pending This problem is yet to be addressed
#3881 opened May 23, 2024 by HideLord updated Jun 27, 2024
1 task done
可以支持英伟达的新模型Nemotron 340B吗? pending This problem is yet to be addressed
#4313 opened Jun 16, 2024 by laosuan updated Jun 24, 2024
1 task done
请问是否会在框架内集成RLOO算法,最新的online RLHF? enhancement New feature or request pending This problem is yet to be addressed
#4287 opened Jun 14, 2024 by ArcherShirou updated Jun 24, 2024
1 task done
关于npu训练模型总结以及疑问 npu This problem is related to NPU devices pending This problem is yet to be addressed
#4388 opened Jun 20, 2024 by sweetning0809 updated Jun 22, 2024
1 task done
Ideas behind sharing parameters of policy model and value model? enhancement New feature or request pending This problem is yet to be addressed
#1563 opened Nov 19, 2023 by MagiaSN updated Jun 22, 2024
sft+freeze训练internlm2-base-7b报错,RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn npu This problem is related to NPU devices pending This problem is yet to be addressed
#4101 opened Jun 6, 2024 by 1737686924 updated Jun 19, 2024
1 task done
昇腾多卡训练问题 npu This problem is related to NPU devices pending This problem is yet to be addressed
#3810 opened May 19, 2024 by 1737686924 updated Jun 19, 2024
1 task done
MODPO: Multi-Objective Direct Preference Optimization enhancement New feature or request pending This problem is yet to be addressed
#3973 opened May 30, 2024 by AlexYoung757 updated Jun 19, 2024
【NPU】GLM-4-9B-Chat PPO 出错 npu This problem is related to NPU devices pending This problem is yet to be addressed
#4135 opened Jun 7, 2024 by hunterhome updated Jun 19, 2024
1 task done
昇腾卡训练不支持offload npu This problem is related to NPU devices pending This problem is yet to be addressed
#4146 opened Jun 7, 2024 by wangbing35 updated Jun 19, 2024
1 task done
在npu上,对大模型用zero3进行全参微调,初始换参数很大,是什么原因造成的? npu This problem is related to NPU devices pending This problem is yet to be addressed
#4272 opened Jun 14, 2024 by fjw1049 updated Jun 19, 2024
1 task done
ProTip! What’s not been updated in a month: updated:<2024-06-05.