[go: nahoru, domu]

Skip to content

Issues: NVIDIA/Megatron-LM

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[BUG] Bug of expert model parallel stale No activity in 60 days on issue or PR
#766 opened Apr 7, 2024 by 1049451037 updated Jun 30, 2024
[QUESTION] bf16 Parameters and fp32 Gradients stale No activity in 60 days on issue or PR
#800 opened Apr 30, 2024 by pluiez updated Jun 29, 2024
[BUG] @jit_fuser fails with Unknown type constructor Sequence
#880 opened Jun 20, 2024 by Edenzzzz updated Jun 28, 2024
[BUG]Question about helpers.cpp in version core_v0.7.0
#896 opened Jun 28, 2024 by longzhang418 updated Jun 28, 2024
[QUESTION] Does Megatron-LM supports P100?
#849 opened May 29, 2024 by gaokaiz2 updated Jun 28, 2024
[BUG] AttributeError: module 'transformer_engine' has no attribute 'pytorch' stale No activity in 60 days on issue or PR
#696 opened Feb 19, 2024 by zhentingqi updated Jun 27, 2024
[QUESTION] Getting tools/preprocess_data.py to work is painful
#892 opened Jun 26, 2024 by sambar1729 updated Jun 26, 2024
[QUESTION] Has standalone_embedding_stage been supported yet in core?
#890 opened Jun 26, 2024 by JiwenJ updated Jun 26, 2024
[BUG] NCCL TIMEOUT ( maybe ALLREDUCE ? )
#735 opened Mar 14, 2024 by ZhangEnmao updated Jun 25, 2024
How about supporting alternatives to fine-tuning? stale No activity in 60 days on issue or PR
#114 opened Jul 6, 2021 by hwijeen updated Jun 22, 2024
[QUESTION] Why megatron-core seems slower and use more gpu mem than legacy for gpt_pretrain? stale No activity in 60 days on issue or PR
#770 opened Apr 9, 2024 by REIGN12 updated Jun 19, 2024
[QUESTION] Validation loss & PPL keep going up stale No activity in 60 days on issue or PR
#787 opened Apr 20, 2024 by zhentingqi updated Jun 19, 2024
When can we have a the MOE checkpoint convert script.
#790 opened Apr 22, 2024 by shamanez updated Jun 19, 2024
[BUG] Megatron Core example not working
#855 opened Jun 3, 2024 by schheda1 updated Jun 18, 2024
ProTip! Exclude everything labeled bug with -label:bug.