-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Issues: NVIDIA/Megatron-LM
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[REGRESSION] MoEs are obtaining higher loss than they should during training
#894
opened Jun 27, 2024 by
kiddyboots216
[QUESTION] Getting tools/preprocess_data.py to work is painful
#892
opened Jun 26, 2024 by
sambar1729
[QUESTION] Sample idx, bin files in public domain for trying out pretrain_gpt.py?
#891
opened Jun 26, 2024 by
sambar1729
[QUESTION] Has standalone_embedding_stage been supported yet in core?
#890
opened Jun 26, 2024 by
JiwenJ
[BUGS] Pipeline Parallelism fails/hangs with Megatron Core example
#881
opened Jun 20, 2024 by
schheda1
[QUESTION]when pretraining bert,meet bug:cuBLAS Error: the requested functionality is not supported
#876
opened Jun 18, 2024 by
shanyuaa
[BUG] the argument of parser.add_argument is wrong in tools/checkpoint/convert.py
#866
opened Jun 14, 2024 by
adoda
[QUESTION] why the _p2p_ops functions has the condition branches for get_pipeline_model_parallel_rank()
#865
opened Jun 14, 2024 by
lichenlu
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.