-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Issues: NVIDIA/NeMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[HELP] Run into the NaN grad problem while going through the exmaple of official document with fp16
bug
Something isn't working
#12134
opened Feb 11, 2025 by
twotwoiscute
Fail to convert trained checkpoint to HF format
bug
Something isn't working
#12124
opened Feb 10, 2025 by
Zhihan1996
Loss Fails to Converge in Nemo2-sft.ipynb with Precision 16
#12102
opened Feb 8, 2025 by
twotwoiscute
ASR Lhoste dataloader : TypeError: object of type 'IterableDatasetWrapper' has no len()
bug
Something isn't working
#12093
opened Feb 7, 2025 by
AudranBert
AttributeError: 'HFDatasetDataModule' object has no attribute 'tokenizer'
bug
Something isn't working
#12080
opened Feb 6, 2025 by
j40903272
extra_loggers is not used to log metrics or hyperparameters
bug
Something isn't working
#12046
opened Feb 4, 2025 by
chajath
llava-like dataset implementation "LazySupervisedDataset" likely fails to handle large dataset
#12034
opened Feb 3, 2025 by
bernardhan33
cfg
must have tokenizer
config to create a tokenizer !
bug
#12019
opened Feb 2, 2025 by
kirayomato
num_sanity_val_steps too large issue
bug
Something isn't working
#11978
opened Jan 28, 2025 by
shanesyy
Add option for prefetch factor of data loader to config
#11977
opened Jan 28, 2025 by
shengshiqi-google
Megatron BERT Embedding conversion inconsistency
bug
Something isn't working
#11970
opened Jan 28, 2025 by
aditya-malte
Pickling error when trying to save checkpoints with custom checkpointIO
bug
Something isn't working
#11955
opened Jan 24, 2025 by
jdnurme
Gemma 2 NeMo 2.0 to HF conversion bug
bug
Something isn't working
#11951
opened Jan 24, 2025 by
domenVres
MegatronGPTModel trains much worse when reducing micro_batch_size
bug
Something isn't working
#11939
opened Jan 23, 2025 by
m-harmonic
Have a nemo training container without additional framework elements
#11933
opened Jan 23, 2025 by
gabwow
Unserializable Error with using Energon Dataloader for NeVA (LLaVA) pretraining / fine-tuning and NeMo 2.0
bug
Something isn't working
#11931
opened Jan 22, 2025 by
bernardhan33
Installation instruction for conda/pip does not work
bug
Something isn't working
#11929
opened Jan 22, 2025 by
erikchwang
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.