Why training is not resuming from checkpoint? #3387
muntasir2000
started this conversation in
General
Replies: 1 comment 1 reply
-
Everything looks fine to me. How do you know that it starts from the beginning? from the epoch counter in the logs? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, I am training an ASR with this notebook. I have set to train for total 100 epochs. But I have stopped training in the middle when the 40 epoch was running. But now I want to resume the training from 40 epoch and finish the remaining 60. But the training is starting from the beginning. My configuration for the ExpManagerConfig looks like this -
What am I doing wrong? I even see this line in the logs -
Beta Was this translation helpful? Give feedback.
All reactions