Skip to content

Model parallelism on ASR tasks with DP accelerator. #2385

Answered by titu1994
roman-vygon asked this question in Q&A
Discussion options

You must be logged in to vote

We do not support any mode other than DDP, because it is the most efficient way of multi gpu training. Most of our models are also non pickleble, therefore other methods of distributed training would not work anyway v

Replies: 3 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by titu1994
Comment options

You must be logged in to vote
1 reply
@roman-vygon
Comment options

Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants