How to do quantization for pretrained model? #4739

EvelinaAleksiutenko · 2022-08-15T15:32:18Z

EvelinaAleksiutenko
Aug 15, 2022

I need to do quantization for the pretrained model. After reading the tutorialI try to run the code
snippet:
model = nemo_nlp.models.PunctuationCapitalizationModel.from_pretrained("punctuation_en_distilbert"). The model was loaded correctly.
quantized_model = torch.quantization.quantize_dynamic(model, {torch.nn.Linear}, dtype=torch.qint8)
Here I faced an error:
NotImplementedError: object proxy must define deepcopy()

How can I fix this? How can I actually do quantization for NLP models from NeMo?

TheFunyBunky · 2022-08-15T15:50:54Z

TheFunyBunky
Aug 15, 2022

Having the same issue

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to do quantization for pretrained model? #4739

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

How to do quantization for pretrained model? #4739

EvelinaAleksiutenko Aug 15, 2022

Replies: 1 comment

TheFunyBunky Aug 15, 2022

EvelinaAleksiutenko
Aug 15, 2022

TheFunyBunky
Aug 15, 2022