Does this library support NLP models such as Transformer #17

ghost · 2020-03-11T00:48:53Z

Hi, I am interested in this work. I want to try this algorithm to accelerate trainning procedure of NLP models. So I want to know if I can directly use this library on NLP models? Thanks!

TimDettmers · 2020-03-11T03:02:15Z

Yes, it should work without any problem. You can just follow the steps of wrapping the transformer into the Masking class and it should work just fine. What is happening in the background is that all weights in the module (and all its sub-modules) are multiplied with a binary mask before each forward pass.

If you apply this to transformers you should make sure though that you keep the layer norm parameters dense. You can achieve this by using the remove_type(torch.nn.LayerNorm) method fo the Masking class.

Let me know if you run into any problems.

nickyi1990 · 2022-10-10T14:40:39Z

Yes, it should work without any problem. You can just follow the steps of wrapping the transformer into the Masking class and it should work just fine. What is happening in the background is that all weights in the module (and all its sub-modules) are multiplied with a binary mask before each forward pass.

If you apply this to transformers you should make sure though that you keep the layer norm parameters dense. You can achieve this by using the remove_type(torch.nn.LayerNorm) method fo the Masking class.

Let me know if you run into any problems.

It's 2022 now, do you get any positive results?

TimDettmers added the enhancement New feature or request label Oct 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does this library support NLP models such as Transformer #17

Does this library support NLP models such as Transformer #17

ghost commented Mar 11, 2020

TimDettmers commented Mar 11, 2020

nickyi1990 commented Oct 10, 2022

Does this library support NLP models such as Transformer #17

Does this library support NLP models such as Transformer #17

Comments

ghost commented Mar 11, 2020

TimDettmers commented Mar 11, 2020

nickyi1990 commented Oct 10, 2022