MCPcopy
hub / github.com/microsoft/Cream / no_weight_decay

Method no_weight_decay

EfficientViT/downstream/efficientvit.py:421–422  ·  view source on GitHub ↗
(self)

Source from the content-addressed store, hash-verified

419
420 @torch.jit.ignore
421 def no_weight_decay(self):
422 return {x for x in self.state_dict().keys() if 'attention_biases' in x}
423
424 def train(self, mode=True):
425 """Convert the model into training mode while keep layers freezed."""

Callers

nothing calls this directly

Calls 1

state_dictMethod · 0.45

Tested by

no test coverage detected