MCPcopy Index your code
hub / github.com/deepspeedai/DeepSpeedExamples / SetTokenizer

Method SetTokenizer

Megatron-LM/data_utils/datasets.py:256–263  ·  view source on GitHub ↗
(self, tokenizer)

Source from the content-addressed store, hash-verified

254 self.Y = binarize_labels(self.Y, hard=binarize_sent)
255
256 def SetTokenizer(self, tokenizer):
257 if tokenizer is None:
258 self.using_tokenizer = False
259 if not hasattr(self, '_tokenizer'):
260 self._tokenizer = tokenizer
261 else:
262 self.using_tokenizer = True
263 self._tokenizer = tokenizer
264
265 def GetTokenizer(self):
266 return self._tokenizer

Callers 6

__init__Method · 0.95
make_datasetFunction · 0.45
SetTokenizerMethod · 0.45
SetTokenizerMethod · 0.45
__init__Method · 0.45
__init__Method · 0.45

Calls

no outgoing calls

Tested by

no test coverage detected