MCPcopy
hub / github.com/zai-org/CogView / get_tokenizer

Function get_tokenizer

data_utils/unified_tokenizer.py:198–206  ·  view source on GitHub ↗
(args=None)

Source from the content-addressed store, hash-verified

196 return ret
197
198def get_tokenizer(args=None):
199 if not hasattr(get_tokenizer, 'tokenizer'):
200 # the first time to load the tokenizer, specify img_tokenizer_path
201 get_tokenizer.tokenizer = UnifiedTokenizer(
202 args.img_tokenizer_path,
203 device=torch.cuda.current_device(),
204 img_tokenizer_num_tokens=args.img_tokenizer_num_tokens
205 )
206 return get_tokenizer.tokenizer
207
208class FakeTokenizer(object):
209 def __init__(self, num_tokens):

Callers 15

test_lmdb.pyFile · 0.90
_parse_and_to_tensorFunction · 0.90
get_contextFunction · 0.90
generate_images_onceFunction · 0.90
super_resolutionFunction · 0.90
post_selectionFunction · 0.90
prepare_tokenizerFunction · 0.90
forward_stepFunction · 0.90
get_train_val_test_dataFunction · 0.90
mainFunction · 0.90

Calls 1

UnifiedTokenizerClass · 0.85

Tested by

no test coverage detected