Code
Hub
Workspaces
Connect
Indexed graphs
Engine
MCP
copy
hub
/
github.com/Morizeyao/GPT2-Chinese
/ encode
Method
encode
tokenizations/bpe_tokenizer.py:79–80 ·
view source on GitHub ↗
(self, text)
Source
from the content-addressed store, hash-verified
77
return
word
78
79
def
encode(self, text):
80
return
[self.encoder.get(token, 1)
for
token in self.tokenize(text)]
81
82
def
decode(self, tokens):
83
text =
''
.join([self.decoder[token]
for
token in tokens])
Callers
nothing calls this directly
Calls
1
tokenize
Method · 0.95
Tested by
no test coverage detected