hub / github.com/InternLM/lmdeploy / decode

Method decode

lmdeploy/tokenizer.py:211–231 · view source on GitHub ↗

De-tokenize. Args: t: a list of token ids. offset: for incrementally decoding. Default to None, which means not applied. skip_special_tokens: Whether or not to remove special tokens in the decoding. Returns:

(self, t: Sequence[int], offset: int | None = None, skip_special_tokens: bool = True)

Source from the content-addressed store, hash-verified

209	return encoded
210
211	def decode(self, t: Sequence[int], offset: int \| None = None, skip_special_tokens: bool = True):
212	"""De-tokenize.
213
214	Args:
215	t: a list of token ids.
216	offset: for incrementally decoding. Default to None, which
217	means not applied.
218	skip_special_tokens: Whether or not to remove special
219	tokens in the decoding.
220
221	Returns:
222	str: text of decoding tokens.
223	"""
224	t = t[offset:]
225	out_string = self.model.decode(t, skip_special_tokens=skip_special_tokens)
226	if offset:
227	logger = get_logger('lmdeploy')
228	logger.warning('For incrementally detokenization, please try '
229	'detokenize_incrementally function instead.')
230	out_string = self._maybe_add_prefix_space(t, out_string)
231	return out_string
232
233	@staticmethod
234	def _convert_tokens_to_string_with_added_encoders(

Callers 1

indexes_containing_tokenMethod · 0.95

Calls 3

_maybe_add_prefix_spaceMethod · 0.95

get_loggerFunction · 0.90

decodeMethod · 0.45

Tested by

no test coverage detected