MCPcopy
hub / github.com/hpcaitech/ColossalAI / disable_spec_dec

Method disable_spec_dec

colossalai/inference/core/llm_engine.py:369–375  ·  view source on GitHub ↗

Disable using speculative decoding for subsequent generations.

(self)

Source from the content-addressed store, hash-verified

367 self.use_spec_dec = True
368
369 def disable_spec_dec(self) -> None:
370 """Disable using speculative decoding for subsequent generations."""
371 self.request_handler.unset_spec_dec_mode()
372 # set back to the maximum number of tokens to speculate
373 self.n_spec_tokens = self.inference_config.max_n_spec_tokens
374 self.use_glide = False
375 self.use_spec_dec = False
376
377 def clear_spec_dec(self) -> None:
378 """Clear relatable structures of speculative decoding, if exist."""

Callers 3

clear_spec_decMethod · 0.95
check_spec_decFunction · 0.80
inferFunction · 0.80

Calls 1

unset_spec_dec_modeMethod · 0.80

Tested by 1

check_spec_decFunction · 0.64