MCPcopy Index your code
hub / github.com/NVIDIA/TensorRT-LLM / has_fp4_kv_cache

Method has_fp4_kv_cache

tensorrt_llm/quantization/mode.py:170–171  ·  view source on GitHub ↗
(self)

Source from the content-addressed store, hash-verified

168 return self._any(self.FP8_KV_CACHE)
169
170 def has_fp4_kv_cache(self):
171 return self._any(self.NVFP4_KV_CACHE)
172
173 def has_kv_cache_quant(self):
174 return (self.has_int8_kv_cache() or self.has_fp8_kv_cache()

Callers 8

has_kv_cache_quantMethod · 0.95
_attn_implMethod · 0.80
update_quant_configMethod · 0.80
__init__Method · 0.80
_create_kv_cache_managerFunction · 0.80
kv_dtypeMethod · 0.80

Calls 1

_anyMethod · 0.95

Tested by

no test coverage detected